Protein-protein interactions (PPIs) are associated with various diseases; hence, they are important targets in drug discovery. However, the physicochemical empirical properties of PPI-targeted drugs are distinct from those of conventional small molecule oral pharmaceuticals, which adhere to the ”rule of five (RO5).” Therefore, developing PPI-targeted drugs using conventional methods, such as molecular generation models, is difficult. In this study, we propose a molecule generation model based on deep reinforcement learning, specialized for generating PPI inhibitors. By introducing a scoring function that can represent the properties of PPI inhibitors, we successfully generated potential PPI inhibitor compounds. The generated virtual compounds possess the desired properties for PPI inhibitors, and show similarity to commercially available PPI libraries. These generated virtual compounds are freely available as a virtual library.