Version 1
: Received: 15 March 2023 / Approved: 17 March 2023 / Online: 17 March 2023 (10:05:05 CET)
How to cite:
Castillo-Mendieta, K.; Agüero-Chapin, G.; Santiago Vispo, N.; Márquez, E.A.; Perez-Castillo, Y.; Barigye, S.J.; Marrero-Ponce, Y. Peptide Hemolytic Activity Analysis using Visual Data Mining of Similarity-based Complex Networks. Preprints.org2023, 2023030322. https://doi.org/10.20944/preprints202303.0322.v1
Castillo-Mendieta, K.; Agüero-Chapin, G.; Santiago Vispo, N.; Márquez, E.A.; Perez-Castillo, Y.; Barigye, S.J.; Marrero-Ponce, Y. Peptide Hemolytic Activity Analysis using Visual Data Mining of Similarity-based Complex Networks. Preprints.org 2023, 2023030322. https://doi.org/10.20944/preprints202303.0322.v1
Cite as:
Castillo-Mendieta, K.; Agüero-Chapin, G.; Santiago Vispo, N.; Márquez, E.A.; Perez-Castillo, Y.; Barigye, S.J.; Marrero-Ponce, Y. Peptide Hemolytic Activity Analysis using Visual Data Mining of Similarity-based Complex Networks. Preprints.org2023, 2023030322. https://doi.org/10.20944/preprints202303.0322.v1
Castillo-Mendieta, K.; Agüero-Chapin, G.; Santiago Vispo, N.; Márquez, E.A.; Perez-Castillo, Y.; Barigye, S.J.; Marrero-Ponce, Y. Peptide Hemolytic Activity Analysis using Visual Data Mining of Similarity-based Complex Networks. Preprints.org 2023, 2023030322. https://doi.org/10.20944/preprints202303.0322.v1
Abstract
Peptides are promising drug development frameworks thanks to their high target selectivity, tolerability and relatively low production cost. However, despite the fact that several thousand potentially therapeutic peptides reported, only sixty have arrived at the market. This concerning low proportion is partially explained by undesired properties such as peptide-induced hemolytic activity. Hence, we aim to get a better insight into the chemical space of hemolytic peptides using a novel approach based on network science and interactive data mining as an alternative to design more effective peptide drugs with low hemolytic activity. Metadata networks (METNs) were used to characterize and find general patterns associated to hemolytic peptides, whereas Half-Space Proximal Networks (HSPNs), created using five different two-way dissimilarity measures, represented the hemolytic peptide space. Then, using the best candidate HSPNs, we extracted various scaffolds that capture information of almost all the chemical space but avoiding peptide overrepresentation. Such scaffolds can have many applications, such as training accurate ML-based prediction models, constructing one-class multi-query similarity searching models and characterizing the diversity of hemolytic peptides using a manageable set of peptides. Finally, by means of an alignment-free approach, we reported 47 putative hemolytic motifs, which might provide hints about the mechanisms of hemolysis and can also be used as toxic signatures when developing novel peptide-based drugs.
Computer Science and Mathematics, Computer Science
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.