Submitted:
19 December 2025
Posted:
22 December 2025
You are already at the latest version
Abstract
This study presents the complete sequencing and comparative genomic analysis of Leishmania (Viannia) naiffi and Leishmania (Viannia) shawi, species of epidemiological relevance in the Brazilian Amazon. Genome assemblies yielded sizes of 32.13 Mb and 32.51 Mb, with 8,170 and 7,767 annotated genes, respectively. Predicted gene functions were primarily related to catalytic, binding, and ATP-dependent activities. Pangenome analysis revealed a core genome of 6,256 genes alongside notable species-specific differences, including 46 and 25 unique genes in L. naiffi and L. shawi. Functional screening identified pharmacologically promising proteins such as calpains, ABC transporters, and notably, GSK-3. Ploidy analysis indicated tetraploidy on chromosome 8 in L. naiffi and chromosome 2 in L. shawi. Genetic variability assessment detected 34,480 SNPs in L. naiffi and 26,562 in L. shawi, indicating greater genomic diversity in the former. Phylogenetic inference based on the polA1 gene confirmed the placement of both species within the Leishmania (Viannia) subgenus. These findings advance Leishmania genomics knowledge by highlighting unique genetic signatures, regions of high variability, and potential therapeutic targets. This work establishes a foundation for future research on evolution, pathogenicity, and drug development for leishmaniasis.

Keywords:
1. Introduction
2. Materials and Methods
2.1. DNA Extraction and Sequencing
2.2. Read Quality Assessment
2.3. Genome Assembly
2.4. Taxonomic Classification
2.5. Genome Annotation
2.6. Gene Ortholog Evaluation
2.7. Identification of Pharmacological Targets
2.8. Reference Mapping, Genomic Coverage, and Variant Calling
2.9. Phylogenetic Inference
3. Results
3.1. Quality Control, Genome Assembly, and Taxonomic Classification
3.2. Genome Annotation
3.3. Gene Ortholog Evaluation
3.4. Identification of Pharmacological Targets
3.5. Chromosomal Coverage, Ploidy, and Variant Calling
3.6. Phylogenetic Inference
4. Discussion
4.1. Genome Assembly, Taxonomic Classification, and Scaffold
4.2. Genome Annotation
4.3. Orthologs
4.4. Identification of Pharmacological Targets
4.5. Coverage, Ploidy, and Variant Calling
4.6. Phylogenetic Inference
5. Conclusions
Supplementary Materials
Funding
Institutional Review Board Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| ATP | Adenosine triphosphate |
| BLASTp | Basic Local Alignment Search Tool for proteins |
| GO | Gene Ontology |
| GSK-3 | Glycogen synthase kinase-3 |
| PDB | Protein Data Bank |
| polA1 | DNA polymerase alpha catalytic subunit 1 |
| SNP | Single nucleotide polymorphism |
References
- Steverding, D. The history of leishmaniasis. Parasites Vectors 2017, *10*, 82. [Google Scholar] [CrossRef]
- WHO. Control of Leishmaniasis (Resolution WHA60.13). 2007. Available online: https://apps.who.int/gb/ebwha/pdf_files/WHA60/A60_R13-en.pdf (accessed on 14 December 2025).
- Lainson, R. The Neotropical Leishmania species: a brief historical review of their discovery, ecology and taxonomy. Rev. Pan-Amazônica Saúde 2010, *1*, 13–32. [Google Scholar] [CrossRef]
- Galati, E.A.B. Phlebotominae (Diptera, Psychodidae): Classification, Morphology and Terminology of Adults and Identification of American Taxa. In Brazilian Sand Flies; Springer: Cham, Switzerland, 2018; pp. 9–212. [Google Scholar] [CrossRef]
- Zapata, S.; Galati, E.A.B.; Chaves, J.A.; Artigas, P.; Gantier, J.-C.; Bargues, M.D.; Mas-Coma, S.; Depaquit, J. Diversity and epidemiological relevance of Neotropical sand flies. Parasites Vectors 2023, *16*, 123. [Google Scholar] [CrossRef]
- Abadías-Granado, I; Diago, A; Cerro, PA; Palma-Ruiz, AM; Gilaberte, Y. Cutaneous and Mucocutaneous Leishmaniasis; Actas Dermosifiliogr (Engl Ed): English, Spanish, 27 Feb 2021. [Google Scholar] [CrossRef]
- WHO. Control of the Leishmaniases: Report of a Meeting of the WHO Expert Committee on the Control of Leishmaniases, Geneva, 22–26 March 2010. Available online: https://apps.who.int/iris/bitstream/handle/10665/44412/WHO_TRS_949_eng.pdf (accessed on 14 December 2025).
- Figueira, L.P; Soares, F.V.; Naiff, R.D.; Vinho te-Silva, A.C.; Silva, S.S.; Espir, T.T.; Naiff, M.F.; Gomes, L.H.M.; Moreira, F.R.A.C.N.; Franco, A.M.R. New human case reports of cutaneous leishmaniasis by Leishmania (Viannia) naiffi in the Amazon region, Brazil. Acta Amazon. 2017, *47*, pp. 47–52. Available online: https://repositorio.inpa.gov.br/handle/1/13127 (accessed on 14 December 2025).
- Brazil. Ministério da Saúde. Leishmanioses: Painel de Monitoramento. 2025. Available online: https://leishmanioses.aids.gov.br/app/dashboards?auth_provider_hint=anonymous1#/view/5fc3a7ec-22b9-4402-9a52-a5001ca659ad?embed=true&_g=()&show-top-menu=false (accessed on 11 December 2025).
- Lainson, R. The Neotropical Leishmania Species: A Brief Historical Review of Their Discovery, Ecology and Taxonomy. Rev. Pan-Amaz. Saúde 2010, *1*, 13–32. Available online: http://scielo.iec.gov.br/pdf/rpas/v1n2/v1n2a02.pdf (accessed on 11 December 2025).
- Lainson, R.; Shaw, J.J.; Silveira, F.T.; Braga, R.R.; Ishikawa, E.A. Cutaneous leishmaniasis of man due to Leishmania (Viannia) naiffi Lainson and Shaw, 1989. Ann. Parasitol. Hum. Comp. 1990, *65*, 282–284. Available online: https://patua.iec.gov.br/items/1bfb14ad-52e4-4e9f-a935-b0e1ef151e5a (accessed on 11 December 2025).
- Figueira, L.P; Soares, F.V.; Naiff, R.D.; Vinho te-Silva, A.C.; Silva, S.S.; Espir, T.T.; Naiff, M.F.; Gomes, L.H.M.; Moreira, F.R.A.C.N.; Franco, A.M.R. New Human Case Reports of Cutaneous Leishmaniasis by Leishmania (Viannia) naiffi in the Amazon Region, Brazil. Acta Amazon. 2017, *47*, pp. 47–52. Available online: https://repositorio.inpa.gov.br/handle/1/13127 (accessed on 14 December 2025).
- Sundar, S.; Singh, V.K.; Agrawal, N.; Singh, O.P.; Kumar, R. Investigational New Drugs for the Treatment of Leishmaniasis. Expert Opin. Investig. Drugs 2024, *33*, 1029–1046. [Google Scholar] [CrossRef]
- Antinori, S.; Schifanella, L.; Corbellino, M. Leishmaniasis: New Insights from an Old and Neglected Disease. Eur. J. Clin. Microbiol. Infect. Dis. 2012, *31*, 109–118. [Google Scholar] [CrossRef]
- Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data. 2010. Available online: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 11 December 2025).
- Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 16 *34*, i884–i890. [Google Scholar] [CrossRef]
- Li, D.; Liu, C.-M.; Luo, R.; Sadakane, K.; Lam, T.-W. MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 2015, *31*, 1674–1676. [Google Scholar] [CrossRef]
- Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, *215*, 403–410. [Google Scholar] [CrossRef]
- Wood, D.E.; Lu, J.; Langmead, B.; 19. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019, *20*, 257. [Google Scholar] [CrossRef]
- Salzberg, F.P.; Breitwieser, S.L. Pavian: Interactive analysis of metagenomics data for microbiome studies and pathogen identification. Bioinformatics 2020, *36*, 1303–1304. [Google Scholar] [CrossRef]
- Boetzer, M.; Henkel, C.V.; Jansen, H.J.; Butler, D.; Pirovano, W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 2011, *27*, 578–579. [Google Scholar] [CrossRef] [PubMed]
- Stanke, M.; Diekhans, M.; Baertsch, R.; Haussler, D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics 2008, *24*, 637–644. [Google Scholar] [CrossRef] [PubMed]
- Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, *28*, 1647–1649. [Google Scholar] [CrossRef] [PubMed]
- Mi, H.; Muruganujan, A.; Ebert, D.; Huang, X.; Thomas, P.D. PANTHER version 14: More genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Res. 2019, *47*, D419–D426. [Google Scholar] [CrossRef]
- Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009, *10*, 421. [Google Scholar] [CrossRef]
- Chen, H. VennDiagram: Generate High-Resolution Venn and Euler Plots. R Package Version 1.6.17. 2016. Available online: https://cran.r-project.org/package=VennDiagram (accessed on 14 December 2025).
- Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009, *10*, 421. [Google Scholar] [CrossRef]
- Pearson, W.R. An introduction to sequence similarity (“homology”) searching. Curr. Protoc. Bioinform. 2013, *42*, 3.1.1–3.1.8. [Google Scholar] [CrossRef]
- Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, *9*, 357–359. [Google Scholar] [CrossRef]
- Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve years of SAMtools and BCFtools. GigaScience 2021, *10*, giab008. [Google Scholar] [CrossRef]
- Schmidt, H.A.; Strimmer, K.; Vingron, M.; von Haeseler, A. TREE-PUZZLE: Maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 2002, *18*, 502–504. [Google Scholar] [CrossRef]
- Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, *37*, 1530–1534. [Google Scholar] [CrossRef]
- Coughlan, S.; Taylor, A.S.; Feane, E.; Sanders, M.; Schönian, G.; Cotton, J.A.; Downing, T. Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene evolution in the Viannia subgenus. R. Soc. Open Sci. 2018, *5*, 172212. [Google Scholar] [CrossRef]
- Rogers, M.B.; Hilley, J.D.; Dickens, N.J.; Wilkes, J.; Bates, P.A.; Depledge, D.P.; Harris, D.; Herr, Y.; Hermes, I.; Hirst, C.; et al. Chromosome and gene copy number variation allow major structural change between species and strains of leishmania. Genome Res. 2011, *21*, 2129–2142. [Google Scholar] [CrossRef]
- Llanes, A.; Restrepo, C.M.; Lleonart, R. Unraveling the Genome Diversity of Leishmania Parasites Using Next-Generation DNA Sequencing Strategies. Life 2025, *15*, 1590. [Google Scholar] [CrossRef]
- Valdivia, H.O.; Reis-Cunha, J.L.; Rodrigues-Luiz, G.F.; Baptista, R.P.; Baldeviano, G.C.; Gerbasi, R.V.; Dobson, D.E.; Pratlong, F.; Bastien, P.; Lescano, A.G.; et al. Comparative genomic analysis of Leishmania (Viannia) peruviana and Leishmania (Viannia) braziliensis. BMC Genomics 2015, *16*, 715. [Google Scholar] [CrossRef]
- Peacock, C.S.; Seeger, K.; Harris, D.; Murphy, L.; Ruiz, J.C.; Quail, M.A.; Peters, N.; Adlem, E.; Tivey, A.; Aslett, M.; et al. Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat. Genet. 2007, *39*, 839–847. [Google Scholar] [CrossRef]
- Real, F.; Vidal, R.O.; Carazzolle, M.F.; Mondego, J.M.; Costa, G.G.; Herai, R.H.; Würtele, M.; de Carvalho, L.M.; Carmona e Ferreira, R.; Mortara, R.A.; et al. The genome sequence of Leishmania (Leishmania) amazonensis: Functional annotation and extended analysis of gene models. DNA Res. 2013, *20*, 567–581. [Google Scholar] [CrossRef]
- Gonzalez-Garcia, L.N.; Rodriguez, M.P.; Parra-Muñoz, M.; Clavijo, A.M.; Levy, L.; Ovalle-Bracho, C.; Colorado, C.; Camargo, C.; Quiceno, E.; Moncada, M.J.; et al. Genetic diversity and comparative genomics across Leishmania (Viannia) species. Commun. Biol. 2025, *8*, 925. [Google Scholar] [CrossRef]
- Diaz, A.P.; Canal, C.A.M.; Valdés, A.J.; Delgado, J.E.G.; Varela-M, R.E. GSK-3 kinase a putative therapeutic target in trypanosomatid parasites. Braz. J. Infect. Dis. 2024, *28*, 103736. [Google Scholar] [CrossRef]
- Efstathiou, A.; Smirlis, D.; Myrianthopoulos, V.; Fytas, C.; Galanis, A.; Papadopoulou, B. Leishmania Protein Kinases: Important Regulators of the Parasite Life Cycle and Promising Drug Targets. Microorganisms 2021, *9*, 691. [Google Scholar] [CrossRef]
- Ojo, K.K.; Gillespie, J.R.; Riechers, A.J.; Napuli, A.J.; Verlinde, C.L.M.J.; Buckner, F.S.; Van Voorhis, W.C.; Merritt, E.A. Structure determination of glycogen synthase kinase-3 from Leishmania major and comparison with Trypanosoma brucei. Mol. Biochem. Parasitol. 2011, *176*, 98–108. [Google Scholar] [CrossRef]
- Black, J.A.; Reis-Cunha, J.L.; Cruz, A.K.; Tosi, L.R.O. Life in plastic, it’s fantastic! How Leishmania exploit genome instability to shape gene expression. Front. Cell. Infect. Microbiol. 2023, *13*, 1102462. [Google Scholar] [CrossRef]







| Read quality control | Leishmania naiffi |
Leishmania shawi |
|---|---|---|
| Total reads before trimming | 16,008,124 | 10,855,510 |
| Total reads after trimming | 15,103,553 | 9,741,698 |
| Total base pairs before trimming | 2,401,218,600 | 1,628,326,500 |
| Total base pairs after trimming | 2,265,532,950 | 1,461,254,700 |
| De novo assembly | ||
| Total contigs | 20,542 | 13,849 |
| Genome coverage | 70.5x | 45x |
| Contig analysis after taxonomic classification | ||
| Total Leishmania contigs | 20,446 | 13,816 |
| Total base pairs | 32,129,546 | 32,505,670 |
| Number of scaffolds | 20,446 | 13,816 |
| Minimum contig length | 200 bp | 200 bp |
| Maximum contig length | 17,834 bp | 49,040 bp |
| Average contig length | 1,565.86 bp | 2,347.94 bp |
| N25 | 4,494 bp | 10,605 bp |
| N50 | 2,583 bp | 5,882 bp |
| N75 | 1,303 bp | 2,839 bp |
| GC content (bp) | 18,445,163 | 18,717,749 |
| GC content (%) | 57.41 | 57.58 |
| Molecular Function Classes (Gene Ontology—GO) | L. shawi | L. naiffi | ||
|---|---|---|---|---|
| No PANTHER category is assigned (UNCLASSIFIED) | 4,942 | 62.0% | 4,614 | 61.1% |
| Catalytic activity (GO:0003824) | 1,326 | 16.6% | 1,269 | 16.8% |
| Binding (GO:0005488) | 965 | 12.1% | 932 | 12.3% |
| ATP-dependent activity (GO:0140657) | 230 | 2.9% | 223 | 3.0% |
| Transporter activity (GO:0005215) | 166 | 2.1% | 163 | 2.2% |
| Molecular function regulator activity (GO:0098772) | 75 | 0.9% | 75 | 1.0% |
| Structural molecule activity (GO:0005198) | 76 | 1.0% | 84 | 1.1% |
| Cytoskeletal motor activity (GO:0003774) | 67 | 0.8% | 66 | 0.9% |
| Molecular adaptor activity (GO:0060090) | 42 | 0.5% | 40 | 0.5% |
| Translation regulator activity (GO:0045182) | 41 | 0.5% | 42 | 0.6% |
| Transcription regulator activity (GO:0140110) | 21 | 0.3% | 16 | 0.2% |
| Antioxidant activity (GO:0016209) | 14 | 0.2% | 15 | 0.2% |
| Molecular transducer activity (GO:0060089) | 6 | 0.1% | 6 | 0.1% |
| Electron transfer activity (GO:0009055) | 3 | 0.0% | 3 | 0.0% |
| Cargo receptor activity (GO:0038024) | 1 | 0.0% | 1 | 0.0% |
| Biological Processes (Gene Ontology—GO) | L. shawi | L. naiffi | ||
|---|---|---|---|---|
| No PANTHER category is assigned (UNCLASSIFIED) | 5,011 | 52.7% | 4,684 | 52% |
| Cellular process (GO:0009987) | 1,905 | 20.0% | 1,842 | 20.5% |
| Metabolic process (GO:0008152) | 1,388 | 14.6% | 1,326 | 14.7% |
| Localization (GO:0051179) | 476 | 5.0% | 469 | 5.2% |
| Biological regulation (GO:0065007) | 359 | 3.8% | 340 | 3.8% |
| Response to stimulus (GO:0050896) | 269 | 2.8% | 254 | 2.8% |
| Reproductive process (GO:0022414) | 30 | 0.3% | 28 | 0.3% |
| Reproduction (GO:0000003) | 30 | 0.3% | 28 | 0.3% |
| Homeostatic process (GO:0042592) | 30 | 0.3% | 29 | 0.3% |
| Developmental process (GO:0032502) | 2 | 0.0% | 2 | 0.0% |
| Rhythmic process (GO:0048511) | 1 | 0.0% | 1 | 0.0% |
| Multicellular organismal process (GO:0032501) | 1 | 0.0% | 1 | 0.0% |
| Gene | target | pident | length | evalue | bitscore | Target family | quality_score | Species |
|---|---|---|---|---|---|---|---|---|
| g2262.t1_1 | GSK-3 | 93.239 | 355 | 0 | 702 | GSK-3 | 0.969575937 | L. shawi |
| g6653.t1_1 | GSK-3 | 92.676 | 355 | 0 | 698 | GSK-3 | 0.967042056 | L. naiffi |
| g70.t1_1 | GSK-3 | 41.176 | 68 | 2.61E-10 | 50.4 | GSK-3 | 0.638724366 | L. naiffi |
| g6076.t1_1 | GSK-3 | 41.176 | 68 | 2.65E-10 | 50.4 | GSK-3 | 0.638592259 | L. shawi |
| g7580.t1_1 | GSK-3 | 52.593 | 135 | 0 | 144 | GSK-3 | 0.588149574 | L. shawi |
| g4889.t1_1 | Calpain | 40.278 | 72 | 2.96E-10 | 52.4 | Calpain | 0.577380499 | L. naiffi |
| g6708.t1_1 | Calpain | 38.889 | 72 | 3.67E-10 | 52 | Calpain | 0.569262623 | L. shawi |
| g5011.t1_1 | GSK-3 | 35.135 | 111 | 0 | 62.4 | GSK-3 | 0.568017545 | L. naiffi |
| g5438.t1_1 | GSK-3 | 35.135 | 111 | 0 | 61.2 | GSK-3 | 0.568017545 | L. shawi |
| g1346.t1_1 | GSK-3 | 35.294 | 102 | 0 | 59.3 | GSK-3 | 0.564705471 | L. naiffi |
| g7003.t1_1 | GSK-3 | 46.753 | 77 | 0 | 63.5 | GSK-3 | 0.556492643 | L. shawi |
| g5886.t1_1 | GSK-3 | 35.417 | 144 | 0 | 95.5 | GSK-3 | 0.555903944 | L. shawi |
| g3959.t1_1 | GSK-3 | 46.154 | 78 | 0 | 65.5 | GSK-3 | 0.553846692 | L. naiffi |
| g6215.t1_1 | GSK-3 | 36.905 | 84 | 0 | 56.2 | GSK-3 | 0.517262738 | L. shawi |
| g6018.t1_1 | GSK-3 | 36.905 | 84 | 0 | 56.2 | GSK-3 | 0.517262738 | L. naiffi |
| g5925.t1_1 | GSK-3 | 37 | 100 | 0 | 57.4 | GSK-3 | 0.4695 | L. shawi |
| g6586.t1_1 | GSK-3 | 36.41 | 195 | 0 | 112 | GSK-3 | 0.465896539 | L. naiffi |
| g627.t1_1 | GSK-3 | 36.774 | 155 | 0 | 80.1 | GSK-3 | 0.409676742 | L. naiffi |
| g4212.t1_1 | ABCG2 | 39.394 | 66 | 1.17E-06 | 36.2 | ABC | 0.381454677 | L. naiffi |
| g3791.t1_1 | GSK-3 | 36.522 | 115 | 0 | 56.6 | GSK-3 | 0.372174826 | L. naiffi |
| g4105.t1_1 | GSK-3 | 35.652 | 115 | 0 | 54.7 | GSK-3 | 0.368260261 | L. shawi |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).