Submitted:
15 June 2023
Posted:
16 June 2023
You are already at the latest version
Abstract
Keywords:
Introduction
Material and Methods
Downloading the Nucleotide Sequences
Microsatellite prediction
CpG Island prediction
Results:
Microsatellite and CpG prediction in the Avian group
Microsatellite prediction in the Avian group (W chromosome)
Microsatellite prediction in the Avian group (Z chromosome)
CpG island prediction in the Avian group (W chromosome):
CpG island prediction in the Avian group (Z chromosome):
Microsatellite and CpG prediction in the insect group
Microsatellite Prediction in the insect group(X chromosomes)
Microsatellite Prediction in the insect group ( Y chromosomes)
CpG Island prediction in Insect group (X chromosomes)
CpG Island prediction in Insect group (Y chromosomes)
Microsatellite and CpG prediction in Primates
Microsatellite prediction in Primates (X chromosomes)
Microsatellite prediction in Primates (Y chromosomes)
CpG island prediction in Primates(X chromosomes)
CpG island prediction in Primates(Y chromosomes)
Microsatellite and CpG prediction in Rodents
Microsatellite prediction in Rodents(X chromosome)
Microsatellite prediction in Rodents(Y chromosome)
CpG Island prediction in Rodents(X chromosome)
CpG Island prediction in Rodents(Y chromosome)
Microsatellite and CpG prediction in even-toed ungulates
Microsatellite prediction in Even-toed ungulates (X chromosome)
Microsatellite prediction in Even-toed ungulates (Y chromosome)
CpG Island prediction in Even-toed ungulates(X chromosome)
CpG Island prediction in Even-toed ungulates(Y chromosome)
Discussion
Data availability
| S.N. | Animal Name | Accession Number (Y Chr) | URL | Accession Number (X Chr) | URL |
| 1 | Chlorocebus sabaeus (Green Monkey) | CM001940.1 | https://www.ncbi.nlm.nih.gov/nuccore/CM001940.1 | CM001951.2 | https://www.ncbi.nlm.nih.gov/nuccore/CM001951.2 |
| 2 | Homo sapiens (Human) | CM000686.2 | https://www.ncbi.nlm.nih.gov/nuccore/CM000686.2 | CM000685.2 | https://www.ncbi.nlm.nih.gov/nuccore/CM000685.2 |
| 3 | Callithrix jacchus (White-tufted-ear marmoset) | CM000879.1 | https://www.ncbi.nlm.nih.gov/nuccore/CM000879.1 | CM000878.1 | https://www.ncbi.nlm.nih.gov/nuccore/CM000878.1 |
| 4 | Rattus norvegicus (Norway rat) | CM002824.1 | https://www.ncbi.nlm.nih.gov/nuccore/CM002824.1 | CM000092.5 | https://www.ncbi.nlm.nih.gov/nuccore/CM000092.5 |
| 5 | Mus musculus (House mouse) | CM001014.2 | https://www.ncbi.nlm.nih.gov/nuccore/CM001014.2 | CM001013.2 | https://www.ncbi.nlm.nih.gov/nuccore/CM001013.2 |
| 6 | Sus scrofa (Pig) | CM001155.2 | https://www.ncbi.nlm.nih.gov/nuccore/CM001155.2 | CM000830 | https://www.ncbi.nlm.nih.gov/nuccore/CM000830 |
| 7 | Anopheles gambiae (Mosquito) | KJ608153.1 | https://www.ncbi.nlm.nih.gov/nuccore/KJ608153.1 | CM000360.1 | https://www.ncbi.nlm.nih.gov/nuccore/CM000360.1 |
| 8 | Bos Taurus (Cow) | CM001061.2 | https://www.ncbi.nlm.nih.gov/nuccore/CM001061.2 | GK000030.2 | https://www.ncbi.nlm.nih.gov/nuccore/GK000030.2 |
| 9 | Pan troglodytis (Common chimpanzee) | NC_006492.3 | https://www.ncbi.nlm.nih.gov/nuccore/NC_006492.3 | CM000336.2 | https://www.ncbi.nlm.nih.gov/nuccore/CM000336.2 |
| 10 | Gallus gallus (Chicken) | CM000122.3 (Z) | https://www.ncbi.nlm.nih.gov/nuccore/CM000122.3 | CM000121.3 (W) | https://www.ncbi.nlm.nih.gov/nuccore/CM000121.3 |
| 11 | Meleagris gallopavo (Wild turkey) | CM000993.2 (Z) | https://www.ncbi.nlm.nih.gov/nuccore/CM000993.2 | CM000992.2 (W) | https://www.ncbi.nlm.nih.gov/nuccore/CM000992.2 |
| 12 | Drosophila melanogaster (Fruit fly) | CP007106.1 | https://www.ncbi.nlm.nih.gov/nuccore/CP007106.1 | AE014298.5 | https://www.ncbi.nlm.nih.gov/nuccore/AE014298.5 |
References
- Antequera, F. (2003). Structure, function and evolution of CpG island promoters. Cellular and Molecular Life Sciences. 60(8): 1647-1658. [CrossRef]
- Antequera, F., & Bird, A. (1999). CpG islands as genomic footprints of promoters that are associated with replication origins. Current Biology. 9(17): R661-R667. [CrossRef]
- Bird A, Taggart M, Frommer M, Miller O J and Macleod D. (1985). A fraction of the mouse genome that is derived from islands of nonmethylated, CpGrich DNA. Cell 40(1): 9199. [CrossRef]
- Blackmon, H., Ross, L., &Bachtrog, D. (2017). Sex determination, sex chromosomes, and karyotype evolution in insects. Journal of Heredity. 108(1):78-93. [CrossRef]
- Borstnik B, Pumpernik D. (2002). Tandem repeats in protein coding regions of primate genes (2002). Genome Res. 12:909-915. [CrossRef]
- Cechova, M., & Miga, K. H. (2022, May). Satellite DNAs and human sex chromosome variation. In Seminars in Cell & Developmental Biology. Academic Press. [CrossRef]
- Cooper D N, Taggart M H and Bird A P. (1983). Unmethlated domains in vertebrate DNA. Nucleic acids research. 11(3): 647658. [CrossRef]
- Duncan, C. G., Grimm, S. A., Morgan, D. L., Bushel, P. R., Bennett, B. D., Roberts, J. D., & Wade, P. A. (2018). Dosage compensation and DNA methylation landscape of the X chromosome in mouse liver. Scientific reports. 8(1):1-17. [CrossRef]
- Gardiner-Garden, M., & Frommer, M. (1987). CpG islands in vertebrate genomes. Journal of molecular biology. 196(2):261-282. [CrossRef]
- Graves, J. A. M. (2006). Sex chromosome specialization and degeneration in mammals. Cell. 124(5) :901-914. [CrossRef]
- Hakki EE, Akkaya MS. (2000). Microsatellite isolation using amplified fragment lengthpolymorphism markers: no cloning, no screening. Molecular Ecology.9:2152-2154. [CrossRef]
- Hughes, J. F., Skaletsky, H., Pyntikova, T., Minx, P. J., Graves, T., Rozen, S.,& Page, D. C. (2005). Conservation of Y-linked genes during human evolution revealed by comparative sequencing in chimpanzee. Nature. 437(7055):100-103. [CrossRef]
- Kananen, L., & Marttila, S. (2021). Ageing-associated changes in DNA methylation in X and Y chromosomes. Epigenetics & chromatin. 14(1): 1-10. [CrossRef]
- Kapila N, Sharma A, Kishore A, Sodhi M, Tripathi P K, Mohanty A K and Mukesh M. (2016). Impact of heat stress on cellular and transcriptional adaptation of mammary epithelial cells in swine (Sus scrofa). PloS one. 11(9): e0157237. [CrossRef]
- Karagyozov L, Kalcheva ID, Chapman VM. (1993). Construction of random small-insertgenomic libraries highly enriched for simple sequence repeats. Nucleic Acids Research. 21:3911- 3912. [CrossRef]
- Kunzler P, Matsuo K, Schaffner W: Pathological, physiological, and evolutionary aspects of short unstable DNA repeats in the human genome. (1995). BiolChem Hoppe Seyler. 4:201-211.
- Lander E S, Linton L M, Birren B, Nusbaum C, Zody M C, Baldwin J and Proctor M J. (2001). Initial sequencing and analysis of the human genome.67: 209 213. [CrossRef]
- Larsen F, Gundersen G, Lopez R and Prydz H. (1992). CpG islands as gene markers in the human genome. Genomics. 13(4): 10951107. [CrossRef]
- Moxon ER, Wills C: DNA microsatellites: agents of evolution? (1999)Sci Am, 280:94-99. [CrossRef]
- Muyle, A., Bachtrog, D., Marais, G. A., & Turner, J. M. (2021). Epigenetics drive the evolution of sex chromosomes in animals and plants. Philosophical Transactions of the Royal Society B. 376(1826):20200124. [CrossRef]
- Okano M, Bell DW, Haber DA, Li E. (1999). DNA methyltransferases Dnmt3a and Dnmt3b are essential for de novo methylation and mammalian development. Cell. 99: 247–257. [CrossRef]
- Priolli RHG, Mendes-Junior CT, Arantes NE and Contel EPB. (2002). Characterization of Brazilian soybean cultivars using microsatellite markers. Genet Mol Biol. 25:185-193. [CrossRef]
- Robinson, P. N. et al. 2004. “Gene-Ontology analysis reveals association of tissue specific 5’ CpG-island genes with development and embryogenesis.” Human Molecular Genetics. 1969-78. [CrossRef]
- Romanenko, S. A., Perelman, P. L., Trifonov, V. A., &Graphodatsky, A. S. (2012). Chromosomal evolution in Rodentia. Heredity.108(1): 4-16. [CrossRef]
- Saghai-Maroof MA, Biyashev RM, Yang GP, Zang Q and Allard RW. (1994).Extraordinarily polymorphic microsatellites DNA in barley species diversity, chromosomal locations, and population dynamics. ProcNatlAcadSci USA.91:5466-6470. [CrossRef]
- Samuelsson T. (2010). Group project for Sequence Bioinformatics course. Chalmers University of Technology. [online]. Available at: http://bio.lundberg.gu.se/courses/ht10/bio2/group_projects_2010.pdf.
- Shyamala, N., Kongettira, C. L., Puranam, K., Kupsal, K., Kummari, R., Padala, C., & Hanumanth, S. R. (2022). In silico identification of single nucleotide variations at CpG sites regulating CpG island existence and size. Scientific reports. 12(1):1-17. [CrossRef]
- Stevens, L. (1997). Sex chromosomes and sex determining mechanisms in birds. Science Progress. 80:197-216.
- Takai, D., and Peter Jones. (2002). “Comprehensive analysis of CpG islands in human chromosomes 21 and 22.” PNAS. [CrossRef]
- Tautz D and Renz M. (1984). Simple sequence repeats are ubiquitous repetitive components of eukaryotic genomes. Nucl Acids Res. 12:4127-4137. [CrossRef]
- Thomson, J. P., Skene, P. J., Selfridge, J., Clouaire, T., Guy, J., Webb, S., & Bird, A. (2010). CpG islands influence chromatin structure via the CpG-binding protein Cfp1. Nature. 464(7291): 1082-1086. [CrossRef]
- Wang, H., Gao, S., Liu, Y., Wang, P., Zhang, Z., & Chen, D. (2022). A pipeline for effectively developing highly polymorphic simple sequence repeats markers based on multi-sample genomic data. Ecology and evolution.12(3):e8705. [CrossRef]
- Yu K, Park J, Poysa V and Gepts P. (2000). Integration of Simple Sequence Repeats (SSR) markers into a molecular linkage map of common bean (Phaseolus vulgaris). J Hered. 91:429-434. [CrossRef]
- Zhao, Z., & Han, L. (2009). CpG islands: algorithms and applications in methylation studies. Biochemical and biophysical research communications. 382(4):643645. [CrossRef]












| S.No. | Animal species | Common Name | Group | Order | Y_chr size* | X_chr size** |
|---|---|---|---|---|---|---|
| 1. | Gallus gallus | Red junglefowl | Avian | Galliformes | 82363669$ | 1248174$$ |
| Meleagris gallopavo | Wild turkey | Galliformes | 68461266$ | 260627$$ | ||
| 2. | Anopheles gambiae | Mosquitoes | Insects | Diptera | 10,429 | 24393108 |
| Drosophila melanogaster | Fruit fly | Diptera | 3667352 | 23542271 | ||
| 3. | Callithrix jacchus | New World monkey | Primates | Primates | 2,853,901 | 142,054,208 |
| Chlorocebus sabaeus | Green monkey | Primates | 6181219 | 130038232 | ||
| Homo sapiens | Humans | Primates | 57,227,415 | 156040895 | ||
| Pan troglodytes | Chimpanzee | Primates | 263,42,871 | 156848144 | ||
| 4. | Mus musculus | House mouse | Rodents | Rodentia | 91,744,698 | 171,031,299 |
| Rattus norvegicus | Brown rat | Rodentia | 3,310,458 | 159,970,021 | ||
| 5. | Bos taurus | Cattle | Even-toed ungulates | Artiodactyla | 433,00,181 | 148823899 |
| Sus scrofa | Wild boars | Artiodactyla | 1,637,716 | 144,288,218 |
| Features | Gallus gallus | Meleagris gallopavo |
|---|---|---|
| Average Island Length | 569.12 | 564.88 |
| The standard error (Island Length) | 4.38 | 17.82 |
| Island Number | 2433 | 83.00 |
| Average G+ C percent | 51.97 | 50.5 |
| Standard error G+ C percent | 0.06 | 0.14 |
| Average CpG percent | 4.7 | 5.25 |
| Standard error CpG percent | 0.01 | 0.12 |
| Average Ratio | 0.72 | 0.9 |
| Standard error Ratio | 0.00 | 0.03 |
| Minimum Island length | 500.00 | 500.00 |
| Maximum Island length | 5230.00 | 1653.00 |
| Features |
Gallus gallus |
Meleagris gallopavo |
|---|---|---|
| Average Island Length | 743.57 | 630.51 |
| The standard error (Island Length) | 7.1 | 5.63 |
| Island Number | 4635.00 | 1799.00 |
| Average G+ C percent | 55.3 | 51.74 |
| Standard error G+ C percent | 0.08 | 0.08 |
| Average CpG percent | 5.76 | 5.26 |
| Standard error CpG percent | 0.02 | 0.02 |
| Average Ratio | 0.76 | 0.81 |
| Standard error Ratio | 0.00 | 0.00 |
| Minimum Island length | 500.00 | 500.00 |
| Maximum Island length | 6949.00 | 2330.00 |
| Features |
Anopheles gambiae |
Drosophila melanogaster |
|---|---|---|
| Average Island Length | 634.24 | 619.42 |
| The standard error (Island Length) | 1.78 | 1.94 |
| Island Number | 50388.00 | 31613.00 |
| Average G+ C percent | 50.46 | 50.69 |
| Standard error G+ C percent | 0.01 | 0.01 |
| Average CpG percent | 6.5 | 5.64 |
| Standard error CpG percent | 0.01 | 0.01 |
| Average Ratio | 1.03 | 0.89 |
| Standard error Ratio | 0.00 | 0.00 |
| Minimum Island length | 500.00 | 500.00 |
| Maximum Island length | 9249.00 | 7881.00 |
| Features |
Anopheles gambiae |
Drosophila melanogaster |
|---|---|---|
| Average Island Length | 535.00 | 601.02 |
| The standard error (Island Length) | 34.00 | 7.03 |
| Island Number | 3.00 | 3270.00 |
| Average G+ C percent | 50.41 | 50.37 |
| Standard error G+ C percent | 0.21 | 0.02 |
| Average CpG percent | 6.07 | 5.73 |
| Standard error CpG percent | 0.13 | 0.02 |
| Average Ratio | 0.97 | 0.91 |
| Standard error Ratio | 0.03 | 0.00 |
| Minimum Island length | 500.00 | 500.00 |
| Maximum Island length | 603.00 | 7406.00 |
| Features | Callithrix jacchus |
Chlorocebus sabaeus |
Homo sapiens |
Pan troglodytes |
|---|---|---|---|---|
| Average Island Length | 634.76 |
628.83 | 611.57 | 608.97 |
| The standard error (Island Length) | 4.3 | 4.93 |
3.62 | 3.8 |
| Island Number | 4426.00 |
4232.00 |
6770.00 | 4379.00 |
| Average G+ C percent | 55.66 | 55.17 | 55.68 | 54.53 |
| Standard error G+ C percent | 0.09 | 0.1 | 0.08 | 0.09 |
| Average CpG percent | 5.38 | 5.26 | 5.32 | 5.24 |
| Standard error CpG percent | 0.02 | 0.02 | 0.02 | 0.02 |
| Average Ratio | 0.7 | 0.7 | 0.7 | 0.72 |
| Standard error Ratio | 0.00 | 0.00 | 0.00 | 0.00 |
| Minimum Island length | 500.00 | 500.00 | 500.00 | 500.00 |
| Maximum Island length | 4007.00 | 4473.00 | 4472.00 | 2991.00 |
| Features |
Callithrix jacchus |
Chlorocebus sabaeus |
Homo sapiens |
Pan troglodytes |
|---|---|---|---|---|
| Average Island Length | 643.37 | 559.51 | 569.93 | 570.2 |
| The standard error (Island Length) | 20.72 | 10.69 | 5.41 | 6.38 |
| Island Number | 268.00 | 257.00 | 1756.00 | 997.00 |
| Average G+ C percent | 56.92 | 52.07 | 53.89 | 55.66 |
| Standard error G+ C percent | 0.39 | 0.24 | 0.14 | 0.19 |
| Average CpG percent | 5.58 | 4.78 | 4.89 | 5.29 |
| Standard error CpG percent | 0.08 | 0.05 | 0.03 | 0.04 |
| Average Ratio | 0.7 | 0.72 | 0.71 | 0.69 |
| Standard error Ratio | 0.00 | 0.01 | 0.00 | 0.00 |
| Minimum Island length | 500.00 | 500.00 | 500.00 | 500.00 |
| Maximum Island length | 3242.00 |
1950.00 | 3420.00 | 1987.00 |
| Features | Mus musculus | Rattus norvegicus |
|---|---|---|
| Average Island Length | 588.17 | 596.19 |
| The standard error (Island Length) | 3.46 | 3.68 |
| Island Number | 4545.00 | 4465.00 |
| Average G+ C percent | 54.68 | 53.03 |
| Standard error G+ C percent | 0.09 | 0.07 |
| Average CpG percent | 5.26 | 4.94 |
| Standard error CpG percent | 0.02 | 0.01 |
| Average Ratio | 0.72 | 0.73 |
| Standard error Ratio | 0.00 | 0.00 |
| Minimum Island length | 500.00 | 500.00 |
| Maximum Island length | 3476.00 | 4484.00 |
| Features | Mus musculus | Rattus norvegicus |
|---|---|---|
| Average Island Length | 548.88 | 560.46 |
| The standard error (Island Length) | 3.39 | 12.67 |
| Island Number | 1516.00 | 110.00 |
| Average G+ C percent | 52.21 | 52.7 |
| Standard error G+ Cpercent | 0.11 | 0.37 |
| Average CpG percent | 4.79 | 4.73 |
| Standard error CpG percent | 0.02 | 0.06 |
| Average Ratio | 0.71 | 0.7 |
| Standard error Ratio | 0.00 | 0.01 |
| Minimum Island length | 500.00 | 500.00 |
| Maximum Island length | 1568.00 | 1059.00 |
| Features | Bos taurus | Sus scrofa |
|---|---|---|
| Average Island Length | 701.35 | 580.44 |
| Standard error (Island Length) | 10.07 | 2.02 |
| Island Number | 1516.00 | 13539.00 |
| Average G+ C percent | 57.61 | 54.5 |
| Standard error G+C percent | 0.15 | 0.05 |
| Average CpG percent | 5.84 | 5.13 |
| Standard error CpG percent | 0.03 | 0.01 |
| Average Ratio | 0.72 | 0.7 |
| Standard error Ratio | 0.00 | 0.00 |
| Minimum Island length | 500.00 | 500.00 |
| Maximum Island length | 3832.00 | 5639.00 |
| Features | Bos taurus | Sus scrofa |
|---|---|---|
| Average Island Length | 545.86 | 567.28 |
| Standard error (Island Length) | 2.92 | 4.55 |
| Island Number | 1994.00 | 1820.00 |
| Average G+ C percent | 53.36 | 53.86 |
| Standard error G+ C percent | 0.1 | 0.11 |
| Average CpG percent | 4.93 | 4.96 |
| Standard error CpG percent | 0.02 | 0.02 |
| Average Ratio | 0.7 | 0.7 |
| Standard error Ratio | 0.00 | 0.00 |
| Minimum Island length | 500.00 | 500.00 |
| Maximum Island length | 2360.00 | 2650.00 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).