Submitted:
24 May 2023
Posted:
26 May 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Retrieval of protein-coding sequences
2.2. Identification of similarity between human CDS and other mammalian sequences
2.3. Comparison of conserved CDS and identification of SNPs and their associated diseases
2.4. Construction of phylogenetic tree
2.5. Transcriptomic analysis using tissue-specific data
3. Results
3.1. Identification of conserved CDS with human sequences
3.2. Mapping human disease-relevant SNPs in other species
3.3. Transcriptomic analysis of six different tissues
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Hickman, D.L.; Johnson, J.; Vemulapalli, T.H.; Crisler, J.R.; Shepherd, R. Commonly Used Animal Models. In Principles of Animal Research for Graduate and Undergraduate Students; 2017. [Google Scholar]
- Vandamme, T. Use of Rodents as Models of Human Diseases. J Pharm Bioallied Sci 2014, 6. [Google Scholar] [CrossRef]
- Nelson, D.R.; Zeldin, D.C.; Hoffman, S.M.G.; Maltais, L.J.; Wain, H.M.; Nebert, D.W. Comparison of Cytochrome P450 (CYP) Genes from the Mouse and Human Genomes, Including Nomenclature Recommendations for Genes, Pseudogenes and Alternative-Splice Variants. Pharmacogenetics 2004, 14. [Google Scholar] [CrossRef]
- Junhee Seok; H. Shaw Warren; Alex, G.C.; Michael, N.M.; Henry, V.B.; Xu, W.; Richards, D.R.; McDonald-Smith, G.P.; Gao, H.; Hennessy, L.; et al. Genomic Responses in Mouse Models Poorly Mimic Human Inflammatory Diseases. Proc Natl Acad Sci U S A 2013, 110. [Google Scholar] [CrossRef]
- Bailey, K.L.; Cartwright, S.B.; Patel, N.S.; Remmers, N.; Lazenby, A.J.; Hollingsworth, M.A.; Carlson, M.A. Porcine Pancreatic Ductal Epithelial Cells Transformed with KRASG12D and SV40T Are Tumorigenic. Sci Rep 2021, 11. [Google Scholar] [CrossRef]
- Bailey, K.L.; Carlson, M.A. Porcine Models of Pancreatic Cancer. Front Oncol 2019, 9. [Google Scholar] [CrossRef]
- Mondal, P.; Bailey, K.L.; Cartwright, S.B.; Band, V.; Carlson, M.A. Large Animal Models of Breast Cancer. Front Oncol 2022, 12. [Google Scholar] [CrossRef]
- Mondal, P.; Patel, N.S.; Bailey, K.; Aravind, S.; Cartwright, S.B.; Hollingsworth, M.A.; Lazenby, A.J.; Carlson, M.A. Induction of Pancreatic Neoplasia in the KRAS/TP53 Oncopig. Dis Model Mech 2023, 16. [Google Scholar] [CrossRef]
- Wernersson, R.; Schierup, M.H.; Jørgensen, F.G.; Gorodkin, J.; Panitz, F.; Stærfeldt, H.H.; Christensen, O.F.; Mailund, T.; Hornshøj, H.; Klein, A.; et al. Pigs in Sequence Space: A 0.66X Coverage Pig Genome Survey Based on Shotgun Sequencing. BMC Genomics 2005, 6. [Google Scholar] [CrossRef] [PubMed]
- Groenen, M.A.M.; Archibald, A.L.; Uenishi, H.; Tuggle, C.K.; Takeuchi, Y.; Rothschild, M.F.; Rogel-Gaillard, C.; Park, C.; Milan, D.; Megens, H.J.; et al. Analyses of Pig Genomes Provide Insight into Porcine Demography and Evolution. Nature 2012, 491. [Google Scholar] [CrossRef] [PubMed]
- Schook, L.B.; Collares, T. V.; Darfour-Oduro, K.A.; De, A.K.; Rund, L.A.; Schachtschneider, K.M.; Seixas, F.K. Unraveling the Swine Genome: Implications for Human Health. Annu Rev Anim Biosci 2015, 3. [Google Scholar] [CrossRef] [PubMed]
- Nakamura, T.; Fujiwara, K.; Saitou, M.; Tsukiyama, T. Non-Human Primates as a Model for Human Development. Stem Cell Reports 2021, 16. [Google Scholar] [CrossRef]
- Yan, G.; Zhang, G.; Fang, X.; Zhang, Y.; Li, C.; Ling, F.; Cooper, D.N.; Li, Q.; Li, Y.; Van Gool, A.J.; et al. Genome Sequencing and Comparison of Two Nonhuman Primate Animal Models, the Cynomolgus and Chinese Rhesus Macaques. Nat Biotechnol 2011, 29. [Google Scholar] [CrossRef] [PubMed]
- Matsuzaki, M.; Ebina, T. Common Marmoset as a Model Primate for Study of the Motor Control System. Curr Opin Neurobiol 2020, 64. [Google Scholar] [CrossRef]
- Howe, K.L.; Achuthan, P.; Allen, J.; Allen, J.; Alvarez-Jarreta, J.; Ridwan Amode, M.; Armean, I.M.; Azov, A.G.; Bennett, R.; Bhai, J.; et al. Ensembl 2021. Nucleic Acids Res 2021, 49. [Google Scholar] [CrossRef]
- Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and Applications. BMC Bioinformatics 2009, 10. [Google Scholar] [CrossRef]
- Conway, J.R.; Lex, A.; Gehlenborg, N. UpSetR: An R Package for the Visualization of Intersecting Sets and Their Properties. Bioinformatics 2017, 33. [Google Scholar] [CrossRef] [PubMed]
- Yu, Y.; Ouyang, Y.; Yao, W. ShinyCircos: An R/Shiny Application for Interactive Creation of Circos Plot. Bioinformatics 2018, 34. [Google Scholar] [CrossRef]
- Larkin, M.A.; Blackshields, G.; Brown, N.P.; Chenna, R.; Mcgettigan, P.A.; McWilliam, H.; Valentin, F.; Wallace, I.M.; Wilm, A.; Lopez, R.; et al. Clustal W and Clustal X Version 2.0. Bioinformatics 2007, 23. [Google Scholar] [CrossRef]
- Page, A.J.; Taylor, B.; Delaney, A.J.; Soares, J.; Seemann, T.; Keane, J.A.; Harris, S.R. SNP-Sites: Rapid Efficient Extraction of SNPs from Multi-FASTA Alignments. Microb Genom 2016, 2. [Google Scholar] [CrossRef]
- McLaren, W.; Gil, L.; Hunt, S.E.; Riat, H.S.; Ritchie, G.R.S.; Thormann, A.; Flicek, P.; Cunningham, F. The Ensembl Variant Effect Predictor. Genome Biol 2016, 17. [Google Scholar] [CrossRef]
- Oscanoa, J.; Sivapalan, L.; Gadaleta, E.; Dayem Ullah, A.Z.; Lemoine, N.R.; Chelala, C. SNPnexus: A Web Server for Functional Annotation of Human Genome Sequence Variation (2020 Update). Nucleic Acids Res 2020, 48. [Google Scholar] [CrossRef] [PubMed]
- Piñero, J.; Ramírez-Anguita, J.M.; Saüch-Pitarch, J.; Ronzano, F.; Centeno, E.; Sanz, F.; Furlong, L.I. The DisGeNET Knowledge Platform for Disease Genomics: 2019 Update. Nucleic Acids Res 2020, 48. [Google Scholar] [CrossRef]
- Gel, B.; Serra, E. KaryoploteR: An R/Bioconductor Package to Plot Customizable Genomes Displaying Arbitrary Data. Bioinformatics 2017, 33. [Google Scholar] [CrossRef] [PubMed]
- Rice, P.; Longden, L.; Bleasby, A. EMBOSS: The European Molecular Biology Open Software Suite. Trends in Genetics 2000, 16. [Google Scholar] [CrossRef]
- Price, M.N.; Dehal, P.S.; Arkin, A.P. FastTree 2 - Approximately Maximum-Likelihood Trees for Large Alignments. PLoS One 2010, 5. [Google Scholar] [CrossRef]
- Tamura, K.; Stecher, G.; Kumar, S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol 2021, 38. [Google Scholar] [CrossRef]
- Papatheodorou, I.; Fonseca, N.A.; Keays, M.; Tang, Y.A.; Barrera, E.; Bazant, W.; Burke, M.; Füllgrabe, A.; Fuentes, A.M.P.; George, N.; et al. Expression Atlas: Gene and Protein Expression across Multiple Studies and Organisms. Nucleic Acids Res 2018, 46. [Google Scholar] [CrossRef] [PubMed]
- Wei, T.; Simko, V. Corrplot: Visualization of a Correlation Matrix. R Package Version 0.84. Https://Github.Com/Taiyun/Corrplot. Statistician 2017, 56. [Google Scholar]
- Harding, J.D. Nonhuman Primates and Translational Research: Progress, Opportunities, and Challenges. ILAR J 2017, 58. [Google Scholar] [CrossRef]
- Feng, G.; Jensen, F.E.; Greely, H.T.; Okano, H.; Treue, S.; Roberts, A.C.; Fox, J.G.; Caddick, S.; Poo, M.M.; Newsome, W.T.; et al. Opportunities and Limitations of Genetically Modified Nonhuman Primate Models for Neuroscience Research. Proc Natl Acad Sci U S A 2020, 117. [Google Scholar] [CrossRef]
- Miller, C.T.; Freiwald, W.A.; Leopold, D.A.; Mitchell, J.F.; Silva, A.C.; Wang, X. Marmosets: A Neuroscientific Model of Human Social Behavior. Neuron 2016, 90. [Google Scholar] [CrossRef] [PubMed]
- Pomberger, T.; Risueno-Segovia, C.; Gultekin, Y.B.; Dohmen, D.; Hage, S.R. Cognitive Control of Complex Motor Behavior in Marmoset Monkeys. Nat Commun 2019, 10. [Google Scholar] [CrossRef]
- Ludlage, E.; Mansfield, K. Clinical Care and Diseases of the Common Marmoset (Callithrix Jacchus). In Proceedings of the Comparative Medicine; 2003; Vol. 53. [Google Scholar]
- David, J.M.; Dick, E.J.; Hubbard, G.B. Spontaneous Pathology of the Common Marmoset (Callithrix Jacchus) and Tamarins (Saguinus Oedipus, Saguinus Mystax). J Med Primatol 2009, 38. [Google Scholar] [CrossRef] [PubMed]
- Conley, A.J.; Moeller, B.C.; Nguyen, A.D.; Stanley, S.D.; Plant, T.M.; Abbott, D.H. Defining Adrenarche in the Rhesus Macaque (Macaca Mulatta), a Non-Human Primate Model for Adrenal Androgen Secretion. Mol Cell Endocrinol 2011, 336. [Google Scholar] [CrossRef]
- Higham, J.P.; Heistermann, M.; Maestripieri, D. The Endocrinology of Male Rhesus Macaque Social and Reproductive Status: A Test of the Challenge and Social Stress Hypotheses. Behav Ecol Sociobiol 2013, 67. [Google Scholar] [CrossRef] [PubMed]
- Litten-Brown, J.C.; Corson, A.M.; Clarke, L. Porcine Models for the Metabolic Syndrome, Digestive and Bone Disorders: A General Overview. Animal 2010, 4. [Google Scholar] [CrossRef] [PubMed]
- Koopmans, S.J.; Schuurman, T. Considerations on Pig Models for Appetite, Metabolic Syndrome and Obese Type 2 Diabetes: From Food Intake to Metabolic Disease. Eur J Pharmacol 2015, 759. [Google Scholar] [CrossRef]
- Morey-Holton, E.R.; Globus, R.K. Hindlimb Unloading Rodent Model: Technical Aspects. J Appl Physiol 2002, 92. [Google Scholar] [CrossRef]
- Witsø, E.; Hoang, L.; Løseth, K.; Bergh, K. Establishment of an in Vivo Rat Model for Chronic Musculoskeletal Implant Infection. J Orthop Surg Res 2020, 15. [Google Scholar] [CrossRef]
- Grisel, P.; Meinhardt, A.; Lehr, H.A.; Kappenberger, L.; Barrandon, Y.; Vassalli, G. The MRL Mouse Repairs Both Cryogenic and Ischemic Myocardial Infarcts with Scar. Cardiovascular Pathology 2008, 17. [Google Scholar] [CrossRef]
- Unsld, B.; Schotola, H.; Jacobshagen, C.; Seidler, T.; Sossalla, S.; Emons, J.; Klede, S.; Knll, R.; Guan, K.; El-Armouche, A.; et al. Age-Dependent Changes in Contractile Function and Passive Elastic Properties of Myocardium from Mice Lacking Muscle LIM Protein (MLP). Eur J Heart Fail 2012, 14. [Google Scholar] [CrossRef] [PubMed]
- Sarkar, S.; Chawla-Sarkar, M.; Young, D.; Nishiyama, K.; Rayborn, M.E.; Hollyfield, J.G.; Sen, S. Myocardial Cell Death and Regeneration during Progression of Cardiac Hypertrophy to Heart Failure. Journal of Biological Chemistry 2004, 279. [Google Scholar] [CrossRef] [PubMed]
- Elliott, J.F.; Liu, J.; Yuan, Z.N.; Bautista-Lopez, N.; Wallbank, S.L.; Suzuki, K.; Rayner, D.; Nation, P.; Robertson, M.A.; Liu, G.; et al. Autoimmune Cardiomyopathy and Heart Block Develop Spontaneously in HLA-DQ8 Transgenic IAβ Knockout NOD Mice. Proc Natl Acad Sci U S A 2003, 100. [Google Scholar] [CrossRef]
- Xu, Y.; Wu, Z.; Liu, L.; Liu, J.; Wang, Y. Rat Model of Cockayne Syndrome Neurological Disease. Cell Rep 2019, 29, 800–809.e5. [Google Scholar] [CrossRef]
- Harper, A. Mouse Models of Neurological Disorders-A Comparison of Heritable and Acquired Traits. Biochim Biophys Acta Mol Basis Dis 2010, 1802. [Google Scholar] [CrossRef]
- Hutter, C.; Zenklusen, J.C. The Cancer Genome Atlas: Creating Lasting Value beyond Its Data. Cell 2018, 173. [Google Scholar] [CrossRef]





| Comparison | Identified Blast hits* | Average percentage identity | Range of percent identity |
Average percentage identity for conserved CDS |
|---|---|---|---|---|
| Human vs. Rhesus macaque | 17,638 | 96.82 | 100-71.74 | 97.53 |
| Human vs. Marmoset | 17,787 | 94.65 | 100-71.63 | 95.76 |
| Human vs. Pig | 14,992 | 89.37 | 100-70.81 | 90.38 |
| Human vs. Mouse | 13,806 | 86.65 | 100-70.11 | 87.19 |
| Human vs. Rat | 13,222 | 86.53 | 100-68.93 | 87.04 |
| Human Chromosomes | Total CDS | Conserved CDS |
Rhesus macaque* | Marmoset* | Pig* | Mouse* | Rat* |
|---|---|---|---|---|---|---|---|
| Chr1 | 2049 | 1088 | 1 | 7, 18, 19 | 6, 4, 9, 10, 14, 2, 7 | 4, 3, 1, 8 | 5, 2, 13,19, 14, 10, 17, 4 |
| Chr2 | 1244 | 750 | 12, 13 | 6, 14 | 15, 3 | 1, 2, 6, 17, 12, 11 | 9, 6, 3, 4, 14, 13, 20, 18 |
| Chr3 | 1075 | 645 | 2 | 15, 17 | 13 | 9, 16, 3, 6, 14 | 8, 11, 2, 4, 16, 15 |
| Chr4 | 752 | 390 | 5 | 3 | 8, 15, 14 | 5, 3, 8 | 14, 2, 16, 19, 4 |
| Chr5 | 883 | 502 | 6 | 2 | 2, 16 | 13, 18, 11, 15 | 2, 18, 10, 17, 1, 9 |
| Chr6 | 1045 | 574 | 4 | 4 | 7, 1 | 17, 10, 13, 9, 4, 1 | 20, 1, 17, 9, 8, 5 |
| Chr7 | 919 | 470 | 3 | 8,2 | 18, 9, 3 | 5, 6, 12, 11, 13 | 4, 12, 6, 14, 17 |
| Chr8 | 684 | 372 | 8 | 16, 13 | 4, 14,17, 15 | 15, 8, 14, 4, 1, 3 | 7, 5, 16, 15, 2, 11 |
| Chr9 | 779 | 402 | 15 | 1 | 1, 10, 14 3 | 4, 2, 19, 13 | 5, 3, 1, 17 |
| Chr10 | 1309 | 619 | 9 | 12, 7 | 14, 10 | 19, 14, 2, 10, 7, 18, 6, 13 | 1, 17, 20, 16, 15, 4 |
| Chr11 | 727 | 432 | 14 | 11 | 2, 9 | 7, 9, 19, 2 | 1, 8, 3 |
| Chr12 | 1033 | 582 | 11 | 9 | 5, 14 | 10, 5, 6, 15 | 7, 12, 4 |
| Chr13 | 321 | 182 | 17 | 1, 5 | 11 | 14, 8, 5, 3 | 15, 16, 12, 2, 9 |
| Chr14 | 610 | 360 | 7 | 10 | 7, 1 | 12, 14 | 6, 15 |
| Chr15 | 596 | 371 | 7 | 10, 6 | 1, 7 | 9, 2, 7 | 8, 3, 1 |
| Chr16 | 851 | 378 | 20 | 12, 20 | 6, 3 | 8, 7, 16, 17, 11 | 19, 1, 10 |
| Chr17 | 1182 | 637 | 16 | 5 | 12 | 11 | 10 |
| Chr18 | 269 | 157 | 18 | 13 | 1, 6 | 18, 17, 1 | 18, 9, 3 |
| Chr19 | 546 | 282 | 19 | 22 | 6, 2 | 7, 8, 10, 17, 9 | 1, 7, 16, 8, 19, 9, 12 |
| Chr20 | 1469 | 457 | 10 | 5 | 17 | 2 | 3 |
| Chr21 | 234 | 76 | 3 | 21 | 13 | 16, 10, 17 | 11, 20 |
| Chr22 | 444 | 202 | 10 | 1 | 5, 14 | 15, 11, 16, 5, 10 | 7, 14, 11, 12, 20 |
| ChrX | 853 | 381 | X | X | X | X | X |
| ChrY | 46 | 7 | Y | Y, X | Y, X | Y, X | Y, X |
| Organisms | Total SNPs in 10,316 CDS with RS number | SNPs associated with disease | SNPs identified in genes | No. of identified diseases | Species- specific diseases* |
|---|---|---|---|---|---|
| Human vs. Rhesus macaque | 63,449 | 2198 | 1074 | 1255 | 77 (77) |
| Human vs. Marmoset | 40,181 | 1533 | 867 | 1039 | 4 (12) |
| Human vs. Pig | 145,715 | 4011 | 1428 | 1597 | 117 (96) |
| Human vs. Mouse | 221,383 | 5824 | 1630 | 1798 | 58 (44) |
| Human vs. Rat | 220,631 | 5739 | 1646 | 1787 | 81 (54) |
| Chromosome | Rat* | Diseases | Mouse* | Diseases | Pig* | Diseases | Marmoset* | Diseases | Rhesus macaque* | Diseases |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | ABCA4 | 41 | MUTYH | 44 | SPTA1 | 34 | NLRP3 | 15 | SPTA1 | 18 |
| 2 | MSH6 | 115 | MSH6 | 97 | MSH6 | 78 | APOB | 22 | APOB | 31 |
| 3 | MLH1 | 36 | BAP1 | 34 | MLH1 | 34 | ITIH3 | 21 | ITIH3 | 21 |
| 4 | WFS1 | 23 | PDGFRA | 27 | KIT | 29 | KIT | 13 | PDGFRA | 18 |
| 5 | SDHA | 55 | SDHA | 75 | SDHA | 35 | VCAN | 21 | SDHA | 14 |
| 6 | DSP | 40 | DSP | 52 | SLC22A7 | 12 | CFB | 6 | DSP | 28 |
| 7 | CFTR | 39 | CFTR | 38 | GARS1 | 19 | GARS1 | 13 | RELN | 14 |
| 8 | NBN | 17 | NBN | 17 | NBN | 23 | KCNQ3 | 7 | FGFR1 | 7 |
| 9 | NOTCH1 | 95 | NOTCH1 | 102 | PTCH1 | 55 | COL5A1 | 13 | NOTCH1 | 25 |
| 10 | RET | 57 | RET | 55 | RET | 65 | RET | 25 | CUBN | 14 |
| 11 | ATM | 125 | ATM | 117 | ATM | 116 | ATM | 43 | ATM | 37 |
| 12 | POLE | 107 | POLE | 118 | POLE | 78 | POLE | 24 | POLE | 24 |
| 13 | RB1 | 15 | RB1 | 14 | RB1 | 8 | RB1 | 5 | RB1 | 8 |
| 14 | DYNC1H1 | 43 | DYNC1H1 | 44 | DICER1 | 24 | C14orf39 | 8 | DICER1 | 10 |
| 15 | FBN1 | 140 | FBN1 | 152 | FBN1 | 120 | FBN1 | 41 | FBN1 | 36 |
| 16 | TSC2 | 209 | TSC2 | 215 | TSC2 | 112 | TSC2 | 38 | TSC2 | 46 |
| 17 | SCN4A | 53 | SCN4A | 58 | NF1 | 33 | AC004223.3 | 17 | AC004223.3 | 21 |
| 18 | LAMA3 | 13 | LOXHD1 | 15 | LAMA3 | 14 | LAMA3 | 4 | LOXHD1 | 6 |
| 19 | LDLR | 72 | STK11 | 59 | LDLR | 56 | LDLR | 22 | LDLR | 25 |
| 20 | COL9A3 | 15 | SLC2A10 | 18 | JAG1 | 11 | MYH7B | 9 | MYH7B | 9 |
| 21 | COL6A1 | 19 | COL6A1 | 19 | CBS | 13 | CBS | 4 | CBS | 13 |
| 22 | NF2 | 18 | DEPDC5 | 17 | TMPRSS6 | 29 | TMPRSS6 | 26 | TMPRSS6 | 28 |
| X | FLNA | 110 | FLNA | 118 | FLNA | 30 | FLNA | 14 | FLNA | 26 |
| Tissues | Expressed genes | Conserved genes in four organisms based on 10316 CDS | |||
|---|---|---|---|---|---|
| Human | Mouse | Rat | Pig | ||
| Spleen | 10,873 | 9,984 | 16,697 | 17,647 | 4,121 |
| Skeletal muscle | 10,590 | 10,650 | 12,770 | 15,152 | 3,963 |
| Lung | 11,285 | 11,213 | 18,171 | 12,943 | 4,034 |
| Colon | 11,125 | 10,755 | 16,949 | 15,603 | 4,128 |
| Heart | 10,967 | 10,369 | 14,821 | 13,995 | 4,116 |
| Kidney | 11,325 | 10,706 | 16,400 | 16,410 | 4,401 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).