Submitted:
18 September 2025
Posted:
19 September 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Data Collection
2.2. SARS-CoV-2 Spike Protein Domains
2.3. Evolutionary Macro-Lineage Definition of SARS-CoV-2
2.4. Definition of Charge Increment
2.5. Phylogenetic Tree Construction Scheme
2.5.1. Based on Full-Length S Protein Sequences
2.5.2. Based on Mutated Sites of the S Protein
3. Results
3.1. Relationship Between Spike Protein Charge Increment and Lineage Divergence
3.2. Correlation Analysis of Spike Protein Charge Increment, Immune Escape, Affinity, and Expression Levels Across Different Macro-Lineages
3.3. Analysis of the Correlation Between Codon Usage Characteristics of SARS-CoV-2 Variants and Spike Protein Charge Increment
3.4. Phylogenetic Tree Construction Based on the Charge Properties of Mutation Sites in the Spike Protein Sequence
4. Discussion
4.1. Evolutionary Trend of Spike Protein Charge Increment
4.2. Evolution of the Balance Between SARS-CoV-2 Viral Infectivity and Immune Escape
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Hadfield, J.; Megill, C.; Bell, S.M.; Huddleston, J.; Potter, B.; Callender, C.; Sagulenko, P.; Bedford, T.; Neher, R.A. Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics 2018, 34, 4121–4123. [Google Scholar] [CrossRef] [PubMed]
- Cocherie, T.; Zafilaza, K.; Leducq, V.; Marot, S.; Calvez, V.; Marcelin, A. G.; Todesco, E. Epidemiology and characteristics of SARS-CoV-2 variants of concern: The impacts of the spike mutations. Microorganisms 2022, 11, 30. [Google Scholar] [CrossRef]
- Huang, Y.; Yang, C.; Xu, X.F.; Xu, W.; Liu, S.W. Structural and functional properties of SARS-CoV-2 spike protein: Potential antivirus drug development for COVID-19. Acta Pharmacol Sin. 2020, 41, 1141–1149. [Google Scholar] [CrossRef]
- Yan, R.; Zhang, Y.; Li, Y.; Xia, L.; Guo, Y.; Zhou, Q. Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2. Science. 2020, 367, 1444–1448. [Google Scholar] [CrossRef]
- Adamczyk, Z.; Batys, P.; Barbasz, J. SARS-CoV-2 virion physicochemical characteristics pertinent to abiotic substrate attachment. Curr Opin Colloid Interface Sci. 2021, 55, 101466. [Google Scholar] [CrossRef]
- Kucherova, A.; Strango, S.; Sukenik, S.; Theillard, M. Computational modeling of protein conformational changes—Application to the opening SARS-CoV-2 spike. J Comput Phys. 2021, 444, 110591. [Google Scholar] [CrossRef]
- Zhang, Z.; Zhang, J.; Wang, J. Surface charge changes in spike RBD mutations of SARS-CoV-2 and its variant strains alter the virus evasiveness via HSPGs: A review and mechanistic hypothesis. Front Public Health. 2022, 10, 952916. [Google Scholar] [CrossRef]
- Cotten, M.; Phan, M.V.T. Evolution of increased positive charge on the SARS-CoV-2 spike protein may be adaptation to human transmission. iScience. 2023, 26, 106230. [Google Scholar] [CrossRef]
- Jawad, B.; Adhikari, P.; Podgornik, R.; Ching, W.Y. Binding interactions between receptor-binding domain of spike protein and human angiotensin converting enzyme-2 in Omicron variant. J Phys Chem Lett. 2022, 13, 3915–3921. [Google Scholar] [CrossRef]
- Božič, A.; Podgornik, R. Evolutionary changes in the number of dissociable amino acids on spike proteins and nucleoproteins of SARS-CoV-2 variants. Virus Evol. 2023, 9, vead040. [Google Scholar] [CrossRef] [PubMed]
- Javidpour, L.; Božič, A.; Naji, A.; Podgornik, R. Electrostatic interactions between the SARS-CoV-2 virus and a charged electret fibre. Soft Matter. 2021, 17, 4296–4303. [Google Scholar] [CrossRef]
- Arbeitman, C.R.; Rojas, P.; Ojeda-May, P.; Garcia, M.E. The SARS-CoV-2 spike protein is vulnerable to moderate electric fields. Nat Commun. 2021, 12, 5407. [Google Scholar] [CrossRef]
- Lu, X.; Chen, Y.; Zhang, G. Functional evolution of SARS-CoV-2 spike protein: Maintaining wide host spectrum and enhancing infectivity via surface charge of spike protein. Comput Struct Biotechnol J. 2023, 21, 2068–2074. [Google Scholar] [CrossRef] [PubMed]
- Adhikari, P.; Jawad, B.; Podgornik, R.; Ching, W.Y. Mutations of Omicron variant at the interface of the receptor domain motif and human angiotensin-converting enzyme-2. Int J Mol Sci. 2022, 23, 2870. [Google Scholar] [CrossRef] [PubMed]
- Barroso da Silva, F.L.; Giron, C.C.; Laaksonen, A. Electrostatic features for the receptor binding domain of SARS-CoV-2 wildtype and its variants. Compass to the severity of the future variants with the charge-rule. J Phys Chem B. 2022, 126, 6835–6852. [Google Scholar] [CrossRef]
- Nie, C.; Sahoo, A.K.; Netz, R.R.; Herrmann, A.; Ballauff, M.; Haag, R. Charge matters: Mutations in Omicron variant favor binding to cells. Chembiochem. 2022, 23, e202100681. [Google Scholar] [CrossRef]
- Kim, S.H.; Kearns, F.L.; Rosenfeld, M.A.; Votapka, L.; Casalino, L.; Papanikolas, M.; Amaro, R.E. SARS-CoV-2 evolved variants optimize binding to cellular glycocalyx. Cell Rep Phys Sci. 2023, 4, 101346. [Google Scholar] [CrossRef]
- Gangavrapu, K.; Latif, A.A.; Mullen, J.L.; Alkuzweny, M.; Hufbauer, E.; Tsueng, G.; Haag, E.; Zeller, M.; et al. Outbreak.info genomic reports: Scalable and dynamic surveillance of SARS-CoV-2 variants and mutations. Nat Methods. 2023, 20, 512–522. [Google Scholar] [CrossRef] [PubMed]
- Starr, T.N.; Greaney, A.J.; Hilton, S.K.; Ellis, D.; Crawford, K.H.D.; Dingens, A.S.; Navarro, M.J.; Bowen, J.E.; Tortorici, M.A.; Walls, A.C.; et al. Deep mutational scanning of SARS-CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding. Cell. 2020, 182, 1295–1310. [Google Scholar] [CrossRef]
- Starr, T.N.; Greaney, A.J.; Stewart, C.M.; Walls, A.C.; Hannon, W.W.; Veesler, D.; Bloom, J.D. Deep mutational scans for ACE2 binding, RBD expression, and antibody escape in the SARS-CoV-2 Omicron BA.1 and BA.2 receptor-binding domains. PLoS Pathog. 2022, 18, e1010951. [Google Scholar] [CrossRef]
- Padhiar, N.H.; Ghazanchyan, T.; Fumagalli, S.E.; DiCuccio, M.; Cohen, G.; Ginzburg, A.; Rikshpun, B.; Klein, A.; Santana-Quintero, L.; Smith, S.; Komar, A. A.; Kimchi-Sarfaty, C. SARS-CoV-2 CoCoPUTs: Analyzing GISAID and NCBI data to obtain codon statistics, mutations, and free energy over a multiyear period. Virus Evol. 2025, 11, veae115. [Google Scholar] [CrossRef]
- Luo, L.; Lv, J. An evolutionary theory on virus mutation in COVID-19. Virus Res. 2024, 344, 199358. [Google Scholar] [CrossRef]
- Nei, M.; Kumar, S. Molecular Evolution and Phylogenetics; Oxford University Press: 198 Madison Avenue, New York, 2000; pp. 87–113. [Google Scholar]
- R Core Team. R: A Language and Environment for Statistical Computing. 2025, R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org.
- Quaranta, M.; Via, A.; Pascarella, S. Structural analysis of the SARS-CoV-2 spike N-terminal domain across wild-type and recent variants: a comparative study. Proteins 2025, 10.1002/prot.26855. [CrossRef]
- Ma, W.; Fu, H.; Jian, F.; Cao, Y.; Li, M. Immune evasion and ACE2 binding affinity contribute to SARS-CoV-2 evolution. Nat Ecol Evol. 2023, 7, 1457–1466. [Google Scholar] [CrossRef] [PubMed]
- Caputo, E.; Mandrich, L. Structural and phylogenetic analysis of SARS-CoV-2 spike glycoprotein from the most widespread variants. Life. 2022, 12, 1245. [Google Scholar] [CrossRef] [PubMed]
- Yuan, M.; Huang, D.; Lee, C.D.; Wu, N.C.; Jackson, A.M.; Zhu, X.; Liu, H.; Peng, L.; van Gils, M.J.; Sanders, R.W.; et al. Structural and functional ramifications of antigenic drift in recent SARS-CoV-2 variants. Science. 2021, 373, 818–823. [Google Scholar] [CrossRef] [PubMed]
- Xue, S.; Han, Y.; Wu, F.; Wang, Q. Mutations in the SARS-CoV-2 spike receptor binding domain and their delicate balance between ACE2 affinity and antibody evasion. Protein Cell. 2024, 15, 403–418. [Google Scholar] [CrossRef]
- Yang, S.; Yu, Y.; Xu, Y.; Jian, F.; Song, W.; Yisimayi, A.; Wang, P.; Wang, J.; Liu, J.; Yu, L.; et al. Fast evolution of SARS-CoV-2 BA.2.86 to JN.1 under heavy immune pressure. Lancet Infect Dis. 2024, 24, e70–e72. [Google Scholar] [CrossRef]
- Peng, C.; Zhu, Z.; Shi, Y.; Wang, X.; Mu, K.; Yang, Y.; Zhang, X.; Xu, Z.; Zhu, W. Computational insights into the conformational accessibility and binding strength of SARS-CoV-2 spike protein to human angiotensin-converting enzyme 2. J Phys Chem Lett. 2020, 11, 10482–10488. [Google Scholar] [CrossRef]






| Macro-lineage | Variant | NMS1 on spike protein | Lineage divergence | Earliest date2 | Charge increment | |||
| B-RBD3 | RBD4 | A-RBD5 | Spike | |||||
| N-lineage | B.1 | 1 | 7 | 2020.01.15 | — | — | 1 | 1 |
| B.1.617.2 | 9 | 17 | 2020.10.15 | 1 | 2 | 3 | 6 | |
| B.1.351 | 10 | 19 | 2020.07.09 | 2 | 0 | 1 | 3 | |
| B.1.525 | 9 | 25 | 2020.12.11 | 1 | 2 | 2 | 5 | |
| B.1.1.7 | 10 | 26 | 2020.05.14 | 1 | -1 | 3 | 3 | |
| B.1.526 | 4 | 27 | 2020.11.15 | 1 | — | 1 | 2 | |
| B.1.429 | 4 | 28 | 2020.07.06 | 0 | 1 | 1 | 2 | |
| AY.4 | 10 | 30 | 2020.10.27 | 1 | 2 | 3 | 6 | |
| AY.103 | 9 | 30 | 2021.01.02 | 1 | 2 | 3 | 6 | |
| P.1 | 12 | 32 | 2020.09.11 | -1 | 0 | -1 | -2 | |
| B.1.351.3 | 11 | 33 | 2020.11.04 | 2 | 0 | 1 | 3 | |
| P.1.14 | 12 | 34 | 2020.11.03 | -1 | 0 | -1 | -2 | |
| AZ.2 | 6 | 36 | 2021.02.05 | 1 | 2 | 4 | 7 | |
| B.1.1.529 | 7 | 64 | 2021.04.15 | — | 1 | 2 | 3 | |
| O-lineage | BA.1 | 33 | 51 | 2021.01.27 | 0 | 4 | 6 | 10 |
| BA.1.1.2 | 37 | 57 | 2021.11.23 | 0 | 4 | 6 | 10 | |
| BA.2 | 31 | 59 | 2021.03.25 | -1 | 4 | 4 | 7 | |
| BA.5 | 34 | 62 | 2021.12.09 | -2 | 4 | 4 | 6 | |
| BA.5.5 | 35 | 63 | 2022.01.10 | -2 | 4 | 4 | 6 | |
| BA.4 | 34 | 68 | 2022.01.06 | -2 | 4 | 4 | 6 | |
| BF.7 | 35 | 70 | 2022.01.24 | -2 | 3 | 4 | 5 | |
| BA.2.12.1 | 33 | 71 | 2021.09.28 | -1 | 4 | 4 | 7 | |
| BA.4.6 | 36 | 72 | 2022.01.03 | -2 | 3 | 4 | 5 | |
| BQ.1 | 36 | 72 | 2022.01.11 | -2 | 4 | 4 | 6 | |
| DY.1 | 35 | 74 | 2022.11.25 | -2 | 4 | 4 | 6 | |
| BA.4.1 | 35 | 74 | 2021.12.14 | -2 | 4 | 4 | 6 | |
| BA.2.75 | 30 | 86 | 2021.12.31 | 0 | 6 | 4 | 10 | |
| CH.1.1 | 41 | 89 | 2022.05.12 | -2 | 5 | 3 | 6 | |
| XBB.1.5 | 42 | 91 | 2022.06.12 | -3 | 5 | 4 | 6 | |
| BN.1.1 | 41 | 93 | 2022.07.27 | -2 | 4 | 4 | 6 | |
| XBB.1.9 | 40 | 94 | 2022.10.12 | -3 | 5 | 4 | 6 | |
| XBB.2.3 | 43 | 95 | 2022.12.21 | -2 | 5 | 4 | 7 | |
| BJ.1 | 36 | 96 | 2022.06.15 | -3 | 7 | 3 | 7 | |
| GE.1 | 45 | 97 | 2023.03.08 | -2 | 5 | 4 | 7 | |
| BN.1 | 40 | 98 | 2022.01.24 | -2 | 4 | 4 | 6 | |
| FL.1.5.1 | 44 | 101 | 2023.01.03 | -3 | 5 | 4 | 6 | |
| XBB.1.16 | 43 | 101 | 2023.01.04 | -2 | 5 | 4 | 7 | |
| EG.5.1 | 44 | 103 | 2023.01.31 | -2 | 5 | 3 | 6 | |
| HV.1 | 46 | 103 | 2023.01.29 | -2 | 6 | 4 | 8 | |
| JB.2 | 44 | 105 | 2023.05.29 | -3 | 6 | 4 | 7 | |
| FE.1 | 41 | 105 | 2023.01.26 | -3 | 7 | 4 | 8 | |
| HK.1 | 44 | 107 | 2023.04.12 | -2 | 5 | 4 | 7 | |
| GA.4.1 | 47 | 108 | 2023.05.09 | -3 | 4 | 6 | 7 | |
| JE.1 | 45 | 111 | 2023.08.08 | -3 | 4 | 4 | 5 | |
| XBC.1.6 | 41 | 121 | 2023.02.10 | -1 | 3 | 4 | 6 | |
| DV.7.1 | 45 | 117 | 2023.05.29 | -3 | 4 | 4 | 5 | |
| P-lineage | BA.2.86 | 58 | 114 | 2023.03.11 | -5 | 7 | 6 | 8 |
| JN.1 | 60 | 120 | 2023.01.13 | -5 | 7 | 6 | 8 | |
| JN.1.4 | 60 | 120 | 2023.10.20 | -5 | 7 | 6 | 8 | |
| KQ.1 | 62 | 123 | 2024.01.10 | -5 | 6 | 6 | 7 | |
| KP.1 | 63 | 125 | 2024.02.01 | -5 | 6 | 6 | 7 | |
| KS.1 | 64 | 126 | 2024.02.15 | -6 | 6 | 6 | 6 | |
| KP.2 | 59 | 126 | 2024.01.02 | -5 | 6 | 6 | 7 | |
| KR.1 | 62 | 129 | 2024.02.08 | -5 | 6 | 6 | 7 | |
| XDK.1 | 55 | 131 | 2023.12.22 | -5 | 7 | 6 | 8 | |
| XEC | 65 | 154 | 2024.06.28 | -5 | 6 | 6 | 7 | |
| XDV.1 | 61 | 161 | 2024.02.26 | -5 | 7 | 6 | 8 | |
| Variable | Median | ||
| N-lineage | O-lineage | P-lineage | |
| Charge increment in B-RBD | 1.0 | -2.0 | -5.0 |
| Charge increment in RBD | 0.5 | 4.0 | 6.0 |
| Charge increment at the RBD-ACE2 binding interface | 0.0 | 1.0 | 1.0 |
| Charge increment in A-RBD | 1.5 | 4.0 | 6.0 |
| RBD-ACE2 binding affinity | 0.02 | -0.12 | -0.12 |
| Immune escape | 0.0028 | 0.034 | 0.031 |
| Expression level of RBD | 0.0 | -0.21 | -0.15 |
| Lineage | CAI | ENC | GC1 | GC2 | GC3 | GC |
| All | -0.04 (0.78) | 0.49 (<0.01) | -0.08 (0.58) | -0.60 (<0.01) | 0.02 (0.90) | -0.40 (<0.01) |
| N | -0.56 (0.04) | -0.23 (0.44) | 0.74 (<0.01) | 0.30 (0.32) | -0.13 (0.68) | 0.65 (0.02) |
| O | 0.01 (0.95) | 0.10 (0.58) | -0.13 (0.48) | -0.50 (<0.01) | -0.19 (0.27) | -0.49 (<0.01) |
| P | -0.22 (0.54) | -0.62 (0.05) | 0.37 (0.29) | -0.19 (0.59) | -0.69 (0.03) | -0.44 (0.20) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).