Submitted:
08 April 2026
Posted:
14 April 2026
You are already at the latest version
Abstract
Keywords:
1. Motivation & Introduction

2. Point-Set Representation
2.1. Exon-Intron Junctions

2.2. Branch Points
2.3. Normal Exon-Exon Junctions vs. Cancer Fusions


2.4. Point-Sets to Visualize Correlations
3. Discussion
References
- M. Goldberg, J. Fischer, L. Hood, L. Hartwell, C. Aquardro, L. Silver, and A. E. Reynolds. Genetics:From Genes to Genomes, 7th Edition. McGraw-Hill Publishing, 2021.
- X. Roca, A. R. Krainer, and I. C. Eperon. Pick one, but be quick: 59 splice sites and the problems of too many choices. Genes and Development, 27:129–144, 2013. [CrossRef]
- T. D. Schneider and R. M. Stephens. Sequence logos: a new way to display consensus sequences. Nucleic Acids Research, 18(20):6097–6100, 1990. [CrossRef]
- I. B. Rogozin and L. Milanesi. Analysis of donor splice sites in different eukaryotic organisms. Journal of Molecular Evolution, 45:50–59, 1997. [CrossRef]
- C. R. Sibley, L. Blazquez, and J. Ule. Lessons from non-canonical splicing. Nature Review Genetics, 17(7):407–421, 2016. [CrossRef]
- S. Hümmer, S. Borao, s. Guerra-Moreno, L. Cozzuto, E. Hidalgo, and J. Ayte. Cross talk between the upstream exon-intron junction and Prp2 facilitates splicing of non-consensus introns. Cell Reports, 37:109893, 2021. [CrossRef]
- C. T. Workman, Y. Yin, D. L. Corcoran, T. Ideker, G. D. Stormo, and P. V. Benos. enoLOGOS: a versatile web tool for energy normalized sequence logos. Nucleic Acids Research, 33:W389–W392, 2005. [CrossRef]
- Y.-K. Yu, J. A. Capra, A. Stojmirović, D. Landsman, and S. F. Altschul. Log-odds sequence logos. Bioinformatics, 31:324–331, 2014.
- J. Sponer, J. Leszczynski, and P. Hobza. Nature of nucleic acid-base stacking: Nonempirical ab initio and empirical potential characterization of 10 stacked base dimers. comparison of stacked and h-bonded base pairs. Journal of Chemical Physics, 100:5590–5596, 1996. [CrossRef]
- P. Jurecka, J. Sponer, J. Cerny, and P. Hobza. Benchmark database of accurate (MP2 and CCSD(T) complete basis set limit) interaction energies of small model complexes, DNA base pairs, and amino acid pairs. Physical Chemistry Chemical Physics, 8:1985–1993, 2006. [CrossRef]
- R. Olivia, L. Cavallo, and A. Tramontano. Accurate energies of hydrogen bonded nucleic acid base pairs and triplets in tRNA tertiary interactions. Nucleic Acids Research, 34:865–879, 2006. [CrossRef]
- C. A. Johnson, R. J. Bloomingdale, V. E. Ponnusamy, C. A. Tillinghast, B. M. Znosko, and M. Lewis. A computational model for predicting experimental RNA and DNA nearest-neighbor free energy rankings. Journal of Chemical Physics, 115:9244–9251, 2011. [CrossRef]
- E. A. Jolley, M. Lewis, and B. M. Znosko. A computational model for predicting experimental RNA nearest-neighbor free energy rankings: Inosine–Uridine pairs. Chemical Physics Letters, 639:157–160, 2015. [CrossRef]
- S. C. Leon, M. Prentiss, and M. Fyta. Binding energies of nucleobase complexes: Relevance to homology recognition of DNA. Physical Review E, 93:06210, 2016. [CrossRef]
- M.C. Hopfinger, C. C. Kirkpatrick, and B. M. Znosko. Predictions and analyses of RNA nearest neighbor parameters for modified nucleotides. Nucleic Acids Research, 48:8901–8913, 2020. [CrossRef]
- M.T. Parker, B.K. Soanes, J. Kusakina, A. Larrieu, and K. et al. Knop. m6A modification of U6 snRNA modulates usage of two major classes of pre-mRNA 5′ splice site. eLife, 11:e78808, 2022.
- M. C. Wahl, C. L. Will, and R. Lührmann. The spliceosome: design principles of a dynamic RNP machine. Cell, 136(4):701–718, 2009. [CrossRef]
- C. L Will and R. Lührmann. Spliceosome structure and function. Cold Spring Harbor Perspectives in Biology, 3(7):a003707, 2011.
- Klemens J. Hertel. Spliceosomal Pre-mRNA Splicing Methods and Protocols. Methods in Molecular Biology, 1126. Humana Press, Totowa, NJ, 1st ed. 2014. edition, 2014.
- A. G. Matera and Z. Wang. A day in the life of the spliceosome. Nature Reviews Molecular Cell Biology, 15(2):108–121, 2014.
- E. C. Merkhofer, P. Hu, and T. L. Johnson. Introduction to co-transcriptional RNA splicing. Spliceosomal Pre-mRNA Splicing: Methods and Protocols, pages 83–96, 2014.
- W. Gilbert. Why genes in pieces? Nature, 271(5645):501–501, 1978. [CrossRef]
- N. K. Kadri, X. M. Mapel, and H. Pausch. The intronic branch point sequence is under strong evolutionary constraint in the bovine and human genome. Communications Biology, 4(1):1206, 2021. [CrossRef]
- E. L Lasda and T. Blumenthal. Trans-splicing. Wiley Interdisciplinary Reviews: RNA, 2(3):417–434, 2011.
- M. Hiller, Z. Zhang, R. Backofen, and S. Stamm. Pre-mRNA secondary structures influence exon recognition. PLoS Genetics, 3(11):e204, 2007. [CrossRef]
- M. Long and M. Deutsch. Intron exon structures of eukaryotic model organisms. Nucleic Acids Research, 27(15):3219–3228, 1999. [CrossRef]
- L. Zhu, Y. Zhang, W. Zhang, S. Yang, J.-Q. Chen, and D. Tian. Patterns of exon-intron architecture variation of genes in eukaryotic genomes. BMC Genomics, 10:1–12, 2009. [CrossRef]
- Y. Wang, J. Liu, B. O. Huang, Y.-M. Xu, J. Li, L.-F. Huang, J. Lin, J. Zhang, Q.-H. Min, and W.-M. et al. Yang. Mechanism of alternative splicing and its regulation. Biomedical Reports, 3(2):152–158, 2015.
- N. Stepankiw, M. Raghavan, E. A. Fogarty, A. Grimson, and J. A. Pleiss. Widespread alternative and aberrant splicing revealed by lariat sequencing. Nucleic Acids Research, 43(17):8488–8501, 2015. [CrossRef]
- J. Ule and B. J. Blencowe. Alternative splicing regulatory networks: Functions, mechanisms, and evolution. Molecular Cell, 76:329–345, 2019. [CrossRef]
- L. E. Marasco and A. R. Kornblihtt. The physiology of alternative splicing. Nature Reviews Molecular Cell Biology, 24:242–254, 2023.
- C. E. Walsh. New paradigm for gene transfer: RNA trans-splicing and small interfering RNA as therapeutic strategies. Semin. Hematol., 41:297–302, 2004. [CrossRef]
- Y. Yang and C. E. Walsh. Spliceosome-mediated RNA trans-splicing. Molecular Therapy, 12(6):1006–1012, 2005. [CrossRef]
- T. A. Cooper, L. Wan, and G. Dreyfuss. RNA and disease. Cell, 136(4):777–793, 2009.
- C. J. Mcmanus, M. O. Duff, J. Eipper-Mains, and B. R. Graveley. Global analysis of trans-splicing in Drosophila. PNAS, 107(29):12975–12979, 2010. [CrossRef]
- M. M. Scotti and M. S. Swanson. RNA mis-splicing in disease. Nature Reviews Genetics, 17(1):19–32, 2016. [CrossRef]
- Wei Jiang and Liang Chen. Alternative splicing: Human disease and quantitative analysis from high-throughput sequencing. Computational and Structural Biotechnology Journal, 19:183–195, 2021. [CrossRef]
- R. Aebersold, J. N. Agar, I. J. Amster, M. S. Baker, C. R. Bertozzi, E. S. Boja, C. E. Costello, B. F. Cravatt, C. Fenselau, and B. A . et al. Garcia. How many human proteoforms are there? Nature Chemical Biology, 14(3):206–214, 2018. [CrossRef]
- M. G. Reese, F. H. Eeckman, D. Kulp, and D. Haussler. Improved splice site detection in genie. Journal of Computational Biology, 4(3):311–323, 1997. [CrossRef]
- J. Zuallaert, F. Godin, M. Kim, A. Soete, Y. Saeys, and W. De Nerve. SpliceRover: interpretable convolutional neural networks for improved splice site prediction. Bioinformatics, 34:4180–4188, 2018. [CrossRef]
- K. Jaganathan, S. K. Panagiotopoulou, J. F. McRae, and S. F. et al. Darbandi. Predicting splicing from primary sequence with deep learning. Cell, 176:535–548, 2019. [CrossRef]
- C. Janiesch, P. Zschech, and K. Heinrich. Machine learning and deep learning. Electronic Markets, 31:685–695, 2021.
- W. Jang, J. Park, H. Chae, and M. Kim. Comparison of in silico tools for splice-altering variant prediction using established spliceogenic variants: An end-users point of view. International Journal of Genomics, 2022:5265686, 2022. [CrossRef]
- C. van der Feltz and A. A. Hoskins. Structural and functional modularity of the U2 snRNP in pre-mRNA splicing. Critical Reviews in Biochemistry and Molecular Biology, 54:443–365, 2019.
- C. Burge and S. Karlin. Prediction of complete gene structures in human genomic DNA. Journal of Molecular Biology, 268:78–94, 1997.
- I. Carmel, S. Tai, I. Vig, and G. Ast. Comparative analysis detects dependencies among the 5′ splice-site positions. RNA, 10:828–840, 2004. [CrossRef]
- O.V. Artemyeva-Isman and A.C.G. Porter. Predicting splicing from primary sequence with deep learning. Fronters in Genetics, 12:676971, 2021.
- E. Speakman and G. H. Gunaratne. On a kneading theory for gene-splicing. CHAOS, 34:043125, 2024. [CrossRef]
- Y. Iida and F. Sasaki. Recognition patterns for exon-intron junctions in higher organisms as revealed by a computer search. Journal of Biochemistry, 94:1731–1738, 1983. [CrossRef]
- M. Kramárek, P. Soucek, K. Réblova, L. K. Grodecká, and T. Freiberger. Splicing analysis of STAT3 tandem donor suggests non-canonical binding registers for U1 and U6 snRNAs. Nucleic Acids Research, 52:5959–5974, 2024. [CrossRef]
- G. Perez, G. P. Barber, A. Benet-Pages, J. Casper, H. Clawson, M. Diekhans, C. Fischer, A. S. Gonzalez, J. N.and Hinrichs, C. M. Lee, L. R. Nassar, B. J. , Raney, M. L. Speir, M. J. van Baren, C. J. Vaske, D. Haussler, W. J. Kent, and M. Haeussler. The UCSC Genome Browser database: 2025 update. Nucleic Acids Research, 53:D1243–D1249, 2025.
- M.T. Parker, S.M. Fica, and G.G. Simpson. RNA splicing: a split consensus reveals two major 5’ splice site classes. Open Biology, 15:240293, 2025. [CrossRef]
- A. Anna and G. Monika. Splicing mutations in human genetic disorders: examples, detection, and confirmation. Journal of Applied Genetics, 59:253–268, 2018. [CrossRef]
- T. R. Mercer, M. B. Clark, S. B. Andersen, M. E. Brunck, W. Haerty, J. Crawford, R. J. Taft, L. K. Nielsen, M. E. Dinger, and J. S. Mattick. Genome-wide discovery of human splicing branchpoints. Genome Research, 25:290–303, 2015. [CrossRef]
- N. A. Faustino and T. A. Cooper. Pre-mRNA splicing and human disease. Genes. Dev., 17:419–437, 2003.
- B. R. Graveley. The haplo-spliceo-transcriptome: common variations in alternative splicing in the human population. Trends in Genetics, 24:5–7, 2007. [CrossRef]
- R.-H. Fu, S.-P. Liu, H.-J. Huang, S.-J.and Chen, P.-R. Chen, Y.-H. Lin, Y.-C. Ho, W.-L. Chang, C.-H. Tsai, and W.-C. et al. Shyu. Aberrant alternative splicing events in parkinson’s disease. Cell Transplantation, 22(4):653–661, 2013. [CrossRef]
- Katarzyna Chwalenia, Loryn Facemire, and Hui Li. Chimeric rnas in cancer and normal physiology. Wiley Interdisciplinary Reviews: RNA, 8(6):e1427, 2017. [CrossRef]
- M. Montes, B. L. Sanford, D. F Comiskey, and D. S. Chandler. Rna splicing and disease: animal models to therapies. Trends in Genetics, 35:68–87, 2018. [CrossRef]
- Y. Zhang, J. Qian, C. Gu, and Y. Yang. Alternative splicing and cancer: a systematic review. Signal Transduction and Targeted Therapy, 6(1):78, 2021. [CrossRef]
- H. Sun, K. Liu, and C. Yi. Regulation and functions of non-m6A mRNA modifications. Nature Reviews Molecular Cell Biology, 24:714–731, 2023. [CrossRef]
- P. Kim, S. Yoon, N. Kim, S. Lee, M. Ko, H. Lee, H. Kang, J. Kim, and S. Lee. ChimerDB 2.0 - a knowledge-base for fusion genes updated. Nucleic Acids Research, 38:D81–D85, 2010. [CrossRef]
- P. Kim and X. Zhou. FusionGDB: fusion gene annotation DataBase. Nucleic Acids Research, 47:D994–D1004, 2019. [CrossRef]
- P. Kim, H. Tan, J. Liu, H. Lee, H. Jung, H. Kumar, and H. Zhou. FusionGDB 2.0: fusion gene annotation updates aided by deep learning. Nucleic Acids Research, 50:D1221–D1230, 2022. [CrossRef]
- R. Wan, R. Bai, X. Zhan, and Y. Shi. How is precursor messenger RNA spliced by the spliceosome? Annual Review of Biochemistry, 89:333–358, 2020. [CrossRef]
- N. H. Gehring and J.-Y. Roignant. Anything but ordinary - emerging splicing mechanisms in eukaryotic gene regulation. Trends in Genetics, 37(4):355–372, 2021. [CrossRef]
- I. Beusch, B. Rao, M. K. Studer, T. Luhovska, V. Sukyte, S. Lei, J. Oses-Prieto, E. SeGraves, A. Burlingame, S. Jonas, and H. G. Madhani. Targeted high-throughput mutagenesis of the human spliceosome reveals its in vivo operating principles. Molecular Cell, 83:2578–2594, 2023. [CrossRef]
- M. E. Rogalska, C Vivor, and J. Valcárcel. Regulation of pre-mRNA splicing: roles in physiology and disease, and therapeutic prospects. Nature Reviews Genetics, 24:251–269, 2023. [CrossRef]
- H. Shenasa and D. L. Bentley. Pre-mRNA splicing and its co-transcriptional connections. Trends in Genetics, 39(9):672–685, 2023. [CrossRef]
- X. Zhan, Y. Lu, and Y. Shi. Molecular basis for the activation of human spliceosome. Nature Communications, 15:6348–6357, 2024. [CrossRef]
- B. Mandlebrot. The Fractal Geometry of Nature. W. H. Freeman and Company, New York, 1977. [CrossRef]
- H. G. E. Hentschel and I. Procaccia. The infinite number of generalized dimensions of fractals and strange attractors. Physica D, 8:435–444, 1983. [CrossRef]
- T. C. Halsey, M. H. Jensen, L. P. Kadanoff, I. Procaccia, and B. I. Shraiman. Fractal measures and their singularities: The characterization of strange sets. Physical Review A, 33:1141–1151, 1986. [CrossRef]
- E. Speakman. Point Set Identification of Genetic Sequences. Doctoral Thesis, University of Houston, 2025.
- E. Alpaydin. Introduction to Machine Learning. The MIT Press, 2020.
- T. R. Cech. The chemistry of self-splicing RNA and RNA enzymes. Science, 236:1532–1539, 1987. [CrossRef]
- T. R. Cech. Self-splicing and enzymatic activity of an intervening sequence RNA from Tetrahymena. Bioscience Reports, 10:239–261, 1990. [CrossRef]
- A. M. Pyle. Group II intron self-splicing. Annual Reviews of Biophysics, 45:183–205, 2016. [CrossRef]




Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).