Submitted:
02 May 2025
Posted:
06 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Methodology
1.2. Manual Evaluation of Data
Dynamical generation of 4d gauge theories and gravity at low energy from the 3d ones at high energy is studied, based on the fermion condensation mechanism recently proposed by Arkani-Hamed, Cohen and Georgi. For gravity, 4d Einstein gravity is generated from the multiple copy of the 3d ones, by referring to the two form gravity. Since the 3d Einstein action without matter coupling is topological, ultraviolet divergences are less singular in our model. In the gauge model, matter fermions are introduced on the discrete lattice following Wilson. Then, the 4d gauge interactions are correctly generated from the 3d theories even in the left-right asymmetric theories of the standard model. In order for this to occur, the Higgs fields as well as the gauge fields of the extra dimension should be generated by the fermion condensates. Therefore, the generation of the 4d standard model from the multiple copy of the 3d ones is quite promising. To solve the doubling problem in the weak interaction sector, two kinds of discrete lattices have to be introduced separately for L- and R-handed sectors, and the two types of Higgs fields should be generated.
In this paper, we investigate the relationship between 4d gauge theory and gravity that arises from 3d ones at high energy. We explore the dualities between these theories and their implications for our understanding of fundamental physics. Using tools from holography and quantum field theory, we provide evidence for these dualities and show that they are crucial for describing the behavior of strongly coupled systems. Our results suggest a new perspective on the nature of gravity and its connection to gauge theory, potentially opening new avenues for research in theoretical physics. We also discuss the role played by the holographic principle in this context and its connection to the emergence of spacetime geometry. Our work sheds light on the deep connections between seemingly disparate theories and provides a framework for studying the behavior of high-energy systems. Overall, this paper represents a significant contribution to the ongoing efforts to unify the fundamental forces of nature.
2. Vietoris Rips-Based Evaluation
2.1. Text Embeddings
2.2. Metric Selection
2.2.1. Cosine Similarity and Angular Distance
2.2.2. Experimental Composite Metric
2.2.3. Smoothing
2.3. Method and Implementation
3. Results
3.1. Metric Values
3.2. Simplicial Complex Outputs and Their Transformations
3.2.1. Persistence Diagrams
3.2.2. Persistence Images
3.3. Statistical Modeling and Test Case Analysis
3.3.1. Regression Models
- A linear model does capture the statistical relationship between the topological structures of human-created and AI-generated texts under the chosen embeddings and metrics, and
- Overlap in the bell-shaped PI representations of a homology for the two classes probably corresponds to errors in classification for any model
- No assertion can be made, based on these results, about the effectiveness of a linear model on similar data under different choices of embeddings and metrics.
3.3.2. Binning


3.3.3. Test Case Classification Results
4. Discussion
5. Transformer Investigation

5.1. Why Do We Care About Embedding?


5.2. Challenges
5.2.1. Text Vector Representation and Tool Effectiveness

5.2.2. Non English Text
5.2.3. Shorter Text (1000 Characters or Less) Fails Detection - Analysis of Features Using Explanable Models from SHAP [26]

5.2.4. Fine-Tuned Generative Models
5.2.5. Curse of Recursion

5.3. Methods - Coaxing Data to Our Small Complexes

5.4. Alpha Complexes
5.5. Alpha Complex from Distance Matrix
5.6. Theoretical Underpinnings
5.7. Locally Lipschitz
5.8. Stability

5.9. Computational Complexity
5.10. Observations
5.10.1. Vietoris-Rips Does Not Capture Scale Accurately
| Filtrations | Vietoris-Rips | Alpha |
| 1.5 | h1 = 0 | h1 = 0 |
| 2.5 | h1 = 5 | h1 = 21 |
| 3.5 | h1 = 0 | h1 = 75 |
| 3.9 | h1 = 0 | h1 = 12 |
| 5.0 | h1 = 0 | h1 = 3 |
5.10.2. Vietoris-Rips Grows Exponentially With Points
5.10.3. Reductions Play a Key Role In Reducing Complexity
5.11. Implementation

5.11.1. Computation over Z2
6. Results
6.1. Working Transformer Model





7. Conclusions
Appendix A. Embeddings
Appendix A.1. Bag Of Words


Appendix A.2. SBERT


Appendix B. Persistence for Vietoris-Rips
Appendix B.1. Persistence Diagrams




Appendix B.2. Persistence Images


References
- Gerlich, M. AI Tools in Society: Impacts on Cognitive Offloading and the Future of Critical Thinking. Societies 2025, 15. [Google Scholar] [CrossRef]
- Bataineh, A.A.; Sickler, R.; Kurcz, K.; Pedersen, K. AI-Generated vs. Human Text: Introducing a New Dataset for Benchmarking and Analysis. IEEE Transactions on Artificial Intelligence 2025, pp. 1–11. [CrossRef]
- Ma, Y.; Liu, J.; Yi, F.; Cheng, Q.; Huang, Y.; Lu, W.; Liu, X. AI vs. Human – Differentiation Analysis of Scientific Content Generation, 2023, [arXiv:cs.CL/2301.10416].
- Draganov, O.; Skiena, S. The Shape of Word Embeddings: Quantifying Non-Isometry with Topological Data Analysis. In Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024. Association for Computational Linguistics; 2024; pp. 12080–12099. [Google Scholar] [CrossRef]
- Zhu, X. Persistent homology: An introduction and a new text representation for natural language processing. 08 2013, pp. 1953–1959.
- Uchendu, A.; Le, T. Unveiling Topological Structures in Text: A Comprehensive Survey of Topological Data Analysis Applications in NLP, 2024, [arXiv:cs.CL/2411.10298].
- Solberg. Text Classification via Topological Data Analysis. Master’s thesis, 2023.
- Tymochko, S.; Chaput, J.; Doster, T.; Purvine, E.; Warley, J.; Emerson, T. Con Connections: Detecting Fraud from Abstracts using Topological Data Analysis. In Proceedings of the 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA), 2021, pp. 403–408. [CrossRef]
- Nicolai Thorer Sivesind, A. Human-vs-Machine. https://huggingface.co/datasets/NicolaiSivesind/human-vs-machine, 2023.
- Fröhling, L.; Zubiaga, A. Feature-based detection of automated language models: tackling GPT-2, GPT-3 and Grover. PeerJ Computer Science 2021, 7, e443. [Google Scholar] [CrossRef] [PubMed]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers); Burstein, J.; Doran, C.; Solorio, T., Eds., Minneapolis, Minnesota, 2019; pp. 4171–4186. [CrossRef]
- Reimers, N.; Gurevych, I. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks, 2019, [arXiv:cs.CL/1908.10084].
- sentence-transformers/all-MiniLM-L6-v2. https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2.
- Strehl, A.; Ghosh, J.; Mooney, R. Impact of Similarity Measures on Web-page Clustering. In Proceedings of the Proceedings of the Workshop on Artificial Intelligence for Web Search (AAAI 2000). AAAI Press, 2000.
- Deisenroth, M.P.; Faisal, A.A.; Ong, C.S. Mathematics for Machine Learning; Cambridge University Press, 2020.
- Gholizadeh, S.; Seyeditabari, A.; Zadrozny, W. A Novel Method of Extracting Topological Features from Word Embeddings. CoRR 2020, abs/2003.13074, [2003.13074].
- Zomorodian, A.J. Spaces and Filtrations. In Topology for Computing; Cambridge Monographs on Applied and Computational Mathematics, Cambridge University Press, 2005; p. 13–40.
- Edelsbrunner, H.; Harer, J. Computational topology: An introduction; American Mathematical Society, 2010.
- Tralie, C.; Saul, N.; Bar-On, R. Ripser.py: A Lean Persistent Homology Library for Python. The Journal of Open Source Software 2018, 3, 925. [Google Scholar] [CrossRef]
- Cohen-Steiner, Edelsbrunner, H. Stability of Persistence Diagrams. Discrete & Computational Geometry 2006, 37, 103–120. [Google Scholar] [CrossRef]
- Adams, e.a. Persistence Images: A Stable Vector Representation of Persistent Homology. Journal of Machine Learning Research 2017, 18, 1–35. [Google Scholar]
- Bethany, M.; Wherry, B.; Bethany, E.; Vishwamitra, N.; Rios, A.; Najafirad, P. Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text. In Proceedings of the 33rd USENIX Security Symposium (USENIX Security 24), Philadelphia, PA, 2024; pp. 5805–5822.
- Stephen Fitz, Peter Romero, J.J.S. Hidden Holes topological aspects of language models, 2024, [arXiv:cs.CL/2406.05798v1].
- Luis Balderas 1, M.L.; Ben´ıtez, J.M. Can persistent homology whiten Transformer-based black-box models? A case study on BERT compression, 2023, [arXiv:cs.LG/2312.10702].
- Nanda, V.; Sazdanovi´c, R. Simplicial Models and Topological Inference in Biological Systems. Discrete and Topological Models in Molecular Biology, Springer-Verlag 2014. [CrossRef]
- Lundberg, S.; Lee, S.I. A Unified Approach to Interpreting Model Predictions, 2017, [arXiv:cs.AI/1705.07874].
- Shumailov, I.; Shumaylov, Z.; Zhao, Y.; Gal, Y.; Papernot, N.; Anderson, R. The Curse of Recursion: Training on Generated Data Makes Models Forget, 2024, [arXiv:cs.LG/2305.17493].
- Liu, Z.; Jin, W.; Mu, Y. Subspace embedding for classification. Neural Computing and Applications 2022, 34, 18407–18420. [Google Scholar] [CrossRef]
- Clarkson, K.L.; Woodruff, D.P. Low Rank Approximation and Regression in Input Sparsity Time, 2013, [arXiv:cs.DS/1207.6365].
- Bauer, U. Ripser: efficient computation of Vietoris–Rips persistence barcodes. Journal of Applied and Computational Topology 2021, 5, 391–423. [Google Scholar] [CrossRef]
- Erik Carlsson, J.C. Computing the alpha complex using dual active set quadratic programming. Nature Sci Rep 2024, 14. [Google Scholar] [CrossRef] [PubMed]
- Talha Bin Masood a, Tathagata Ray b, V.N.c. Parallel computation of alpha complexes for biomolecules. Elsevier Science Direct 2020. [CrossRef]
- Ulrich Bauer a, Michael Kerber b, F.R.a.A.R.a. A unified view on the functorial nerve theorem and its variations. Expositiones Mathematicae 2023, 41. [CrossRef]
- Hurtado, F., N.M..U.J. Flipping edges in triangulations. In Proceedings of the 12th Annual Symposium on Computational Geometry 1996. https://doi.org/https://dl.acm.org/doi/pdf/10.1145/237218.237367.
- Bauer, U. Ripser: efficient computation of Vietoris-Rips persistence barcodes. arXiv:1908.02518 2019. [CrossRef]





| Type | AI | Human | AI | Human | AI | Human |
| BoW | 7.861 | 7.555 | 0.869 | 0.883 | 0.043 | 0.057 |
| BoW, smoothed | 7.861 | 7.557 | 0.982 | 0.874 | 0.021 | 0.019 |
| SBERT | 7.86 | 7.555 | 0.814 | 1.086 | 0.032 | 0.053 |
| SBERT, smoothed | 7.861 | 7.558 | 1.102 | 0.992 | 0.016 | 0.032 |
| Composite, | 8.358 | 8.184 | 0.847 | 1.06 | 0.026 | 0.062 |
| Composite, | 8.358 | 8.184 | 0.882 | 1.038 | 0.042 | 0.055 |
| Composite, | 8.358 | 8.184 | 0.961 | 0.976 | 0.05 | 0.059 |
| H0 | H1 | H2 | ||||
| Type | Linear | Binning | Linear | Binning | Linear | Binning |
| BoW | 0.6497 | 0.6078 | 0.5359 | 0.5359 | 0.5359 | 0.4671 |
| BoW, smoothed | 0.5688 | 0.5494 | 0.5329 | 0.5329 | 0.5359 | 0.4701 |
| SBERT | 0.7814 | 0.7171 | 0.5479 | 0.5659 | 0.5359 | 0.4641 |
| SBERT, smoothed | 0.7485 | 0.6617 | 0.6108 | 0.7036 | 0.5359 | 0.4551 |
| Composite, | 0.705 | 0.665 | 0.5392 | 0.7010 | 0.6875 | 0.5 |
| Composite, | 0.6725 | 0.6725 | 0.5121 | 0.6976 | 0.6111 | 0.7778 |
| Composite, | 0.6213 | 0.5913 | 0.4975 | 0.6281 | 0.5 | 0.6667 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).