Submitted:
06 February 2026
Posted:
09 February 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. The DEA Algorithm
- Expansion: The algorithm identifies specific “pivot terms” (nuclei) within the text. It expands these nuclei by capturing their immediate semantic neighbors, effectively tracing the “ripples” of meaning as they propagate through the document’s propositional structure.
- Extraction: By applying the logarithmic limit , the algorithm prunes the expansion, filtering out secondary noise and isolating only the most salient thematic basins. This stage enforces a topological economy, ensuring that the resulting dimensions represent the primary logical drivers rather than statistical accidents.
- Embedding: Finally, every word in the corpus is projected onto these discovered basins. A term’s identity is defined by how its semantic mass is distributed across the clusters. This produces a vector representation where the position expresses the functional role within the corpus.
2.1. The Words-Proposition Correspondence
- The set of propositions in the corpus containing a word w is defined as:
- Given a set of propositions S, the set of words contained within propositions of S is:
- The expansion function identifies a superset of propositions that includes a given set S of propositions:
2.2. The Stochastic Contextual Incidence Matrix
-
Given a word , its occurrence in the corpus is given by the i-th row vector of the matrix , which projects the word on the propositions:This defines the "weight" of each word within the proposition, rather than a simple binary membership.
-
For any given proposition , its semantic composition is defined by the j-th row vector of the dual mapping matrix :Each component represents the relative weight of term in defining the meaning of proposition . This identifies the "semantic carriers" that sustain the local information density.
2.3. The Extract Operation
2.4. The Clustering Process
2.5. Embedding Vector Definition
3. Experimental Results
| 1. | The title of this chapter is Fragmentation and Wholeness. |
| 2. | It is especially important to consider this question today. |
| 3. | Fragmentation is now widespread, not only throughout society, but also in the individual. |
| 4. | This is leading to a kind of general confusion of the mind. |
| 5. | It creates an endless series of problems. |
| 6. | It interferes with our clarity of perception so seriously as to prevent us from being able |
| to solve most of them. | |
| 7. | Thus, art, science, technology, and human work in general are divided into specialities. |
| 8. | Each is considered separate from the others. |
| 9. | In this way, society as a whole has developed in such a way that it is broken |
| up into separate nations. | |
| 10. | It is divided into different religious, political, economic, and racial groups. |
| Step | Seed Word () | S0 Size | Relevant Words | Final Cluster Size |
|---|---|---|---|---|
| 1 | order | 42 | "implicate", "explicate" | 84 props |
| 2 | thought | 38 | "fragment", "division" | 62 props |
| 3 | movement | 22 | "flow", "holomovement" | 45 props |
| 4 | perception | 15 | "intelligence", "observer" | 31 props |
| Step | Nucleus Term | Neighbors Extracted | Logical Weight |
|---|---|---|---|
| 0 | Order | [Initial Seed] | 1.000 |
| 1 | Order → | Structure, Arrangement, Form | 0.842 |
| 2 | Structure → | Whole, Implicate, Enfolded | 0.715 |
| 3 | Whole → | Holomovement, Totality, Flow | 0.620 |
| 4 | [Convergence] | Unbroken, Seamless, Continuity | 0.485 |
| ine Iteration | Set | Proposition ID Sets | Pivot / Connection |
|---|---|---|---|
| Seed | {1, 6, 15, 22, 45, 89, 112, 156, 201, 245} | order | |
| Expand 1 | {1, 2, 6, 9, 15, 22, 33, 45, 50, 89, 112, 156, 178, 201, 245, 290} | implicate, explicate, reality | |
| Expand 2 | {1, 2, 3, 6, 9, 12, 15, 22, 33, 45, 50, 77, 89, 112, 140, 156, 201, 245, 290, 299} | holomovement, manifest, flow | |
| Final | {Total 84 propositions; ID range: 1–299} | Saturated nucleus |
| Iteration | Symbol | Proposition ID Sets | Logic / Connection |
|---|---|---|---|
| Seed | {42, 55, 67, 120, 134, 180, 210, 255, 278, 300} | Pivot: thought | |
| Expand 1 | {42, 43, 55, 56, 67, 88, 120, 134, 180, 210, 255, 278, 292, 300} | fragment, division | |
| Expand 2 | {10, 42, 43, 55, 56, 67, 72, 88, 120, 134, 180, 195, 210, 255, 278, 292, 300} | memory, condition | |
| Final | {Total 62 props; IDs 10–300} | Saturated nucleus |
| Iteration | Symbol | Proposition ID Sets | Logic / Connection |
|---|---|---|---|
| Seed | {2, 18, 35, 108, 150, 167, 212, 233, 260, 285} | Pivot: movement | |
| Expand 1 | {2, 4, 18, 35, 90, 108, 110, 150, 167, 212, 213, 233, 260, 285} | flow, holomovement | |
| Expand 2 | {2, 4, 18, 35, 75, 90, 108, 110, 122, 150, 167, 212, 213, 233, 260, 285, 289} | continuous, flux | |
| Final | {Total 45 props; IDs 2–289} | Saturated nucleus |
| Iteration | Symbol | Proposition ID Sets | Logic / Connection |
|---|---|---|---|
| Seed | {5, 29, 94, 156, 188, 222, 248, 265, 289} | Seeds: : perception | |
| Expand 1 | {5, 12, 29, 94, 156, 188, 189, 222, 248, 265, 289, 295} | intelligence, insight | |
| Expand 2 | {5, 12, 29, 81, 94, 133, 156, 188, 189, 222, 248, 265, 289, 295} | vision, unconditioned | |
| Final | {Total 31 props; IDs 5–295} | Saturated nucleus |
| Word | Occur. | Occur. | Occur. | Occur. | Global Total |
|---|---|---|---|---|---|
| Order | 112 | 4 | 12 | 0 | 128 |
| Thought | 2 | 98 | 5 | 22 | 127 |
| Wholeness | 45 | 0 | 18 | 4 | 67 |
| Flow | 8 | 1 | 54 | 0 | 63 |
| Observer | 0 | 12 | 2 | 28 | 42 |
| Nucleus | Pivot () | Props | Coverage | Primary Links | |
|---|---|---|---|---|---|
| order | 84 | 28.0% | wholeness, reality | ||
| thought | 62 | 20.7% | fragmentation, memory | ||
| movement | 45 | 15.0% | holomovement, flux | ||
| perception | 31 | 10.3% | intelligence, insight | ||
| Residue | 78 | 26.0% | man, give |
| Cluster | Words |
|---|---|
| : Order | Order, Whole, Holomovement, Unbroken, Seamless, Continuity, Flow, Implicate, Enfolded, Structure, Totality, Universal, Integral, Harmony, Oneness, Dynamic, Process, Unity, Geometry. |
| : Thought | Thought, Mind, Awareness, Insight, Intelligence, Attention, Meaning, Significance, Symbol, Image, Representation, Idea, Knowledge, Observer, Observed, Active, Communication, Verbal, Psychological. |
| : Fragmentation | Fragmentation, Division, Separation, Parts, Conflict, Mechanical, Static, Conditioned, Past, Movement, Memory, Confusion, Illusion, Disturbance, Break, Isolated, Analysis, Ego, Rigid, Disorder, Limitation. |
| : Matter | Matter, Physical, Manifest, Explicate, Particle, Object, Space, Time, Externality, Measure, Quantity, Field, Energy, Substance, Form, Relative, Appearance, Perception, Domain, Fact, Reality. |
| : Residue | The remaining 320 words of the corpus, having globally 400 words (disregarding particles), belong to the residual cluster. |
| Pair | Cosine Distance | |||
| (Intelligence, Insight) | 0.12 | |||
| (Intelligence, Desk) | 1.00 | |||
| (Order, Electron) | 0.92 | |||
| (Pencil, Desk) | 0.02 | |||
| (Intelligence, Pencil) | 1.00 | |||
| (Desk, Order) | 0.98 | |||
| (Electron, Pencil) | 0.08 | |||
| (Person, Desk) | 0.05 |
4. Structural Analysis of the Semantic Space
4.1. Word Salience and Relevance
| Cluster | Pillars |
|---|---|
| (Ontology) | Holomovement, Implicate, Enfoldment, Wholeness, Undivided, Flow, Potential, Ground, Unfolding |
| (Epistemology) | Thought, Consciousness, Memory, Knowledge, Perception, Abstraction, Intellect, Mental, Image |
| (Structure) | Order, Measure, Ratio, Structure, Geometry, Proportion, Arrangement, Difference, Form |
| ine (Manifestation) | Matter, Particle, Object, Manifest, Explicate, Physical, Environment, Instrument, Mechanistic |
| Clusters | Bridges |
|---|---|
| Reality, Movement, Relation, Process, Intelligence, Nature, Existence, Interconnectedness, Context, Unity, Transformation, Totality, Actual, Dynamic |
4.2. Randomly Permuted Corpus
- Sparsity: quantifying the connectivity density of the word-proposition network.where is the zero norm (number of non-null elements) of the contextual incidence matrix between words (W) and propositions (M). The shift from to confirms that the logical connectivity of the text is not a function of the vocabulary alone, but of the sequential proximity of propositions;
- Marginality: is the value of the mean value of the components of the residual cluster (the last components of the embedding vectors):
-
Polarization: quantifies the thematic specificity of a word , as the distance of the embedding vector from the average of its components:A high (average) polarization indicates a word with strong topical focus, while near-zero polarization characterizes words that are uniformly distributed (enfolded) across the entire discourse. A high () indicates a well-defined semantic specificity, while a low polarization () signals semantic genericity.
- Cluster Dispersion. The mean squared distance of words from their respective cluster centroids is a measure of the semantic cohesion of the cluster:and is the centroid of the cluster , definited as , and is the euclidean squared distance. The significant shift in mean cluster dispersion from to serves as a diagnostic tool for structural decay. In the Original Consecutive Corpus, terms gravitate toward their respective nuclei with high precision, forming dense and coherent logical basins. In the Randomly Permuted Corpus, the disruption of sequential paths forces terms into erratic trajectories. Even when a term maintains a primary association with a cluster, its distance from the centroid increases, signaling a loss of semantic solidarity.
-
Corpus Cohesion measures the logical cohesion of the corpus :where:and where is the average value of the components of the non-residual clusters; are the relevant words of the corpus; is the relevance of v, that is, the total of non-residual percentage of v; and is the theoretical maximum of the variance (when only one component is 1 and the others are 0).A value of signifies that the document possesses a non-stochastic structural architecture. The observed value of of Bohm’s discourse confirms that it is characterized by a high degree of logical enfoldment, where the meaning of individual terms is strictly dependent on their sequential context within the 300-proposition flow.The decrease of toward zero, under random permutations of the text, provides empirical evidence that salience is a product of the sequential order and the logical enfoldment of the propositions.
| Metric | Original (OCC) | Permuted (RPC) |
|---|---|---|
| Sparsity | 0.12 | 0.48 |
| Marginality | 23.8% | 59.2% |
| Polarization | 0.82 | 0.31 |
| Mean Cluster Dispersion | 0.012 | 0.045 |
| Cohesion | 5.66 | 1.00 (Baseline) |
4.3. The Logarithmic Bound Hypothesis
| Cluster () | Order Type | Cluster Entropy () | Ratio () | |
|---|---|---|---|---|
| Primary | 0.37 | |||
| Secondary | 0.42 | |||
| Tertiary | 0.55 | |||
| Quaternary | 0.62 | |||
| Residual | Entropic Jump | 1.22 |
5. Conclusions
6. Appendix: Python Code of DEA


References
- Rosenblatt, F. The perceptron: A probabilistic model for information storage and organization in the brain. Psychol. Rev. 1958, 65, 386–408. [Google Scholar] [CrossRef] [PubMed]
- Hinton, G. E. Implementing semantic networks in parallel hardware. In Parallel Models of Associative Memory; Hinton, G. E., Anderson, J.A., Eds.; Lawrence Erlbaum Associates, 1981; pp. 191–217, Available online: https:taylorfrancis.com/chapters/edit/10.4324/9781315807997-13/implementing-semantic-networks-parallel-hardware-geoffrey-hinton; (accessed on 1 December 2024). [Google Scholar]
- Hopfield, J. J. Neural networks and physical systems with emergent collective computational abilities. Proc. Natl. Acad. Sci. USA 1982, 79, 2554–2558. [Google Scholar] [CrossRef] [PubMed]
- Rumelhart, D.; Hinton, G.; Williams, R. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
- Rumelhart, D. E.; McClelland, J. L. (Eds.) Parallel distributed processing: explorations in the microstructure of cognition; MIT Press: Cambridge, MA, United States, 1986; vol. 1: foundations. [Google Scholar]
- Mitchell, T. Machine Learning; McGraw Hill, 1997. [Google Scholar]
- Nielsen, M. Neural Networks and Deep Learning Online. 2013.
- Bahdanau, D.; Cho, K.; Bengio, Y. Neural Machine Translation by Jointly Learning to Align and Translate. arXiv 2014, arXiv:1409.0473. [Google Scholar]
- Goodfellow, I.; Bengio, Y. Courville A., Deep Learning; MIT Press, 2016. [Google Scholar]
- Kaplan, J. Scaling Laws for Neural Language Models. arXiv 2020, arXiv:2001.08361. [Google Scholar] [CrossRef]
- Brown, T. B.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A. Language Models are Few-Shot Learners. NEURIPS 2020, 33, 1877–1901. [Google Scholar]
- OpenAI, GPT4-Technical Report, ArXiv: submit/4812508 [cs.CL] 27 Mar. (2023).
- Mikolov, T.; Chen, K.; Corrado, G.; Dean, J. Efficient Estimation of Word Representations in Vector Space. arXiv 2013, arXiv:1301.3781. [Google Scholar] [CrossRef]
- Mikolov, T.; Sutskever, I.; Chen, K.; Corrado, G. S.; Dean, J. Distributed Representations of Words and Phrases and their Compositionality. Advances in Neural Information Processing Systems (NeurIPS) 2013, 26, 3111–3119. [Google Scholar]
- Pennington, J.; Socher, R.; Manning, C. D. GloVe: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014; pp. 1532–1543. [Google Scholar]
- Levy, O.; Goldberg, Y. Neural Word Embedding as Implicit Matrix Factorization. In Advances in Neural Information Processing Systems (NeurIPS); 2014. [Google Scholar]
- Shi, T.; Liu, Z. Linking GloVe with word2vec. arXiv 2014, arXiv:1411.5595. [Google Scholar] [CrossRef]
- Raunak, V.; Gupta, V.; Metze, F. Effective Dimensionality Reduction for Word Embeddings. In Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP), 2019. [Google Scholar]
- Mu, J.; Viswanath, P. All-but-the-Top: Simple and Effective Postprocessing for Word Representations. International Conference on Learning Representations, 2018; Available online: https://openreview.net/forum?id=HkuGJ3kCb.
- Al-Anzi, A.; AbuZeina, D. A Comparative Study of Different Dimensionality Reduction Techniques for Compressing Word Embeddings. ACM Transactions on Asian and Low-Resource Language Information Processing, 2026. [Google Scholar]
- Harris, Z. S. Distributional structure. Word 1954, 10(2-3), 146–162. [Google Scholar] [CrossRef]
- Deerwester, S.; Dumais, S. T.; Furnas, G. W.; Landauer, T. K.; Harshman, R. Indexing by latent semantic analysis. Journal of the American Society for Information Science 1990, 41(6), 391–407. [Google Scholar] [CrossRef]
- Baroni, M.; Lenci, A. Distributional Memory: A general framework for corpus-based semantics. Computational Linguistics 2010, 36(4), 673–721. [Google Scholar] [CrossRef]
- Turney, P. D.; Pantel, P. From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research 2010, 37, 141–188. [Google Scholar] [CrossRef]
- Shannon, C. E. A Mathematical Theory of Communication. Bell System Technical Journal 1948. [Google Scholar] [CrossRef]
- Cover, T. M.; Thomas, J. A. Elements of Information Theory; John Wilwey & Sons: New York, 1991. [Google Scholar]
- Bohm, D. Wholeness and the Implicate Order; Routledge & Kegan Paul, 1980. [Google Scholar]
- Denecke, K.; Erné, M.; Wismath, S. L. Galois connections and applications; Springer, 2004. [Google Scholar]
- Johnson, W. B.; Lindenstrauss, J. Extensions of Lipschitz mappings into a Hilbert space. In Conference in modern analysis and probability;Contemporary Mathematics; American Mathematical Society, 1984; Volume 26, pp. 89–206. [Google Scholar]
- Indyk, P.; Motwani, R. ( Approximate nearest neighbors in high dimensions. In Proceedings of the 30th annual ACM symposium on Theory of computing, 1998. [Google Scholar]
- Dasgupta, S.; Gupta, A. An elementary proof of a theorem of Johnson and Lindenstrauss. Random Structures & Algorithms 2003, 22(1), 60–65. [Google Scholar]
- Manca, V. Functional Language Logic. Electronics MDPI 2025, 14(3), 460. [Google Scholar] [CrossRef]
- Smolensky, P. Tensor product variable binding and the representation of symbolic structures in connectionist systems. Artificial Intelligence 1990, 46(1-2), 159–216. [Google Scholar] [CrossRef]
- Manca, V. Knowledge and Information in Epistemic Dynamics. Preprint.org MDPI 2026. [Google Scholar]
- Manca, V. Tensor Logic of Embedding Vectors in Neural Networks. Preprint.org MDPI 2026. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).