Submitted:
17 August 2025
Posted:
20 August 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Reader guide. This paper introduces a new way to think about sentence comprehension, not as predicting the next word, but as retrieving meaning over time. Most current models measure how surprising a word is, assuming all readers process it in the same way. ODER shows that is not the case: comprehension speed and difficulty depend on each reader’s attention, memory, and background knowledge. Using a controlled test language, we measure how different kinds of readers recover meaning, and then test the same model on English sentences without changing the parameters. The results match known points where understanding tends to break down and predict when and why this happens for different readers.
- defining entropy retrieval as a joint function of hierarchical syntactic complexity and information-transfer efficiency;
- mapping these constructs to measurable cognitive signatures in EEG, fMRI, and pupillometry;
- providing a replicable benchmarking framework that reports , , and for each observer class.
1.1. Contributions
- A unified mathematical framework for observer-dependent entropy retrieval.
- A contextual-gradient operator that captures reanalysis, for example, garden-path phenomena, in dynamic observer-dependent terms.
- A benchmarking protocol that compares ODER with existing cognitive models and raises stress flags when flattens or diverges.
- A demonstration that quantum-formalism constructs can model ambiguity and interference without implying literal quantum computation in the brain.
1.2. Relationship to Existing Models
- ACT-R parsing frameworks [22] simulate incremental working-memory constraints but treat prediction and retrieval as separate stages, leaving coherence effects unexplained.
- Hierarchical prediction-error accounts [13] model multi-level expectations but do not specify observer-class parameters that modulate collapse timing.
- Transformer language models excel at prediction and generation, yet their weight vectors obscure observer dynamics and reveal little about why or how observers differ in processing.
1.2.1. The ODER Innovation: A Conceptual Map
1.3. Theoretical Positioning of ODER
| Approach | Primary Focus | Treatment of Observer | Key Limitations |
|---|---|---|---|
| Surprisal Models | Input statistics and probability | Uniform processor with idealized capacity | Cannot explain individual differences in processing difficulty |
| Resource-Rational | Bounded rationality and capacity limits | Variable capacity, uniform mechanisms | Lack explicit reanalysis mechanisms; treat processing as passive |
| ACT-R Parsing | Procedural memory retrieval | Slot-limited buffer with decay | Prediction and retrieval treated separately; no coherence term |
| Hierarchical Prediction-Error | Multi-level expectation tracking | Implicit observer; scalar precision weights | No explicit collapse point or observer parameters |
| Optimal Parsing | Strategy selection | Uniform processor with idealized strategies | Cannot explain observer-specific strategy choices |
| ODER (this model) | Observer-relative entropy retrieval (not generative modeling) | Parameterized by attention, memory, and knowledge | Designed partial-fit; requires empirical calibration of observer parameters |
1.4. Forward Retrieval Law and Inverse Decoder
1.5. Implementation Algorithm
| Algorithm 1 ODER Entropy Retrieval. |
|
Require: sentence S, observer parameters Ensure: observer-dependent entropy |
2. Benchmarking Methodology
2.1. Comparative Metrics
- Entropy-reduction rate (ERR)
- – First-derivative slope of ; hypothesized to scale with the N400 slope in centro-parietal EEG.
- Retrieval-collapse point ()
- – Time at which enters a 95% confidence band around zero; anchors the onset of P600 activity and post-disambiguation fixation drops.
- Model-to-trace fit (, AIC, BIC)
- – Overall goodness-of-fit and parsimony; higher predicts tighter coupling between simulated and observed P600 latency.
- Observer-class divergence ()
- – Cohen’s d for between O1 and O3; relates to between-group differences in frontal-theta power (high vs. low working memory).
- Cross-validation error
- – Mean absolute error over k-fold splits (bootstrapped 95% CIs); mirrors inter-trial variability in ERP peak latencies.
- Reanalysis latency
- – Reaction-time variance in garden-path tasks; behavioral proxy for spikes.
- Pupillometric load
- – Peak dilation normalized by baseline; tracks integrated (working-memory demand).
- Eye-movement patterns
- – Fixation count and regression length during disambiguation; fine-grained correlate of local ERR fluctuations.
2.2. Protocol
- Compute baseline entropy with Equation (A1) for all Aurian stimuli.
2.3. Neurophysiological Correlates
- Contextual-gradient spikes () predict P600 amplitude in the window ms [27].
- Information-transfer efficiency () predicts N400 magnitude in the window ms [19].
- Working-memory load () is expected to modulate frontal-midline theta (4–7 Hz) across the same post-collapse interval, consistent with memory-maintenance accounts of theta power [3].
2.4. Distinguishing Retrieval Failure from Prediction Failure
- EEG: sustained P600 with attenuated resolution when retrieval failure persists.
- Pupillometry: plateau in low-capacity observers.
- Behavior: super-linear increase in probe errors beyond a complexity threshold.
3. Empirical Calibration
3.1. Aurian as an Initial Testbed
| Note on Aurian Scope |
| Aurian is used here solely as a controlled corpus for entropy benchmarking. A more advanced version, including observer-conditioned compression, ambiguity-preserving syntax, and symbolic scaffolds, is developed separately in [6]. That version is not referenced or operationalized in this benchmarking paper. |
3.1.1. Aurian Grammar Specification
Core syntactic rules
Lexicon with increments
- kem (subject pronoun, )
- vora (simple verb, )
- sul (complementizer, )
- daz (embedding verb, )
- fel (object noun, )
- ren (modifier, )
- tir (determiner, )
- mek (conjunction, )
- poli (adverb, )
- zul (negation, )
Illustrative sentences
- Low entropy (): Kem vora fel (“He/She sees the object”)
- Medium entropy (): Kem vora fel ren (“He/She sees the object quickly”)
- High entropy (): Kem daz sul tir fel vora (“He/She thinks that the object falls”)
- Very high entropy (): Kem daz sul tir fel sul ren vora poli zul (“He/She thinks that the object that quickly falls does not move”)
| Sentence class | Tokens | Cumulative |
|---|---|---|
| Low | 3 | 2 |
| Medium | 4 | 3 |
| High | 6 | 7 |
| Very High | 9 | 11 |
3.1.2. Clarifying the Metric
Ecological Rationale
3.2. Confidence, Sensitivity, and Parameter Variance
- Report 95% confidence intervals for and , estimated from n-back and reading-span tasks.
- Run sensitivity sweeps; log a stress flag when shifts by more than 50 ms.
4. Results
4.1. Model–Fit Quality
| Sentence | Observer | CI | (t) CI | AIC | |
|---|---|---|---|---|---|
| eng_1 | O1 | 0.871 | |||
| eng_1 | O3 | 0.709 | |||
| aur_1 | O1 | 0.810 | |||
| aur_complex_1 | O1 | 0.759 | |||
| aur_complex_2 | O1 | 0.661 |
4.2. Empirical Generalization Check
- Timing accuracy. ODER predicted collapse within s for 9 of the 12 sentences. Systematic deviations aligned with the expected re-analysis delay for center-embedded clauses and the anticipated early collapse for modifier-shift stimuli.
- Shape fidelity. The median root-mean-square error across full trajectories was , with the largest residuals occurring at re-analysis junctures, which is precisely where the canonical retrieval law is not expected to hold.

4.3. Parameter–Sensitivity Analysis
4.4. Interpreting the 31% Convergence Rate
| Why a 31% Convergence Rate Is a Feature, Not a Flaw |
| Overfitting all traces would render the model unfalsifiable. Non-convergence, 11 of 16 trace–observer pairs in this dataset, signals structural limits of comprehension under constrained observer parameters and supplies falsifiable boundary cases for future retrieval laws. |
4.5. Failure Taxonomy
- Garden-path sentences (gpath_1, gpath_2): Non-monotonic retrieval spikes violate the sigmoidal assumption; and become non-identifiable.
- Flat-anomaly or highly ambiguous items (flat_1, ambig_1): Sustained high and negligible flatten the trace, leading to under-fit ().
| Symptom | Frequency | Provisional Remedy |
|---|---|---|
| pegging | 6/11 | Extending trace length; adding hierarchical priors |
| AIC shortfall () | 3/11 | Using adaptive learning rates in the optimizer |
| inversion (O3 > O1) | 2/11 | Testing a mixed-effects retrieval law |
4.6. Sentence–Level Retrieval Dynamics
4.7. Representative Trace Comparison


4.8. Self-Audit Note
4.9. Predictive Outlook
5. Discussion
5.1. Theoretical Contributions

5.2. ERP Anchoring and Observer Diversity
ERP– Variability and Observer Drift.
5.3. Parameter Diversity and Observer-Class Variation
5.4. Failure Taxonomy
- (a)
- Garden-path spikes: highly non-monotonic traces overshoot the sigmoidal retrieval law, producing low , an AIC shortfall, and stress flags.
- (b)
- Flat-ambiguity plateaus: sentences with persistent semantic superposition yield near-constant and stall entropy growth, causing parameter inversion ().
Toward a Multi-Phase Retrieval Kernel
5.5. Known Limitations and Boundary Conditions
5.6. Open Questions and Future Experiments
- Can and be inferred in vivo from behavioral or neurophysiological streams?
- How do individual profiles evolve across tasks or genres?
- Do –aligned ERP windows replicate in EEG or MEG after O1 versus O3 calibration?
- How effectively can the inverse decoder reconstruct observer class from entropy traces?3
- Can ODER guide adaptive reading interventions, second-language diagnostics, or literary ambiguity modeling?
6. Cross-Domain Applications of ODER
6.1. Tier 1 — Adaptive Interfaces and Reading Diagnostics
6.1.1. Human–Machine Interaction
- On-the-Fly Simplification. When a rising forecasts reanalysis overload, the UI rephrases subordinate clauses into shorter main-clause paraphrases.
- Retrieval-Difficulty Prompts. Sustained combined with ocular regressions initiates a micro-tutorial or offers a chunked information display.
6.1.2. Linguistic Retrieval Diagnostics
- Entropy-Aligned Difficulty Curves. ODER predicts that garden-path items with the steepest slopes will coincide with probe-error spikes in low- readers.
- EEG Convergence Mapping. Public N400/P600 datasets (e.g., ERP-CORE [7]) can be realigned to each observer’s collapse time to test whether P600 amplitude covaries with only in low-working-memory cohorts.
6.2. Tier 2 — Pilot-Ready Extensions
6.2.1. Clinical and Accessibility Contexts
- Assistive Communication. An AAC prototype that caps syntactic depth when rises above a user-specific threshold is expected to support more efficient message access for users with structured retrieval limits.
6.2.2. Translation and Cross-Linguistic Semantics
- Idiomatic Divergence. For idioms whose literal and figurative readings diverge, ODER predicts larger spikes and a delayed collapse ( tokens) in bilinguals with low prior-knowledge parameter .
6.3. Summary Table
| Construct | Interpretation | Support Application | Testable Outcome |
|---|---|---|---|
| Attentional focus | Interface simplification | Drop in regressions () | |
| Working-memory load | Reading-diagnostic clustering | fixation variance by WM group | |
| Semantic superposition | Idiom-translation stress test | Decrease in correlates with RT recovery | |
| Reanalysis gradient | AAC overload detector | Peak vs. error rate |
7. Conclusions and Future Directions
- Near-Term: Deploy ODER in adaptive educational tools, cognitively adaptive user interfaces, and linguistic-assessment platforms. Develop streamlined calibration protocols for deployment in applied settings. Empirical validation of metrics such as , , and can proceed with existing eye-tracking and EEG corpora (e.g., ZuCo and ERP-CORE) rather than requiring new data collection.
- Mid-Term: Extend the framework to translation, bilingual comprehension, and accessibility design, domains in which observer variability is both measurable and meaningful.
- Long-Term: Investigate observer-relative semantics, entropy superposition (), and reanalysis dynamics in philosophical, epistemological, and artificial-intelligence contexts.
Author Contributions
Funding
Data Availability Statement
- ODER_Linguistic_Framework.ipynb: reproduces every figure and table reported in the manuscript.
- ODER_Interactive_Playground.ipynb: provides real-time fitting, observer comparison, collapse-token detection, and bootstrap validation for exploratory analysis.
- ODER_Natural_Stories_Validation.ipynb: applies fixed Aurian parameters to Natural Stories sentences and stress-test items, reporting collapse-time error and fit diagnostics.
Conflicts of Interest
Appendix A. Mathematical Formalism
Appendix A.1. Core Retrieval Equation
Appendix A.2. Density-Matrix Initialization
Appendix A.3. Entropy-Update Function
Appendix A.4. State-Transition Operator
Appendix A.5. Variable Definitions
- — constant retrieval-rate coefficient for the current sentence.
- — characteristic time (in seconds) at which retrieval accelerates before saturating.
- — maximum retrievable entropy (set to 1 in all simulations).
- — entropy retrieved up to time .
- — collapse time at which .
- — observer state (density matrix) encoding interpretation probabilities and coherence.
- — hierarchical syntactic depth of the current token.
- — information-transfer efficiency for the current token.
- — contextual gradient indicating integration/reanalysis cost.
Appendix A.6. Derivation Outline
- (a)
- Start from logistic growth: .
- (b)
- Replace the constant proportionality with to capture a two-phase regime: rapid early acceleration followed by slowdown.
- lcbel=()
- For constant , Equation (A1) has no elementary closed-form solution; numerical integration and curve fitting are therefore used.
Appendix A.7. ERP Alignment via Collapse Point τres
- N400 window: to .
- P600 window: to .
Appendix A.8. Implementation Algorithm
| Algorithm 2 ODER Entropy Retrieval |
|
Require: sentence S, observer parameters Ensure: observer-specific entropy |
Appendix B. Corpus and Entropy Trace Generation
Appendix B.1. Sentence Inventory
| Sentence ID | Observers | Tokens | Complexity | Mode |
|---|---|---|---|---|
| eng_1 | O1, O3 | 9 | low | normal |
| gpath_1 | O1, O3 | 8 | high | gpath |
| gpath_2 | O1, O3 | 9 | very_high | gpath |
| ambig_1 | O1, O3 | 10 | medium | ambig |
| aur_1 | O1, O3 | 9 | medium | aurian |
| aur_complex_1 | O1, O3 | 10 | high | aurian |
| aur_complex_2 | O1, O3 | 12 | very_high | aurian |
| flat_1 | O1, O3 | 8 | anomalous | flat |
Appendix B.2. Entropy Generation Modes and Empirical Grounding
- aurian: Decay modulated by hierarchical complexity , delaying convergence for deeper embeddings. Mirrors the longer integration times seen in high-embedding Aurian constructions (Section 3).
- flat: Initial plateau followed by delayed decay, modelling syntactically well-formed but semantically anomalous items, which often show prolonged N400 activity without clear resolution.
- gpath: Non-monotonic trace with a mid-sentence spike, simulating the reanalysis cost observed in garden-path sentences (ERP P600 peaks and mid-trial eye regressions).
- ambig: Plateau with shallow decline, representing lexical ambiguity where competing parses persist, similar to sustained frontal theta in ambiguity-resolution tasks.
- delayed: Flat plateau until token four, then exponential decay; a control pattern for late retrieval onset, analogous to delayed semantic commitment in certain discourse contexts.
- normal: Monotonic exponential decay with slope set by and mild Gaussian noise, corresponding to straightforward comprehension without reanalysis.
Appendix B.3. Observer Class Bias and Parameter Justification
| Parameter | O1 (high context) | O3 (low context) |
| Baseline entropy at token 1 | 0.60 | 0.60 |
| Early decay constant | 0.25 | 0.15 |
| Late decay constant | 0.12 | 0.08 |
| Noise standard deviation | 0.02 | 0.04 |
Appendix B.4. Trace Generator: Logic Summary
Purpose

Key Points.
- observer_class is explicitly mapped to , , and through OBSERVER_PARAMS.
- The optional lhier_score modulates delay only in aurian mode.
- Output values are clipped to to respect entropy bounds and avoid unphysical values.
- Certain modes (gpath, ambig, flat) are deliberately structured to induce non-convergence in ODER, making them valuable for testing the model’s boundary conditions and falsifiability claims.
Appendix C. Stress Test Summary and Retrieval-Failure Log
Appendix C.1. Failure Matrix
| Sentence | Observer | Stress Flags | Method | ||||
|---|---|---|---|---|---|---|---|
| gpath_1 | O1 | R; A; P | 0.00 | — | — | 5 | 90% |
| gpath_1 | O3 | R; A; P | 0.00 | — | — | 4 | 90% |
| gpath_2 | O1 | R; A; P | 0.00 | — | — | 6 | 90% |
| ambig_1 | O1 | R | 0.07 | 0.375 | 0.05 | 10 | 90% |
| aur_1 | O3 | R | 0.37 | 0.424 | 0.05 | 8 | 90% |
| aur_complex_1 | O3 | R | 0.21 | 0.368 | 0.05 | 9 | 90% |
| aur_complex_2 | O3 | R; A; P | 0.00 | — | — | 12 | 90% |
| flat_1 | O1 | R | 0.00 | 0.254 | 0.05 | 1 | 90% |
| flat_1 | O3 | R | 0.00 | 0.254 | 0.05 | 1 | 90% |
Appendix C.2. Parameter-Surface Illustration

Appendix C.3. Threshold Criteria
- flag “R”: any fit with .
- pegging flag “P”: estimate at lower bound (), especially when combined with inversion.
- AIC under-performance flag “A”: .
- Parameter inversion flag “P”: on O1-favored sentences, or any negative .
Appendix C.4. Root-Cause Notes and Proposed Remedies
-
Non-monotonicity defeats tanh formSymptom: low on garden-path traces (gpath_1, gpath_2); corresponds to P600 spikes and mid-sentence eye regressions.Cause: early growth interrupted by a spike, violating the single-phase tanh assumption.Remedy: allow piecewise or spline-based kernels to capture multi-phase retrieval dynamics.
-
pegging at lower boundSymptom: fixed at , especially on very short sentences (flat_1); mirrors early collapse in shallow-processing cases.Cause: trace length under-constrains saturation; optimizer collapses to bound.Remedy: increase minimum input length or add a weak hierarchical prior on .
-
AIC under-performanceSymptom: AIC despite visually plausible fit (aur_complex_2, O3).Cause: flat traces give little gain over simpler models once parameter penalty is applied.Remedy: introduce attention-gated transitions that reduce to a linear model when .
-
Parameter inversionSymptom: on ambig_1; reflects prolonged semantic superposition (high ) overriding memory-constraint effects.Cause: lexical ambiguity drives more than working-memory limits, reversing expected order.Remedy: couple to or separate lexical and syntactic retrieval-rate parameters; test against ambiguity-resolution ERP corpora.
Appendix C.5. Divergence Taxonomy Across Natural Sentences
| id | type | (s) | RMSE | shape_mismatch | note |
|---|---|---|---|---|---|
| ns_001 | declarative | 4.0 | 0.318 | T | — |
| ns_002 | garden-path | –0.4 | 0.510 | T | — |
| ns_003 | garden-path | –0.4 | 0.414 | T | — |
| stress_001 | garden-path | –0.4 | 0.624 | T | old/NP reanalysis |
| stress_002 | center-embed | 3.6 | 0.408 | T | deep embed delay |
| stress_003 | coord-ambig | –2.0 | 0.284 | T | coordination scope |
| stress_004 | passive | 0.0 | 0.568 | T | reversible passive |
| stress_005 | modifier-shift | –3.2 | 0.372 | T | early collapse on mis-parse |
| stress_006 | idiom | –0.8 | 0.529 | F | literal→idiom switch |
| stress_007 | pronoun | –0.8 | 0.358 | T | referent lag |
| stress_008 | obj-relative | 0.0 | 0.568 | T | object-relative load |
| stress_009 | pp-attach | 0.0 | 0.568 | T | PP-attachment ambiguity |
Appendix D. Interactive Playground Notebook Interface
Appendix D.1. Core Functions
- Real-time entropy-trace fitting using nonlinear least squares or bootstrap resampling, with plots updating as parameters change.
- Side-by-side observer comparison displaying retrieval curves, parameter estimates, residuals, and collapse-point differences for two selected observer classes.
- Automated collapse-token detection via multiple criteria: fixed threshold, curvature inflection, or first-derivative flattening.
- ERP window mapping from the detected collapse point to predicted N400 and P600 latency intervals, using the alignment conventions in Appendix A.
- Bootstrap validation returning confidence intervals for , , and across repeated resamples.
Appendix D.2. Usage Notes
- The notebook operates only within a designated sandbox directory and does not overwrite or alter the publication dataset, protecting the integrity of the archived analysis.
- Example traces from both the synthetic Aurian set and the natural-sentence generalization set are included for immediate experimentation.
Appendix D.3. Access
Appendix E. Glossary and Interpretive Variable Mapping
E.1 Variable Glossary
| Symbol | Description | Interpretation |
|---|---|---|
| Entropy-retrieval rate | Speed of comprehension for an observer; higher values = faster retrieval | |
| Characteristic saturation time | Temporal scale of processing effort before saturation; lower values = quicker approach to plateau | |
| Cumulative entropy retrieved | Portion of meaning resolved up to | |
| Maximum retrievable entropy | Upper bound on sentence information; set to 1 in all simulations | |
| Collapse time | Point of interpretive convergence; ERP anchor | |
| Contextual gradient | Instantaneous slope of reanalysis load; spikes indicate instability or high integration cost | |
| Semantic superposition (off-diagonals in ) | Degree of unresolved ambiguity or competing interpretations | |
| Attentional-focus parameter | Allocation of cognitive resources during processing | |
| Working-memory constraint | Capacity to maintain unresolved structure in active memory | |
| Prior-knowledge exponent | Modulation of retrieval speed based on background familiarity |
E.2 Cross-Domain Interpretive Map
| Term | Linguistics | Cognitive Science | AI / NLP |
|---|---|---|---|
| Parsing velocity (tokens/sec) | Retrieval speed; memory-search efficiency | Token-alignment accuracy or decoding speed | |
| Span of reanalysis in garden-paths | Processing-time constant; transition to late-stage comprehension | Hidden-state decay or context window persistence | |
| Disruption signal in syntactic ambiguity | Neural surprise or integration cost | Attention-weight gradient spike in transformer models | |
| Persistent lexical ambiguity | Interpretive drift or competing schema activation | Blended latent representation of multiple interpretations | |
| ERP timing anchor for N400/P600 | Resolution threshold in task performance | Collapse point in model confidence or output distribution |
Appendix F. Hypothesized Parameter Profiles for Neurodivergent Retrieval
| Neurotype | Range | (s) | Notes | Trace Pattern | ERP Signature |
|---|---|---|---|---|---|
| Autism | 0.9–1.1 | 0.12–0.18 | Steep ; stable | Extended reanalysis plateau | Delayed P600 latency [23] |
| ADHD | 0.7–1.3† | 0.08–0.16 (high variance) | Fluctuating , variable | Irregular ERR, wide variance | Reduced LPP stability [20] |
| Dyslexia | 0.5–0.8 | 0.10–0.15 | Elevated (WM load) | Dampened ERR, retrieval stalls | Attenuated N400 amplitude [4] |
Identifiability

Appendix H: Inverse Retrieval Classification (Toy Example)
Feature set
Toy classifier

Interpretation
References
- Busemeyer, J. R.; Bruza, P. D. Quantum Models of Cognition and Decision; Cambridge University Press, 2012. [Google Scholar] [CrossRef]
- Bruza, P. D.; Wang, Z.; Busemeyer, J. R. Quantum cognition: A new theoretical approach to psychology. Trends in Cognitive Sciences 2015, 19(7), 383–393. [Google Scholar] [CrossRef]
- Cavanagh, J. F.; Frank, M. J. Frontal theta as a mechanism for cognitive control. Trends in Cognitive Sciences 2014, 18(8), 414–421. [Google Scholar] [CrossRef] [PubMed]
- Chang, A.; Zhang, Y.; Ding, H.; Goswami, U. Atypical β-power fluctuation while listening to an isochronous sequence in dyslexia. Clinical Neurophysiology 2021, 132(10), 2384–2390. [Google Scholar] [CrossRef]
- Christianson, K.; Williams, C. C.; Zacks, R. T.; Ferreira, F. Younger and older adults’ “good-enough” interpretations of garden-path sentences. Discourse Processes 2006, 42(2), 205–238. [Google Scholar] [CrossRef]
- Cooper, E. Aurian: A Cognitive-Adaptive Language for Observer-Dependent Communication; Zenodo, 2025. [Google Scholar] [CrossRef]
- Kappenman, E. S.; Farrens, J. L.; Zhang, W.; Stewart, A. X.; Luck, S. J. ERP CORE: An open resource for human event-related potential research. NeuroImage 2021, 225, 117465. [Google Scholar] [CrossRef]
- Ferreira, F.; Henderson, J. M. Recovery from misanalyses of garden-path sentences. Journal of Memory and Language 1991, 30(6), 725–745. [Google Scholar] [CrossRef]
- Futrell, R.; Gibson, E.; Tily, H. J.; Blank, I.; Vishnevetsky, A.; Piantadosi, S. T.; Fedorenko, E. The Natural Stories Corpus. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018); European Language Resources Association (ELRA), 2018; pp. 76–82. [Google Scholar] [CrossRef]
- Futrell, R.; Gibson, E.; Tily, H. J.; Blank, I.; Vishnevetsky, A.; Piantadosi, S. T.; Fedorenko, E. The Natural Stories corpus: A reading-time corpus of English texts containing rare syntactic constructions. Language Resources & Evaluation 2021, 55(1), 63–77. [Google Scholar] [CrossRef]
- Gershman, S. J.; Horvitz, E. J.; Tenenbaum, J. B. Computational rationality: A converging paradigm for intelligence in brains, minds, and machines. Science 2015, 349(6245), 273–278. [Google Scholar] [CrossRef]
- Hale, J. A probabilistic Earley parser as a psycholinguistic model. In Proceedings of NAACL 2001; 2001; Vol. 2, pp. 1–8. [Google Scholar] [CrossRef]
- Heilbron, M.; Armeni, K.; Schoffelen, J. M.; Hagoort, P.; de Lange, F. P. A hierarchy of linguistic predictions during natural language comprehension. Proceedings of the National Academy of Sciences 2022, 119(32), e2201968119. [Google Scholar] [CrossRef]
- Hollenstein, N.; Rotsztejn, J.; Tröndle, M.; Pedroni, A.; Zhang, C.; Langer, N. ZuCo: A simultaneous EEG and eye-tracking resource for natural sentence reading. Scientific Data 2018, 5, 180291. [Google Scholar] [CrossRef]
- Just, M. A.; Carpenter, P. A. A capacity theory of comprehension: Individual differences in working memory. Psychological Review 1992, 99(1), 122–149. [Google Scholar] [CrossRef] [PubMed]
- Demberg, V.; Keller, F. Data from eye-tracking corpora as evidence for theories of syntactic processing complexity. Cognition 2008, 109(2), 193–210. [Google Scholar] [CrossRef]
- Kennedy, A.; Hill, R. L.; Pynte, J. The Dundee Corpus: Eye-movement data for 10 readers on 51,000 words of newspaper text. Poster presented at the 12th European Conference on Eye Movements, Dundee, Scotland; 2003. Available online: https://www.ling.ohio-state.edu/golddundee.
- Kennedy, A.; Pynte, J.; Murray, W. S.; Paul, S. A. Frequency and predictability effects in the Dundee Corpus: An eye-movement analysis. Quarterly Journal of Experimental Psychology 2013, 66(3), 601–618. [Google Scholar] [CrossRef]
- Kutas, M.; Federmeier, K. D. Thirty years and counting: Finding meaning in the N400 component of the event-related brain potential. Annual Review of Psychology 2011, 62, 621–647. [Google Scholar] [CrossRef] [PubMed]
- Lenartowicz, A.; Mazaheri, A.; Jensen, O.; Loo, S. K. Aberrant modulation of brain oscillatory activity and attentional impairment in ADHD. In Cognitive Neuroscience and Neuroimaging; Biological Psychiatry, 2018; Volume 3, 1, pp. 19–29. [Google Scholar] [CrossRef]
- Levy, R. Expectation-based syntactic comprehension. Cognition 2008, 106(3), 1126–1177. [Google Scholar] [CrossRef]
- Lewis, R. L.; Vasishth, S. An activation-based model of sentence processing as skilled memory retrieval. Cognitive Science 2005, 29(3), 375–419. [Google Scholar] [CrossRef] [PubMed]
- Li, J.; Roberts, L.; Smith, E.; Brown, M. Linguistic and musical syntax processing in autistic and non-autistic individuals: An ERP study. Autism Research 2025, 18(6), 1245–1256. [Google Scholar] [CrossRef] [PubMed]
- Lieder, F.; Griffiths, T. L. Resource-rational analysis: Understanding human cognition as the optimal use of limited computational resources. Behavioral and Brain Sciences 2020, 43, e1. [Google Scholar] [CrossRef]
- Lison, P.; Tiedemann, J. OpenSubtitles2016: Extracting large parallel corpora from movie and TV subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016); 2016; pp. 923–929. Available online: https://aclanthology.org/L16-1147/.
- Nieuwland, M. S.; Politzer-Ahles, S.; Heyselaar, E.; Segaert, K.; Darley, E.; Kazanina, N.; et al. Large-scale replication study reveals a limit on probabilistic prediction in language comprehension. eLife 2018, 7, e33468. [Google Scholar] [CrossRef]
- Osterhout, L.; Holcomb, P. J. Event-related brain potentials elicited by syntactic anomaly. Journal of Memory and Language 1992, 31(6), 785–806. [Google Scholar] [CrossRef]
- Piantadosi, S. T. A rational analysis of the approximate number system. Psychonomic Bulletin & Review 2016, 23(3), 877–886. [Google Scholar] [CrossRef]
- Pothos, E. M.; Busemeyer, J. R. Can quantum probability provide a new direction for cognitive modeling? Behavioral and Brain Sciences 2013, 36(3), 255–274. [Google Scholar] [CrossRef] [PubMed]
- Rasmussen, N. E.; Schuler, W. Left-corner parsing with distributed associative memory produces surprisal and locality effects. Cognitive Science 2018, 42(S4), 1009–1042. [Google Scholar] [CrossRef] [PubMed]
- Rello, L.; Ballesteros, M. Detecting readers with dyslexia using machine learning with eye tracking measures. In Proceedings of the 12th Web for All Conference (Article 16); Association for Computing Machinery, 2015. [Google Scholar] [CrossRef]
- Shannon, C. E. A mathematical theory of communication. Bell System Technical Journal 1948, 27(3), 379–423. [Google Scholar] [CrossRef]
- Simon, H. A. McGuire, C. B., Radner, R., Eds.; Theories of bounded rationality. In Decision and Organization; North-Holland, 1972; pp. 161–176. [Google Scholar]
- Snowling, M. J.; Hulme, C. Dyslexia: A Very Short Introduction; Oxford University Press, 2021. [Google Scholar] [CrossRef]
| 1 | An ICP is the final word position at which retrieval resolves to a single interpretation. |
| 2 | Collapse tokens are the final word positions where retrieval resolves to a single interpretation. |
| 3 | See Appendix F for a toy implementation illustrating a minimal, testable approach. |
| 4 | Comparable variability is observed in real-world comprehension studies, where accuracy on certain complex constructions often falls well below ceiling, especially for low working-memory or high-ambiguity cases. |
| 5 | The 31% ceiling reflects falsifiability: it spotlights lawful divergences rather than indicating model failure. |
| Metric | Interpretation |
|---|---|
| ERR | Entropy-reduction rate (slope of ) |
| Retrieval-collapse point (resolution time) | |
| Overall model-to-trace fit quality | |
| AIC | Parsimony advantage over baselines |
| Contextual gradient (reanalysis effort) | |
| Entropy-retrieval rate coefficient | |
| CV Error | Mean absolute error across k folds |
| Observer-class divergence in |
| Sentence | Observer | Stress Flag(s) | Root–cause commentary |
|---|---|---|---|
| gpath_1 | O1 | Low , AIC , pegging | Non-monotonic spike defeats tanh shape; optimizer stalls. |
| gpath_1 | O3 | Low , AIC , pegging | Same as above plus early-noise plateau. |
| gpath_2 | O1 | Fit fail, parameter pegging | Extreme garden-path yields negative gradient. |
| gpath_2 | O3 | Fit fail, parameter pegging | Identical to O1; inversion of expected . |
| ambig_1 | O1 | Low | Lexical ambiguity generates flat . |
| ambig_1 | O3 | Low | Same; retrieval never saturates. |
| aur_1 | O3 | Low | High WM load and short trace under-constrain fit. |
| aur_complex_1 | O3 | Low | Same pattern as aur_1. |
| aur_complex_2 | O3 | Fit fail, inversion | Excessively long trace; optimizer exits at local minimum. |
| flat_1 | O1 | Low | Anomalous semantics keeps high; tanh under-fits tail. |
| flat_1 | O3 | Low | Same; observer divergence negligible. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 1996 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).