Structure-Aware Temporal Modeling for Chronic Disease Progression Prediction

Jiacheng Hu; Bo Zhang; Ting Xu; Haifeng Yang; Min Gao

doi:10.20944/preprints202508.1506.v1

Submitted:

20 August 2025

Posted:

20 August 2025

You are already at the latest version

Abstract

This study addresses the challenges of symptom evolution complexity and insufficient temporal dependency modeling in Parkinson's disease progression prediction. It proposes a unified prediction framework that integrates structural perception and temporal modeling. The method leverages graph neural networks to model the structural relationships among multimodal clinical symptoms and introduces graph-based representations to capture semantic dependencies between symptoms. It also incorporates a Transformer architecture to model dynamic temporal features during disease progression. To fuse structural and temporal information, a structure-aware gating mechanism is designed to dynamically adjust the fusion weights between structural encodings and temporal features, enhancing the model's ability to identify key progression stages. To improve classification accuracy and stability, the framework includes a multi-component modeling pipeline, consisting of a graph construction module, a temporal encoding module, and a prediction output layer. The model is evaluated on real-world longitudinal Parkinson's disease data. The experiments involve comparisons with mainstream models, sensitivity analysis of hyperparameters, and graph connection density control. Results show that the proposed method outperforms existing approaches in AUC, RMSE, and IPW-F1 metrics. It effectively distinguishes progression stages and improves the model's ability to capture personalized symptom trajectories. The overall framework demonstrates strong generalization and structural scalability, providing reliable support for intelligent modeling of chronic progressive diseases such as Parkinson's disease.

Keywords:

structural modeling

;

time perception

;

Parkinson’s disease

;

symptom evolution map

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

1. Introduction

Parkinson’s disease is a common neurodegenerative disorder characterized by bradykinesia, tremor, and muscular rigidity [1]. These symptoms severely impair patients’ quality of life and independence. With the acceleration of global population aging, the prevalence of Parkinson’s disease continues to rise, making it a major concern in public health. The disease progresses slowly and exhibits significant individual differences. Its clinical manifestations are highly heterogeneous, making it difficult for clinicians to accurately predict its future course in the early stages. Traditional assessment methods rely heavily on questionnaires and periodic clinical follow-ups. These methods are subjective, vulnerable to external interference, and fail to capture the dynamic changes in disease progression. Therefore, developing an efficient and intelligent framework for Parkinson’s disease progression prediction is crucial for disease staging, personalized treatment planning, and the rational allocation of healthcare resources [2].

With the rapid advancement of artificial intelligence, more research has begun to explore how multimodal medical data can enhance early diagnosis and progression prediction of neurological disorders. Data from Parkinson’s patients include various structured and unstructured modalities, such as imaging, gait signals, clinical scales, and behavioral logs. These data contain complex temporal patterns and cross-modal semantic correlations, providing an opportunity for modeling disease trajectories more precisely. However, integrating temporal dependencies and structural representations across these heterogeneous sources into a unified and generalizable predictive model remains challenging [3,4,5]. Most existing methods focus on unimodal time series modeling, ignoring the influence of structural information on symptom evolution. Conversely, structural modeling often lacks temporal awareness, making it difficult to capture the continuity and stage-wise characteristics of disease progression [6].

During the progression of Parkinson’s disease, symptom changes do not occur in isolation but follow certain structural dependencies and temporal patterns. For example, motor degradation is often accompanied by declines in speech and cognitive abilities. These changes show complex interactions and coupling across different patient groups. Static modeling tends to overlook such structure-aware signals, reducing predictive accuracy and robustness. There is a pressing need to introduce structural modeling mechanisms that capture multi-dimensional pathological trajectories, including entity relations and behavioral graphs. At the same time, integrating temporal modeling helps detect trends, abrupt shifts, and nonlinear variations in symptoms across time windows, thereby strengthening the foundation for fine-grained risk assessment [7].

In recent years, deep modeling techniques such as graph neural networks and Transformers have demonstrated strong representation capabilities in various tasks [8,9,10]. Graph-based models effectively capture semantic associations between symptoms and uncover latent progression pathways and critical nodes. Temporal models excel at describing time-evolving patterns in data and tracking trends in behavioral and clinical indicators [11,12,13,14,15]. Combining these approaches for Parkinson’s disease progression prediction enables multi-scale modeling from individuals to populations. This allows the discovery of pathological rules in dynamic environments. The integration of structural perception and temporal modeling enhances interpretability, precision, and adaptability. It also brings new momentum to the development of intelligent diagnostics and treatment of neurological diseases [16].

In summary, constructing a Parkinson’s disease progression prediction framework that integrates structural perception and temporal modeling is both a practical requirement and a key breakthrough for applying AI in healthcare. This direction aims not only to improve diagnostic efficiency and accuracy but also to enhance patient quality of life and reduce social and familial burdens. In scenarios such as early screening, stage recognition, treatment evaluation, and long-term monitoring, the synergy of structural and temporal modeling will play a central role [17,18,19,20]. It will drive the shift from reactive intervention to proactive management in intelligent Parkinson’s disease care.

2. Method

This paper proposes a Parkinson’s disease progression prediction framework that integrates structure perception and time series modeling, aiming to mine the dynamic characteristics and potential structural dependencies of disease evolution from multidimensional medical data. The overall framework consists of two core modules: the structure modeling module and the time series modeling module. Among them, the structure modeling part constructs a disease map based on a graph neural network, and explicitly represents the correlation between multimodal clinical indicators and symptoms as a graph structure; the time series modeling part uses an improved Transformer architecture to model the evolution trend of symptoms in the time dimension, to achieve accurate prediction of future disease status. The entire framework is trained in an end-to-end manner, so that the structure and time series features can be optimized synergistically and complement each other. Its model architecture is shown in Figure 1.

First, for the structural modeling part, a graph structure is constructed to represent the semantic relationship between patient states. The feature vector of each patient at a certain moment is represented as a node set

V = {v_{1}, v_{2}, ..., v_{N}}

, and the dependency relationship between nodes constitutes an edge set

ε \subseteq V \times V

. The graph structure is represented by the adjacency matrix

A \in R^{N \times N}

, and the input feature is

X \in R^{N \times d}

. The graph convolution operation uses the following update formula:

H^{(l + 1)} = σ ({\tilde{D}}^{- 1 / 2} \tilde{A} {\tilde{D}}^{- 1 / 2} H^{(l)} W^{(l)})

(1)

Where

\tilde{A} = A + I

is the adjacency matrix after adding the self-loop,

\tilde{D}

is its corresponding degree matrix,

W^{(l)}

is the learnable weight of the lth layer, and

σ

represents the activation function. By stacking multiple layers of graph convolution, local structural information can be effectively fused, and potential inter-disease dependency patterns can be captured.

Next, for the temporal modeling part, the improved Transformer module is used to model the changes of structural features over time. Assume that the graph structure of each patient at T consecutive time steps is encoded as

{H_{t}}_{t = 1}^{T}

, which is stacked into a three-dimensional tensor

H \in R^{T \times N \times d}

. First, the temporal position information is introduced through the position encoding function

P E (t)

:

\begin{array}{l} P E (t, 2 i) = \sin (\frac{t}{10000^{2 i / d}}) \\ P E (t, 2 i + 1) = \cos (\frac{t}{10000^{2 i / d}}) \end{array}

(2)

Then, input the Transformer encoder for temporal dependency modeling and calculate the self-attention weight as follows:

A t t e n t i o n (Q, K, V) = softmax (\frac{Q K^{T}}{\sqrt{d_{k}}}) V

(3)

Q, K, V

is the query, key, and value vectors obtained by linear transformation of the structural encoding

H_{t}

, and

d_{k}

is the attention head dimension. This mechanism can capture high-order dependencies between different time steps, and is particularly suitable for modeling the stage-by-stage and nonlinear characteristics of Parkinson’s disease progression.

To achieve a more effective integration of structural features and temporal dynamics, this study introduces a structure-aware gating mechanism that dynamically adjusts the contribution of each type of information to the final model representation. In developing this gating mechanism, we build on the adaptive fusion principles proposed by Zhang and Wang, who showed that incorporating gating units within the SegFormer architecture enables clinical imaging models to flexibly couple domain-specific structural features, thereby improving segmentation accuracy in complex environments [21].Similarly, Yan et al. showed that dynamically weighting different data sources in neural networks enhances survival prediction across diverse cancer types [22]. Xiao et al. further emphasized the advantages of flexible information fusion for medical image classification, particularly in cytopathology analysis [23]. Building on these findings, our gating mechanism allows the model to adaptively determine the optimal balance between structural and temporal information for each patient case, leading to a more refined and robust predictive framework. In this context, the fusion output is defined as:

Z_{t} = γ_{t} \cdot H_{t}^{G C N} + (1 - γ_{t}) \cdot H_{t}^{T r a n s f o r m e r}

(4)

γ_{t} = σ (W_{γ} [H_{t}^{G C N}; H_{t}^{T r a n s f o r m e r}]) \in {[0, 1]}^{d}

is the gating weight, which is obtained by concatenating the output of GCN and the output of Transformer, and inputting it into the fully connected layer and then activating it with Sigmoid. This fusion mechanism maintains the structural expression ability while enhancing the response ability to time dynamics, making the model more generalizable and personalized.

Finally, the entire framework is optimized in a supervised learning manner, to minimize the difference between the predicted value and the true disease score (such as UPDRS). Assuming the final output of the model is

{\overset{⌢}{y}}_{t}

and the true label is

y_{t}

, the overall loss function can be expressed as the mean square error (MSE):

L = \frac{1}{T} \sum_{t = 1}^{T} | | {\overset{⌢}{y}}_{t} - y_{t} | |_{2}^{2}

(5)

Through joint structural modeling, temporal perception, and dynamic fusion, this method can understand the evolution path of Parkinson’s disease symptoms from multiple perspectives, thereby providing basic support for accurate prediction and intelligent intervention.

3. Dataset

This study uses the Parkinson’s Progression Markers Initiative (PPMI) dataset as the research subject. The dataset is collected through multi-center collaboration and is widely used for early diagnosis and progression analysis of Parkinson’s disease. The PPMI dataset contains a large volume of multimodal clinical data from patients with Parkinson’s disease and healthy controls. It includes neuroimaging, biomarkers, behavioral scales, genetic information, and longitudinal follow-up records, providing a solid foundation for modeling disease progression. Especially in the temporal dimension, the dataset offers longitudinal data across multiple time points, capturing dynamic changes in disease status over time.

In this study, structured clinical scale information and selected gait signals from sensors in the PPMI dataset are used as input features. The scale data include the Unified Parkinson’s Disease Rating Scale (UPDRS), the Montreal Cognitive Assessment (MoCA), and questionnaires related to mood and sleep. These reflect changes in patient status across different functional domains. The gait signals are time series collected from wearable devices. They exhibit strong temporal dependencies and help capture subtle fluctuations in disease progression.

To ensure data quality and modeling consistency, this study selects samples with complete and continuous follow-up records. All features are standardized, and missing values are imputed. These steps ensure stable model training under high-dimensional, heterogeneous, and imbalanced data conditions. The final dataset subset retains the multimodal characteristics of the original data while providing sufficient temporal coverage and sample size. It meets the experimental requirements for jointly modeling structural and temporal patterns.

4. Experimental Results

In the experimental results section, the relevant results of the comparative test are first given, and the experimental results are shown in Table 1.

As shown in Table 1, the proposed model, which integrates structural perception and temporal modeling, achieves the best performance on Parkinson’s disease progression prediction, attaining an AUC of 0.879, significantly higher than competing methods and demonstrating stronger stage discrimination. It also records the lowest RMSE of 3.12, representing a 16.1% and 11.9% reduction compared to AdaMedGraph and TFT, respectively, indicating improved trend fitting of clinical scores through the synergy of graph neural networks for global symptom dependencies and Transformer encoders for fine-grained temporal dynamics. Moreover, the model achieves the highest IPW-F1 score of 91.7, outperforming methods such as KAN and XGBoost, which highlights its robustness to imbalanced data and superior predictive accuracy for mid-to-late-stage patients, aided by the structure-aware gating mechanism. Overall, the results confirm that combining structural and temporal modeling is essential to capture the complexity of Parkinson’s progression, enabling more accurate and clinically meaningful predictions than existing methods, while the impact of different structure-aware gating ratios is further illustrated in Figure 2.

As shown in the results of Figure 2, the structure-aware gating ratio has a significant impact on model performance. In the AUC curve, as the gating ratio increases from 0 to 1.0, the AUC value shows a general upward trend. This indicates that introducing the structure-aware mechanism can effectively enhance the model’s ability to distinguish different disease progression states. It also improves the model’s classification stability and global perception.

The RMSE curve further supports this trend. As the gating ratio increases, the prediction error decreases continuously from 3.54 to 3.12. This demonstrates that incorporating structural information enhances the model’s expressive power. It also helps fit the nonlinear trends in disease progression, improving the model’s approximation of continuous clinical targets. This confirms the importance of structural modeling in tasks involving interactive symptom evolution, such as Parkinson’s disease.

The IPW-F1 results show that the structure-aware mechanism significantly improves the identification of critical stages, such as mid-to-late-stage patients. As the gating ratio increases from 0 to 1.0, the F1 score continues to rise. This indicates that the model demonstrates stronger robustness and responsiveness when handling imbalanced samples or weighted importance scenarios. The gating mechanism builds flexible pathways between temporal and structural features, enabling the model to dynamically adjust its focus across different stages.

This paper also gives an analysis of the structural modeling capabilities under different graph connection densities, and the experimental results are shown in Figure 3.

As shown in the results of Figure 3, graph connection density has a significant impact on structural modeling capability, particularly in distinguishing different stages of Parkinson’s disease progression. As the connection density increases from 0.1 to 1.0, the AUC value rises steadily from 0.831 to 0.879. This indicates that richer structural information allows the model to better capture potential dependencies and semantic associations between symptoms. The ability of structural modeling to understand complex symptom evolution is greatly enhanced.

When the connection density is low, such as 0.1 or 0.3, the graph neural network captures only sparse structural information. It is limited to simple local relations between symptoms, which restricts the model’s structural perception and leads to unstable prediction performance. As the density increases to 0.5 or higher, especially at 0.7 and 0.9, the model gradually develops a more comprehensive structural awareness. It becomes more effective at identifying interaction patterns among multi-dimensional features, thereby improving modeling accuracy for complex disease dynamics.

It is worth noting that the model achieves optimal performance when the connection density reaches 1.0. This shows that complete structural information significantly benefits the modeling of disease progression pathways. However, in practical applications, it is also important to balance model complexity and computational cost. A suitable connection strategy should be chosen to achieve better structural modeling performance. Dense structures should not imply disordered aggregation but rather selective enhancement of key symptom connections to form more discriminative pathological graphs.

5. Conclusions

This study focuses on Parkinson’s disease, a complex neurodegenerative disorder, and proposes a progression prediction framework that integrates structural perception and temporal modeling. The goal is to improve the accuracy and generalization of disease trajectory modeling. By introducing graph neural networks to capture structural dependencies among multi-dimensional symptoms and applying a Transformer architecture to model temporal dynamics, the framework demonstrates strong adaptability in clinical scenarios with high variability and heterogeneous features. The structure-aware gating mechanism further enhances the model’s ability to adaptively regulate the importance of information across time steps, offering a more interpretable solution for disease course recognition and stage prediction.

Experimental results show that the proposed framework significantly outperforms existing methods across multiple key metrics. It maintains stable performance even under practical challenges such as imbalanced samples, missing features, and frequent stage transitions. This indicates that structural and temporal information are not isolated in chronic disease modeling tasks like Parkinson’s disease. Instead, they are interwoven and jointly drive disease progression. The fusion of these two dimensions is essential for improving prediction accuracy and robustness. Moreover, the introduced gating mechanism and graph construction strategy exhibit good scalability and modularity. They can be flexibly adapted to various types of spatiotemporal medical data.

This study not only offers a new technical path for Parkinson’s disease modeling but also provides theoretical support for broader applications such as clinical decision support, rehabilitation monitoring, and personalized intervention planning. In areas like smart healthcare, remote care, and chronic disease management for the elderly, the proposed method can serve as a foundational framework. It promotes a shift from static modeling to dynamic perception and proactive prediction in intelligent medical systems. The structural perception design can also be extended to other chronic neurological disorders, such as Alzheimer’s disease and depression, providing a technological basis for long-term disease management.

Future research can further enhance the framework’s expressive power and adaptability through multimodal graph modeling, cross-institutional data transfer, and self-supervised pretraining. In addition, how to effectively integrate the model with clinical expertise to achieve a closed-loop diagnostic system of model decision, expert intervention, and feedback optimization will be key to translating this research into practical applications. With the continued growth of medical data and computational resources, structurally and temporally integrated intelligent prediction models are expected to play an increasingly central role in disease warning, risk screening, and treatment optimization.

References

Lian, J.; Luo, X.; Shan, C.; Han, D.; Zhang, C.; Vardhanabhuti, V.; Li, D.; Qiu, L. Personalized progression modelling and prediction in Parkinson’s disease with a novel multi-modal graph approach. npj Park. Dis. 2024, 10, 1–10. [Google Scholar] [CrossRef] [PubMed]
Pei, X.; Fan, H.; Tang, Y. Temporal pyramid attention-based spatiotemporal fusion model for Parkinson's disease diagnosis from gait data. IET Signal Process. 2021, 15, 80–87. [Google Scholar] [CrossRef]
R. Wang, Y. Luo, X. Li, Z. Zhang, J. Hu and W. Liu, “A hybrid recommendation approach integrating matrix decomposition and deep neural networks for enhanced accuracy and generalization,” Proceedings of the 2025 5th International Conference on Neural Networks, Information and Communication Engineering (NNICE), pp. 1778-1782, 2025.
M. Wei, H. Xin, Y. Qi, Y. Xing, Y. Ren and T. Yang, “Analyzing data augmentation techniques for contrastive learning in recommender models,” 2025.
H. Zheng, Y. Xing, L. Zhu, X. Han, J. Du and W. Cui, “Modeling multi-hop semantic paths for recommendation in heterogeneous information networks,” arXiv preprint 2025. arXiv:2505.05989, 2025.
Huo, H.; Zhang, C.; Liu, W.; Zhao, C.; Ma, L.; Wang, J.; Xu, N. Early detection of Parkinson’s disease using a multi area graph convolutional network. Sci. Rep. 2025, 15, 1–14. [Google Scholar] [CrossRef] [PubMed]
Mohanraj, P.; Raman, V.; Ramanathan, S. Deep Learning for Parkinson’s Disease Diagnosis: A Graph Neural Network (GNN) Based Classification Approach with Graph Wavelet Transform (GWT) Using Protein–Peptide Datasets. Diagnostics 2024, 14, 2181. [Google Scholar] [CrossRef] [PubMed]
N. Jiang, W. Zhu, X. Han, W. Huang and Y. Sun, “Joint graph convolution and sequential modeling for scalable network traffic estimation,” arXiv preprint 2025. arXiv:2505.07674.
J. Zhan, “MobileNet compression and edge computing strategy for low-latency monitoring,” Journal of Computer Science and Software Applications, vol. 4, no. 4, 2024.
Y. Lou, J. Liu, Y. Sheng, J. Wang, Y. Zhang and Y. Ren, “Addressing class imbalance with probabilistic graphical models and variational inference,” Proceedings of the 2025 5th International Conference on Artificial Intelligence and Industrial Technology Applications (AIITA), pp. 1238-1242, 2025.
Patel, S.B.; FitzGerald, J.J.; A Antoniades, C. Decoding dynamic brain networks in Parkinson’s disease with temporal attention. Sci. Rep. 2025, 15, 1–20. [Google Scholar] [CrossRef]
X. Sun, Y. Duan, Y. Deng, F. Guo, G. Cai and Y. Peng, “Dynamic operating system scheduling using double DQN: A reinforcement learning approach to task optimization,” Proceedings of the 2025 8th International Conference on Advanced Algorithms and Control Engineering (ICAACE), pp. 1492-1497, 2025.
Y. Wang, T. Tang, Z. Fang, Y. Deng and Y. Duan, “Intelligent task scheduling for microservices via A3C-based reinforcement learning,” arXiv preprint 2025. arXiv:2505.00299.
L. Dai, W. Zhu, X. Quan, R. Meng, S. Chai and Y. Wang, “Deep probabilistic modeling of user behavior for anomaly detection via mixture density networks,” arXiv preprint 2025. arXiv:2505.08220.
Y. Peng, Y. Wang, Z. Fang, L. Zhu, Y. Deng and Y. Duan, “Revisiting LoRA: A smarter low-rank approach for efficient model adaptation,” Proceedings of the 2025 5th International Conference on Artificial Intelligence and Industrial Technology Applications (AIITA), pp. 1248-1252, 2025.
M. Z. U. Alam, M. S. Islam, E. Hoque et al., “PULSAR: Graph based positive unlabeled learning with multi stream adaptive convolutions for Parkinson’s disease recognition,” arXiv preprint 2023. arXiv:2312.05780.
W. Wang, Y. W. Wang, Y. Li, X. Yan, M. Xiao and M. Gao, “Breast cancer image classification method based on deep transfer learning,” Proceedings of the International Conference on Image Processing, Machine Learning and Pattern Recognition, pp. 190-197, 2024.
Y. Cheng, “Multivariate time series forecasting through automated feature extraction and transformer-based modeling,” Journal of Computer Science and Software Applications, vol. 5, no. 5, 2025.
Y. Zhang, J. Liu, J. Wang, L. Dai, F. Guo and G. Cai, “Federated learning for cross-domain data privacy: A distributed approach to secure collaboration,” arXiv preprint 2025. arXiv:2504.00282.
Y. Ma, G. Cai, F. Guo, Z. Fang and X. Wang, “Knowledge-informed policy structuring for multi-agent collaboration using language models,” Journal of Computer Science and Software Applications, vol. 5, no. 5, 2025.
X. Zhang and X. Wang, “Domain-adaptive organ segmentation through SegFormer architecture in clinical imaging,” Transactions on Computational and Scientific Methods, vol. 5, no. 7, 2025.
X. Yan, W. Wang, M. Xiao, Y. Li and M. Gao, “Survival prediction across diverse cancer types using neural networks,” Proceedings of the 2024 7th International Conference on Machine Vision and Applications, pp. 134-138, 2024.
M. Xiao, Y. Li, X. Yan, M. Gao and W. Wang, “Convolutional neural network classification of cancer cytopathology images: Taking breast cancer as an example,” Proceedings of the 2024 7th International Conference on Machine Vision and Applications, pp. 145-149, Singapore, Singapore, 2024.
J. Lian, X. Luo, C. Shan et al., “AdaMedGraph: Adaboosting graph neural networks for personalized medicine,” arXiv preprint 2023. arXiv:2311.14304.
S. Bengesi, “Enhanced Parkinson’s disease detection using hybrid deep learning and transformer model,” Ph.D. dissertation, Bowie State University, 2025.
Z. Yang, J. Zhang, X. Luo et al., “Medkan: An advanced Kolmogorov-Arnold network for medical image classification,” arXiv preprint 2025. arXiv:2502.18416.
Zhou, L.; Zhu, Q.; Chen, Q.; Wang, P.; Huang, H. Predicting hospital outpatient volume using XGBoost: a machine learning approach. Sci. Rep. 2025, 15, 1–13. [Google Scholar] [CrossRef]

Figure 1. Overall model architecture.

Figure 2. The impact of structure-aware gating ratio on prediction performance.

Figure 3. Analysis of structural modeling capabilities under different graph connection densities.

Table 1. Comparative experimental results.

Method	AUC	RMSE	IPW-F1
AdaMedGraph [24]	0.812	3.72	85.6
TFT [25]	0.834	3.54	87.1
KAN [26]	0.846	3.41	88.3
XGBoost meta-predictor [27]	0.857	3.35	88.9
Ours	0.879	3.12	91.7

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.