Submitted:
03 June 2026
Posted:
04 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
2.1. Sentiment Analysis in Public Health Emergencies
2.2. Online Feature Selection for Streaming Data
2.3. Multimodal Feature Fusion Strategies
3. Problem Definition and Overall Framework
3.1. Problem Formulation
3.2. Overall Framework
4. Multimodal Online Divide-And-Conquer Markov Blanket Learning Algorithm
4.1. Markov Blanket and Mutual Information
4.2. M-O-DC Algorithm
4.2.1. Problem Setup and Multimodal Input Definition
4.2.2. Decoupled Multimodal Processing
4.3. Online Mutual Information Estimation for Text and Image
4.3.1. Text Features
4.3.2. Image Features
4.3.3. Cross-modal Conditional MI Approximation
4.4. Multimodal Dynamic Adaptation Mechanisms
4.4.1. Modality-Adaptive Thresholds
4.4.2. Multimodal Concept Drift Detection
5. Cross-Modal Interactive Enhanced Fusion Network
5.1. Deep Encoder Design
5.1.1. Formalization of Multimodal Feature Encoding
5.2. Two-Stage Cross-Modal Interactive Attention:
5.2.1. Stage 1: Bidirectional Cross-Modal Attention
5.2.2. Stage 2: Deep Interactive Normalization
5.3. Adaptive Modal Weighting
5.3.1. Enhanced Cross-Modal Features
5.3.2. Gated Weight Generation Network
5.3.3. Weighted Fusion
5.4. Residual Fusion and Classification
5.4.1. Residual-Enhanced Feature Fusion
5.4.2. Self-Attentive Pooling
5.4.3. Sentiment Classification
5.5. Loss Function and Optimization
5.5.1. Multi-Task Loss
5.5.2. Optimization
6. Experiments and Results Analysis
6.1. Experimental Data
6.2. Main Results and Analysis
6.2.1. The Classification Results
6.2.2. Baseline Models
6.2.3. Results and Discussion
6.2.4. Ablation Study Analysis
7. Conclusion
References
- Che, S. P.; Wang, X.; Zhang, S. N.; et al. Effect of daily new cases of COVID-19 on public sentiment and concern: Deep learning-based sentiment classification and semantic network analysis. J. Public Health 2024, 32(3), 509–528. [Google Scholar] [CrossRef]
- Mumuni, A.; Mumuni, F. Data augmentation with automated machine learning: approaches and performance comparison with classical data augmentation methods. Knowl. Inf. Syst. 2025, 67(5), 1–11. [Google Scholar] [CrossRef]
- Tang, H.; Wang, Y.; Zhang, Y.; et al. TS-Mixer: A lightweight text representation model based on context awareness. Expert Syst. 2025, 42(2). [Google Scholar] [CrossRef]
- Zhang, L.; Wang, X.; Wang, J.; Liao, G. Research on emergency decision quality evaluation and optimization basing on public sentiment big data analysis. Comput. Ind. Eng. 2024, 193(7), 109452. [Google Scholar] [CrossRef]
- Han, P.; Zhang, W.; Zhang, Z. Sentiment Analysis of Weibo Posts on Public Health Emergency with Feature Fusion and Multi-Channel. Data Anal. Knowl. Discov. 2021, 5(11), 68–79. [Google Scholar]
- Gariboldi, M.I.; Lin, V.; Bland, J.; et al. Foresight in the time of COVID-19. Lancet Reg. Health West. Pac. 2021, 6, 100049. [Google Scholar] [CrossRef]
- Ahelegbey, D.F.; Celani, A.; Cerchiello, P. Measuring the impact of the EU health emergency response authority on the economic sectors and the public sentiment. Socioecon. Plan. 2024, 92, 101842. [Google Scholar] [CrossRef]
- Chen, X.; Zhang, W.; Xu, X.; Cao, W. A public and large-scale expert information fusion method and its application: Mining public opinion via sentiment analysis and measuring public dynamic reliability. Inf. Fusion 2022, 78, 71–85. [Google Scholar] [CrossRef]
- An, L.; Xu, M. Measuring online trust in government microblogs in public health emergencies. Data Anal. Knowl. Discov. 2022, 6(1), 55–68. [Google Scholar]
- Cai, Y.; Yang, K.; Huang, D.; Zhou, Z.; Lei, X.; Xie, H.; Wong, T. A hybrid model for opinion mining based on domain sentiment dictionary. Int. J. Mach. Learn. Cybern. 2019, 10(8), 2131–2142. [Google Scholar] [CrossRef]
- Xu, G.; Yu, Z.; Yao, H.; Li, F.; Meng, Y.; Wu, X. Chinese text sentiment analysis based on extended sentiment dictionary. IEEE Access 2019, 7, 43749–43762. [Google Scholar] [CrossRef]
- Yang, S.; Chen, F. Analyzing sentiments of Micro-blog posts based on support vector machine. Data Anal. Knowl. Discov. 2017, 1(2), 73–79. [Google Scholar]
- Fan, H.; Li, P. Sentiment analysis of short text based on FastText word vector and bidirectional GRU recurrent neural network. Inf. Sci. 2021, 39(4), 15–22. [Google Scholar]
- Liu, J.; Gu, F. Unbalanced text sentiment analysis of network public opinion based on BERT and BiLSTM hybrid method. J. Intell. 2022, 41(04), 104–110. [Google Scholar]
- Hyun, D.; Park, C.; Yang, M.; Song, I.; Lee, J.; Yu, H. Target-aware convolutional neural network for target-level sentiment analysis. Inf. Sci. 2019, 491, 166–178. [Google Scholar] [CrossRef]
- Guo, X.; Zhao, N.; Cui, S. Consumer reviews sentiment analysis based on CNN-BILSTM. Syst. Eng. Theory Pract. 2020, 40, 653–663. [Google Scholar]
- Lai, X.; Tang, H.; Chen, H.; Li, S. Multimodal sentiment analysis based on feature fusion of attention mechanism-bidirectional gated recurrent unit. J. Comput. Appl. 2021, 41(5), 1268–1274. [Google Scholar]
- Duan, W. J.; Deng, J. K.; Zhang, S. X.; et al. Aspect-based sentiment analysis model based on multilevel knowledge enhancement. CAAI Trans. Intell. Syst. 2024, 19(5), 1287–1297. [Google Scholar]
- Liu, J. H.; Li, l.; Wu, R. W.; et al. Mutli-prompt learning based aspect-category sentiment analysis. J. Front. Comput. Sci. Technol. 2025, 19(05), 1334–1341. [Google Scholar]
- Zhang, W.; Chu, Z. Y.; Chen, X. Q.; et al. Aspect-based sentiment analysis with syntactic prompt. J. Comput. Appl. 2024, 44, 35–43. [Google Scholar]
- Li, Z.; Xu, H.; Duan, B. Research on image emotion feature extraction based on deep learning CNN model. Libr. Inf. Serv. 2019, 63(11), 96–107. [Google Scholar]
- Cai, G.; He, X.; Chu, Y. Visual sentiment analysis by combining global and local regions of image. J. Comput. Appl. 2019, 39(8), 2181–2185. [Google Scholar]
- Alamoodi, A.H.; Zaidan, B.B.; Zaidan, A.A.; et al. Sentiment analysis and its applications in fighting COVID-19 and infectious diseases: A systematic review. Expert Syst. Appl. 2021, 167, 114155. [Google Scholar] [CrossRef]
- Arbane, M.; Benlamri, R.; Brik, Y.; Alahmar, A.D. Social media-based COVID-19 sentiment classification model using Bi-LSTM. Expert Syst. Appl. 2023, 212, 118710. [Google Scholar] [CrossRef] [PubMed]
- Blanco, G.; Lourenço, A. Optimism and pessimism analysis using deep learning on COVID-19 related twitter conversations. Inf. Process. Manag. 2022, 59(3), 102918. [Google Scholar] [CrossRef]
- Tan, H.; Peng, S.; Zhu, C.; et al. Long-term effects of the COVID-19 pandemic on public sentiments in mainland China: Sentiment analysis of social media posts. J. Med. Internet Res. 2021, 23(8), e29150. [Google Scholar] [CrossRef] [PubMed]
- Kumar, A.; Garg, G. Sentiment analysis of multimodal twitter data. Multimed. Tools Appl. 2019, 78(17), 24103–24119. [Google Scholar] [CrossRef]
- Gan, Z. H.; Miao, Y. Q.; Liu, T. L. Multimodal aspect-level sentiment analysis based on cross-modal interaction Transformer. Appl. Res. Comput. 2025, 42(9), 2707–2713. [Google Scholar]
- Perkins, S.; Theiler, J. Online feature selection using grafting[C]. the Twentieth International Conference on Machine Learning, 2003; pp. 592–599. [Google Scholar]
- Ungar, L. Streaming feature selection using alpha-investing[C]. the Eleventh International Conference on Knowledge Discovery and Data Mining, 2005; pp. 384–393. [Google Scholar]
- Wu, X.; Yu, K.; Ding, W.; et al. Online feature selection with streaming features. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35(5), 1178–1192. [Google Scholar]
- Lin, Y.; Hu, Q.; Zhang, J. Multi-label feature selection with streaming labels. Inf. Sci. 2016, 372, 256–275. [Google Scholar] [CrossRef]
- Lin, Y.; Hu, Q.; Liu, J.; et al. Streaming feature selection for multi-label learning based on fuzzy mutual information. IEEE Trans. Fuzzy Syst. 2017, 25(6), 1491–1507. [Google Scholar] [CrossRef]
- Liu, J.; Lin, Y.; Li, Y.; et al. Online multi-label streaming feature selection based on neighborhood rough set. Pattern Recognit. 2018, 84, 273–287. [Google Scholar] [CrossRef]
- Li, H.; Wu, X.; Li, Z.; et al. Group feature selection with streaming features[C]. 13th International Conference on Data Mining, 2013; pp. 1109–1114. [Google Scholar]
- Jing, W.; Meng, W.; Li, P.; et al. Online feature selection with group structure analysis[J]. IEEE Trans. Knowl. Data Eng. 2015, 27(11), 3029–3041. [Google Scholar]
- Pearl, J.; Mackenzie, D. The book of why: the new science of cause and effect. Science 2018, 361(6405), 852–855. [Google Scholar] [CrossRef]
- Jake, H.; Amit, S.; Duncan, W. Prediction and explanation in social systems. Science 2017, 355(6324), 486–488. [Google Scholar] [CrossRef]
- Gao, T.; Ji, Q. Efficient markov blanket discovery and its application. IEEE Trans. Cybern. 2016, 47(5), 1169–1179. [Google Scholar] [CrossRef] [PubMed]
- Ling, Z.; Yu, K.; Wang, H.; et al. Bamb: A balanced markov blanket discovery approach to feature selection. ACM Trans. Intell. Syst. Technol. (TIST) 2019, 10(5), 1–25. [Google Scholar] [CrossRef]
- Wang, H.; Ling, Z.; Yu, K.; et al. Towards efficient and effective discovery of markov blankets for feature selection. Inf. Sci. 2020, 509, 227–242. [Google Scholar] [CrossRef]
- Wu, X.; Jiang, B.; Yu, K.; et al. Accurate markov boundary discovery for causal feature selection. IEEE Trans. Cybern. 2019, 50(12), 4983–4996. [Google Scholar] [CrossRef]
- Liu, C.; Wang, Y.; Yang, J. A transformer-encoder-basedmultimodalmulti-attention fusion network for sentiment analysis. Appl. Intell. 2024, 54, 8415–8441. [Google Scholar] [CrossRef]
- Chen, X.; Zhang, W.; Xu, X.; Cao, W. A public and large-scale expert information fusion method and its application: Mining public opinion via sentiment analysis and measuring public dynamic reliability. Inf. Fusion 2022, 78, 71–85. [Google Scholar] [CrossRef]
- Wang, Y.; Xie, J.; Chen, B.; Xu, X. Multi-modal sentiment analysis based on cross-modal context-aware attention. Data Anal. Knowl. Discov. 2021, 5(4), 49–59. [Google Scholar]
- Harish, A. B.; Sadat, F. Trimodal Attention Module for Multimodal Sentiment Analysis. Assoc. Adv. Artif. Intell. (AAAI) 2020, 1–10. [Google Scholar]
- Majumder, N.; Hazarika, D.; Gelbukh, A. Multimodal sentiment analysis using hierarchical fusion with context modeling. Knowl. Based Syst. 2018, 161, 124–133. [Google Scholar] [CrossRef]
- Sun, H.; Chen, Y. W.; Lin, L. Tensor Former: A Tensor-Based Multimodal Transformer for Multimodal Sentiment Analysis and Depression Detection. IEEE Trans. Affect. Comput. 2023, 14(4), 2776–2786. [Google Scholar] [CrossRef]
- Tsai, Y. H. H.; Bai, S. J.; Liang, P. P.; et al. Multimodal Transformer for Unaligned Multimodal Language Sequences. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019; pp. 6558–6569. [Google Scholar]
- Yang, K. C.; Xu, H.; Gao, K. CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis. In Proceedings of the 28th ACM International Conference on Multimedia, 2020; pp. 521–528. [Google Scholar]
- Yuzhu, Wang; Jun, Xie; Bo, Chen; et al. Multi-modal Sentiment Analysis Based on Cross modal Context-aware Attention. Data Anal. Knowl. Discov. 2021, 5(4), 49–59. [Google Scholar]
- Yansong, Chen; Le, Zhang; Leihan, Zhang. Multimodal Sentiment Analysis Method Based on Cross-Modal Attention and Gated Unit Fusion Network. Data Anal. Knowl. Discov. 2024, 8(7), 67–76. [Google Scholar]
- Bengong, Yu; Zhongyu, Shi. Deep Attention and Two-Stage Fusion of Image-Text Sentiment Contrastive Learning Method. Comput. Eng. Appl. 2025, 61(3), 223–233. [Google Scholar]
- Alahmadi, K.; Alharbi, S.; Wang, X. Integrating dense layers with residual connections into transformers for enhanced sentiment classification. J. Supercomput. 2025, 81, 1542. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; et al. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016; pp. 770–778. [Google Scholar]
- Islam, M.S.; Xiangdong, L.; Ahmed, J. BERT: advancements in language understanding for different NLP tasks: challenges and future perspectives. J. Electr. Syst. Inf. Technol. 2026, 13, 49. [Google Scholar] [CrossRef]
- Yanxiao, Han; Jing, Ma. The RCHFN Model: A Multimodal Feature Fusion Approach for Sentiment Classification. Data Anal. Knowl. Discov. 2024, 8(12), 18–29. [Google Scholar]
- Su, Y. Y.; Han, C. J.; Li, A. M.; et al. Research on Image-text Multimodal Sentiment Recognition Driven by Large Model Enhancement and Multi-feature Cross-fusion. Inf. Stud. Theory Appl. 2025, 9, 1–16. [Google Scholar]
- Shahid, S. D.; Mohammad, Z.; Karan, B.; et al. A social context-aware graph-based multi-modal attentive learning framework for disaster content classification during emergencies. Expert Syst. With Appl. 2025, 259, 125337–125360. [Google Scholar]
- Zeng, Z. M.; Sun, S. Q.; Li, Q. Q. Multimodal negative sentiment recognition of online public opinion on public health emergencies based on graph convolutional networks and ensemble learning. Inf. Process. Manag. 2023, 60(4), 103378–103395. [Google Scholar] [CrossRef]



| Public Health Emergency Names | Total Sample Size | Positive | Neutral | Negative |
|---|---|---|---|---|
| COVID-19 | 10,245 | 2,377 | 4,252 | 3,616 |
| Avian Influenza | 3,210 | 803 | 1445 | 962 |
| Monkeypox | 2,222 | 489 | 934 | 799 |
| Total | 15,677 | 3,669 | 6631 | 5377 |
| Stage | Parameters Value |
|---|---|
| O-DC thresholds | |
| Optimizer | Learning_rate=5×10⁻⁵ |
| β=(0.9, 0.999) | |
| Training: Online learning | batch size=1 |
| update every 100 samples |
| Events | Sentiment Category | Precision (P) | Recall (R) | F1-score (F) | Accuracy (A) |
|---|---|---|---|---|---|
| COVID-19 | positive | 91.15% | 82.54% | 86.64% | |
| Neutral | 88.45% | 82.78% | 85.52% | 86.62% | |
| Negative | 90.31% | 85.35% | 87.75% | ||
| Avian Influenza | positive | 91.37% | 89.29% | 90.30% | |
| Neutral | 84.76% | 81.78% | 83.27% | ||
| Negative | 90.48% | 88.12% | 89.25% | 87.15% | |
|
Monkeypox |
positive | 91.23% | 86.72% | 88.92% | |
| Neutral | 83.97% | 80.34% | 82.12% | 87.47% | |
| Negative | 92.09% | 90.75% | 91.38% | ||
| Average of the three events | 89.31% | 85.30% | 87.24% | 87.08% |
| Model | Precision (P) | Recall (R) | F1-Score (F) | Accuracy (A) |
|---|---|---|---|---|
| Online SVM (Text) | 72.2% | 65.5% | 70.8% | 72.5% |
| Online BERT | 75.5% | 66.3% | 76.5% | 78.2% |
| EF + OGL | 64.7% | 79.2% | 78.9% | 80.1% |
| CMA | 77.3% | 81.3% | 80.8% | 82.3% |
| MARN | 82.7% | 91.3% | 81.6% | 83.1% |
| CLIP (finetuned) | 88.1% | 83.9% | 85.9% | 86.2% |
| O-DC + CMA | 85.6% | 71.8% | 83.2% | 84.7% |
| O-DC + Our Fusion | 89.3% | 85.3% | 87.2% | 87.1% |
| Model Variant | Accuracy (%) | ΔAcc | F1-Score (%) | ΔF1 | Processing Time (ms) | |
|---|---|---|---|---|---|---|
| Full Model | 87.1 | – | 85.4 | – | 68 | |
| w/o O-DC (using all features) | 84.1 | -2.8 | 82.7 | -2.7 | 82 | |
| w/o 2nd-stage Attention | 85.6 | -1.3 | 84.0 | -1.4 | 64 | |
| w/o Adaptive Weighting | 85.9 | -1.0 | 84.3 | -1.1 | 66 | |
| w/o Residual Connections | 86.1 | -0.8 | 84.7 | -0.7 | 67 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).