Submitted:
25 November 2025
Posted:
26 November 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Works
3. Materials & Methods
3.1. Frontier Topic Mining Method via Generative Agent (G-Agent)
3.2. Frontier Topics Mining Method via Dual Agents (D-Agents)
4. Data Sets and Evaluation Metrix
4.1. Dataset
4.2. Evaluation Metrix
5. Experimental and Results
5.1. Baseline Methods
5.2. Results and Analysis
6. Discussion
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A




References
- Zhang Hui,Yang Xiaoyan,Zhao Xujian, et al. Subject Frontiers Hot Spots Mining Based on Social Network Attention[J]. Journal of Zhengzhou University (Natural Science Edition), 2018, 50(03):46-52. https://link.oversea.cnki.net/doi/10.13705/j.issn.1671-6841.2017201.
- Prapobratanakul, Chariya. Frequency Analysis, Distribution, and Coverage of Academic Words in Materials Science Research Articles: A Corpus-Based Study [J]. LEARN Journal: Language Education and Acquisition Research Network 17.2, 2024: 793-813. [CrossRef]
- Kyebambe, Moses Ntanda, et al. Forecasting emerging technologies: A supervised learning approach through patent analysis [J]. Technological Forecasting and Social Change 125, 2017: 236-244. https://doi.org/10.1016/j.techfore.2017.08.002. [CrossRef]
- Klarin, Anton. How to conduct a bibliometric content analysis: Guidelines and contributions of content co-occurrence or co-word literature reviews [J]. International Journal of Consumer Studies 48.2, 2024: e13031. [CrossRef]
- Ge, Bin, et al. Chinese news hot subtopic discovery and recommendation method based on key phrase and the LDA model [J]. DEStech Transactions on Engineering and Technology Research, ECAR, 2018. [CrossRef]
- Lalk, Christopher, et al. Measuring alliance and symptom severity in psychotherapy transcripts using bert topic modeling [J]. Administration and Policy in Mental Health and Mental Health Services Research 51.4, 2024: 509-524. [CrossRef] [PubMed]
- He, Chunhui, Bin Ge, and Chong Zhang. Chinese Text Open Domain Tag Generation Method via Large Language Model [C].10th International Conference on Big Data and Information Analytics. IEEE, 2024, 183-188. [CrossRef]
- Jin, Yiqiao, et al. AGENTREVIEW: Exploring Peer Review Dynamics with LLM Agents [C]. //In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 1208–1226, Miami, Florida, USA. Association for Computational Linguistics. [CrossRef]
- Giray, L. Prompt Engineering with ChatGPT: A Guide for Academic Writers [J]. Ann Biomed Eng 51, 2023, 2629–2633. [CrossRef] [PubMed]
- Hu H, Xue W, Jiang P, et al. Bibliometric analysis for ocean renewable energy: An comprehensive review for hotspots, frontiers, and emerging trends [J]. Renewable and Sustainable Energy Reviews. 2022, 167:112739. [CrossRef]
- Gao Y, XU Y, ZHU Y, et al. An analysis of the hotspot and frontier of mine eco-environment restoration doul big data visualization of VOSviewer and CiteSpace [J]. Geological Bulletin of China. 2018, 37(12):2144-53. http://dx.chinadoi.cn/10.12097/j.issn.1671-2552.2018.12.004.
- Zhang Y, Zhao D, Liu H, et al. Research hotspots and frontiers in agricultural multispectral technology: Bibliometrics and scientometrics analysis of the Web of Science [J]. Frontiers in Plant Science. 2022, 13:955340. [CrossRef] [PubMed]
- Han Q, Li Z, Fu Y, et al. Analyzing the research landscape: mapping frontiers and hot spots in anti-cancer research using bibliometric analysis and research network pharmacology [J]. Frontiers in Pharmacology. 2023, 14:1256188. [CrossRef] [PubMed]
- Zhang N, Feng G. Analysis of Big Data Research Hotspots Based on Keyword Co-occurrence [C]. In IEEE 9th International Conference on Data Science in Cyberspace, 2024, 464-471. [CrossRef]
- Zhang B, Zhang S. Analysis of the Sci-tech Industry Research Hot Spots Based on Keywords Co-occurrence Analysis and Social Network Analysis[J]. Science and Technology Management Research, 2016(20): 93-98. http://dx.chinadoi.cn/10.3969/j.issn.1000-7695.2016.20.018.
- Nie J. Study on the Law and Mechanism of Tibetan Medicine's Prevention and Treatment of Plateau Disease Based on Knowledge Discovery [D]. Chengdu University of Traditional Chinese Medicine, 2017.
- Mi L, Zhang W, Yu H, et al. Knowledge mapping analysis of pro-environmental behaviors: research hotspots, trends and frontiers [J]. Environment, Development and Sustainability. 2024:1-35. [CrossRef]
- Li B, Qin Y, Xu Z. Dynamic Tracking of Research Status and Frontiers in the Field of Management Science [J]. Chinese Journal of Management Science, 2023, 31(07):276-286. https://link.oversea.cnki.net/doi/10.16381/j.cnki.issn1003-207x.2021.1830.
- Xu J, Sun S, Zhao Y, et al. Knowledge domain, research hotspots and frontiers in physiology teaching reforms from 2012 to 2021: A bibliometric and knowledge-map analysis [J]. Frontiers in Medicine. 2023, 10:1031713. [CrossRef] [PubMed]
- Shao B, Li X, Bian G. A survey of research hotspots and frontier trends of recommendation systems from the perspective of knowledge graph [J]. Expert Systems with Applications. 2021, 165:113764. [CrossRef]
- Cheng J, Lu D, Sun L, et al. Development Trends, Current Hotspots, and Research Frontiers of Oyster Reefs: A Bibliometric Analysis Based on CiteSpace [J]. Water. 2023, 15(20):3619. [CrossRef]
- Zhang P, Zhang J, Wang M, et al. Research hotspots and trends of neuroimaging in social anxiety: a CiteSpace bibliometric analysis based on Web of Science and Scopus database [J]. Frontiers in Behavioral Neuroscience. 2024, 18:1448412. [CrossRef] [PubMed]
- Lewis P, Perez E, Piktus A, et al. Retrieval-augmented generation for knowledge-intensive nlp tasks[J]. Advances in neural information processing systems, 2020, 33: 9459-9474.
- Baidu. Agent Builder [EB/OL]. https://agents.baidu.com/agent, 2024-04-19/2025-02-25.
- Baidu. ERNIE-4.0-8k [EB/OL]. https://cloud.baidu.com/doc/WENXINWORKSHOP/s/clntwmv7t, 2025-02-20/2025-02-25.
- Baidu.ERNIE-3.5-128k [EB/OL]. https://cloud.baidu.com/doc/WENXINWORKSHOP/s/dlw4ptsq7, 2025-02-21/20025-02-25.
- Deng Z, Ma W, Han Q L, et al. Exploring DeepSeek: A Survey on Advances, Applications, Challenges and Future Directions[J]. IEEE/CAA Journal of Automatica Sinica, 2025, 12(5): 872-893.



| Agent module name | Module specific functions or corresponding instructions |
| User module | Guide users to enter frontier topic mining prompt instructions or give feedback instructions in specified fields. |
| Brain of the G-Agent | ERNIE 4.0 |
| Role of the G-Agent | Senior experts in the field of intelligence. |
| Perception of the G-Agent | Automatically capture domain frontier topic mining goals and requirements from user-input prompt instructions. |
| Memory of the G-Agent | Long-term memory |
| Action of the G-Agent | ① Default question ② Automatic questioning ③ Tool call ④ Feedback learning ⑤ Clear historical conversation |
| Planning of the G-Agent | ① Automatically complete the search and learning of knowledge according to memory and user feedback. ② Collect and interpret the corresponding authoritative literature according to the captured frontier topics mining goals and requirements. ③ Generate no more than K candidate frontier topics after in-depth interpretation of all data in chronological order. ④ The literature collected comes from published academic papers and patent documents, research reports, and conference call for papers. ⑤ Ensure that frontier topics are highly relevant and accurate to the field input by the user. ⑥ Ensure that the number of words in a single frontier topic is strictly prohibited from exceeding 10 words, and it is not repeated. ⑦ Only use the list form to return topics one by one, without any explanatory text. ⑧ The output format refers to the style of the knowledge base, and the output results are automatically aligned with the frontier topics in the knowledge base as much as possible. |
| Optional tool set for G-Agent | ① Online search ② Micro single domain knowledge base |
| Result generation of the G-Agent | Candidate frontier topics are returned to users one by one in strict accordance with the list format. |
| Agent module name | Module specific functions or corresponding instructions |
| User module | Guide the user to input verification prompt instructions or give feedback instructions according to the candidate frontier topic list. |
| Brain of the V-Agent | ERNIE 3.5 |
| Role of the V-Agent | Senior experts in the field of intelligence. |
| Perception of the V-Agent | Automatically capture a list of candidate frontier topics and domains to which frontier topics belong from an input prompt instruction. |
| Memory of the V-Agent | Long-term memory |
| Action of the V-Agent | ① Default question ② Automatic questioning ③ Tool call ④ Feedback learning ⑤ Clear historical conversation |
| Planning of the V-Agent | ① Automatically complete the search and learning of knowledge according to memory and user feedback. ② Call knowledge to complete reasoning and verification according to the capture list, domain mining goals and requirements. ③ Strictly verify each frontier topics to make it novel, complete, accurate and authoritative. ④ Ensure that each verified frontier topic is independent and complete, with concise word count and strictly abiding by academic norms. ⑤ Ensure that the frontier topics that have passed the verification cannot be repeated, and automatically filter the frontier topics that have not passed the verification. ⑥ Allows to expand and supplement undiscovered but closely related and authentic frontier topics in the above list. ⑦ The output format strictly refers to the style of the local knowledge base, and the frontier topics covered by the knowledge base should be strictly consistent with it. ⑧ Ensure that the verified frontier topics are directly output one by one without explanatory text. |
| Optional tool set for V-Agent | ① Online search ② Large cross-domain knowledge base |
| Result generation of the V-Agent | The verified frontier topics are uniformly returned to the user in strict accordance with the specified format. |
| Dataset name | Data Source | Top 20 frontier topics |
| CV-DataSet | CVPR ICCV ECCV |
Image 3D, Autonomous driving, Adversarial attack and defense, Biometrics and Biometric Vision, Posture recognition, Computational imaging, Datasets and evaluation, Deep learning architecture innovation, Explainable computer vision, Image and video synthesis, Low level vision, Representation learning, Scene analysis and understanding, Optimization methods, Vision applications and systems, Vision + other modalities, Transfer learning, Vision ethics, Embodied vision, Segmentation grouping and shape analysis |
| Dataset name | Data Source | Top 20 frontier topics |
| NLP-DataSet | ACL EMNLP NAACL CoNLL |
Computational social sciences and sociolinguistics, Dialogue and interaction systems, Efficient/low resource methods for NLP, NLP model interpretability and analysis, Multilingualism and linguistic diversity, Question answering systems, Information retrieval and text mining, Machine translation, Machine learning for NLP, Multimodality and linguistic foundations, Ethical bias and fairness, Information extraction, Pre-trained Models, Resources and evaluation, Semantics and syntax, Sentiment analysis, Text summary, Language understanding and generation, Linguistic theory, Speech processing |
| Dataset name | Data source | Top 20 frontier topics |
| ML-DataSet | ICML NeurIPS COLT |
General machine learning, Game theory and statistical learning theory, Deep learning, Neuroscience and cognitive science, Large language models, Reinforcement learning, Online learning, Time series analysis, Optimization methods, Probabilistic methods, Algorithm design and analysis, Active and interactive learning, Data geometry and topology learning, Kernel methods, Complex data learning, Trusted machine learning, Machine learning applications, High dimensional & nonparametric statistics, Bayesian methods, Supervised learning |
| Dataset name | Method | Accuracy | RC |
| CV-DataSet | G-Agent | ||
| D-Agents | |||
| NLP-DataSet | G-Agent | ||
| D-Agents | |||
| ML-DataSet | G-Agent | ||
| D-Agents |
| Domain Frontier Topic Mining |
Expert Mining Results |
DeepSeek-R1-671B Mining Results |
D-Agents Mining Results |
D-Agents Accuracy | D-Agents RC |
| Mining Top6 frontier topics in the Field of "Altitude Sickness" from 2006 to 2016 | Altitude polycythemia, Acute altitude sickness, Chronic altitude sickness, Apoptosis, Altitude hypoxia, Pulmonary hypertension [16] | Molecular mechanisms of altitude hypoxia adaptation, Characteristics and prevention of acute altitude sickness, Prevention and control of echinococcosis in high altitude areas, Altitude hypoxia, Altitude cardiovascular diseases, Altitude ecology and health | Altitude polycythemia, Acute altitude sickness, Chronic altitude sickness, Altitude hypoxia, Pulmonary hypertension, Preventive measures of altitude sickness, Pathogenesis of altitude sickness | ||
| Mining Top7 frontier topics in the field of "Recommendation System" from 2009 to 2018 | Collaborative filtering and matrix factorization, Information technology and recommendation system, Recommendation algorithm and performance evaluation, User feature representation technology, Cold start and data sparsity, Personalized recommendation, privacy protection [20] |
Explainable recommendation, Fairness & diversity, Reinforcement learning for recommendation system, Multi-modal recommendation, Cold start & data sparsity, Knowledge graph for recommendation system, Privacy & federated learning |
Collaborative filtering and matrix factorization, Information technology and recommendation system, User feature representation technology, Cold start and data sparsity, Personalized recommendation, privacy protection, AI recommendation system | ||
| Mining Top3 frontier topics in the field of "Oyster Reef Ecosystem" from 1981 to 2022 | Habitat protection and restoration, Ecosystem services, Climate change [21] | Ecosystem service, Ecological degradation and remediation techniques, Ecological conservation and policy support | Habitat protection and restoration, Ecosystem services, Ecological diversity conservation |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
