Submitted:
03 October 2025
Posted:
15 October 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Requirements Analysis
3. Pinecone Vector Retrieval and RAG Architecture Design
3.1. Overall System Architecture Design
3.2. Data Processing and Vectorization Strategy
3.3. Pinecone Vector Database Construction
3.4. Retrieval-Augmented Generation (RAG) Workflow Design
4. Model Construction and System Implementation
4.1. Generative Recommendation Model Design
4.2. Modular System Development and Interface Implementation
4.3. Model Training and Optimization
5. Experimental Results and Analysis
5.1. Experimental Environment Configuration
5.2. Vector Retrieval Performance Testing
5.3. Recommendation System Performance Comparison
6. Conclusion
References
- Shan S, Li Y. Research on the Application Framework of Generative AI in Emergency Response Decision Support Systems for Emergencies [J]. International Journal of Human–Computer Interaction, 2025, 41 (15): 9191-9208.
- Wang, L. Research on Teaching Reform of Generative AI-Empowered New Media Data Analysis and Application Course [J]. Journal of Education and Educational Research, 2025, 14 (1): 109-112.
- Zhou Y, Yang X, Xu W, et al. Rag GTPases control lysosomal acidification by regulating v-ATPase assembly in Drosophila. [J]. The Journal of biological chemistry, 2025, 301 (7): 110400.
- Yang W, Sun Y, Yang F, et al. Preparation and properties of slow-release fertilizer containing urea encapsulated by pinecone biochar and cellulose acetate. [J]. International journal of biological macromolecules, 2025, 315 (P2): 144448.
- He Y, Zhu X, Li D, et al. Enhancing Large Language Models for Specialized Domains: A Two-Stage Framework with Parameter-Sensitive LoRA Fine-Tuning and Chain-of-Thought RAG [J]. Electronics, 2025, 14 (10): 1961-1961.
- Zhong X, Bai J, Deng C, et al. Pinecone-Structured ZnO microparticle coatings: A superhydrophobic approach for scale prevention on steel surfaces [J]. Surface Engineering, 2025, 41 (4): 463-470.
- He X, Islam A M, Zhao T, et al. KOH-Activated Pinecone Biochar for Efficient Chloramphenicol Removal From Aqueous Solutions [J]. CleanMat, 2025, 2 (1): 72-84.
- Haowei Yang, Yu Tian, Zhongheng Yang, Zhao Wang, Chengrui Zhou, and Dannier Li. 2025. Research on Model Parallelism and Data Parallelism Optimization Methods in Large Language Model-Based Recommendation Systems. arXiv preprint arXiv:2506.17551. arXiv:2506.17551. [CrossRef]
- Huang L, Lu H. Design of intelligent financial data management system based on higher-order hybrid clustering algorithm. [J]. PeerJ. Computer science, 2024, 10 e1799.
- Feiyun Sha, Changxu Ding, Xiaoyu Zheng, Jun Wang, and Yafang Tao. 2025. Weathering the Policy Storm: How Trade Uncertainty Shapes Firm Financial Performance through Innovation and Operations. International Review of Economics & Finance 102 (2025), 104274. [CrossRef]
- Zhongheng Yang, Aijia Sun, Yushang Zhao, Yinuo Yang, Dannier Li, and Chengrui Zhou. 2025. RLHF Fine-Tuning of LLMs for Alignment with Implicit User Feedback in Conversational Recommenders. arXiv:2508.05289. [CrossRef]
- Feiyun Sha, Changxu Ding, Xiaoyu Zheng, Jun Wang, and Yafang Tao. 2025. Weathering the Policy Storm: How Trade Uncertainty Shapes Firm Financial Performance through Innovation and Operations. International Review of Economics & Finance (2025), 104274. [CrossRef]
- Feiyun Sha, Jiawei Meng, Xiaoyu Zheng, and Yaqi Jiang. 2025. Sustainability Under Fire: How China-US Tensions Impact Corporate ESG Performance?. Finance Research Letters (2025), 107882. [CrossRef]
- Xiaoyu Deng. 2025. Cooperative Optimization Strategies for Data Collection and Machine Learning in Large-Scale Distributed Systems. In Proceedings of the 2025 4th International Symposium on Computer Applications and Information Technology (ISCAIT ’25), Xi’an, China, 2025. IEEE, Piscataway, NJ, USA, 2151–2154. [CrossRef]
- Jing Yang, Yuangui Wu, Yuping Yuan, Haozhong Xue, Sami Bourouis, Mahmoud Abdel-Salam, Sunil Prajapat and Lip Yee Por. 2025. LLm-AE-MP: Web attack detection using a large language model with autoencoder and multilayer perceptron. Expert Systems with Applications 274 (2025), 126982. [CrossRef]
- Wei Yang, Yuzhen Lin, Haozhong Xue, and Jun Wang. 2025. Research on stock market sentiment analysis and prediction method based on convolutional neural network. In Proceedings of the 2025 International Conference on Machine Learning and Neural Networks (MLNN '25). Association for Computing Machinery, New York, NY, USA, 91-96. [CrossRef]
- Wei Yang, Bochen Zhang, and Jun Wang. 2025. Research on AI economic cycle prediction method based on big data. In Proceedings of the 2025 International Conference on Digital Economy and Intelligent Computing (DEIC '25). Association for Computing Machinery, New York, NY, USA, 13-17. [CrossRef]
- Yuping Yuan and Haozhong Xue. 2025. Cross-Media Data Fusion and Intelligent Analytics Framework for Comprehensive Information Extraction and Value Mining. International Journal of Innovative Research in Computer Science and Technology 13, 1 (2025), 50-57.




| Shards | ef_search | p50 Latency (ms) | p95 Latency (ms) | Recall@100 | QPS | CPU Util.(%) |
|---|---|---|---|---|---|---|
| 2 | 64 | 32 | 84 | 0.913 | 3,400 | 62 |
| 128 | 49 | 121 | 0.951 | 3,100 | 68 | |
| 256 | 78 | 186 | 0.972 | 2,500 | 74 | |
| 4 | 64 | 36 | 92 | 0.912 | 4,200 | 58 |
| 128 | 54 | 129 | 0.953 | 3,900 | 64 | |
| 256 | 85 | 197 | 0.973 | 3,200 | 71 |
| Concurrency (VU) | p50 (ms) | p95 (ms) | p99 (ms) | QPS | Availability(%) | Error Rate(%) |
|---|---|---|---|---|---|---|
| 50 | 41 | 74 | 108 | 1,200 | 99.98 | 0.02 |
| 200 | 69 | 129 | 181 | 3,900 | 99.94 | 0.05 |
| 500 | 118 | 214 | 289 | 5,600 | 99.82 | 0.12 |
| 1000 | 205 | 378 | 512 | 6,100 | 99.21 | 0.31 |
| Model | Precision@10 | Recall@50 | NDCG@10 | MRR |
|---|---|---|---|---|
| ItemCF | 0.121 | 0.291 | 0.319 | 0.226 |
| MF-BPR | 0.137 | 0.332 | 0.361 | 0.267 |
| DeepFM | 0.149 | 0.361 | 0.389 | 0.288 |
| BERT4Rec | 0.158 | 0.392 | 0.421 | 0.301 |
| RAG-GenRec | 0.177 | 0.436 | 0.463 | 0.334 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).