Submitted:
22 March 2025
Posted:
26 March 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Literature Review
3. Discussions
3.1. Comparative Analysis with Other LLMs
3.2. Technical Capabilities and Innovations
3.3. Multimodal and Domain-Specific Applications
3.4. Economic and Market Impact
3.5. Technological Innovation and Education
3.6. Safety and Comparative Performance
3.7. Academic Writing and Content Generation
3.8. Implications for AI Research
3.9. Applications in Industry
3.10. Limitations and Future Work
4. Societal Impact Applications and User Perceptions
4.1. Societal Impact and Applications
4.2. Ethical Considerations and Challenges
4.3. User Perceptions and Feedback
4.4. User Perceptions and Comparative Evaluations
4.5. Safety and Ethical Considerations
4.6. User Perceptions and Feedback
5. Methodology
5.1. Model Architecture
5.2. Training and Optimization
5.3. Evaluation Metrics
5.4. Results
5.4.1. Performance on Benchmarks
5.4.2. Efficiency and Scalability
5.5. Architecture and Training
5.6. Model Comparison
6. DeepSeek’s Architecture
6.1. Multi-Head Latent Attention (MLA)
6.2. DeepSeekMoE
6.3. Mixture-of-Experts (MoE) Framework
6.4. Multi-Head Latent Attention
6.5. Reinforcement Learning and Fine-Tuning
6.6. Open-Source Nature
6.7. Performance and Scalability
6.8. Conclusion
7. Comparative Analysis with Other LLMs
7.1. Performance in Writing and Communication
7.2. Reasoning and Language Acquisition
7.3. Academic Writing and Content Generation
7.4. Predictive Power in Finance
7.5. Safety and Ethical Considerations
7.6. Overall Performance and Efficiency
7.7. Performance on Benchmarks
7.8. Efficiency and Scalability
7.9. Open-Source vs. Proprietary Models
7.10. Limitations and Trade-Offs
7.11. Conclusion
8. DeepSeek Applications in Healthcare, Finance, Business, and Risk Management
8.1. Applications in Different Domains
8.2. Applications in Industry
8.3. Applications in Healthcare and Finance
8.4. Financial Technology (FinTech) and Democratization of AI
8.5. Healthcare and Financial Risk Management
8.6. Applications in Business and Industry
8.7. Challenges and Future Directions
8.8. Conclusion
9. DeepSeek and Gen AI Applications in Finance
9.1. Predictive Power and Economic Analysis
9.2. Limitations and Language Training
9.3. Potential and Future Directions
- Sentiment Analysis of Financial News: Analyzing financial news articles and social media to gauge market sentiment.
- Financial Report Summarization: Extracting key information from lengthy financial reports.
- Risk Assessment and Modeling: Assisting in the development of risk models by analyzing historical data and market trends.
- Automated Financial Reporting: Generating reports and summaries for financial stakeholders.
10. Technical Capabilities of DeepSeek
10.1. Efficient Model Architecture
10.2. Scaling and Performance
10.3. Contextual Understanding and Language Processing
10.4. Domain-Specific Applications
10.5. Safety and Ethical Considerations
10.6. Continuous Development and Updates
10.7. Model Architecture
10.8. Training and Optimization
10.9. Performance Benchmarks
10.10. Applications in Specialized Domains
10.11. Challenges and Future Directions
10.12. Conclusion
11. Quantitative Findings and Performance Metrics
11.1. Comparative Performance Benchmarks
11.2. Efficiency Metrics in Model Architecture
11.3. Sentiment Analysis and User Feedback
11.4. Safety and Unsafe Response Rates
11.5. Plagiarism and Semantic Similarity
11.6. Predictive Accuracy in Financial Markets
11.7. Performance on Language Understanding and Reasoning Tasks
11.8. Efficiency and Scalability
11.9. Multimodal Performance
11.10. Comparative Analysis with Other Models
11.11. Energy Efficiency and Environmental Impact
11.12. Conclusion
12. Possible Applications of Deepseek Generative AI in Financial Risk Management
12.1. Overview of Generative AI in Finance
12.2. DeepSeek in Financial Risk Applications
12.3. Prompt Engineering for Financial Risk
12.4. Data Engineering and Generative AI
12.5. Recent Work on Generative AI in Finance
12.6. Challenges and Future Directions
12.7. Case Studies
13. Gap Analysis and Proposals from the Literature
13.1. Enhancing Language-Specific Training and Performance
13.2. Addressing Safety and Ethical Concerns
13.3. Improving Readability and Academic Writing Quality
13.4. Refining Contextual Understanding in Language Acquisition
13.5. Optimizing User Experience and Interface Design
13.6. Expanding Domain-Specific Customization and Applications
13.7. Enhancing Model Adaptability and Flexibility
13.8. Gaps in Current Capabilities
13.8.1. Limited Multimodal Integration
13.8.2. Challenges in Ethical Alignment
13.8.3. Scalability and Resource Constraints
13.9. Proposals for Future Research
13.9.1. Enhancing Multimodal Capabilities
13.9.2. Strengthening Ethical Alignment
13.9.3. Improving Scalability and Accessibility
13.9.4. Expanding Domain-Specific Applications
13.10. Section Conclusion
14. Funding, Costs, and Economic Considerations
14.1. Training Cost Reductions
14.2. Economic Impact and Market Prediction
14.3. Open-Source Development and Accessibility
14.4. Potential Cost Savings in Specialized Domains
14.5. Development Costs
14.6. Market Impact and Valuation
14.7. Operational and Deployment Costs
14.8. Funding Sources and Economic Implications
14.9. Section Conclusion
14.10. Future Directions and Challenges
15. Conclusion
References
- Al-Garaady, J.; Albuhairy, M.M. Understanding User Perceptions of DeepSeek: A Mixed-Methods Sentiment and Thematic Analysis, 2025, [5172367]. [CrossRef]
- AlAfnan, M.A. DeepSeek Vs. ChatGPT: A Comparative Evaluation of AI Tools in Composition, Business Writing, and Communication Tasks. Journal of Artificial Intelligence and Technology 2025. [CrossRef]
- DeepSeek-AI.; Bi, X.; Chen, D.; Chen, G.; Chen, S.; Dai, D.; Deng, C.; Ding, H.; Dong, K.; Du, Q.; et al. DeepSeek LLM: Scaling Open-Source Language Models with Longtermism, 2024, [arXiv:cs/2401.02954]. [CrossRef]
- DeepSeek-AI.; Liu, A.; Feng, B.; Wang, B.; Wang, B.; Liu, B.; Zhao, C.; Dengr, C.; Ruan, C.; Dai, D.; et al. DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model, 2024, [arXiv:cs/2405.04434]. [CrossRef]
- DeepSeek . http://www.kjdb.org/CN/10.3981/j.issn.1000-7857.2025.02.00183.
- Guo, D.; Zhu, Q.; Yang, D.; Xie, Z.; Dong, K.; Zhang, W.; Chen, G.; Bi, X.; Wu, Y.; Li, Y.K.; et al. DeepSeek-Coder: When the Large Language Model Meets Programming – The Rise of Code Intelligence, 2024, [arXiv:cs/2401.14196]. [CrossRef]
- Hayder, W.A. Highlighting DeepSeek-R1: Architecture, Features and Future Implications. International Journal of Computer Science and Mobile Computing 2025, 14, 1–13. [CrossRef]
- Gao, T.; Jin, J.; Ke, Z.T.; Moryoussef, G. A Comparison of DeepSeek and Other LLMs, 2025, [arXiv:cs/2502.03688]. [CrossRef]
- Gupta, R. Comparative Analysis of DeepSeek R1, ChatGPT, Gemini, Alibaba, and LLaMA: Performance, Reasoning Capabilities, and Political Bias.
- Jiang, Q.; Gao, Z.; Karniadakis, G.E. DeepSeek vs. ChatGPT vs. Claude: A Comparative Study for Scientific Computing and Scientific Machine Learning Tasks, 2025, [arXiv:cs/2502.17764]. [CrossRef]
- Manik, M.M.H. ChatGPT vs. DeepSeek: A Comparative Study on AI-Based Code Generation, 2025, [arXiv:cs/2502.18467]. [CrossRef]
- Shakya, R.; Vadiee, F.; Khalil, M. A Showdown of ChatGPT vs DeepSeek in Solving Programming Tasks, 2025, [arXiv:cs/2503.13549]. [CrossRef]
- Fernandes, D.; Matos-Carvalho, J.P.; Fernandes, C.M.; Fachada, N. DeepSeek-V3, GPT-4, Phi-4, and LLaMA-3.3 Generate Correct Code for LoRaWAN-related Engineering Tasks, 2025, [arXiv:cs/2502.14926]. [CrossRef]
- Huang, D.; Wang, Z. Explainable Sentiment Analysis with DeepSeek-R1: Performance, Efficiency, and Few-Shot Learning, 2025, [arXiv:cs/2503.11655]. [CrossRef]
- Hussain, Z.S.; Delsoz, M.; Elahi, M.; Jerkins, B.; Kanner, E.; Wright, C.; Munir, W.M.; Soleimani, M.; Djalilian, A.; Lao, P.A.; et al. Performance of DeepSeek, Qwen 2.5 MAX, and ChatGPT Assisting in Diagnosis of Corneal Eye Diseases, Glaucoma, and Neuro-Ophthalmology Diseases Based on Clinical Case Reports, 2025. [CrossRef]
- Mondillo, G.; Colosimo, S.; Perrotta, A.; Frattolillo, V.; Masino, M. Comparative Evaluation of Advanced AI Reasoning Models in Pediatric Clinical Decision Support: ChatGPT O1 vs. DeepSeek-R1, 2025. [CrossRef]
- de Paiva, L.F.; Luijten, G.; Puladi, B.; Egger, J. How Does DeepSeek-R1 Perform on USMLE?, 2025. [CrossRef]
- Habib Lantyer, V. How U.S. Trade Sanctions Fueled Chinese Innovation in AI: The DeepSeek Case, 2025, [5112973]. [CrossRef]
- Krause, D. DeepSeek and FinTech: The Democratization of AI and Its Global Implications, 2025, [5116322]. [CrossRef]
- Krause, D. DeepSeek’s Potential Impact on the Magnificent 7: A Valuation Perspective, 2025, [5117909]. [CrossRef]
- Sallam, M.; Al-Mahzoum, K.; Sallam, M.; Mijwil, M.M. DeepSeek: Is It the End of Generative AI Monopoly or the Mark of the Impending Doomsday? Mesopotamian Journal of Big Data 2025, 2025, 26–34. [CrossRef]
- Wu, J. The Rise of DeepSeek: Technology Calls for the “Catfish Effect”. Journal of Thoracic Disease 2025, 17, 1106–1108. [CrossRef]
- Parmar, M.; Govindarajulu, Y. Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies, 2025, [arXiv:cs/2501.17030]. [CrossRef]
- Olson, M.L.; Ratzlaff, N.; Hinck, M.; Luo, M.; Yu, S.; Xue, C.; Lal, V. Semantic Specialization in MoE Appears with Scale: A Study of DeepSeek R1 Expert Specialization, 2025, [arXiv:cs/2502.10928]. [CrossRef]
- Performance Optimization of DeepSeek MoE Architecture in Multi-Scale Prediction of Stock Returns. World Journal of Information Technology 2025, 3. [CrossRef]
- Wang, C.; Kantarcioglu, M. A Review of DeepSeek Models’ Key Innovative Techniques, 2025, [arXiv:cs/2503.11486]. [CrossRef]
- Xin, H.; Ren, Z.Z.; Song, J.; Shao, Z.; Zhao, W.; Wang, H.; Liu, B.; Zhang, L.; Lu, X.; Du, Q.; et al. DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search, 2024, [arXiv:cs/2408.08152]. [CrossRef]
- Lu, H.; Liu, W.; Zhang, B.; Wang, B.; Dong, K.; Liu, B.; Sun, J.; Ren, T.; Li, Z.; Yang, H.; et al. DeepSeek-VL: Towards Real-World Vision-Language Understanding, 2024, [arXiv:cs/2403.05525]. [CrossRef]
- Piplani, T.; Bamman, D. DeepSeek: Content Based Image Search & Retrieval, 2018, [arXiv:cs/1801.03406]. [CrossRef]
- Thoughts on the DeepSeek Triggered Path of AI Development. http://www.kjdb.org/EN/abstract/article/1000-7857/17790.
- Thoughts on the DeepSeek Triggered Path of AI Development. http://www.kjdb.org/EN/10.3981/j.issn.1000-7857.2025.02.00183.
- (PDF) Grok, Gemini, ChatGPT and DeepSeek: Comparison and Applications in Conversational Artificial Intelligence. https://www.researchgate.net/publication/389065042_Grok_Gemini_ChatGPT_and_DeepSeek_ Comparison_and_Applications_in_Conversational_Artificial_Intelligence.
- Ziba-Kulawik, K. Generative AI: New Framework of Using Large Language Models for Analysing Descriptive Qualitative Data, 2025. [CrossRef]
- Puspitasari, F.D.; Zhang, C.; Dam, S.K.; Zhang, M.; Kim, T.H.; Hong, C.S.; Bae, S.H.; Qin, C.; Wei, J.; Wang, G.; et al. DeepSeek Models: A Comprehensive Survey of Methods and Applications.
- Neha, F.; Bhati, D. A Survey of DeepSeek Models.
- Piastou, M. Efficiency and safety of the DeepSeek R1 model compared to OpenAI models. CYRP 2025.
- Maiti, A.; Adewumi, S.; Tikure, T.A.; Wang, Z.; Sengupta, N.; Sukhanova, A.; Jana, A. Comparative Analysis of OpenAI GPT-4o and DeepSeek R1 for Scientific Text Categorization Using Prompt Engineering, 2025, [arXiv:cs/2503.02032]. [CrossRef]
- DeepSeek-AI.; Liu, A.; Feng, B.; Xue, B.; Wang, B.; Wu, B.; Lu, C.; Zhao, C.; Deng, C.; Zhang, C.; et al. DeepSeek-V3 Technical Report, 2025, [arXiv:cs/2412.19437]. [CrossRef]
- Chen, J.; Zhang, Q. DeepSeek Reshaping Healthcare in China’s Tertiary Hospitals, 2025, [arXiv:cs/2502.16732]. [CrossRef]
- Albuhairy, M.M.; Algaraady, J. DeepSeek vs. ChatGPT: Comparative Efficacy in Reasoning for Adults’ Second Language Acquisition Analysis. Irani Studies 2025, pp. 864–883. [CrossRef]
- Chowdhury, M.N.U.R.; Haque, A.; Ahmed, I. DeepSeek vs. ChatGPT: A Comparative Analysis of Performance, Efficiency, and Ethical AI Considerations.
- Okaiyeto, S.A.; Bai, J.; Wang, J.; Mujumdar, A.S.; Xiao, H. Success of DeepSeek and Potential Benefits of Free Access to AI for Global-Scale Use. International Journal of Agricultural and Biological Engineering 2025, 18, 304–306.
- Allen, R. DeepSeek and AI Innovation : How Chinese Universities Broke through the Glass Ceiling of Technological Advancement. American Journal of STEM Education 2025, 6, 1–10. [CrossRef]
- Arrieta, A.; Ugarte, M.; Valle, P.; Parejo, J.A.; Segura, S. O3-Mini vs DeepSeek-R1: Which One Is Safer?, 2025, [arXiv:cs/2501.18438]. [CrossRef]
- Aydin, O.; Karaarslan, E.; Erenay, F.S.; Bacanin, N. Generative AI in Academic Writing: A Comparison of DeepSeek, Qwen, ChatGPT, Gemini, Llama, Mistral, and Gemma, 2025, [arXiv:cs/2503.04765]. [CrossRef]
- Arabiat, O. DeepSeek AI in Accounting: Opportunities and Challenges in Intelligent Automation, 2025, [5116945]. [CrossRef]
- Chen, Y.; Shen, J.; Ma, D. DeepSeek’s Impact on Thoracic Surgeons’ Work Patterns—Past, Present and Future. Journal of Thoracic Disease 2025, 17, 1114–1117. [CrossRef]
- China’s AI Revolution: How DeepSeek Is Changing the Game. | EBSCOhost. https://openurl.ebsco.com.
- Katta, K. Analyzing User Perceptions of Large Language Models (LLMs) on Reddit: Sentiment and Topic Modeling of ChatGPT and DeepSeek Discussions, 2025, [arXiv:cs/2502.18513]. [CrossRef]
- Chen, J.; Tang, G.; Zhou, G.; Zhu, W. ChatGPT and Deepseek: Can They Predict the Stock Market and Macroeconomy?, 2025, [arXiv:econ/2502.10008]. [CrossRef]
- Sapkota, R.; Raza, S.; Karkee, M. Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And Other SoTA Large Language Models, 2025, [arXiv:cs/2502.18505]. [CrossRef]
- Islam, C.M.; Chacko, S.J.; Horne, P.; Liu, X. DeepSeek on a Trip: Inducing Targeted Visual Hallucinations via Representation Vulnerabilities, 2025, [arXiv:cs/2502.07905]. [CrossRef]
- Joshi Satyadhar. Enhancing Structured Finance Risk Models (Leland-Toft and Box-Cox) Using GenAI (VAEs GANs). IJSRA 2025, 14, 1618–1630.
- Satyadhar, J. Gen AI for Market Risk and Credit Risk [Ebook ISBN: 9798230094388]. Draft2Digital Publications Ebook ISBN: 9798230094388 2025.
- Joshi Satyadhar. Using Gen AI Agents With GAE and VAE to Enhance Resilience of US Markets. The International Journal of Computational Science, Information Technology and Control Engineering (IJCSITCE) 2025, 12, 23–38.
- Joshi, Satyadhar. Leveraging prompt engineering to enhance financial market integrity and risk management. World Journal of Advanced Research and Reviews WJARR 2025, 25, 1775–1785. [CrossRef]
- Joshi Satyadhar. Quantitative Foundations for Integrating Market, Credit, and Liquidity Risk with Generative AI. https://www.preprints.org/ 2025.
- Satyadhar, J. Review of Gen AI Models for Financial Risk Management. International Journal of Scientific Research in Computer Science, Engineering and Information Technology ISSN : 2456-3307 2025, 11, 709–723.
- Satyadhar Joshi. The synergy of generative AI and big data for financial risk: Review of recent developments. IJFMR-International Journal For Multidisciplinary Research 2025, 7. [CrossRef]
- Satyadhar, J. ADVANCING FINANCIAL RISK MODELING: VASICEK FRAMEWORK ENHANCED BY AGENTIC GENERATIVE AI. International Research Journal of Modernization in Engineering Technology and Science 2025, 7, 4413–4420.
- Joshi Satyadhar. Implementing gen AI for increasing robustness of US financial and regulatory system. International Journal of Innovative Research in Engineering and Management 2024, 11, 175–179.
- Satyadhar Joshi. Agentic Generative AI and the Future US Workforce: Advancing Innovation and National Competitiveness. International Journal of Research and Review 2025, 12, 102–113. [CrossRef]
- Joshi, .S. A Literature Review of Gen AI Agents in Financial Applications: Models and Implementations. International Journal of Science and Research (IJSR) 2025, 12, 1094–1100.
- Satyadhar Joshi . The Transformative Role of Agentic GenAI in Shaping Workforce Development and Education in the US. Iconic Research And Engineering Journals 2025, 8, 199–206.
- Joshi, S. A Comprehensive Review of Data Pipelines and Streaming for Generative AI Integration: Challenges, Solutions, and Future Directions.
- Satyadhar, J. Retraining US Workforce in the Age of Agentic Gen AI: Role of Prompt Engineering and Up-Skilling Initiatives. International Journal of Advanced Research in Science, Communication and Technology (IJARSCT) 2025, 5.
- Joshi Satyadhar. Generative AI: Mitigating Workforce and Economic Disruptions While Strategizing Policy Responses for Governments and Companies. International Journal of Advanced Research in Science, Communication and Technology (IJARSCT) ISSN (Online) 2581-9429 2025, 5, 480–486.
- Satyadhar, J. Training US Workforce for Generative AI Models and Prompt Engineering: ChatGPT, Copilot, and Gemini. International Journal of Science, Engineering and Technology ISSN (Online): 2348-4098 2025, 13.
- Satyadhar Joshi. Introduction to Vector Databases for Generative AI: Applications, Performance, Future Projections, and Cost Considerations. International Advanced Research Journal in Science, Engineering and Technology ISSN (O) 2393-8021, ISSN (P) 2394-1588 2025, 12, 79–93. [CrossRef]
- Satyadhar Joshi. Bridging the AI Skills Gap: Workforce Training for Financial Services. International Journal of Innovative Science and Research Technology 2025, 10, 1023–1030.
- Joshi, S. Introduction to Generative AI and DevOps: Synergies, Challenges and Applications. [CrossRef]
| Model | Parameters | Training Data | Performance Metrics |
|---|---|---|---|
| DeepSeek | XXB | Diverse Web Corpus | 95% Accuracy |
| GPT-4 | XXB | OpenAI Curated Dataset | 94% Accuracy |
| Gemini | XXB | Multimodal Enhanced Data | 92% Accuracy |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).