Submitted:
01 May 2025
Posted:
02 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Publication Year Analysis
| Year | Count |
|---|---|
| 2022 | 1 |
| 2023 | 4 |
| 2024 | 10 |
| No Date | 3 |
| Total | 18 |
- Recent Dominance: 77.8% of citations (14/18) are from 2023-2024
- Acceleration: 2024 alone accounts for 55.6% of references (10/18)
3. Quantitative Findings and Gap Analysis
3.1. Quantitative Findings Summary
3.2. Gap Analysis
- Comparative Benchmarks: Limited head-to-head comparisons exist between Gen AI and traditional methods across financial tasks.
3.3. Literature Synthesis
- 1.
- 2.
- 3.

4. Literature Review
4.0.1. AI Agent Frameworks in Finance
4.1. Prompt Engineering and Model Optimization
4.2. Implementation Frameworks
4.3. Use Case Maturity
4.4. Technical Challenges
| Challenge | Prevalence |
|---|---|
| Data quality issues | 93% of cases [24] |
| Model interpretability | 87% of institutions [25] |
| Regulatory alignment | 79% of deployments [26] |
4.5. Emerging Best Practices
- 1.
- Phased Rollouts: TCS advocates starting with low-risk areas like document processing before core risk systems [27]
- 2.
- Human-in-the-Loop: Deloitte’s case studies show 60% better outcomes when combining AI with expert oversight [24]
- 3.
- Proprietary Data Leverage: IBM demonstrates 40% accuracy boosts from domain-specific fine-tuning [14]
- 4.
- Regulatory Sandboxes: ISDA highlights successful derivatives market testing environments [28]
4.6. Future Directions
- Real-world implementation metrics
- Organizational change management insights
- Regulatory compliance roadmaps

-
Cloud Platforms:
- –
- AWS Bedrock for foundation models [19]
- –
- Azure OpenAI Service for enterprise deployment
- –
- GCP Vertex AI for MLOps integration
-
Python Ecosystem:
- –
- Transformers 4.40+ for latest LLMs
- –
- LangChain 0.1+ for agent orchestration
- –
- Financial-specific libraries (FinBERT, RiskLabAI)
-
Theoretical Foundations:
- Implementation Flow:with regulatory constraints from [6]
5. Additional Relevant Literature
5.1. Industry White Papers and Frameworks
5.2. Technical Implementation Guides
5.3. Economic and Policy Perspectives
5.4. Complementary Technical Works
- [36] offers practical Python implementations that could supplement our Algorithm 1.
- [37] demonstrates text-to-chart applications relevant to our visualization discussions.
- [38] presents feature selection methods that predate but inform our LLM-based approaches.

6. Generative AI in Risk Management
6.1. Related Work with Applications in Financial Risk (Market and Credit Risk)
7. Credit Assessment Applications
8. Feature Engineering and Model Selection
9. Macroeconomic Simulation
10. Implementation Challenges
11. Future Directions
- Development of specialized financial LLMs (FinLLMs) as surveyed by [31]
- Improved interpretability methods for financial applications
- Robust evaluation frameworks for Gen AI in finance
- Ethical guidelines and regulatory standards
- Integration with traditional quantitative methods
12. Quantitative Foundations and Methods
12.1. Statistical Foundations
12.2. Feature Engineering Mathematics
12.3. Credit Scoring Models
12.4. Optimization Frameworks
12.5. Agent-Based Simulation
12.6. Performance Metrics
13. Technical Architecture
13.1. Proposed System Architecture
13.1.1. Component View

- Data Sources: Raw data ingestion via S3, APIs, and SQL databases
- LLM Feature Selection: Automated feature engineering using modern GenAI techniques
- Risk Models: Integration of Vasicek models and GANs for financial forecasting
13.1.2. Block-and-Flow Design

- Flow Labels: Explicit “Raw Data → Cleaned Data” pipeline stages
- Cloud Services: AWS/SageMaker deployment
- Automated Reporting: LLM-generated dashboards
13.2. Figure Descriptions
13.2.1. Figure 1: Proposed GenAI Architecture for Financial Risk Management
- Data Layer (blue): Handles heterogeneous data sources, preprocessing, and feature storage
- Processing Layer (red): Core LLM orchestrator with risk engine and credit scoring modules
- Application Layer (green): Delivers business applications through API endpoints
- Control Layer (gray): Provides monitoring, governance, and compliance oversight
13.2.2. Figure 2: Modern GenAI Implementation Stack for Financial Risk
- Cloud Platforms (orange): Shows major providers (AWS, Azure, GCP) and their AI services
- Python Ecosystem (blue): Lists essential libraries for transformers, financial ML, and monitoring
- Theoretical Foundations (green): Highlights key concepts like RAG and agentic AI
- Implementation Flow (purple): Illustrates the data processing pipeline from raw inputs to risk insights
13.3. Figure 3: Proposed Architecture for GenAI Financial Analytics System
- Data flow from raw sources through preprocessing to LLM processing
- Parallel feature engineering and risk analysis pathways
- Integration points for human oversight and regulatory compliance
- Clear separation between data, processing, application, and oversight layers
13.3.1. Figure 6: Technical Architecture with Mathematical Formulations
- Mathematical notations from key references
- Explicit formulas for risk calculations and credit scoring
- Implementation details from cited works
- Color-coded legend explaining layer semantics
13.3.2. Figure 4: Modular Architecture of Cloud-Based Financial Risk System
- Cloud-native data ingestion through S3, APIs, and SQL databases
- LLM-powered feature selection modules
- Integration of advanced risk models (Vasicek, GANs)
- Automated reporting capabilities
13.3.3. Figure 5: Block-and-Flow Architecture with GenAI Integration
- Explicit data transformation stages
-
AWS/SageMaker deployment specificsitem Flow labels showing data progression
- LLM-generated dashboard components

14. Algorithmic Implementations
14.1. Core Algorithms
| Algorithm 1 LLM-Augmented Feature Selection [8] |
|
Require: Raw financial data D, pretrained LLM M Ensure: Selected feature set
|
14.2. Optimization Process
| Algorithm 2 LLM Regulatory Code Generation |
|
Require: Regulatory text R, test cases T Ensure: Compliant code
|
14.3. Data Engineering for GenAI
14.4. Python Implementation

14.5. Macroeconomic Simulation

15. Technical Implementation Landscape
| Reference | Cloud/Platform | Languages | Models | Libraries/Frameworks |
|---|---|---|---|---|
| [1] | Microsoft 365 | Python | Copilot 365 | Pandas, NumPy, Power BI |
| [9] | AWS/GCP | Python | BERT, FinBERT | Transformers, PyTorch |
| [6] | Azure ML | Python | GPT-4 | OpenAI API, NumPy |
| [12] | Simulation Env | Python | GPT-4 | Mesa, NumPy |
| [8] | QuantConnect | Python | GPT-4 | Pandas, scikit-learn |
| [7] | Private Cloud | Python | BERT, GPT | HuggingFace, TensorFlow |
| [5] | IBM Cloud | Python | GPT-3.5 | LangChain, PyMC3 |
- Dominant Language: Python is used in 100% of implementations
- Cloud Adoption: 71% (5/7) utilize major cloud platforms (AWS/GCP/Azure/IBM)
- Model Variety: Mix of proprietary (GPT series) and open-source (BERT) models
15.1. Cloud Solutions
15.2. Implementation Code Example

16. Implementation Gaps
16.1. Data-Related Challenges
- Proprietary Data Integration: While noa [14] emphasizes the value of proprietary data, practical methods for securely integrating sensitive financial data with LLMs remain underdeveloped.
- Data Scarcity: Few studies address the cold-start problem for financial institutions with limited historical data, despite its prevalence in emerging markets.
16.2. Model Limitations
| Limitation | Impact | References |
|---|---|---|
| Mathematical reasoning gaps | 30-40% error rate in VaR calculations | [13] |
| Context window constraints | Limits document processing capability | [6] |
| Computational inefficiency | 5-10x cost premium vs traditional models | [7] |
16.3. Operational Gaps
- Real-time Processing: Only [2] discusses latency requirements for trading applications, with most implementations focusing on batch processing.
- Model Drift Monitoring: Current architectures ([47]) lack standardized approaches for detecting financial concept drift in LLM outputs.
- Audit Trails: Regulatory compliance requirements from [6] are not fully operationalized in most technical implementations.
16.4. Proposed Solutions
- 1.
- Development of financial-specific tokenization methods to improve proprietary data utilization
- 2.
- Hybrid architectures combining LLMs with symbolic reasoning engines for mathematical tasks
- 3.
- Standardized benchmarking frameworks for real-time financial applications
- 4.
- Integration of explainability tools from [10] into production pipelines
16.5. Workforce and Policy Implications
17. Conclusions
- Understanding model capabilities and limitations
- Addressing data quality and privacy concerns
- Developing appropriate evaluation frameworks
- Ensuring regulatory compliance
References
- Koskula, J. Generative artificial intelligence in support of analytics : Copilot 365 2024. Accepted: 2024-10-29T09:41:08Z.
- Aldasoro, I.; Gambacorta, L.; Korinek, A.; Shreeti, V.; Stein, M. Intelligent financial system: how AI is transforming finance. BIS Working Papers 2024. Number: 1194 Publisher: Bank for International Settlements.
- Aljaloudi, O.; Thiam, M.; Qader, M.; Al-Mhdawi, M.S.; Qazi, A.; Dacre, N. Examining the Integration of Generative AI Models for Improved Risk Management Practices in the Financial Sector 2024.
- Babaei, G.; Giudici, P. GPT classifications, with application to credit lending. Machine Learning with Applications 2024, 16, 100534. [CrossRef]
- Teixeira, A.C.; Marar, V.; Yazdanpanah, H.; Pezente, A.; Ghassemi, M. Enhancing Credit Risk Reports Generation using LLMs: An Integration of Bayesian Networks and Labeled Guide Prompting. In Proceedings of the Proceedings of the Fourth ACM International Conference on AI in Finance, New York, NY, USA, 2023; ICAIF ’23, pp. 340–348. [CrossRef]
- Fazlija, B.; Ibraimi, M.; Forouzandeh, A.; Fazlija, A. Implementing Financial Regulations Using Large Language Models, 2024.
- Sharkey, E.; Treleaven, P. BERT vs GPT for financial engineering, 2024. arXiv:2405.12990 [q-fin], . [CrossRef]
- Wang, Y.; Zhao, J.; Lawryshyn, Y. GPT-Signal: Generative AI for Semi-automated Feature Engineering in the Alpha Research Process, 2024. arXiv:2410.18448 [cs], . [CrossRef]
- Sanz-Guerrero, M.; Arroyo, J. Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending, 2024. arXiv:2401.16458 [q-fin], . [CrossRef]
- Hinterleitner, A.; Bartz-Beielstein, T.; Schulz, R.; Spengler, S.; Winter, T.; Leitenmeier, C. Enhancing Feature Selection and Interpretability in AI Regression Tasks Through Feature Attribution, 2024. arXiv:2409.16787 [cs], . [CrossRef]
- Li, D.; Tan, Z.; Liu, H. Exploring Large Language Models for Feature Selection: A Data-centric Perspective, 2024. arXiv:2408.12025 [cs], . [CrossRef]
- Li, N.; Gao, C.; Li, M.; Li, Y.; Liao, Q. EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities. In Proceedings of the Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers); Ku, L.W.; Martins, A.; Srikumar, V., Eds., Bangkok, Thailand, 2024; pp. 15523–15536. [CrossRef]
- Hofert, M. Assessing ChatGPT’s Proficiency in Quantitative Risk Management. Risks 2023, 11, 166. Number: 9 Publisher: Multidisciplinary Digital Publishing Institute, . [CrossRef]
- Proprietary data, your competitive edge in generative AI | IBM, 2024.
- Inala, J.P.; Wang, C.; Drucker, S.; Ramos, G.; Dibia, V.; Riche, N.; Brown, D.; Marshall, D.; Gao, J. Data Analysis in the Era of Generative AI, 2024. arXiv:2409.18475 [cs], . [CrossRef]
- Joshi, S. Advancing Innovation in Financial Stability: A Comprehensive Review of Ai Agent Frameworks, Challenges and Applications. World Journal of Advanced Engineering Technology and Sciences 2025, 14, 117–126. [CrossRef]
- Satyadhar, J. Review of Autonomous Systems and Collaborative AI Agent Frameworks. International Journal of Science and Research Archive 2025, 14, 961–972. [CrossRef]
- Satyadhar Joshi. Leveraging Prompt Engineering to Enhance Financial Market Integrity and Risk Management. World Journal of Advanced Research and Reviews 2025, 25, 1775–1785. [CrossRef]
- Generative AI for Financial Services - AWS.
- Generative AI Controls Framework Safe, Secure, and Compliant AI Adoption Approach Whitepaper - Financial Services Cloud Council and Forum.
- Adopting generative AI in banking – Corporates and Institutions.
- The rise of generative artificial intelligence in lending, 2024.
- How to Train Generative AI Using Your Company’s Data.
- Financial Services: scaling GenAI for maximum impact.
- Generative AI in Banking and Finance I White Paper.
- The Financial Stability Implications of Artificial Intelligence, 2024.
- Generative AI in Finance: Opening up a Sea of Possibilities.
- GenAI in the Derivatives Market: a Future Perspective – International Swaps and Derivatives Association.
- What is the World Economic Forum saying about artificial intelligence? Must-read research on the impact of genAI, 2024.
- Transforming to an AI-powered finance function | McKinsey.
- Lee, J.; Stevens, N.; Han, S.C.; Song, M. A Survey of Large Language Models in Finance (FinLLMs), 2024. arXiv:2402.02315 [cs], . [CrossRef]
- Generative AI Controls Framework Safe, Secure, and Compliant AI Adoption Approach Whitepaper - Financial Services Cloud Council and Forum.
- The Transformative Impact of AI and GenAI on Financial Services.
- Nguyen, Era Dabla-Norris, R.d.M.D.G.M.T.H.L.L.A.D.M.F.B. Broadening the Gains from Generative AI: The Role of Fiscal Policies.
- The revolution arrives: How gen AI is poised to transform banking.
- Campesato, O. Python 3 and Machine Learning Using ChatGPT/GPT-4; Walter de Gruyter GmbH & Co KG, 2024. Google-Books-ID: Mm8NEQAAQBAJ.
- NLP in FinTech: Developing a Lightweight Text-to-Chart Application for Financial Analysis | IEEE Conference Publication | IEEE Xplore.
- Jomthanachai, S.; Wong, W.P.; Khaw, K.W. An application of machine learning regression to feature selection: a study of logistics performance and economic attribute. Neural Computing and Applications 2022, 34, 15781–15805. [CrossRef]
- Joshi, S. Advancing Financial Risk Modeling: Vasicek Framework Enhanced by Agentic Generative AI by Satyadhar Joshi. Advancing Financial Risk Modeling: Vasicek Framework Enhanced by Agentic Generative AI by Satyadhar Joshi 2025, Volume 7. [CrossRef]
- Satyadhar, J. Enhancing Structured Finance Risk Models (Leland-Toft and Box-Cox) Using GenAI (VAEs GANs). International Journal of Science and Research Archive 2025, 14, 1618–1630. [CrossRef]
- Khoja, M. AI and Bond Values: How Large Language Models Predict Default Signals, 2024. [CrossRef]
- Jeong, D.P.; Lipton, Z.C.; Ravikumar, P. LLM-Select: Feature Selection with Large Language Models, 2024. arXiv:2407.02694 [cs], . [CrossRef]
- Nascimento, N.; Tavares, C.; Alencar, P.; Cowan, D. GPT in Data Science: A Practical Exploration of Model Selection. In Proceedings of the 2023 IEEE International Conference on Big Data (BigData), 2023, pp. 4325–4334. [CrossRef]
- Li, N.; Gao, C.; Li, Y.; Liao, Q. Large Language Model-Empowered Agents for Simulating Macroeconomic Activities, 2023. [CrossRef]
- Joshi, S. Review of Data Engineering and Data Lakes for Implementing GenAI in Financial Risk A Comprehensive Review of Current Developments in GenAI Implementations, 2025, [5123081]. [CrossRef]
- Satyadhar, J. Review of Data Engineering Frameworks (Trino and Kubernetes) for Implementing Generative AI in Financial Risk. International Journal of Research Publication and Reviews 2025, 6, 1461–1470. [CrossRef]
- Joshi, S. Agentic Generative AI and the Future U.S. Workforce: Advancing Innovation and National Competitiveness, 2025, [5126922]. [CrossRef]
- Satyadhar, J. Generative AI: Mitigating Workforce and Economic Disruptions While Strategizing Policy Responses for Governments and Companies, 2025, [5135229]. [CrossRef]
| Study | Metric | Result | Application |
|---|---|---|---|
| [4] | Classification accuracy | Comparable to logistic regression | Credit decisions |
| [5] | Human preference rate | 60-90% preferred over human reports | Credit risk analysis |
| [6] | Code generation accuracy | 75.38% pass@1, 91.67% pass@10 | Regulatory compliance |
| [7] | F1-score improvement | 10% over GPT-4, 16% over CopGPT | Financial sentiment analysis |
| [8] | Alpha generation | Automated formulaic alphas | Feature engineering |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).