Submitted:
09 November 2025
Posted:
11 November 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Early Foundations (1940s–1970s)
2.1. Logic Theorist (1956)
2.2. ELIZA (1966)
2.3. SHRDLU (1970)
2.4. DENDRAL (1965–1970)
2.5. MYCIN (1972)
2.6. PROLOG (1972)
2.7. LISP (1958)
3. The Rise of Machine Learning (1980s–2000s)
3.1. SOAR (1983)
3.2. ALVINN (1989)
3.3. Backpropagation Algorithm (1986)
3.4. WEKA (1993)
3.5. Support Vector Machines (SVM, 1995)
3.6. OpenCV (2000)
3.7. MATLAB AI Toolbox (2000s)
3.8. Scikit-learn (2007–2010)
4. Deep Learning & Neural Network Revolution (2010s)
4.1. Theano (2010)
4.2. Caffe (2013)
4.3. TensorFlow (2015)
4.4. Keras (2015)
4.5. PyTorch (2016)
4.6. CNTK (2016)
4.7. MXNet (2015)
4.8. DeepMind AlphaGo (2016)
4.9. OpenAI Gym (2016)
4.10. H2O.ai (2015)
5. Generative & Pretrained Model Era (2017–2020)
5.1. Transformer Architecture (2017)
5.2. BERT (2018)
- Masked Language Modeling (MLM): Randomly masks words in a sentence and trains the model to predict them using the surrounding context, fostering deep bidirectional understanding [69].
- Next Sentence Prediction (NSP): Trains the model to predict whether a given sentence logically follows another, enhancing its ability to grasp sentence relationships and discourse structure.
5.3. GPT-2 (2019)
5.4. StyleGAN (2019)
5.5. T5 (2019)
5.6. FastAI (2018)
- High-level: Functions for common deep learning tasks (vision, text, tabular, time series, collaborative filtering) that minimize code and domain-specific knowledge requirements.
- Mid-level/Low-level: Modular building blocks that let advanced users customize model architectures, training strategies, and data preprocessing pipelines.
- DataBlock/DataLoaders: Cleanly structured tools for scalable, flexible data preprocessing and loading.
- Learner: Encapsulates a complete model training pipeline—bringing together data, network architecture, training/evaluation logic, and reporting.
- Built-in Best Practices: Automated data augmentation, mixed precision training, transfer learning integration, and state-of-the-art optimizers simplify robust model development.
5.7. AllenNLP (2018)
- PyTorch Foundation: Leveraging PyTorch’s dynamic computation graphs, enabling flexible model design and intuitive debugging.
- Modular Components: Reusable modules for tokenization, data reading, embedding, encoding, and pre/post-processing, which allow rapid prototyping and efficient pipeline construction.
- Declarative Configurations: JSON or Python-based experiment configurations, making it easy to define, reproduce, and share experimental workflows, models, and hyperparameters.
- Reference Implementations: High-quality models for a variety of NLP tasks, such as semantic role labeling, textual entailment, question answering, and named entity recognition, help users benchmark and extend cutting-edge research methods.
- Flexible Data API: A “Field” and “Instance” abstraction allows unified and efficient handling of diverse NLP data structures, such as sequences, spans, and trees, with automatic sorting, batching, and padding.
5.8. Hugging Face Transformers (2019)
6. Multimodal & Generative AI Explosion (2020–2023)
6.1. GPT-3 (2020)
6.2. CLIP (2021)
6.3. DALL·E (2021)
6.4. Stable Diffusion (2022)
6.5. Midjourney (2022)
6.6. Whisper (2022)
6.7. ChatGPT (2022)
6.8. BLOOM (2022)
6.9. LLaMA (2023)
6.10. Gemini (Bard) (2023)
6.11. Claude (2023)
7. The Current Multimodal & Agentic Era (2024–2025)
7.1. GPT-4 / GPT-4o (2024)
7.2. Sora (2024)
7.3. Gemini 1.5 (2024)
7.4. Mistral & Mixtral (2024)
7.5. Claude 3 (2024)
7.6. DeepSeek (2025)
7.7. Runway Gen-2 (2024–2025)
- Text-to-Video: Generate original videos using descriptive natural language.
- Image-to-Video: Animate a given image, bringing static scenes and objects to motion.
- Text + Image-to-Video: Combine textual instructions and images for nuanced, controllable video output.
- Stylization and Render: Transfer the style of a provided image or prompt to video frames, or turn untextured renders into realistic scenes.
- Storyboard: Convert a sequence of mockups into fully animated video narratives.
7.8. Perplexity AI (2025)
- Direct citations and clickable sources on every answer, supporting instant verification.
- Multi-model switching, so users can refine queries with different LLMs for varied perspectives.
- Contextual memory for follow-up Q&A within a single conversational session.
- Advanced filtering and custom search parameters (especially via API) for enterprise, research, or developer integrations.
- Rapid trend analysis, market insight, academic meta-reviews, and technical troubleshooting—all in natural language.
References
- (2006) — Newell and Simon’s Logic Theorist: Historical Background and Impact on Cognitive Modeling. [CrossRef]
- (1964) — Experiments with a theorem-utilizing program. [CrossRef]
- (2022) — What is Mathematics and What Should it Be https://arxiv.org/pdf/1704.05560. [CrossRef]
- (2008) — Principia Mathematica and the Development of Automated Theorem Proving. [CrossRef]
- (2006) — Newell and Simon’s Logic Theorist: Historical Background and Impact on Cognitive Modeling. [CrossRef]
- (2023) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s00146-023-01793-z.pdf. [CrossRef]
- (2024) — Representing Rule-based Chatbots with Transformers https://arxiv.org/pdf/2407.10949. [CrossRef]
- (2024) — ELIZA Reinterpreted: The world’s first chatbot was not intended as a chatbot at all https://arxiv.org/pdf/2406.17650. [CrossRef]
- (2023) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s00146-023-01793-z.pdf. [CrossRef]
- (1988) — SHRDLU, Procedures, Mini-World. [CrossRef]
- (2005) — SHRDLU. [CrossRef]
- (2019) — Survey on frontiers of language and robotics https://www.tandfonline.com/doi/pdf/10.1080/01691864.2019.1632223?needAccess=true. [CrossRef]
- (1970) — Applications of “Artificial Intelligence” for Chemical Inference, VI. Approach to a General Method of Interpreting Low Resolution Mass Spectra with a Computer. [CrossRef]
- (1974) — Inference of molecular structure. [CrossRef]
- (2025) — Challenges for artificial cognitive systems https://arxiv.org/pdf/2505.20339. [CrossRef]
- (1977) — Production Rules as a Representation for a Knowledge-Based Consultation Program. [CrossRef]
- (1974) — A rule-based computer program for advising physicians regarding antimicrobial therapy selection https://dl.acm.org/doi/pdf/10.1145/1408800.1408906.
- (2018) — Studying microbial functionality within the gut ecosystem by systems biology https://genesandnutrition.biomedcentral.com/track/pdf/10.1186/s12263-018-0594-6. [CrossRef]
- (2024) — Logic Programming with PROLOG. [CrossRef]
- (2018) — WAM for everyone: a virtual machine for logic programming. [CrossRef]
- (2022) — Fifty Years of Prolog and Beyond https://arxiv.org/pdf/2201.10816. [CrossRef]
- (1978) — History of LISP. [CrossRef]
- (1980) — LISP - notes on its past and future https://dl.acm.org/doi/pdf/10.1145/800087.802782. [CrossRef]
- (1978) — History of LISP https://dl.acm.org/doi/pdf/10.1145/960118.808387. [CrossRef]
- (2022) — Introduction to Soar https://arxiv.org/pdf/2205.03854. [CrossRef]
- (1986) — Soar—A General Problem-Solving Architecture. [CrossRef]
- (2022) — Introduction to Soar https://arxiv.org/pdf/2205.03854. [CrossRef]
- (1986) — Soar—A General Problem-Solving Architecture. [CrossRef]
- (2024) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s44267-024-00065-8.pdf. [CrossRef]
- (2025) — Evidence of sensory error threshold in triggering locomotor adaptations in humans. [CrossRef]
- (2024) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s00348-024-03814-z.pdf. [CrossRef]
- (1986) — Learning representations by back-propagating errors. [CrossRef]
- (1986) — Learning representations by back-propagating errors. [CrossRef]
- (2016) — Introducing Machine Learning Concepts with WEKA https://hdl.handle.net/10289/13170. [CrossRef]
- (2016) — Introducing Machine Learning Concepts with WEKA https://hdl.handle.net/10289/13170. [CrossRef]
- (2025) — Bayesian Workflow for Generative Modeling in Computational Psychiatry. [CrossRef]
- (2011) — Soviet Archaeological Expedition as a Research Object. [CrossRef]
- (2020) — Robust Predictive Models in Clinical Data—Random Forest and Support Vector Machines https://link.springer.com/content/pdf/10.1007%2F978-3-030-47994-7_13.pdf. [CrossRef]
- (2024) — Image Processing Using OpenCV. [CrossRef]
- (2023) — A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS https://arxiv.org/pdf/2304.00501. [CrossRef]
- (2022) — Object Detection in 20 Years: A Survey https://arxiv.org/pdf/1905.05055. [CrossRef]
- (2020) — Machine Learning in Python: Main Developments and Technology Trends in Data Science, Machine Learning, and Artificial Intelligence https://www.mdpi.com/2078-2489/11/4/193/pdf?version=1587379966. [CrossRef]
- (2022) — Neural Networks and Learning Algorithms in MATLAB. [CrossRef]
- (2023) — Quantifying the Benefit of Artificial Intelligence for Scientific Research https://arxiv.org/pdf/2304.10578. [CrossRef]
- (2024) — Implementing a Hierarchical Deep Learning Approach for Simulating Multilevel Auction Data. [CrossRef]
- (2022) — Caffe con Troll: Shallow Ideas to Speed Up Deep Learning https://arxiv.org/pdf/1504.04343. [CrossRef]
- (2014) — Caffe. [CrossRef]
- (2022) — Comparative Study of Deep Learning Software Frameworks https://arxiv.org/pdf/1511.06435. [CrossRef]
- (2022) — TensorFlow: A system for large-scale machine learning https://arxiv.org/pdf/1605.08695. [CrossRef]
- (2023) — Artificial Intelligence Index Report 2023 https://arxiv.org/pdf/2310.03715. [CrossRef]
- (2017) — kerasR: R Interface to the Keras Deep Learning Library https://joss.theoj.org/papers/10.21105/joss.00296.pdf. [CrossRef]
- (2021) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s10586-021-03240-4.pdf. [CrossRef]
- (2024) — Practical machine learning with PyTorch https://jose.theoj.org/papers/10.21105/jose.00239.pdf. [CrossRef]
- (2025) — PyG 2.0: Scalable Learning on Real World Graphs https://arxiv.org/pdf/2507.16991. [CrossRef]
- (2022) — A detailed comparative study of open source deep learning frameworks https://arxiv.org/pdf/1903.00102. [CrossRef]
- (2022) — Benchmarking State-of-the-Art Deep Learning Software Tools https://arxiv.org/pdf/1608.07249. [CrossRef]
- (2022) — MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems https://arxiv.org/pdf/1512.01274. [CrossRef]
- (2021) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s10586-021-03240-4.pdf. [CrossRef]
- (2016) — Mastering the game of Go with deep neural networks and tree search. [CrossRef]
- (2016) — Mastering the game of Go with deep neural networks and tree search. [CrossRef]
- (2022) — Benchmarking Deep Reinforcement Learning for Continuous Control https://arxiv.org/pdf/1604.06778. [CrossRef]
- (2024) — Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning https://arxiv.org/pdf/2402.03046. [CrossRef]
- (2023) — h2oGPT: Democratizing Large Language Models https://arxiv.org/pdf/2306.08161. [CrossRef]
- (2024) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s10462-024-10726-1.pdf. [CrossRef]
- (2022) — Attention Is All You Need https://arxiv.org/pdf/1706.03762. [CrossRef]
- (2023) — Introduction to Transformers: an NLP Perspective https://arxiv.org/pdf/2311.17633. [CrossRef]
- (2023) — Introduction to Transformers: an NLP Perspective https://arxiv.org/pdf/2311.17633. [CrossRef]
- (2025) — Research and Implementation of Text Classification Based on BERT Model https://dl.acm.org/doi/pdf/10.1145/3746709.3746721. [CrossRef]
- (2021) — Encoder-Attention-Based Automatic Term Recognition (EA-ATR). [CrossRef]
- (2024) — BERT: A Paradigm Shift in Natural Language Processing. [CrossRef]
- (2022) — Language Models are Few-Shot Learners https://arxiv.org/pdf/2005.14165.
- (2020) — GPT-3: Its Nature, Scope, Limits, and Consequences https://link.springer.com/content/pdf/10.1007/s11023-020-09548-1.pdf. [CrossRef]
- (2025) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s43621-025-00815-8.pdf. [CrossRef]
- (2022) — Language Models are Few-Shot Learners https://arxiv.org/pdf/2005.14165.
- (2022) — StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets https://arxiv.org/pdf/2202.00273. [CrossRef]
- (2022) — State-of-the-Art in the Architecture, Methods and Applications of StyleGAN https://arxiv.org/pdf/2202.14020. [CrossRef]
- (2022) — Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer https://arxiv.org/pdf/1910.10683. [CrossRef]
- (2022) — Attention Is All You Need https://arxiv.org/pdf/1706.03762. [CrossRef]
- (2025) — A Comprehensive Overview of Large Language Models. [CrossRef]
- (2022) — Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer https://arxiv.org/pdf/1910.10683. [CrossRef]
- (2025) — Foundations of Large Language Models https://arxiv.org/pdf/2501.09223. [CrossRef]
- (2024) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s12599-024-00851-0.pdf. [CrossRef]
- (2023) — Artificial Intelligence Index Report 2023 https://arxiv.org/pdf/2310.03715. [CrossRef]
- (2018) — AllenNLP: A Deep Semantic Natural Language Processing Platform https://www.aclweb.org/anthology/W18-2501.pdf. [CrossRef]
- (2023) — Catwalk: A Unified Language Model Evaluation Framework for Many Datasets https://arxiv.org/pdf/2312.10253. [CrossRef]
- (2022) — HuggingFace’s Transformers: State-of-the-art Natural Language Processing https://arxiv.org/pdf/1910.03771. [CrossRef]
- (2023) — Transformer models: an introduction and catalog https://arxiv.org/pdf/2302.07730. [CrossRef]
- (2024) — Large language models (LLMs): survey, technical frameworks, and future challenges. [CrossRef]
- (2023) — To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing https://arxiv.org/pdf/2310.07715. [CrossRef]
- (2024) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s12599-024-00851-0.pdf. [CrossRef]
- (2020) — GPT-3: Its Nature, Scope, Limits, and Consequences https://link.springer.com/content/pdf/10.1007/s11023-020-09548-1.pdf. [CrossRef]
- (2022) — Language Models are Few-Shot Learners https://arxiv.org/pdf/2005.14165.
- (2023) — Sparks of Artificial General Intelligence: Early experiments with GPT-4 https://arxiv.org/pdf/2303.12712. [CrossRef]
- (2024) — Large Language Models: A Survey https://arxiv.org/pdf/2402.06196. [CrossRef]
- (2022) — Reproducible scaling laws for contrastive language-image learning https://arxiv.org/pdf/2212.07143. [CrossRef]
- (2022) — Learning Transferable Visual Models From Natural Language Supervision https://arxiv.org/pdf/2103.00020. [CrossRef]
- (2023) — Demystifying CLIP Data https://arxiv.org/pdf/2309.16671. [CrossRef]
- (2023) — CLIP in Medical Imaging: A Survey https://arxiv.org/pdf/2312.07353. [CrossRef]
- (2022) — Learning Transferable Visual Models From Natural Language Supervision https://arxiv.org/pdf/2103.00020. [CrossRef]
- (2022) — Zero-Shot Text-to-Image Generation https://arxiv.org/pdf/2102.12092. [CrossRef]
- (2022) — Large-scale Text-to-Image Generation Models for Visual Artists’ Creative Works https://arxiv.org/pdf/2210.08477. [CrossRef]
- (2023) — Text-to-image Diffusion Models in Generative AI: A Survey https://arxiv.org/pdf/2303.07909. [CrossRef]
- (2023) — SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis https://arxiv.org/pdf/2307.01952. [CrossRef]
- (2022) — Learning Transferable Visual Models From Natural Language Supervision https://arxiv.org/pdf/2103.00020. [CrossRef]
- (2023) — Art and the science of generative AI: A deeper dive https://arxiv.org/pdf/2306.04141. [CrossRef]
- (2025) — Role of tectonic stress and topography on repeated lateral dikes: application to the 1975-1984 Krafla and 2023-2025 Svartsengi rifting episodes in Iceland. https://link.springer.com/content/pdf/10.1007/s10462-025-11110-3.pdf. [CrossRef]
- (2024) — Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models https://arxiv.org/pdf/2409.10695. [CrossRef]
- (2023) — AI Art and its Impact on Artists https://dl.acm.org/doi/pdf/10.1145/3600211.3604681. [CrossRef]
- (2022) — AI Art in Architecture https://arxiv.org/pdf/2212.09399. [CrossRef]
- (2023) — AI Art and its Impact on Artists https://dl.acm.org/doi/pdf/10.1145/3600211.3604681. [CrossRef]
- (2023) — Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages https://arxiv.org/pdf/2303.01037. [CrossRef]
- (2023) — Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling https://arxiv.org/pdf/2311.00430. [CrossRef]
- (2024) — Fine-tuning Whisper on Low-Resource Languages for Real-World Applications https://arxiv.org/pdf/2412.15726. [CrossRef]
- (2024) — OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer https://arxiv.org/pdf/2401.16658. [CrossRef]
- (2024) — Transforming Conversations with AI—A Comprehensive Study of ChatGPT. [CrossRef]
- (2024) — Transforming Conversations with AI—A Comprehensive Study of ChatGPT. [CrossRef]
- (2024) — Systematic exploration and in-depth analysis of ChatGPT architectures progression. [CrossRef]
- (2024) — The Educational Affordances and Challenges of ChatGPT: State of the Field. [CrossRef]
- (2023) — ChatGPT and large language models in academia: opportunities and challenges. [CrossRef]
- (2024) — BLOOM: A 176B-Parameter Open-Access Multilingual Language Model https://arxiv.org/pdf/2211.05100. [CrossRef]
- (2024) — BLOOM: A 176B-Parameter Open-Access Multilingual Language Model https://arxiv.org/pdf/2211.05100. [CrossRef]
- (2024) — Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence https://arxiv.org/pdf/2405.15802. [CrossRef]
- (2024) — Near to Mid-term Risks and Opportunities of Open-Source Generative AI https://arxiv.org/pdf/2404.17047. [CrossRef]
- (2023) — Code Llama: Open Foundation Models for Code https://arxiv.org/pdf/2308.12950. [CrossRef]
- (2023) — LLaMA: Open and Efficient Foundation Language Models https://arxiv.org/pdf/2302.13971. [CrossRef]
- (2023) — Llama 2: Open Foundation and Fine-Tuned Chat Models https://arxiv.org/pdf/2307.09288. [CrossRef]
- (2024) — On the Societal Impact of Open Foundation Models https://arxiv.org/pdf/2403.07918. [CrossRef]
- (2024) — Google Gemini as a next generation AI educational tool: a review of emerging educational technology. [CrossRef]
- (2023) — Gemini: A Family of Highly Capable Multimodal Models https://arxiv.org/pdf/2312.11805. [CrossRef]
- (2024) — Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context https://arxiv.org/pdf/2403.05530. [CrossRef]
- (2025) — Comparative performance of neurosurgery-specific, peer-reviewed versus general AI chatbots in bilingual board examinations: evaluating accuracy, consistency, and error minimization strategies. [CrossRef]
- (2023) — Gemini: A Family of Highly Capable Multimodal Models https://arxiv.org/pdf/2312.11805. [CrossRef]
- (2023) — Frontier AI Regulation: Managing Emerging Risks to Public Safety https://arxiv.org/pdf/2307.03718. [CrossRef]
- (2024) — The Ethics of Advanced AI Assistants https://arxiv.org/pdf/2404.16244. [CrossRef]
- (2024) — Open Problems in Technical AI Governance https://arxiv.org/pdf/2407.14981. [CrossRef]
- (2025) — GPT-4o: The Cutting-Edge Advancement in Multimodal LLM. [CrossRef]
- (2024) — GPT-4o System Card https://arxiv.org/pdf/2410.21276. [CrossRef]
- (2025) — GPT-4o: The Cutting-Edge Advancement in Multimodal LLM. [CrossRef]
- (2024) — Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities https://arxiv.org/pdf/2410.11190. [CrossRef]
- (2024) — From Efficient Multimodal Models to World Models: A Survey https://arxiv.org/pdf/2407.00118. [CrossRef]
- (2024) — Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models https://arxiv.org/pdf/2402.17177. [CrossRef]
- (2024) — Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models https://arxiv.org/pdf/2402.17177. [CrossRef]
- (2024) — Sora OpenAI’s Prelude: Social Media Perspectives on Sora OpenAI and the Future of AI Video Generation https://arxiv.org/pdf/2403.14665. [CrossRef]
- (2024) — Sora OpenAI’s Prelude: Social Media Perspectives on Sora OpenAI and the Future of AI Video Generation https://arxiv.org/pdf/2403.14665. [CrossRef]
- (2023) — Gemini: A Family of Highly Capable Multimodal Models https://arxiv.org/pdf/2312.11805. [CrossRef]
- (2023) — Gemini: A Family of Highly Capable Multimodal Models https://arxiv.org/pdf/2312.11805. [CrossRef]
- (2024) — Gemma: Open Models Based on Gemini Research and Technology https://arxiv.org/pdf/2403.08295. [CrossRef]
- (2024) — Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context https://arxiv.org/pdf/2403.05530. [CrossRef]
- (2024) — Mixtral of Experts https://arxiv.org/pdf/2401.04088. [CrossRef]
- (2023) — Mistral 7B https://arxiv.org/pdf/2310.06825. [CrossRef]
- (2024) — Linq-Embed-Mistral Technical Report https://arxiv.org/pdf/2412.03223. [CrossRef]
- (2023) — A Comprehensive Overview of Large Language Models https://arxiv.org/pdf/2307.06435. [CrossRef]
- (2023) — MemGPT: Towards LLMs as Operating Systems https://arxiv.org/pdf/2310.08560. [CrossRef]
- (2024) — The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use https://arxiv.org/pdf/2411.10323. [CrossRef]
- (2024) — Clio: Privacy-Preserving Insights into Real-World AI Use https://arxiv.org/pdf/2412.13678. [CrossRef]
- (2024) — DeepSeek-V3 Technical Report https://arxiv.org/pdf/2412.19437. [CrossRef]
- (2024) — DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model https://arxiv.org/pdf/2405.04434. [CrossRef]
- (2025) — DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models https://arxiv.org/pdf/2402.03300. [CrossRef]
- (2023) — VideoPoet: A Large Language Model for Zero-Shot Video Generation https://arxiv.org/pdf/2312.14125. [CrossRef]
- (2023) — VideoPoet: A Large Language Model for Zero-Shot Video Generation https://arxiv.org/pdf/2312.14125. [CrossRef]
- (2023) — Gemini: A Family of Highly Capable Multimodal Models https://arxiv.org/pdf/2312.11805. [CrossRef]
- (2024) — GPT-4o System Card https://arxiv.org/pdf/2410.21276. [CrossRef]
- (2024) — A Survey on Retrieval-Augmented Text Generation for Large Language Models https://arxiv.org/pdf/2404.10981. [CrossRef]
- (2025) — Singh, G. — *The Multiple Approaches for Drug-Drug Interaction Extraction using Machine Learning and Transformer-Based Model* bioRxiv, 2025-10. [CrossRef]
- (2025) — Singh, G. — *A Review of Multimodal Vision–Language Models: Foundations, Applications, and Future Directions* Preprints. [CrossRef]
- (2024) — Singh, G., Singh, S., Rehmani, N., Kumari, P., & Varshini, S. V. — *The Role of Data Analytics in Driving Business Innovation and Economic Growth: A Comparative Study Across Industries* International Journal of Innovative Research in Engineering and Management (IJIREM), Vol. 11, Issue 4, pp. 33–38. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).