Submitted:
18 December 2025
Posted:
19 December 2025
You are already at the latest version
Abstract

Keywords:
I. Introduction
A. Limitations of Existing Approaches

B. Proposed Approach
C. Applications
D. Related Work
II. Methods
A. System Architecture
B. Tool Ingestion and Indexing
C. Retrieval and Ranking
D. Dynamic Expansion Pattern
E. Implementation Details
F. Evaluation
III. Results
A. Selection Accuracy
B. Latency and Token Savings
IV. Discussion
V. Conclusions
References
- Schick, T.; et al. Toolformer: language models can teach themselves to use tools. In Proceedings of the 37th International Conference on Neural Information Processing Systems, in NIPS ’23, 2023; Curran Associates Inc.: Red Hook, NY, USA. [Google Scholar]
- Yao, S.; et al. ReAct: Synergizing Reasoning and Acting in Language Models. arXiv:2210.03629. 2023. [Google Scholar] [CrossRef]
- Lewis, P.; et al. Retrieval-augmented generation for knowledge-intensive NLP tasks. In Proceedings of the 34th International Conference on Neural Information Processing Systems, in NIPS ’20, 2020; Curran Associates Inc.: Red Hook, NY, USA. [Google Scholar]
- Guu, K.; et al. REALM: retrieval-augmented language model pre-training. In Proceedings of the 37th International Conference on Neural Information Processing Systems, in NIPS ’23. Red Hook, NY, USA: Curran Associates Inc., 2020. [Google Scholar]
- Karpukhin, V.; et al. Dense Passage Retrieval for Open-Domain Question Answering. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP); Webber, B., Cohn, T., He, Y., Liu, Y., Eds.; Association for Computational Linguistics: Online, Nov 2020; pp. 6769–6781. [Google Scholar] [CrossRef]
- Reimers, N.; et al. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP); Inui, K., Jiang, J., Ng, V., Wan, X., Eds.; Association for Computational Linguistics: Hong Kong, China, Nov 2019; pp. 3982–3992. [Google Scholar] [CrossRef]
- Johnson, J.; Douze, M.; Jégou, H.; Johnson, J.; Douze, M.; Jégou, H. Billion-Scale Similarity Search with GPUs. IEEE Transactions on Big Data 2021, vol. 7(no. 3), 535–547. [Google Scholar] [CrossRef]
- Shi, J.; et al. Prompt Injection Attack to Tool Selection in LLM Agents. arXiv:2504.19793. doi: 10.48550/arXiv.2504.19793. 2025. [Google Scholar] [CrossRef]
- Dao, T.; et al. FLASHATTENTION: fast and memory-efficient exact attention with IO-awareness. In Proceedings of the 36th International Conference on Neural Information Processing Systems, in NIPS ’22, 2022; Curran Associates Inc.: Red Hook, NY, USA. [Google Scholar]
- Munkhdalai, T.; Faruqui, M.; Gopal, S.; Munkhdalai, T.; Faruqui, M.; Gopal, S. Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention. arXiv:2404.07143. 2024. [Google Scholar] [CrossRef]
- [11]A. J. Oche et al., “A Systematic Review of Key Retrieval-Augmented Generation (RAG) Systems: Progress, Gaps, and Future Directions,” July 25, 2025. arXiv:2507.18910. [CrossRef]
- Malkov, Y. A.; Yashunin, D. A.; Malkov, Y. A.; Yashunin, D. A. Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs. IEEE Trans. Pattern Anal. Mach. Intell. 2020, vol. 42(no. 4), 824–836. [Google Scholar] [CrossRef] [PubMed]
- Lumer, E.; et al. Toolshed: Scale Tool-Equipped Agents with Advanced RAG-Tool Fusion and Tool Knowledge Bases. arXiv:2410.14594. 2024. [Google Scholar] [CrossRef]
- Khattab, M. Zaharia; Khattab, O.; Zaharia, M. ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, in SIGIR ’20. New York, NY, USA: Association for Computing Machinery, 2020; 2020; pp. 39–48. [Google Scholar] [CrossRef]
- Nogueira, R.; Cho, K.; Nogueira, R.; Cho, K. Passage Re-ranking with BERT. arXiv:1901.04085. 2020. [Google Scholar] [CrossRef]
- DeepEval, “Tool Correctness,” DeepEval. Available online: https://deepeval.com/docs/metrics-tool-correctness.
- Huang, Y.; et al. “MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use,” in The Twelfth International Conference on Learning Representations, 2024. Available online: https://openreview.net/forum?id=R0c2qtalgG.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).