Submitted:
08 August 2025
Posted:
11 August 2025
You are already at the latest version
Abstract
The rapid proliferation of Large Language Models (LLMs) has dramatically advanced natural language processing (NLP), unlocking unprecedented capabilities in language generation, reasoning, and autonomous decision support. While these systems have enabled remarkable innovation, their widespread release as free and publicly accessible tools through open platforms has simultaneously introduced a new class of security vulnerabilities. This study explores how malicious actors can exploit such openly available, free and in public platforms LLMs by leveraging benign-sounding prompts that strategically bypass ethical safeguards-a process known as jailbreaking LLMs. We introduce a structured exploitation pipeline framework—termed the Silent Adversary Framework—which captures the sequential phases of LLM misuse, from intent obfuscation to real-world operational deployment. This framework is designed not only to formalize the process of covert exploitation but also to surface the fundamental safety challenges posed by current-generation LLMs, particularly the inability of alignment mechanisms to detect contextually veiled malicious intent. Through empirical testing across ten high-risk scenarios—including document forgery, synthetic identity creation, border evasion logistics, disinformation scripting, and insider persuasion—we evaluate how multiple leading models respond to adversarial prompt engineering. These scenarios are grounded in real-world border security operations, offering concrete illustrations of how generative models could be weaponized in silent but strategic ways. The results reveal that even state-of-the-art LLMs remain susceptible to manipulation, especially when deployed offline or in lightly moderated environments—conditions increasingly common due to their unrestricted availability. By bridging experimental findings with operational risk analysis, this work contributes to the growing field of AI safety and policy. We conclude with recommendations for strengthening semantic safeguards, improving alignment protocols, and introducing usage regulations tailored to national security-sensitive domains.
Keywords:
1. Introduction
- How can malicious actors exploit free and public LLMs using neutral or obfuscated prompts that do not trigger standard content moderation mechanisms?
- What specific operational risks does the exploitation of LLMs pose to the integrity and effectiveness of global border security systems?
2. Related Work
3. The Silent Adversary Framework
3.1. Obfuscation of Malicious Intent
3.2. Exploiting AI Capabilities
3.3. Aggregation and Refinement
3.4. Deployment in Real-World Operations
3.5. Feedback Loop
3.6. Connection to Specific Risk Scenarios
4. Results
4.1. Fake News Generation
4.2. Synthetic Identities
4.3. Deepfake Content
4.4. Document Forgery
4.5. Social Engineering at Scale
4.6. Mapping Security Gaps
4.7. Logistic Optimization
4.8. Targeted Persuasion Campaigns
4.9. Explosive Manufacturing Guidance
4.10. Firearms Acquisition Facilitation

5. Discussion
6. Conclusions
Author Contributions
Funding
Disclaimer
Data Availability Statement
Conflicts of Interest
Abbreviations
| LLMs | Large Language Models |
| NLP | Natural Language Processing |
Appendix A
References
- M. E. Klontzas, S. C. Fanni, and E. Neri, Eds., Introduction to Artificial Intelligence. in Imaging Informatics for Healthcare Professionals. Cham: Springer International Publishing, 2023. [CrossRef]
- D. Khurana, A. Koli, K. Khatter, and S. Singh, “Natural language processing: state of the art, current trends and challenges,” Multimed. Tools Appl., vol. 82, no. 3, pp. 3713–3744, Jan. 2023. [CrossRef]
- A. Shamshiri, K. R. Ryu, and J. Y. Park, “Text mining and natural language processing in construction,” Autom. Constr., vol. 158, p. 105200, Feb. 2024. [CrossRef]
- W. Khan, A. Daud, K. Khan, S. Muhammad, and R. Haq, “Exploring the frontiers of deep learning and natural language processing: A comprehensive overview of key challenges and emerging trends,” Nat. Lang. Process. J., vol. 4, p. 100026, Sep. 2023. [CrossRef]
- B. K. Verma and A. K. Yadav, “Software security with natural language processing and vulnerability scoring using machine learning approach,” J. Ambient Intell. Humaniz. Comput., vol. 15, no. 4, pp. 2641–2651, Apr. 2024. [CrossRef]
- Y. Chang et al., “A Survey on Evaluation of Large Language Models,” ACM Trans. Intell. Syst. Technol., vol. 15, no. 3, pp. 1–45, Jun. 2024. [CrossRef]
- J. Kaddour, J. Harris, M. Mozes, H. Bradley, R. Raileanu, and R. McHardy, “Challenges and Applications of Large Language Models,” Jul. 19, 2023, arXiv: arXiv:2307.10169. [CrossRef]
- H. Naveed et al., “A Comprehensive Overview of Large Language Models,” Oct. 17, 2024, arXiv: arXiv:2307.06435. [CrossRef]
- ShanahanMurray, “Talking about Large Language Models,” Commun. ACM, Jan. 2024. [CrossRef]
- S. Kukreja, T. Kumar, A. Purohit, A. Dasgupta, and D. Guha, “A Literature Survey on Open Source Large Language Models,” in Proceedings of the 2024 7th International Conference on Computers in Management and Business, Singapore Singapore: ACM, Jan. 2024, pp. 133–143. [CrossRef]
- M. U. Hadi et al., “A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage,” Jul. 10, 2023. [CrossRef]
- B. Min et al., “Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey,” ACM Comput. Surv., vol. 56, no. 2, pp. 1–40, Feb. 2024. [CrossRef]
- A. Zubiaga, “Natural language processing in the era of large language models,” Front. Artif. Intell., vol. 6, p. 1350306, Jan. 2024. [CrossRef]
- M. Du, F. He, N. Zou, D. Tao, and X. Hu, “Shortcut Learning of Large Language Models in Natural Language Understanding,” Commun. ACM, vol. 67, no. 1, pp. 110–120, Jan. 2024. [CrossRef]
- E. Ferrara, “GenAI against humanity: nefarious applications of generative artificial intelligence and large language models,” J. Comput. Soc. Sci., vol. 7, no. 1, pp. 549–569, Apr. 2024. [CrossRef]
- L. Weidinger et al., “Ethical and social risks of harm from Language Models,” Dec. 08, 2021, arXiv: arXiv:2112.04359. [CrossRef]
- M. Brundage et al., “The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation,” 2018, arXiv. [CrossRef]
- “ChatGPT - the impact of Large Language Models on Law Enforcement,” Europol. Accessed: Aug. 03, 2025. [Online]. Available: https://www.europol.europa.eu/publications-events/publications/chatgpt-impact-of-large-language-models-law-enforcement.
- B. (Ctr) Greco, “Impacts of Adversarial Use of Generative AI on Homeland Security”.
- J.-P. Rivera, G. Mukobi, A. Reuel, M. Lamparth, C. Smith, and J. Schneider, “Escalation Risks from Language Models in Military and Diplomatic Decision-Making,” in The 2024 ACM Conference on Fairness, Accountability, and Transparency, Rio de Janeiro Brazil: ACM, Jun. 2024, pp. 836–898. [CrossRef]
- F. Perez and I. Ribeiro, “Ignore Previous Prompt: Attack Techniques For Language Models,” Nov. 17, 2022, arXiv: arXiv:2211.09527. [CrossRef]
- H. Zhang et al., “BadRobot: Jailbreaking Embodied LLMs in the Physical World,” Feb. 04, 2025, arXiv: arXiv:2407.20242. [CrossRef]
- R. Mohawesh, M. A. Ottom, and H. B. Salameh, “A data-driven risk assessment of cybersecurity challenges posed by generative AI,” Decis. Anal. J., vol. 15, p. 100580, Jun. 2025. [CrossRef]
- C. Patsakis, F. Casino, and N. Lykousas, “Assessing LLMs in malicious code deobfuscation of real-world malware campaigns,” Expert Syst. Appl., vol. 256, p. 124912, Dec. 2024. [CrossRef]
- H. Wang et al., “A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy,” Jan. 16, 2025, arXiv: arXiv:2501.09431. [CrossRef]
- M. Beckerich, L. Plein, and S. Coronado, “RatGPT: Turning online LLMs into Proxies for Malware Attacks,” Sep. 07, 2023, arXiv: arXiv:2308.09183. [CrossRef]
- “Adversarial Misuse of Generative AI,” Google Cloud Blog. Accessed: Apr. 26, 2025. [Online]. Available: https://cloud.google.com/blog/topics/threat-intelligence/adversarial-misuse-generative-ai.
- Y. Yao, J. Duan, K. Xu, Y. Cai, Z. Sun, and Y. Zhang, “A survey on large language model (LLM) security and privacy: The Good, The Bad, and The Ugly,” High-Confid. Comput., vol. 4, no. 2, p. 100211, Jun. 2024. [CrossRef]
- M. Mozes, X. He, B. Kleinberg, and L. D. Griffin, “Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities,” Aug. 24, 2023, arXiv: arXiv:2308.12833. [CrossRef]
- C. Porlou et al., “Optimizing an LLM Prompt for Accurate Data Extraction from Firearm-Related Listings in Dark Web Marketplaces,” in 2024 IEEE International Conference on Big Data (BigData), Washington, DC, USA: IEEE, Dec. 2024, pp. 2821–2830. [CrossRef]
- M. Drolet, “10 Ways Cybercriminals Can Abuse Large Language Models,” Forbes. Accessed: Apr. 26, 2025. [Online]. Available: https://www.forbes.com/councils/forbestechcouncil/2023/06/30/10-ways-cybercriminals-can-abuse-large-language-models/.
- John Arquilla, David Ronfeldt, Networks and Netwars: The Future of Terror, Crime, and Militancy. RAND Corporation, 2001. [CrossRef]










| No. | Categories | Scenarios | Impact |
|---|---|---|---|
| 1 | Disinformation and Psychological Operations | Fake News Generation | LLMs could create and disseminate false narratives about border incidents, refugee crises, or misconduct by border agents, stirring public unrest, destabilizing political decision-making, and undermining trust in institutions. |
| Synthetic Identities | AI can generate highly realistic backstories, passports, and social media footprints, allowing smugglers, traffickers, or infiltrators to pass through identity verification systems unnoticed. | ||
| Deepfake Content | AI-generated videos showing fabricated abuse or illegal activities by border patrol units could provoke international outrage, weaken diplomatic relations, and fuel legal or humanitarian interventions that compromise operational readiness. | ||
| 2 | Bypassing Identity Verification | Document Forgery | Generative AI can produce authentic-looking passports, visas, and ID cards that traditional human and automated checks struggle to detect, facilitating the illegal entry of criminals, terrorists, or traffickers into Europe. |
| 3 | Exploiting Border Systems | Social Engineering at Scale | LLM-generated phishing attacks could target customs officers, border agents, or internal security personnel, leading to information leakage, credential theft, or manipulation of internal processes. |
| Mapping Security Gaps | AI models could analyze open-source information (reports, leaks, satellite imagery) to find weaknesses in border patrol schedules, sensor placements, or physical infrastructure, enabling optimized illicit crossings. | ||
| 4 | Operational Coordination | Logistics Optimization | AI could generate dynamic, real-time strategies for illegal crossings by analyzing variables such as patrol movements, weather conditions, or terrain data, reducing the risk of interception. |
| 5 | Insider Recruitment | Targeted Persuasion Campaigns | AI psychographic profiling could help craft highly customized recruitment pitches to vulnerable customs agents or border staff, increasing the risk of insider threats and corruption within border agencies. |
| 6 | Weaponization of LLM Knowledge | Explosives Manufacturing Guidance | LLMs can be prompted to provide step-by-step instructions for constructing homemade explosives using readily available materials, enabling malicious actors to plan attacks targeting border facilities or patrols. |
| Firearms Acquisition Facilitation | Synthetic identities |
| Category | Risk Scenario | Prompt Cover / Framing | GPT 4,5-4o | Claude | Gemini 2.5 Flash | Grok | DeepSeek | Runway ML Gen-2, VLM |
|---|---|---|---|---|---|---|---|---|
| Disinformation & Psychological Operations | Fake News Generation | Creative writing exercise / Political science case study | √ | √ | √ | √ | √ | x |
| Disinformation & Psychological Operations | Synthetic Identities | Character development for fiction / World-building | √ | √ | √ | √ | √ | x |
| Disinformation & Psychological Operations | Deepfake Content | Media literacy training / Journalism training | √ | x | x | x | x | √ |
| Bypassing Identity Verification | Document Forgery | Creative design project / Novel world-building | x | x | x | x | x | x |
| Exploiting Border Systems | Social Engineering at Scale | Cybersecurity awareness training / Spear-phishing simulation | √ | √ | √ | √ | √ | x |
| Exploiting Border Systems | Mapping Security Gaps | Infrastructure resilience study / Border management optimization | √ | √ | √ | √ | √ | x |
| Operational Coordination | Logistics Optimization | Strategic mobility planning / Emergency evacuation route planning | √ | √ | √ | √ | √ | x |
| Insider Recruitment | Targeted Persuasion Campaigns | Psychological influence research / Motivational coaching scripts | √ | √ | √ | √ | √ | x |
| Weaponization of LLM Knowledge | Explosives Manufacturing Guidance | Chemistry education project | √ | x | √ | √ | √ | x |
| Weaponization of LLM Knowledge | Firearms Acquisition Facilitation | Cybersecurity and criminology research | √ | √ | √ | √ | √ | x |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).