Language-Guided Segmentation of Medical Images: A Review of Foundation Models

Saqib Qamar

doi:10.20944/preprints202606.1606.v1

Submitted:

19 June 2026

Posted:

22 June 2026

You are already at the latest version

Abstract

Vision-language foundation models have transformed medical image segmentation over the past three years. These models pair large image encoders with text prompts, so a single model can segment many anatomical structures, lesion types, and imaging modalities through natural language. This survey reviews vision-language foundation models designed for medical image segmentation. We describe the technical background from contrastive vision-language pretraining to the Segment Anything Model and its medical variants. We propose a three-part taxonomy that covers text-prompt guided models, large language model embedded architectures, and hybrid frameworks. We examine adaptation strategies such as full fine-tuning, Low-Rank Adaptation, adapters, and prompt engineering. We organize the literature by modality and cover computed tomography, magnetic resonance imaging, pathology, chest radiography, and ultrasound. We discuss clinical uses such as organ segmentation, tumor delineation, and radiotherapy planning. We summarize evaluation metrics and benchmark datasets. We identify four open challenges: prompt dependence, mask hallucination, slow volumetric inference, and limited annotated data. We close with a research roadmap for trustworthy deployment, multimodal pretraining, and clinical integration.

Keywords:

vision-language models

;

medical image segmentation

;

foundation models

;

Segment Anything Model

;

text-prompted segmentation

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Language-Guided Segmentation of Medical Images: A Review of Foundation Models

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe