4. Hybrid Quantum–Classical Models in COVID-19 Diagnosis
Hybrid quantum–classical learning has been actively investigated for COVID-19 imaging as a pragmatic strategy to exploit quantum representations while preserving the robustness of classical deep learning. In all reported studies, quantum components are not used as standalone learners. However, they are integrated into classical pipelines, typically either as quantum convolutional operators or as variational quantum classifiers operating on low-dimensional features extracted from chest X-ray (CXR) or computed tomography (CT) images. This section critically reviews the existing hybrid quantum–classical COVID-19 imaging studies by analyzing their methodological design, datasets, experimental outcomes, and reported limitations. A comprehensive hybrid quantum convolutional neural network (QCNN) for COVID-19 chest X-ray classification was presented in [
44]. In this work, a classical CNN is augmented with a quanvolutional layer constructed from parameterized quantum circuits, which processes local image patches before classical feature aggregation. The model was evaluated on publicly available CXR datasets from the COVID-19 Radiography Database, in both binary (COVID-19 vs. normal) and multi-class (COVID-19, viral pneumonia, normal) scenarios. The reported results indicate high accuracy in binary classification and competitive performance relative to deeper classical CNNs, while using fewer trainable parameters. However, a noticeable decline in performance was observed for multi-class classification, highlighting the difficulty of distinguishing COVID-19 from other pneumonia types using current hybrid architectures. For CT-based diagnosis, a quantum neural network framework for clinical prognostic analysis of COVID-19 patients was proposed in [
48]. Their approach used variational quantum circuits as classifiers and was trained on a large chest CT dataset comprising several thousand slices. Compared with a classical CNN baseline, the quantum model demonstrated improved diagnostic accuracy and reduced training time, suggesting potential efficiency benefits of quantum-assisted optimization. Despite these promising results, the study relied primarily on slice-level evaluation and lacked extensive external validation, limiting its direct clinical applicability.
To mitigate the limited availability of labeled CT data, a hybrid quantum machine learning architecture combining conditional generative adversarial networks (cGANs) with quantum neural networks was introduced in [
49]. In this pipeline, synthetic CT images were generated to augment both a public COVID-CT dataset and a private hospital dataset prior to classification. The augmented datasets enabled the quantum classifier to achieve competitive performance compared with classical counterparts, particularly in low-data regimes. While this work highlights the potential of data augmentation in hybrid quantum learning, it also raises concerns regarding the reliability of synthetic samples and the risk of inflated performance if patient-level independence is not strictly maintained. Hybrid quantum transfer learning for multi-class CT diagnosis was investigated in [
47]. The proposed framework combined pretrained classical feature extractors with variational quantum classifiers to distinguish COVID-19, community-acquired pneumonia, and normal CT scans. Experiments on public CT datasets demonstrated that hybrid quantum classifiers can effectively operate on compressed feature representations. However, the results also indicated that classification performance is highly sensitive to feature embedding strategies and circuit depth, and that multi-class CT discrimination remains challenging under current qubit limitations. A fully integrated hybrid quantum–classical convolutional architecture for CT imaging was proposed in [
68]. The model embeds quantum processing blocks within a classical CNN and was evaluated on publicly available chest CT datasets for COVID-19 diagnosis. The reported results show strong diagnostic performance and parameter efficiency, supporting the feasibility of hybrid quantum convolution in CT analysis. Nevertheless, the evaluation was limited to a small number of datasets, and the absence of cross-institutional testing constrains conclusions regarding generalization.
Transfer learning–based hybrid quantum models were further explored in [
46], where a pretrained quantum convolutional neural network was proposed for COVID-19 CT classification. In this approach, classical deep networks were employed to extract transferable features, which were subsequently processed by a quantum convolutional classifier. The model was evaluated across multiple CT datasets with varying sizes and class distributions, demonstrating stable performance across heterogeneous data sources. A key strength of this study is its multi-dataset evaluation strategy, while a notable limitation is that quantum execution was conducted primarily in simulation rather than on real quantum hardware.
Beyond binary diagnosis, COVID-19 severity classification from chest CT images was investigated within a hybrid quantum–classical framework in [
69]. The proposed model formulated severity assessment as a multi-level classification problem, aiming to support clinical stratification rather than simple disease detection. Experimental results suggest that hybrid quantum models can capture severity-related patterns in CT images; however, the study is constrained by the limited availability of large, standardized datasets with reliable severity annotations.
On the chest X-ray side, hybrid quantum convolutional neural network models have been proposed for COVID-19 detection, notably by Houssein et al. [
44,
45], who integrated shallow quantum circuits as quanvolutional layers within classical CNN pipelines. In these frameworks, classical preprocessing is first applied to reduce input dimensionality and normalize image intensities, after which local image patches are encoded into variational quantum circuits. When evaluated on public COVID-19 CXR datasets, these models achieved high performance in binary detection tasks, with reported accuracies exceeding 98% in some experimental settings. However, performance in multi-class classification scenarios exhibited noticeable degradation, indicating limited robustness when distinguishing COVID-19 from other pneumonia types. The studies further highlighted that classification accuracy is sensitive to circuit depth, qubit count, and encoding strategies, underscoring the importance of careful quantum architecture design.
Similarly, hybrid quantum transfer learning approaches have been explored, combining pretrained classical encoders with variational quantum classifier heads operating on compressed feature embeddings [
46]. These frameworks demonstrated the feasibility of integrating quantum decision modules into established deep learning pipelines and reported competitive results on public CXR and CT datasets. Nevertheless, the observed performance gains were typically obtained under controlled experimental settings with limited dataset diversity and without external validation.
More recently, quantum-assisted deep learning architectures have been proposed in preprint studies, integrating strong classical backbones such as EfficientNet-B4 with deeper variational quantum circuits for COVID-19 imaging classification [
70]. These works report strong classification performance on the COVIDx chest X-ray dataset and auxiliary benchmarks, including PneumoniaMNIST and OrganAMNIST, alongside systematic ablation analyses examining circuit depth and entanglement patterns. However, as these findings are currently available only as preprints and primarily validated in simulation environments, independent peer-reviewed confirmation and hardware-aware benchmarking remain necessary.
Overall, these studies demonstrate that hybrid quantum–classical models can achieve competitive performance for COVID-19 imaging tasks across both CXR and CT modalities. Nevertheless, the literature consistently reports limitations, including dataset bias, lack of external validation, sensitivity to preprocessing choices, and reliance on simulator-based quantum evaluations. These challenges underscore the need for more rigorous validation protocols, multi-institutional datasets, and hardware-aware benchmarking to establish the true clinical value of hybrid quantum–classical approaches.
4.1. Archetype A: Quanvolution-Based Hybrid Architectures ***
Quanvolution-based hybrid architectures integrate quantum computation into the early stages of the visual processing pipeline, where the quantum module functions as a local nonlinear feature transformation operator. Instead of learning convolutional filters in the classical sense, shallow quantum circuits process small image patches individually, and the resulting measurement statistics are interpreted as feature channels. The downstream classical neural network is then responsible for hierarchical feature aggregation and final decision making. This architectural paradigm is particularly compatible with near-term quantum hardware constraints, as it limits quantum computation to small-scale, patch-level operations, thereby controlling qubit count, circuit depth, and execution overhead. At the same time, it preserves the expressive power and scalability of classical convolutional neural networks (CNNs).
Given an input medical image
(e.g., chest X-ray or CT slice), Classical preprocessing is applied first, including resizing, intensity normalization, and optional denoising. The processed image
is then partitioned into a set of non-overlapping (or weakly overlapping) patches:
Each patch is vectorized into a low-dimensional real-valued feature vector.
which serves as the classical–quantum interface.
Quantum Encoding and Circuit Execution: Each patch vector
is encoded into a quantum state using angle encoding, the most widely adopted scheme in hybrid medical imaging due to its simplicity and numerical stability. Specifically, each component of
is mapped to the rotation angle of a single-qubit gate:
where
qubits are used, and
are scaling parameters ensuring valid angular ranges.
Quantum circuit transformation: After encoding, a shallow quantum circuit
is applied:
Two principal circuit configurations are commonly employed:
Random Quantum Circuits (RQC): circuit parameters are fixed and not optimized, acting as stochastic nonlinear feature maps.
Parameterized Quantum Circuits (PQC): circuit parameters are trainable and optimized jointly with classical network parameters.
Entangling gates (e.g., CNOT) may be introduced to capture intra-patch correlations.
The quantum state
is measured using expectation values of Pauli observables:
producing a feature vector
, where
denotes the number of measured observables.
The vectors
are spatially reassembled to form a quantum-generated feature tensor:
with
and
in the non-overlapping case. This tensor plays a role analogous to the output of classical convolutional filters.
The quantum feature tensor is processed by a conventional CNN consisting of convolutional, pooling, and fully connected layers. All global context modeling, high-level abstraction, and decision boundary learning are performed in this classical stage. Consequently, the quantum module serves as a feature-transformation front-end, while the classical network retains full responsibility for classification performance.
Two training regimes are typically considered:
Fixed quanvolution (RQC): Only classical parameters
are optim
Trainable quanvolution (PQC): Both quantum parameters
and classical parameters
are optimized:
Trainable quanvolution (PQC): Both quantum parameters and classical parameters are optimized:
where gradients with respect to are computed using parameter-shift rules.
The RQC configuration offers superior stability and computational efficiency, whereas PQC-based quanvolution provides greater task adaptivity at the cost of increased optimization complexity.
Quanvolution-based hybrid architectures are particularly effective in scenarios where local texture patterns dominate, such as binary COVID-19 screening from chest X-ray images. Their modular design allows straightforward integration with classical deep learning pipelines and ensures scalability under quantum resource constraints. However, because quantum processing is limited to local patches, the burden of global reasoning and class discrimination remains with the classical network, especially in multi-class and fine-grained diagnostic tasks.
4.2. Archetype B: Classical Feature Extractor with Quantum Classifier Head
In this archetype, the hybrid coupling is explicitly decision-centric: a strong classical backbone performs high-capacity representation learning from medical images, while the quantum module serves as a final classifier head, replacing or augmenting conventional dense layers. This placement reflects a practical constraint of near-term quantum computing: quantum circuits are currently better suited for operating on compact, low-dimensional embeddings rather than high-resolution image tensors. Consequently, the classical network is responsible for transforming raw pixels into a semantically meaningful latent space, after which the quantum head performs a nonlinear decision mapping under a small qubit budget.
Let the input image be
. After standard classical preprocessing, a pretrained convolutional backbone
produces a high-level embedding:
Because
typically exceeds feasible quantum input sizes, an intermediate classical adapter is used to compress the embedding into a small vector compatible with a quantum circuit:
The vector
forms the classical-to-quantum interface. It is encoded into a quantum state through an encoding map
:
where
is the number of qubits, and
is typically chosen such that
under angle encoding, or
is set by available hardware limits.
A parameterized quantum circuit
then acts as the classifier head:
Finally, measurement operators
produce a classical measurement vector:
The vector
is mapped to class logits using either a direct linear layer or a lightweight classical post-processing head
:
where
denotes the number of classes.
Two implementation styles are commonly used within this archetype:
Variational quantum classifier head: The quantum head is directly optimized, and the measurement vector is interpreted as class evidence or as features for a final linear classifier. This configuration emphasizes compactness and direct decision mapping.
Dressed quantum circuit head: A small classical layer is placed before and after the quantum circuit. The pre-quantum adapter improves trainability by shaping the embedding distribution, and the post-quantum layer stabilizes the formation and calibration of logits. Although this increases classical involvement, it typically improves the stability of optimization under limited circuit depth.
Let be the ground-truth label and the supervised loss. In this archetype, the classical backbone parameters , adapter parameters , and quantum parameters are trained either jointly or partially:
compactness and direct decision mapping.
Common strategies include freezing the backbone and training only the adapter and quantum head when data is limited or fine-tuning the backbone when sufficient labeled data is available. Gradients with respect to are computed using standard differentiable quantum programming methods such as parameter-shift rules, enabling end-to-end optimization.
This archetype is most appropriate when:
The imaging modality requires strong global context modeling, such as CT slices.
A reliable pretrained backbone is available and can produce robust embeddings.
Quantum resources are limited, and the quantum module must be compact and shallow.
A key implication is that the quantum head’s contribution is tightly coupled to the quality of the classical embedding. Therefore, fair evaluation requires carefully matched baselines and ablations that isolate the effect of the quantum classifier head from that of classical feature extraction.
4.3. Archetype C: Quantum Feature Extractor with Classical Classifier
Archetype C places the quantum module in the role of a feature extractor operating on compact, classically prepared representations, while a lightweight classical classifier implements the final decision function. This design is motivated by two practical considerations. First, near-term quantum devices cannot process high-dimensional medical images directly; therefore, the input to a quantum circuit must be a low-dimensional vector. Second, using the quantum circuit as a feature generator rather than as a classifier head can provide a structured nonlinear transformation that produces quantum-enhanced features, which may be more discriminative for downstream classical classification, particularly when the classical classifier is intentionally kept simple to reduce overfitting.
Unlike quanvolution-based hybrids, the quantum processing here is not patch-wise. Unlike transfer-learning hybrids with quantum decision heads, the quantum circuit is used primarily to construct a feature representation, and the classification burden is shifted to a classical model operating on the quantum-derived feature space.
Given an input image
, classical preprocessing is applied to obtain
. A compact representation is then constructed through a lightweight classical mapping
, such as dimensionality reduction, shallow feature extraction, or a small embedding network:
Since the quantum circuit requires a small input dimension, a compression step is applied:
The vector defines the classical-to-quantum interface.
Quantum feature extraction module: The quantum module encodes
into a quantum state and applies a parameterized circuit to generate a measurement vector that is interpreted as a learned feature embedding:
Here, serves as the quantum-derived feature vector, typically with chosen to match the needs of the downstream classifier rather than the number of classes.
Classical classifier: A classical classifier
then maps
to class logits and predictions:
The classifier may be a small MLP, logistic regression, SVM, or a custom lightweight decision module.
The coupling mechanism is sequential and can be summarized as:
Classical preprocessing and compact feature construction generate
Quantum processing transforms into a quantum-derived feature vector .
Classical classification maps
to the final prediction .
In this archetype, the quantum circuit is optimized to produce features useful for classification rather than directly producing final class logits. This separation makes the role of the quantum module more explicit and can facilitate interpretability through feature-space analyses.
Given labels
, the training objective minimizes a supervised loss:
where
denotes quantum circuit parameters and
denotes classifier parameters. The classical front-end parameters can be fixed or lightly tuned depending on the design of
and available data. Quantum gradients are typically computed via parameter-shift rules, enabling end-to-end optimization of the quantum feature extractor and the classical classifier.
Archetype C is most appropriate when the goal is to exploit quantum circuits as feature generators under strict resource constraints, while maintaining flexibility in the decision stage via classical classifiers. It is particularly attractive for moderately sized datasets, where full fine-tuning of deep backbones may increase the risk of overfitting. However, performance is sensitive to the quality of the compact representation , since excessive compression can remove clinically relevant patterns before quantum processing. Therefore, careful design of the classical front-end and rigorous reporting of feature dimensionality and circuit configuration are essential for reproducible comparisons.
4.4. Instantiation of the Three Hybrid Architectural Archetypes in Reviewed Studies
The taxonomy introduced in
Section 4.1,
Section 4.2 and
Section 4.3 separates hybrid quantum–classical (HQC) imaging pipelines based on
where quantum processing is injected and
what function it serves (patch-level transformation, decision refinement, or latent feature generation). In survey studies, such a taxonomy is only valuable if it is operational—i.e., if it can consistently map real-world studies onto archetypes in a way that explains observed differences in performance, computational scaling, and validation behavior. Accordingly, this subsection anchors the taxonomy in the seven peer-reviewed studies retained for comparative analysis, explicitly identifying the classical component, the functional role of the quantum module, and the coupling point that determines system-level behavior.
Table 2 exposes a pronounced concentration in Archetypes A and B, indicating that current HQC research largely explores two extremes: (i) maximizing quantum exposure by applying circuits to image patches early in the pipeline (Archetype A), or (ii) minimizing quantum footprint by restricting the circuit to the final decision stage (Archetype B). This polarization is not just a stylistic choice—it has direct consequences for (a) scaling laws (patch-wise circuits scale with image resolution), (b) noise sensitivity (measurement noise accumulates over many circuit calls), and (c) attribution clarity (decision-stage circuits make it easier to isolate what the quantum block contributes).
In contrast, Archetype C is represented by only one study. This is notable because intermediate feature generation is arguably the most “balanced” strategy under NISQ constraints: the quantum module can act as a nonlinear representation shaper without incurring patch-wise multiplicative costs. The limited adoption of Archetype C suggests an open methodological gap: the community has not yet systematically evaluated whether intermediate quantum-feature generation can offer a better trade-off between expressivity, stability, and computational feasibility than the two dominant extremes.
4.5. Comparative Summary of Hybrid Quantum–Classical Models
After structurally mapping studies into archetypes (
Table 2), the next step is to quantify how design decisions translate into measurable outcomes. In hybrid pipelines, reported accuracy cannot be interpreted in isolation, because it is confounded by task formulation (binary vs. multi-class), encoding stability, qubit budget, circuit depth, and the degree of classical dominance. Therefore, this subsection provides an integrated comparison that jointly reports diagnostic outcomes and quantum resource characteristics, enabling a survey-level assessment of feasibility under NISQ constraints and identifying which architectural patterns are consistently associated with strengths or failure modes.
Three survey-level insights are reinforced by
Table 3:
Binary screening is structurally “easier” for current hybrids than multi-class diagnosis. Across studies that evaluate both, binary performance is consistently higher. This suggests that today’s HQC design, especially patch-level feature transforms—primarily amplify coarse separability (COVID vs. non-COVID) rather than extracting representations that remain stable under fine-grained semantic overlap (e.g., COVID vs. pneumonia). In a review context, this is a crucial distinction: many high accuracies reported in HQC work are likely tied to binary formulations and do not necessarily generalize to clinically realistic differential diagnosis.
Quanvolution-based gains may be real, but they are coupled with a robustness penalty.
Archetype A models dominate headline binary performance, but their multi-class degradation suggests that patch-level quantum nonlinearities can enhance local texture cues without guaranteeing global discriminability. This is consistent with a system interpretation: local quantum feature maps may enrich early representations, but downstream classical aggregation is still required to learn global decision boundaries—and those boundaries become fragile under multi-class overlap.
Scaling qubits upward is not a free lunch under NISQ-era training dynamics.
QCNN-CT shows non-monotonic behavior (4 qubits outperform 8). From a survey perspective, this is exactly the type of pattern that should temper overly optimistic “more qubits → better performance” narratives. Larger circuits expand parameterization and can exacerbate optimization pathologies and measurement variance, offsetting any representational benefit.
Collectively,
Table 3 supports a conservative yet technically grounded conclusion for this survey section: current HQC models are best supported for binary screening in shallow, angle-encoded, small-qubit regimes, while multi-class robustness and scaling behavior remain unsettled.