Preprint
Review

This version is not peer-reviewed.

Impact of Preprocessing Parameters in Multimodality Imaging-Based Radiomic Studies: A Review

A peer-reviewed article of this preprint also exists.

Submitted:

24 June 2024

Posted:

25 June 2024

You are already at the latest version

Abstract
Lately, radiomic studies featuring the development of a signature to use in prediction models in diagnosis or prognosis outcomes are being published increasingly. While the results are shown to be promising, these studies still have many pitfalls and limitations. One of the main issues of this methodology is that radiomic features depend on how the images are preprocessed before their computation. Since, in widely known and used software for radiomic features calculation, it is possible to set these preprocessing parameters before the calculation of the radiomic feature, there are ongoing studies assessing the stability and repeatability of radiomic features to find the most suitable preprocessing parameters for every used imaging modality. This review examines the impact of preprocessing parameters on the reproducibility and reliability of radiomic features extracted from multimodality imaging (CT, MRI, CBCT, PET/CT). Standardized preprocessing is crucial for consistent radiomic feature extraction. Key preprocessing steps include voxel resampling, normalization, and discretization, which influence feature robustness and reproducibility. From our work, we highlighted the need to harmonize the use of preprocessing parameters and their values, especially in light of future studies of prospective studies, which are still lacking in the current literature.
Keywords: 
;  ;  ;  ;  ;  ;  

1. Introduction

In recent years, there has been a notable increase in research examining the potential of quantitative imaging features to reveal additional information from medical images beyond what is perceptible to the human eye [1,2,3]. Handcrafted radiomics systematically extracts quantitative imaging features from medical images to interpret biological information [4,5]. The term "radiomics" yields over 11,100 studies on PubMed. This approach takes manually delineated regions of interest, such as tumors, on medical images and extracts a high number of quantitative features with pre-determined mathematical formulas [6]. One of the biggest advantages of radiomics is that those values can be extracted from medical images which can be normally acquired from the clinical pathway of the patient. In radiomics studies, the hypothesis is that radiomic features (RFs) can act as clinical biomarkers, individually or in combination, even with clinical parameters [4].
Numerous investigations have explored and documented RFs' ability to predict clinical outcomes, including both a prognostic aim (overall survival, treatment response) as well as diagnostic aim (e.g., differential diagnosis) [7,8,9,10,11,12,13,14,15,16]. These studies highlight the potential clinical application of radiomics, offering non-invasive, reliable, accessible, and cost-effective alternatives to invasive procedures like tissue biopsies. Moreover, with proper implementation, radiomics could provide useful predictions to help the clinician and reduce their workload [17,18].
As the field of radiomics advanced, there has been increasing attention to its limitations [5,19]. The primary identified limitation is the sensitivity of RFs to variations in image acquisition and reconstruction parameters [20,21,22,23,24,25]. To be effective clinical biomarkers, RFs must demonstrate reproducibility across different imaging parameters for generalization [25]. However, numerous studies have documented differences in imaging acquisition and reconstruction parameters [25,26,27,28,29,30,31,32,33,34,35,36,37]. Moreover, the impact of variation in a single acquisition or reconstruction parameter on RF reproducibility depends on the individual feature [31,38]. Two works also highlight that reconstruction kernels notably affect RF reproducibility [21,37]. There are several works exploring various methods to address RF reproducibility across differently acquired scans, as well as how to standardize them.
Image preprocessing occurs between the image segmentation and feature extraction stages. Preprocessing steps involve pixel spacing, grey-level intensities, bins of the grey-level histogram, and similar parameters, ensuring a lighter computational burden when extracting radiomic features. Preliminary findings indicate that the repeatability of extracted radiomic features is significantly influenced by the image preprocessing settings employed [39,40,41,42,43,44]. Usually, the acquisition and reconstructed voxels are not isotropic, especially if no voxel re-segmentation is applied (e.g., CT scans are usually not isotropic at acquisition). However, it is possible to achieve isotropic ones using an interpolation typically available in the software used for radiomic feature calculation.
Normalization or range re-segmentation is used to remove voxels in the segmented volume of interest (VOI) that fall outside a specific grey-level range. The normalization technique is commonly used to ensure data consistency, especially for imaging modalities with calibrated units. In fact, normalization is typically necessary for CT and PET to exclude voxels representing air or bone within a tumor VOI. However, this method does not apply to MRI data, which employs arbitrary intensity units instead. In this case, intensity outlier filtering is applied by calculating the mean (µ) and the standard deviation (σ) of grey levels within the VOI and excluding grey levels outside the range of µ ± n σ, where n is usually chosen to 3 [44,45,46,47]. Furthermore, other relative normalizations have been proposed to determine MRI radiomic feature robustness [48,49]. Discretization involves grouping the original gray-level values into specific range intervals (bins), similar to creating a histogram. This step makes the calculation of features manageable [44].
This work focused on gathering information about voxel resampling, normalization, and discretization preprocessing parameters from our included studies. We also investigated the study aim, anatomic district, and comparison metrics to better understand the up-to-date overview of the radiomic studies. This review aims to provide an up-to-date and comprehensive overview of the most commonly used preprocessing techniques.

2. Materials and Methods

This systematic review followed the recommendations of the PRISMA-P (Preferred Reporting Items for Systematic Review and Meta-Analysis Protocols) 2020 [50]. The inclusion criteria for the records in the review process were the presence of specific preprocessing parameters evaluated in CBCT, CT, MRI, and PET/CT radiomic studies.
We performed a comprehensive literature search using four electronic databases: PubMed, Cochrane Library, Embase, and Scopus. Since radiomic studies are relatively recent, we conducted the literature search from January 2008 until December 2023. Mesh terms and free text were modeled in search strategies for databases using the one designed for PubMed: (((("Positron-Emission Tomography" [Mesh]) OR "Tomography, X-Ray Computed" [Mesh]) OR "Magnetic Resonance Imaging" [Mesh] OR PET OR Positron emission tomography OR computed tomography OR CT OR MRI OR magnetic resonance) AND (radiomics)) AND (preprocessing OR preprocessing). The study was exclusively confined to English-language research without publication status restrictions. The authors did not contact other institutions or researchers to uncover additional studies.
Two reviewers independently screened records by reviewing their titles and abstracts and extracting general study characteristics (such as study title, first author's name, publication year, journal, abstract, corresponding reviewer, and research keywords) for records meeting the inclusion criteria, using a customized data extraction form. In this first step, the authors independently used the ASReview tool [40] to speed up the article selection. The authors compared their decisions, resolving disagreements until a consensus was reached. Records lacking information on CT parameters were excluded, and any eligible studies with full-text versions were included in the review process, while conference proceedings and monographs were disregarded. Additional articles not meeting the inclusion criteria upon full-text reading were identified and excluded at this stage. Reviewers independently selected full-text articles, resolving any uncertainties by reaching a consensus.
From each selected full-text article, the following parameters were identified and extracted:
  • acquisition modality (CBCT, CT, MRI, PET/CT);
  • number of patients or phantom;
  • disease/s name (if appropriate);
  • equipment vendor and model;
  • presence of acquisition parameters;
  • the total number of features;
  • type of features subsampled in FO (first order), SM (shape metric), and TA textural features;
  • type of software used in the radiomic feature extraction;
  • image filtering used (Y/N, if Y the type was reported);
  • voxel resampling;
  • normalization process;
  • discretization technique;
  • retrospective study (Y/N or NA);
  • statistical analysis: intraclass correlation coefficient (ICC), concordance correlation coefficient (CCC), area under the receiver operation curve (AUC), Mean, Average percentage difference, Relative difference, Spearman correlation, Kolmogorov Smirnov, double sample test, and two-way ANOVA;
  • type of study (reproducibility/repeatability/both or best performance);
  • main findings.
No contact with the authors of the records for complementary information was necessary.
Quality scoring for study selection was not utilized. The reviewers reviewed articles in duplicate, resolving disagreements by achieving an agreement. Data from the studies were standardized to mitigate potential biases. Data extracted from studies that did not meet the criteria for standardization were excluded.
This review aimed to describe the most used preprocessing pipeline in the radiomic workflow of the CBCT, CT, MRI, and PET/CT studies. It also sought to find possible standard radiomic preprocessing setups and describe a potential evolution over time, aiming to uncover any increased awareness of the technique's standardization.
Regarding voxel resampling, we counted the number of works that used: single isotropic, multiple isotropic, and non-isotropic voxel interpolation. When the original voxel size was used, we put "N.A.", and when the information was not indicated, "None".
Concerning the normalization strategies, we counted the number of papers that chose absolute, relative, or combination strategies. When this information was not applied, we set "None". A normalization strategy enhances the robustness and reliability of the radiomic analysis. The absolute normalization strategy involves standardizing the intensity values of imaging by transforming them to a common scale based on absolute reference points. Conversely, relative normalization concerns adjusting the intensity values of images relative to a certain reference within each individual image or dataset; this technique often includes scaling the pixel values based on the mean (µ), median, or a specific percentile of the intensity values in the image. This approach accounts for differences in overall intensity levels between images by normalizing the intensities relative to their own distribution.
In terms of discretization strategies, we identified the number of studies that utilized: a bin number (BN), a bin width (BW), or both the bin number and width (BN + BW) approach. When this information was not applied or not specified, we set "None". The discretization strategy involves adjusting values to standardize the feature extraction process across different images or imaging modalities. The BN discretization ensures that the radiomic features are comparable and consistent by transforming the intensity values into a fixed number of discrete bins. The BW method assures that each bin represents a fixed range of intensity values. The BN + BW strategy is a dual approach that guarantees the simultaneous adjustment of the number of bins and the width of each bin used to discretize image intensity values.
In addition, we investigated the study aims, the anatomical districts considered, and the metric used within the works. Concerning the study aims, we counted how many studies used best performance, repeatability, or both strategies. The anatomical districts were categorized into "abdomen", "brain", "thorax", and "pelvis" for simplicity's sake. When no anatomical part was specified, e.g., for phantom studies, we set "N.A.". Moreover, the metric indexes were grouped into ICC, CCC, AUC, or "Other" when different statistical metrics were used.
We used the QUADAS-2 (Quality Assessment of Diagnostic Accuracy Studies 2) tool to investigate the risk of bias [51]. The tool evaluates four key domains: patient selection, index test, reference standard, and flow and timing. Each domain is evaluated for the risk of bias and applicability. We used this tool only for the first two domains because reference standards and flow and timing are not defined in this review's field. Regarding patient selection's risk of bias, we checked the method in doing it (lower marks were given in case of inappropriate exclusions or not consecutive/random selection), while the applicability was evaluated if the included patients matched the typical clinical scenario.
We considered the index test the metric used in the result comparison. In this context, the risk of bias consisted of consistently evaluating the calculation/use of the aforementioned metric, and the applicability was referred to the methodology of its use in practice.

3. Results

3.1. Literature Search

The computer-assisted search across PubMed, Cochrane Library, Embase, and Scopus yielded 546 records (as shown in Table 1). Following the elimination of duplicate entries, 459 unique records remained. Subsequently, a screening process based on the coherence of the title and abstract reduced the number of articles to 72 (refer to Figure 1).
Reviews and conference proceedings were excluded from further consideration. Consequently, 53 full-text articles underwent evaluation by the reviewers to determine their eligibility. Ten articles were excluded from the database for the following reasons: six lacked preprocessing results, one focused on a modality outside the scope of this review, two utilized only a digital phantom, and one was unfound.

3.2. Data Collection and Elaboration

The selected papers were grouped by considering the acquisition modality: one about CBCT, 12 about CT, 26 about MRI, and five about PET/CT.
CT studies were the most represented from 2013 to 2023. MRI works denoted an increment of publications over the years, starting from 1 in 2017 to 9 in 2023. PET/CT studies were published in 2019 (2) and 2021 (3).
In 2017, CT phantom studies were published to study the stability and reproducibility of radiomic features using phantoms made of different materials compared to those traditionally used in quality control (QC) [39,52,53].
The number of patients has increased, although not monotonically, from 256 in 2013 to 3,000 in 2023. The modality that saw the greatest increase was MRI, demonstrating the growing interest of the scientific community in this type of imaging.

3.2.1. Acquisition Parameters Presence and Voxel Resampling

Table 2 shows the studies reporting the acquisition parameters and the voxel resampling information. Only in PET and CBCT did we find that most papers used isotropic voxel resampling, while in CT and MRI, only 41.7% and 41.3% used it, respectively.
For CT and CBCT, the most studied voxel resampling interpolation was 1 x 1 x 1 mm3. Furthermore, a significant number of studies used 1 x 1 x1 mm3 in MRI, but more voxel sizes were investigated in the range of 0.9 – 4.8 mm3. This result is expected because in MRI the image characteristics are strongly influenced by the acquisition protocols.
For PET/CT, voxel resampling dimensions were in the range of 1 – 4 mm3. This imaging modality, which employs a small resampling dimension, might introduce biases due to its intrinsic resolution (around non-isotropic 3 - 5 mm3).

3.2.2. Normalization Strategies

Table 3 reports the normalization strategies adopted by the authors, considering the different modalities. In CT and PET/CT, most works did not use a normalization strategy, while in MRI, relative normalization was the most used, as expected. In CBCT, a relative strategy was used because the image grey levels are not in Hounsfield but relative units.

3.2.3. Discretization Strategies

Most studies opted for a discretization strategy: 75% in CT, 100% in CBCT, 92.3% in MRI, and 100% in PET/CT. The combination of BN + BW was the most represented in this latter modality (Table 4).
In CT, the most investigated BNs were 32, 64, and 128, while BW ranged between 5 and 50 Hounsfield units (HU). In CBCT, the only work selected used BN equal to 64, 128, and 256.
In MRI, the BN = 32 was the most used bin number, and eleven works studied it (42.3%), followed by BN = 64 and 128 (38.5% of the papers).
In PET/CT, the most studied BN were 32 and 64, while BW typically ranged from 0.01 to 0.5 SUV.

3.2.4. Study Aims

Table 5 shows the count of study aims for each imaging modality.
Most works employing CT and MRI aimed to study reproducibility and its association with repeatability. It is worth noting that, since 2021, best performance has also been well investigated. Moreover, the CBCT work studied the aim of repeatability and reproducibility. PET/CT works consisted of 3 works about repeatability published in 2019 and 2021 and two works about best performance issued in 2021.

3.2.5. Anatomic District

Table 6 reports the anatomic districts categorized in the main four body regions. The CT studies focused on the trunk's anatomical regions (chest, abdomen, and pelvis), while MRI studies were polarized between the brain and pelvis. In addition, CBCT and PET/CT investigations were centered on the pelvis or chest.

3.2.5. Comparison Metrics

Table 7 illustrates the most common metrics used in the result comparison. In absolute terms, the most used metric was ICC, which is used predominantly in MRI, with 15 out of 26 works. Some works used more than one metric, so the number of papers reported in Tab. 7 is more than the number of works included.

3.3. Risk of Bias Analysis

The risk of bias assessment revealed that about 35% of the studies had a low risk of bias across the patient selection domain, while the remainder did not specify or were not clearly specified. In addition, 60% had a low risk of bias across the index test. The patient selection and index test applicability was higher than 72% (Figure 2).

4. Discussion

The volume of published studies on predictive modeling using radiomic features has been proliferating. However, no global consensus exists on which features are consistently repeatable and reproducible. This lack of agreement could potentially hinder future discussions on clinical applicability and the feasibility of prospective multi-institutional external validation trials.
This review's main objective was to identify preprocessing strategies for radiomic features in the four primary 3D imaging modalities: CBCT, CT, MRI, and PET. It also aimed to identify common preprocessing parameter settings to evaluate the current state of the art in this field and raise awareness, thereby promoting more stable and reproducible results. In fact, the preprocessing technique has the greatest impact on feature reproducibility [57].
In 2017, CT phantom studies were published to study the stability and reproducibility of radiomic features using phantoms made of materials different from those traditionally used in QC [39,52,53]. Moreover, Palani et al. [62] published a phantom study in 2023, highlighting the scientific interest in this topic.
This review showed how the choice of 1 x 1 x 1 mm3 could be a feasible starting point for CT and MRI examinations if the original voxel size parameters are not too different from these values. On the other hand, for CT, the dimension related to slice thickness (the third dimension) is notably larger compared to the other two, which in turn depend only on the FOV and reconstruction matrix size (typically 512 x 512). Also, for this reason, Larue et al. [52] and Shafiq-ul Hassan [39] studied 1 x 1 x 3 and 1 x 1 x 2 mm3, respectively. In MRI, however, the discussion is broader. From a radiomic analysis perspective, a volumetric acquisition might be the best choice to introduce minimal bias in voxel resampling. However, this choice is not necessarily optimal for all anatomical regions/lesions, and in any case, it does not apply to retrospective studies. For PET/CT, it seems clear that an isotropic voxel of approximately 3 x 3 x 3 or 4 x 4 x 4 mm3 is a widely accepted choice and a good compromise for obtaining comparable results. The challenges in isotropic reconstruction are similar to those previously described for CT, even though they relate to different physical processes.
A limitation of this review is that it did not delve into the issue of the resampling function used (e.g., linear, bicubic, nearest neighbor interpolation, etc.) because the studies were too varied to obtain a meaningful result. Larue et al. [52] demonstrated that linear interpolation resulted in the narrowest range of feature values for nearly half of the features, and cubic interpolation did so for a smaller portion. In contrast, nearest neighbor interpolation produced the broadest range for most CT study features.
Different techniques have been considered for normalization strategies. Starting from CT, Larue et al. [52] studied the influence of BW from 5 to 50 HU spaced by 5 HU. Fave et al. [60] preprocessed the CT images with 8-bit depth resampling using a 16 HU bin width. Kolossváry et al. [59] compared two equally sized bins and equally probable bins where each bin contains a proportion of the data, finding that all GLRLM features were significantly affected by binning type; besides, BNs significantly affected the values for all GLCM and GLRLM. Gray level resampling affects only second and higher-order radiomics features, while voxel size variation can influence first-, second-, and higher-order features [39]. The optimal discretization methods for feature inter- and intra-sample reproducibility depend on the imaging modality. The choice of discretization significantly affects intensity distributions, feature values, and reproducibility [45]. Defining only bin number or width is insufficient, as these two quantities are related to the maximum number of gray levels used. The IBSI group mentions preprocessing recommendations in their work [45]. In modalities that use calibrated imaging intensity units (e.g., CT and PET/CT), when a re-segmentation range is defined, either fixed bin size or fixed bin width can be defined. If no re-segmentation range is used in calibrated imaging modalities, then fixed bin number is recommended. When non-calibrated imaging modality (e.g., MRI) is employed, they recommend using fixed bin number method without an intensity re-segmentation. The issue when using BN is that it introduces a normalization effect, directly comparing feature values across patients, whereas BW preserves the relationship between PET units and their physical meaning. Xu et al. [90] indicate that the BN discretization scheme achieves relatively high stability compared to BW in line with IBSI recommendation [45], contrasting with Pfaehler et al. [28].
Most CT and MRI studies have focused on reproducibility and its relationship with repeatability. Notably, since 2021, there has been significant investigation into achieving optimal performance. Additionally, CBCT studies have examined both repeatability and reproducibility. PET/CT research includes three studies on repeatability published in 2019 and 2021, and two studies on optimal performance published in 2021. The selection of the best performance as the criterion for evaluating the quality of a model (including the choice of preprocessing parameters) should be limited, as this choice could introduce biases.
About the anatomical part investigated, CT studies focused on the torso's anatomical regions (chest, abdomen, and pelvis), while MRI studies regarded the brain and pelvis regions. Additionally, CBCT and PET/CT investigations were centered on the pelvis or chest. The distribution of the anatomical regions per imaging modality aligns with what is expected in a therapeutic pathway (e.g., using MRI for further investigation of brain or prostate/cervix due to its capability to distinguish between soft tissues compared to CT).
Concerning comparison metrics, the most commonly used metric was ICC, predominantly applied in MRI studies, with 15 out of 26 works. ICC is extensively studied in radiomics research due to its role in quantitative assessment, reliability, wide applicability, and standard practice. It serves as a quantitative measure to gauge agreement or consistency between measurements or observations, crucial for evaluating the reproducibility of radiomic features. ICC evaluates measurement reliability by accounting for systematic and random variations, making it effective for assessing feature stability across different scans or observers. Its versatility extends to various data types, including continuous and categorical variables, enabling the evaluation of different radiomic features across diverse imaging modalities and patient groups. Overall, ICC is favored in radiomics for its comprehensive and standardized approach to assessing feature reproducibility and stability, thereby supporting the development and validation of radiomics as a quantitative imaging biomarker. CCC is another essential statistical metric used in radiomics research to evaluate measurement agreement and reliability. It quantifies the agreement between two sets of continuous measurements, assessing precision (how closely measurements cluster around the line of perfect agreement) and accuracy (how close measurements are to the line of identity), offering a comprehensive measure of agreement that is easy to interpret. Like ICC, CCC accounts for systematic bias (shift) and random variation (spread) between measurements. This makes it suitable for evaluating the reproducibility and stability of radiomic features across different scans or observers. AUC quantifies the overall discriminatory ability of a diagnostic test or biomarker. In radiomics, this metric assesses the ability of radiomic features or models to distinguish between different clinical conditions or outcomes based on imaging data. Higher AUC values indicate the better discriminatory performance of the radiomic feature or model.
From our investigated studies, it was not possible to identify a recommended set of preprocessing parameters or a subset of stable features due to the variability in the experimental setups. However, higher-order class features (e.g., GLRM, GLCM-based) are more affected by preprocessing parameters as well as acquisition and reconstruction parameters, so they should be used with caution when dealing with retrospective multicentric studies. When building a predictive model, first-order class features should be preferred for inclusion should they show a correlation with the chosen outcome, especially if a preprocessing strategy is not employed.
Regarding the risk of bias analysis, it was not possible to assess it for the majority of the studies for patient selection. Since most of the radiomic studies are retrospective, it is not possible to understand how the patient selection process might affect the results. Many works showed low concern for risk of bias in the index test, due to the widely accepted choice of well-known statistical indexes, already broadly used in literature to test repeatability even in other scientific fields. Regarding applicability, most studies raised low concern for both patient selection and index test.
From our analysis, we also found out that a significant number of our included studies pursued best model performance in order to find the optimal preprocessing parameters. While this is still a feasible approach given the early stage of literature and research in this field, it would be advised to strive toward a standardization that is based on the feature stability rather than individual model performances as these are influenced by other variables (such as training/validation datasets).
From our analysis, we also found out that a significant number of our included studies pursued best model performance in order to find the optimal preprocessing parameters. While this is still a feasible approach given the early stage of literature and research in this field, it would be advised to strive toward a standardization that is based on the feature stability rather than individual model performances as these are influenced by other variables (such as training/validation datasets).
The problem of generalizability remains open, but using a preprocessing strategy improves the robustness of the analysis, especially when multiple vendors and machine models are included.

5. Conclusions

To date, it does not seem possible to establish a standardized recipe for the unequivocal selection of preprocessing parameter combinations for radiomic analysis. However, it is noteworthy that certain combinations of voxel resampling, normalization, and digitalization are more commonly used than others.
For reproducibility, a study must report the sequence of preprocessing parameters used so that the methodology employed is evident and transparent.
From our work, we highlighted the need to harmonize the use of preprocessing parameters and their values, especially considering future studies of prospective studies, which are still lacking in the current literature.

Author Contributions

Conceptualization, MB and VT; methodology, MB and VT; formal analysis, MB and TV; investigation, MCB; resources, MCB; data curation, MB, TV, and LV; writing—original draft preparation, VT; writing—review and editing, MB, VT and LV; visualization, MB; supervision, MB; project administration, VT. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable

Informed Consent Statement

Not applicable.

Data Availability Statement

The data were obtained from a computer-assisted search of PubMed, Cochrane Library, Embase, and Scopus.

Acknowledgments

This study was partially supported by Italian Ministry of Health – Ricerca Corrente Annual Program 2024.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Gillies, R.J.; Kinahan, P.E.; Hricak, H. Radiomics: Images Are More than Pictures, They Are Data. Radiology 2016, 278, 563–577. [Google Scholar] [CrossRef] [PubMed]
  2. Walsh, S.; De Jong, E.E.C.; Van Timmeren, J.E.; Ibrahim, A.; Compter, I.; Peerlings, J.; Sanduleanu, S.; Refaee, T.; Keek, S.; Larue, R.T.H.M.; et al. Decision Support Systems in Oncology. JCO Clinical Cancer Informatics 2019, 1–9. [Google Scholar] [CrossRef]
  3. Shur, J.; Blackledge, M.; D’Arcy, J.; Collins, D.J.; Bali, M.; O’Leach, M.; Koh, D.-M. MRI Texture Feature Repeatability and Image Acquisition Factor Robustness, a Phantom Study and in Silico Study. Eur Radiol Exp 2021, 5, 2. [Google Scholar] [CrossRef] [PubMed]
  4. Lambin, P.; Rios-Velazquez, E.; Leijenaar, R.; Carvalho, S.; Van Stiphout, R.G.P.M.; Granton, P.; Zegers, C.M.L.; Gillies, R.; Boellard, R.; Dekker, A.; et al. Radiomics: Extracting More Information from Medical Images Using Advanced Feature Analysis. European Journal of Cancer 2012, 48, 441–446. [Google Scholar] [CrossRef] [PubMed]
  5. Yip, S.S.F.; Aerts, H.J.W.L. Applications and Limitations of Radiomics. Phys. Med. Biol. 2016, 61, R150–R166. [Google Scholar] [CrossRef] [PubMed]
  6. Hosny, A.; Aerts, H.J.; Mak, R.H. Handcrafted versus Deep Learning Radiomics for Prediction of Cancer Therapy Response. The Lancet Digital Health 2019, 1, e106–e107. [Google Scholar] [CrossRef] [PubMed]
  7. Aerts, H.J.W.L.; Velazquez, E.R.; Leijenaar, R.T.H.; Parmar, C.; Grossmann, P.; Carvalho, S.; Bussink, J.; Monshouwer, R.; Haibe-Kains, B.; Rietveld, D.; et al. Decoding Tumour Phenotype by Noninvasive Imaging Using a Quantitative Radiomics Approach. Nat Commun 2014, 5, 4006. [Google Scholar] [CrossRef] [PubMed]
  8. Bae, S.; Choi, Y.S.; Ahn, S.S.; Chang, J.H.; Kang, S.-G.; Kim, E.H.; Kim, S.H.; Lee, S.-K. Radiomic MRI Phenotyping of Glioblastoma: Improving Survival Prediction. Radiology 2018, 289, 797–806. [Google Scholar] [CrossRef] [PubMed]
  9. Oikonomou, A.; Khalvati, F.; Tyrrell, P.N.; Haider, M.A.; Tarique, U.; Jimenez-Juan, L.; Tjong, M.C.; Poon, I.; Eilaghi, A.; Ehrlich, L.; et al. Radiomics Analysis at PET/CT Contributes to Prognosis of Recurrence and Survival in Lung Cancer Treated with Stereotactic Body Radiotherapy. Sci Rep 2018, 8, 4003. [Google Scholar] [CrossRef] [PubMed]
  10. Wu, W.; Parmar, C.; Grossmann, P.; Quackenbush, J.; Lambin, P.; Bussink, J.; Mak, R.; Aerts, H.J.W.L. Exploratory Study to Identify Radiomics Classifiers for Lung Cancer Histology. Front. Oncol. 2016, 6. [Google Scholar] [CrossRef] [PubMed]
  11. Blüthgen, C.; Patella, M.; Euler, A.; Baessler, B.; Martini, K.; Von Spiczak, J.; Schneiter, D.; Opitz, I.; Frauenfelder, T. Computed Tomography Radiomics for the Prediction of Thymic Epithelial Tumor Histology, TNM Stage and Myasthenia Gravis. PLoS ONE 2021, 16, e0261401. [Google Scholar] [CrossRef] [PubMed]
  12. E, L.; Lu, L.; Li, L.; Yang, H.; Schwartz, L.H.; Zhao, B. Radiomics for Classification of Lung Cancer Histological Subtypes Based on Nonenhanced Computed Tomography. Academic Radiology 2019, 26, 1245–1252. [Google Scholar] [CrossRef] [PubMed]
  13. Stefan, P.-A.; Puscas, M.E.; Csuak, C.; Lebovici, A.; Petresc, B.; Lupean, R.; Mihu, C.M. The Utility of Texture-Based Classification of Different Types of Ascites on Magnetic Resonance. J BUON 2020, 25, 1237–1244. [Google Scholar] [PubMed]
  14. Csutak, C.; Ștefan, P.-A.; Lupean, R.-A.; Lenghel, L.M.; Mihu, C.M.; Lebovici, A. Computed Tomography in the Diagnosis of Intraperitoneal Effusions: The Role of Texture Analysis. Bosn J Basic Med Sci 2021, 21, 488–494. [Google Scholar] [CrossRef] [PubMed]
  15. Horvat, N.; Veeraraghavan, H.; Khan, M.; Blazic, I.; Zheng, J.; Capanu, M.; Sala, E.; Garcia-Aguilar, J.; Gollub, M.J.; Petkovska, I. MR Imaging of Rectal Cancer: Radiomics Analysis to Assess Treatment Response after Neoadjuvant Therapy. Radiology 2018, 287, 833–843. [Google Scholar] [CrossRef] [PubMed]
  16. Tharmalingam, H.; Tsang, Y.M.; Alonzi, R.; Beasley, W.; Taylor, N.J.; McWilliam, A.; Padhani, A.; Choudhury, A.; Hoskin, P.J. Changes in Magnetic Resonance Imaging Radiomic Features in Response to Androgen Deprivation Therapy in Patients with Intermediate- and High-Risk Prostate Cancer. Clin Oncol (R Coll Radiol) 2022, 34, e246–e253. [Google Scholar] [CrossRef] [PubMed]
  17. Ștefan, P.-A.; Lupean, R.-A.; Mihu, C.M.; Lebovici, A.; Oancea, M.D.; Hîțu, L.; Duma, D.; Csutak, C. Ultrasonography in the Diagnosis of Adnexal Lesions: The Role of Texture Analysis. Diagnostics (Basel) 2021, 11, 812. [Google Scholar] [CrossRef]
  18. Ștefan, R.-A.; Ștefan, P.-A.; Mihu, C.M.; Csutak, C.; Melincovici, C.S.; Crivii, C.B.; Maluțan, A.M.; Hîțu, L.; Lebovici, A. Ultrasonography in the Differentiation of Endometriomas from Hemorrhagic Ovarian Cysts: The Role of Texture Analysis. JPM 2021, 11, 611. [Google Scholar] [CrossRef] [PubMed]
  19. Lohmann, P.; Bousabarah, K.; Hoevels, M.; Treuer, H. Radiomics in Radiation Oncology-Basics, Methods, and Limitations. Strahlenther Onkol 2020, 196, 848–855. [Google Scholar] [CrossRef]
  20. Ibrahim, A.; Primakov, S.; Beuque, M.; Woodruff, H.C.; Halilaj, I.; Wu, G.; Refaee, T.; Granzier, R.; Widaatalla, Y.; Hustinx, R.; et al. Radiomics for Precision Medicine: Current Challenges, Future Prospects, and the Proposal of a New Framework. Methods 2021, 188, 20–29. [Google Scholar] [CrossRef] [PubMed]
  21. Mali, S.A.; Ibrahim, A.; Woodruff, H.C.; Andrearczyk, V.; Müller, H.; Primakov, S.; Salahuddin, Z.; Chatterjee, A.; Lambin, P. Making Radiomics More Reproducible across Scanner and Imaging Protocol Variations: A Review of Harmonization Methods. JPM 2021, 11, 842. [Google Scholar] [CrossRef] [PubMed]
  22. Midya, A.; Chakraborty, J.; Gönen, M.; Do, R.K.G.; Simpson, A.L. Influence of CT Acquisition and Reconstruction Parameters on Radiomic Feature Reproducibility. J Med Imaging (Bellingham) 2018, 5, 011020. [Google Scholar] [CrossRef] [PubMed]
  23. Reiazi, R.; Abbas, E.; Famiyeh, P.; Rezaie, A.; Kwan, J.Y.Y.; Patel, T.; Bratman, S.V.; Tadic, T.; Liu, F.-F.; Haibe-Kains, B. The Impact of the Variation of Imaging Parameters on the Robustness of Computed Tomography Radiomic Features: A Review. Comput Biol Med 2021, 133, 104400. [Google Scholar] [CrossRef] [PubMed]
  24. Espinasse, M.; Pitre-Champagnat, S.; Charmettant, B.; Bidault, F.; Volk, A.; Balleyguier, C.; Lassau, N.; Caramella, C. CT Texture Analysis Challenges: Influence of Acquisition and Reconstruction Parameters: A Comprehensive Review. Diagnostics (Basel) 2020, 10. [Google Scholar] [CrossRef] [PubMed]
  25. Zhao, B. Understanding Sources of Variation to Improve the Reproducibility of Radiomics. Front Oncol 2021, 11, 633176. [Google Scholar] [CrossRef] [PubMed]
  26. Shiri, I.; Abdollahi, H.; Shaysteh, S.; Rabi Mahdavi, S. Test-Retest Reproducibility and Robustness Analysis of Recurrent Glioblastoma MRI Radiomics Texture Features. Iran J Radiol 2017. Special iss. [Google Scholar] [CrossRef]
  27. Peerlings, J.; Woodruff, H.C.; Winfield, J.M.; Ibrahim, A.; Van Beers, B.E.; Heerschap, A.; Jackson, A.; Wildberger, J.E.; Mottaghy, F.M.; DeSouza, N.M.; et al. Stability of Radiomics Features in Apparent Diffusion Coefficient Maps from a Multi-Centre Test-Retest Trial. Sci Rep 2019, 9, 4800. [Google Scholar] [CrossRef] [PubMed]
  28. Pfaehler, E.; Beukinga, R.J.; Jong, J.R.; Slart, R.H.J.A.; Slump, C.H.; Dierckx, R.A.J.O.; Boellaard, R. Repeatability of 18 F- FDG PET Radiomic Features: A Phantom Study to Explore Sensitivity to Image Reconstruction Settings, Noise, and Delineation Method. Med. Phys. 2019, 46, 665–678. [Google Scholar] [CrossRef] [PubMed]
  29. Prayer, F.; Hofmanninger, J.; Weber, M.; Kifjak, D.; Willenpart, A.; Pan, J.; Röhrich, S.; Langs, G.; Prosch, H. Variability of Computed Tomography Radiomics Features of Fibrosing Interstitial Lung Disease: A Test-Retest Study. Methods 2021, 188, 98–104. [Google Scholar] [CrossRef] [PubMed]
  30. Zhao, B.; Tan, Y.; Tsai, W.-Y.; Qi, J.; Xie, C.; Lu, L.; Schwartz, L.H. Reproducibility of Radiomics for Deciphering Tumor Phenotype with Imaging. Sci Rep 2016, 6, 23428. [Google Scholar] [CrossRef] [PubMed]
  31. Ibrahim, A.; Refaee, T.; Primakov, S.; Barufaldi, B.; Acciavatti, R.J.; Granzier, R.W.Y.; Hustinx, R.; Mottaghy, F.M.; Woodruff, H.C.; Wildberger, J.E.; et al. The Effects of In-Plane Spatial Resolution on CT-Based Radiomic Features’ Stability with and without ComBat Harmonization. Cancers (Basel) 2021, 13. [Google Scholar] [CrossRef] [PubMed]
  32. Zhovannik, I.; Bussink, J.; Traverso, A.; Shi, Z.; Kalendralis, P.; Wee, L.; Dekker, A.; Fijten, R.; Monshouwer, R. Learning from Scanners: Bias Reduction and Feature Correction in Radiomics. Clin Transl Radiat Oncol 2019, 19, 33–38. [Google Scholar] [CrossRef] [PubMed]
  33. Ibrahim, A.; Refaee, T.; Primakov, S.; Barufaldi, B.; Acciavatti, R.J.; Granzier, R.W.Y.; Hustinx, R.; Mottaghy, F.M.; Woodruff, H.C.; Wildberger, J.E.; et al. Reply to Orlhac, F.; Buvat, I. Comment on “Ibrahim et al. The Effects of In-Plane Spatial Resolution on CT-Based Radiomic Features’ Stability with and without ComBat Harmonization. Cancers 2021, 13, 1848”. Cancers (Basel) 2021, 13. [Google Scholar] [CrossRef] [PubMed]
  34. Ibrahim, A.; Lu, L.; Yang, H.; Akin, O.; Schwartz, L.H.; Zhao, B. The Impact of Image Acquisition Parameters and ComBat Harmonization on the Predictive Performance of Radiomics: A Renal Cell Carcinoma Model. Appl Sci (Basel) 2022, 12, 9824. [Google Scholar] [CrossRef] [PubMed]
  35. Park, J.E.; Park, S.Y.; Kim, H.J.; Kim, H.S. Reproducibility and Generalizability in Radiomics Modeling: Possible Strategies in Radiologic and Statistical Perspectives. Korean J Radiol 2019, 20, 1124–1137. [Google Scholar] [CrossRef] [PubMed]
  36. Meyer, M.; Ronald, J.; Vernuccio, F.; Nelson, R.C.; Ramirez-Giraldo, J.C.; Solomon, J.; Patel, B.N.; Samei, E.; Marin, D. Reproducibility of CT Radiomic Features within the Same Patient: Influence of Radiation Dose and CT Reconstruction Settings. Radiology 2019, 293, 583–591. [Google Scholar] [CrossRef] [PubMed]
  37. Lu, L.; Ehmke, R.C.; Schwartz, L.H.; Zhao, B. Assessing Agreement between Radiomic Features Computed for Multiple CT Imaging Settings. PLoS One 2016, 11, e0166550. [Google Scholar] [CrossRef] [PubMed]
  38. Ibrahim, A.; Refaee, T.; Leijenaar, R.T.H.; Primakov, S.; Hustinx, R.; Mottaghy, F.M.; Woodruff, H.C.; Maidment, A.D.A.; Lambin, P. The Application of a Workflow Integrating the Variable Reproducibility and Harmonizability of Radiomic Features on a Phantom Dataset. PLoS One 2021, 16, e0251147. [Google Scholar] [CrossRef] [PubMed]
  39. Shafiq-ul-Hassan, M.; Zhang, G.G.; Latifi, K.; Ullah, G.; Hunt, D.C.; Balagurunathan, Y.; Abdalah, M.A.; Schabath, M.B.; Goldgof, D.G.; Mackin, D.; et al. Intrinsic Dependencies of CT Radiomic Features on Voxel Size and Number of Gray Levels. Med. Phys. 2017, 44, 1050–1062. [Google Scholar] [CrossRef] [PubMed]
  40. Wichtmann, B.D.; Harder, F.N.; Weiss, K.; Schönberg, S.O.; Attenberger, U.I.; Alkadhi, H.; Pinto Dos Santos, D.; Baeßler, B. Influence of Image Processing on Radiomic Features From Magnetic Resonance Imaging. Invest Radiol 2023, 58, 199–208. [Google Scholar] [CrossRef]
  41. Altazi, B.A.; Zhang, G.G.; Fernandez, D.C.; Montejo, M.E.; Hunt, D.; Werner, J.; Biagioli, M.C.; Moros, E.G. Reproducibility of F18-FDG PET Radiomic Features for Different Cervical Tumor Segmentation Methods, Gray-level Discretization, and Reconstruction Algorithms. J Applied Clin Med Phys 2017, 18, 32–48. [Google Scholar] [CrossRef] [PubMed]
  42. Bailly, C.; Bodet-Milin, C.; Couespel, S.; Necib, H.; Kraeber-Bodéré, F.; Ansquer, C.; Carlier, T. Revisiting the Robustness of PET-Based Textural Features in the Context of Multi-Centric Trials. PLoS ONE 2016, 11, e0159984. [Google Scholar] [CrossRef] [PubMed]
  43. Leijenaar, R.T.H.; Nalbantov, G.; Carvalho, S.; Van Elmpt, W.J.C.; Troost, E.G.C.; Boellaard, R.; Aerts, H.J.W.L.; Gillies, R.J.; Lambin, P. The Effect of SUV Discretization in Quantitative FDG-PET Radiomics: The Need for Standardized Methodology in Tumor Texture Analysis. Sci Rep 2015, 5, 11075. [Google Scholar] [CrossRef] [PubMed]
  44. Van Timmeren, J.E.; Cester, D.; Tanadini-Lang, S.; Alkadhi, H.; Baessler, B. Radiomics in Medical Imaging—“How-to” Guide and Critical Reflection. Insights Imaging 2020, 11, 91. [Google Scholar] [CrossRef]
  45. Zwanenburg, A.; Vallières, M.; Abdalah, M.A.; Aerts, H.J.W.L.; Andrearczyk, V.; Apte, A.; Ashrafinia, S.; Bakas, S.; Beukinga, R.J.; Boellaard, R.; et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-Based Phenotyping. Radiology 2020, 295, 328–338. [Google Scholar] [CrossRef] [PubMed]
  46. Collewet, G.; Strzelecki, M.; Mariette, F. Influence of MRI Acquisition Protocols and Image Intensity Normalization Methods on Texture Classification. Magnetic Resonance Imaging 2004, 22, 81–91. [Google Scholar] [CrossRef] [PubMed]
  47. Vallières, M.; Freeman, C.R.; Skamene, S.R.; El Naqa, I. A Radiomics Model from Joint FDG-PET and MRI Texture Features for the Prediction of Lung Metastases in Soft-Tissue Sarcomas of the Extremities. Phys. Med. Biol. 2015, 60, 5471–5496. [Google Scholar] [CrossRef] [PubMed]
  48. Giannini, V.; Panic, J.; Regge, D.; Balestra, G.; Rosati, S. Could Normalization Improve Robustness of Abdominal MRI Radiomic Features? Biomed. Phys. Eng. Express 2023, 9, 055002. [Google Scholar] [CrossRef] [PubMed]
  49. Kremer, L.E.; Perri, N.; Sorber, E.; Chapman, A.; Armato, S.G. Normalization of MRI Signal Intensity in Polycystic Kidney Disease and the Effect on Radiomic Features. In Proceedings of the Medical Imaging 2022: Computer-Aided Diagnosis; Iftekharuddin, K.M., Drukker, K., Mazurowski, M.A., Lu, H., Muramatsu, C., Samala, R.K., Eds.; SPIE: San Diego, United States, April 4, 2022; p. 132. [Google Scholar]
  50. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 Statement: An Updated Guideline for Reporting Systematic Reviews. BMJ 2021, n71. [Google Scholar] [CrossRef]
  51. Whiting, P.F. QUADAS-2: A Revised Tool for the Quality Assessment of Diagnostic Accuracy Studies. Ann Intern Med 2011, 155, 529. [Google Scholar] [CrossRef] [PubMed]
  52. Larue, R.T.H.M.; Van Timmeren, J.E.; De Jong, E.E.C.; Feliciani, G.; Leijenaar, R.T.H.; Schreurs, W.M.J.; Sosef, M.N.; Raat, F.H.P.J.; Van Der Zande, F.H.R.; Das, M.; et al. Influence of Gray Level Discretization on Radiomic Feature Stability for Different CT Scanners, Tube Currents and Slice Thicknesses: A Comprehensive Phantom Study. Acta Oncologica 2017, 56, 1544–1553. [Google Scholar] [CrossRef] [PubMed]
  53. Mackin, D.; Fave, X.; Zhang, L.; Yang, J.; Jones, A.K.; Ng, C.S.; Court, L. Harmonizing the Pixel Size in Retrospective Computed Tomography Radiomics Studies. PLoS One 2017, 12, e0178524. [Google Scholar] [CrossRef] [PubMed]
  54. Stüber, A.T.; Coors, S.; Schachtner, B.; Weber, T.; Rügamer, D.; Bender, A.; Mittermeier, A.; Öcal, O.; Seidensticker, M.; Ricke, J.; et al. A Comprehensive Machine Learning Benchmark Study for Radiomics-Based Survival Analysis of CT Imaging Data in Patients With Hepatic Metastases of CRC. Invest Radiol 2023, 58, 874–881. [Google Scholar] [CrossRef] [PubMed]
  55. Demircioğlu, A. The Effect of Preprocessing Filters on Predictive Performance in Radiomics. Eur Radiol Exp 2022, 6, 40. [Google Scholar] [CrossRef] [PubMed]
  56. Linsalata, S.; Borgheresi, R.; Marfisi, D.; Barca, P.; Sainato, A.; Paiar, F.; Neri, E.; Traino, A.C.; Giannelli, M. Radiomics of Patients with Locally Advanced Rectal Cancer: Effect of Preprocessing on Features Estimation from Computed Tomography Imaging. BioMed Research International 2022, 2022, 1–21. [Google Scholar] [CrossRef] [PubMed]
  57. Au, R.C.; Tan, W.C.; Bourbeau, J.; Hogg, J.C.; Kirby, M. Impact of Image Pre-Processing Methods on Computed Tomography Radiomics Features in Chronic Obstructive Pulmonary Disease. Phys. Med. Biol. 2021, 66, 245015. [Google Scholar] [CrossRef] [PubMed]
  58. Nazari, M.; Shiri, I.; Zaidi, H. Radiomics-Based Machine Learning Model to Predict Risk of Death within 5-Years in Clear Cell Renal Cell Carcinoma Patients. Computers in Biology and Medicine 2021, 129, 104135. [Google Scholar] [CrossRef] [PubMed]
  59. Kolossváry, M.; Szilveszter, B.; Karády, J.; Drobni, Z.D.; Merkely, B.; Maurovich-Horvat, P. Effect of Image Reconstruction Algorithms on Volumetric and Radiomic Parameters of Coronary Plaques. Journal of Cardiovascular Computed Tomography 2019, 13, 325–330. [Google Scholar] [CrossRef]
  60. Fave, X.; Zhang, L.; Yang, J.; Mackin, D.; Balter, P.; Gomez, D.; Followill, D.; Jones, A.K.; Stingo, F.; Court, L.E. Impact of Image Preprocessing on the Volume Dependence and Prognostic Potential of Radiomics Features in Non-Small Cell Lung Cancer. Transl. Cancer Res. 2016, 5, 349–363. [Google Scholar] [CrossRef]
  61. Hunter, L.A.; Krafft, S.; Stingo, F.; Choi, H.; Martel, M.K.; Kry, S.F.; Court, L.E. High Quality Machine-Robust Image Features: Identification in Nonsmall Cell Lung Cancer Computed Tomography Images: Robust Quantitative Image Features. Med. Phys. 2013, 40, 121916. [Google Scholar] [CrossRef] [PubMed]
  62. Palani, D.; Ganesh, K.M.; Karunagaran, L.; Govindaraj, K.; Shanmugam, S. Statistical Analysis on Impact of Image Preprocessing of CT Texture Patterns and Its CT Radiomic Feature Stability: A Phantom Study. Asian Pac J Cancer Prev 2023, 24, 2061–2072. [Google Scholar] [CrossRef] [PubMed]
  63. Delgadillo, R.; Spieler, B.O.; Ford, J.C.; Kwon, D.; Yang, F.; Studenski, M.; Padgett, K.R.; Abramowitz, M.C.; Dal Pra, A.; Stoyanova, R.; et al. Repeatability of CBCT Radiomic Features and Their Correlation with CT Radiomic Features for Prostate Cancer. Med. Phys. 2021, 48, 2386–2399. [Google Scholar] [CrossRef] [PubMed]
  64. Li, Y.; Ammari, S.; Balleyguier, C.; Lassau, N.; Chouzenoux, E. Impact of Preprocessing and Harmonization Methods on the Removal of Scanner Effects in Brain MRI Radiomic Features. Cancers (Basel) 2021, 13. [Google Scholar] [CrossRef]
  65. McHugh, D.J.; Porta, N.; Little, R.A.; Cheung, S.; Watson, Y.; Parker, G.J.M.; Jayson, G.C.; O’Connor, J.P.B. Image Contrast, Image Pre-Processing, and T1 Mapping Affect MRI Radiomic Feature Repeatability in Patients with Colorectal Cancer Liver Metastases. Cancers 2021, 13, 240. [Google Scholar] [CrossRef] [PubMed]
  66. Carré, A.; Klausner, G.; Edjlali, M.; Lerousseau, M.; Briend-Diop, J.; Sun, R.; Ammari, S.; Reuzé, S.; Alvarez Andres, E.; Estienne, T.; et al. Standardization of Brain MR Images across Machines and Protocols: Bridging the Gap for MRI-Based Radiomics. Sci Rep 2020, 10, 12340. [Google Scholar] [CrossRef] [PubMed]
  67. Traverso, A.; Kazmierski, M.; Welch, M.L.; Weiss, J.; Fiset, S.; Foltz, W.D.; Gladwish, A.; Dekker, A.; Jaffray, D.; Wee, L.; et al. Sensitivity of Radiomic Features to Inter-Observer Variability and Image Pre-Processing in Apparent Diffusion Coefficient (ADC) Maps of Cervix Cancer Patients. Radiotherapy and Oncology 2020, 143, 88–94. [Google Scholar] [CrossRef] [PubMed]
  68. Um, H.; Tixier, F.; Bermudez, D.; Deasy, J.O.; Young, R.J.; Veeraraghavan, H. Impact of Image Preprocessing on the Scanner Dependence of Multi-Parametric MRI Radiomic Features and Covariate Shift in Multi-Institutional Glioblastoma Datasets. Phys. Med. Biol. 2019, 64, 165011. [Google Scholar] [CrossRef] [PubMed]
  69. Bleker, J.; Roest, C.; Yakar, D.; Huisman, H.; Kwee, T.C. The Effect of Image Resampling on the Performance of Radiomics-Based Artificial Intelligence in Multicenter Prostate MRI. Magnetic Resonance Imaging 2024, 59, 1800–1806. [Google Scholar] [CrossRef] [PubMed]
  70. Bologna, M.; Tenconi, C.; Corino, V.D.A.; Annunziata, G.; Orlandi, E.; Calareso, G.; Pignoli, E.; Valdagni, R.; Mainardi, L.T.; Rancati, T. Repeatability and Reproducibility of MRI-Radiomic Features: A Phantom Experiment on a 1.5 T Scanner. Med Phys 2023, 50, 750–762. [Google Scholar] [CrossRef] [PubMed]
  71. Marzi, C.; Marfisi, D.; Barucci, A.; Del Meglio, J.; Lilli, A.; Vignali, C.; Mascalchi, M.; Casolo, G.; Diciotti, S.; Traino, A.C.; et al. Collinearity and Dimensionality Reduction in Radiomics: Effect of Preprocessing Parameters in Hypertrophic Cardiomyopathy Magnetic Resonance T1 and T2 Mapping. Bioengineering 2023, 10, 80. [Google Scholar] [CrossRef]
  72. Marfisi, D.; Tessa, C.; Marzi, C.; Del Meglio, J.; Linsalata, S.; Borgheresi, R.; Lilli, A.; Lazzarini, R.; Salvatori, L.; Vignali, C.; et al. Image Resampling and Discretization Effect on the Estimate of Myocardial Radiomic Features from T1 and T2 Mapping in Hypertrophic Cardiomyopathy. Sci Rep 2022, 12, 10186. [Google Scholar] [CrossRef] [PubMed]
  73. Koçak, B.; Yüzkan, S.; Mutlu, S.; Karagülle, M.; Kala, A.; Kadıoğlu, M.; Solak, S.; Sunman, Ş.; Temiz, Z.H.; Ganiyusufoğlu, A.K. Influence of Image Preprocessing on the Segmentation-Based Reproducibility of Radiomic Features: In Vivo Experiments on Discretization and Resampling Parameters. dir 2023, 0, 0–0. [Google Scholar] [CrossRef] [PubMed]
  74. Brynolfsson, P.; Nilsson, D.; Torheim, T.; Asklund, T.; Karlsson, C.T.; Trygg, J.; Nyholm, T.; Garpebring, A. Haralick Texture Features from Apparent Diffusion Coefficient (ADC) MRI Images Depend on Imaging and Pre-Processing Parameters. Sci Rep 2017, 7, 4041. [Google Scholar] [CrossRef] [PubMed]
  75. Abunahel, B.M.; Pontre, B.; Petrov, M.S. Effect of Gray Value Discretization and Image Filtration on Texture Features of the Pancreas Derived from Magnetic Resonance Imaging at 3T. J. Imaging 2022, 8, 220. [Google Scholar] [CrossRef] [PubMed]
  76. Toffoli, T.; Saut, O.; Etchegaray, C.; Jambon, E.; Le Bras, Y.; Grenier, N.; Marcelin, C. Differentiation of Small Clear Renal Cell Carcinoma and Oncocytoma through Magnetic Resonance Imaging-Based Radiomics Analysis: Toward the End of Percutaneous Biopsy. JPM 2023, 13, 1444. [Google Scholar] [CrossRef] [PubMed]
  77. Abunahel, B.M.; Pontre, B.; Ko, J.; Petrov, M.S. Towards Developing a Robust Radiomics Signature in Diffuse Diseases of the Pancreas: Accuracy and Stability of Features Derived from T1-Weighted Magnetic Resonance Imaging. Journal of Medical Imaging and Radiation Sciences 2022, 53, 420–428. [Google Scholar] [CrossRef] [PubMed]
  78. Dewi, D.E.O.; Sunoqrot, M.R.S.; Nketiah, G.A.; Sandsmark, E.; Giskeødegård, G.F.; Langørgen, S.; Bertilsson, H.; Elschot, M.; Bathen, T.F. The Impact of Pre-Processing and Disease Characteristics on Reproducibility of T2-Weighted MRI Radiomics Features. Magn Reson Mater Phy 2023, 36, 945–956. [Google Scholar] [CrossRef] [PubMed]
  79. Foltyn-Dumitru, M.; Schell, M.; Rastogi, A.; Sahm, F.; Kessler, T.; Wick, W.; Bendszus, M.; Brugnara, G.; Vollmuth, P. Impact of Signal Intensity Normalization of MRI on the Generalizability of Radiomic-Based Prediction of Molecular Glioma Subtypes. Eur Radiol 2023, 34, 2782–2790. [Google Scholar] [CrossRef] [PubMed]
  80. Salome, P.; Sforazzini, F.; Grugnara, G.; Kudak, A.; Dostal, M.; Herold-Mende, C.; Heiland, S.; Debus, J.; Abdollahi, A.; Knoll, M. MR Intensity Normalization Methods Impact Sequence Specific Radiomics Prognostic Model Performance in Primary and Recurrent High-Grade Glioma. Cancers (Basel) 2023, 15. [Google Scholar] [CrossRef]
  81. Ubaldi, L.; Saponaro, S.; Giuliano, A.; Talamonti, C.; Retico, A. Deriving Quantitative Information from Multiparametric MRI via Radiomics: Evaluation of the Robustness and Predictive Value of Radiomic Features in the Discrimination of Low-Grade versus High-Grade Gliomas with Machine Learning. Physica Medica 2023, 107, 102538. [Google Scholar] [CrossRef] [PubMed]
  82. Granzier, R.W.Y.; Ibrahim, A.; Primakov, S.; Keek, S.A.; Halilaj, I.; Zwanenburg, A.; Engelen, S.M.E.; Lobbes, M.B.I.; Lambin, P.; Woodruff, H.C.; et al. Test–Retest Data for the Assessment of Breast MRI Radiomic Feature Repeatability. Magnetic Resonance Imaging 2022, 56, 592–604. [Google Scholar] [CrossRef]
  83. Hoebel, K.V.; Patel, J.B.; Beers, A.L.; Chang, K.; Singh, P.; Brown, J.M.; Pinho, M.C.; Batchelor, T.T.; Gerstner, E.R.; Rosen, B.R.; et al. Radiomics Repeatability Pitfalls in a Scan-Rescan MRI Study of Glioblastoma. Radiology: Artificial Intelligence 2021, 3, e190199. [Google Scholar] [CrossRef] [PubMed]
  84. Scalco, E.; Belfatto, A.; Mastropietro, A.; Rancati, T.; Avuzzi, B.; Messina, A.; Valdagni, R.; Rizzo, G. T2w-MRI Signal Normalization Affects Radiomics Features Reproducibility. Med. Phys. 2020, 47, 1680–1691. [Google Scholar] [CrossRef] [PubMed]
  85. Shiri, I.; Hajianfar, G.; Sohrabi, A.; Abdollahi, H.; Shayesteh, S.P.; Geramifar, P.; Zaidi, H.; Oveisi, M.; Rahmim, A. Repeatability of Radiomic Features in Magnetic Resonance Imaging of Glioblastoma: Test–Retest and Image Registration Analyses. Med. Phys. 2020, 47, 4265–4280. [Google Scholar] [CrossRef] [PubMed]
  86. Simpson, G.; Ford, J.C.; Llorente, R.; Portelance, L.; Yang, F.; Mellon, E.A.; Dogan, N. Impact of Quantization Algorithm and Number of Gray Level Intensities on Variability and Repeatability of Low Field Strength Magnetic Resonance Image-Based Radiomics Texture Features. Physica Medica 2020, 80, 209–220. [Google Scholar] [CrossRef] [PubMed]
  87. Schwier, M.; Van Griethuysen, J.; Vangel, M.G.; Pieper, S.; Peled, S.; Tempany, C.; Aerts, H.J.W.L.; Kikinis, R.; Fennessy, F.M.; Fedorov, A. Repeatability of Multiparametric Prostate MRI Radiomics Features. Sci Rep 2019, 9, 9441. [Google Scholar] [CrossRef] [PubMed]
  88. Traverso, A.; Kazmierski, M.; Shi, Z.; Kalendralis, P.; Welch, M.; Nissen, H.D.; Jaffray, D.; Dekker, A.; Wee, L. Stability of Radiomic Features of Apparent Diffusion Coefficient (ADC) Maps for Locally Advanced Rectal Cancer in Response to Image Pre-Processing. Physica Medica 2019, 61, 44–51. [Google Scholar] [CrossRef] [PubMed]
  89. Ferreira, M.; Lovinfosse, P.; Hermesse, J.; Decuypere, M.; Rousseau, C.; Lucia, F.; Schick, U.; Reinhold, C.; Robin, P.; Hatt, M.; et al. [18F]FDG PET Radiomics to Predict Disease-Free Survival in Cervical Cancer: A Multi-Scanner/Center Study with External Validation. Eur J Nucl Med Mol Imaging 2021, 48, 3432–3443. [Google Scholar] [CrossRef]
  90. Xu, H.; Lv, W.; Zhang, H.; Ma, J.; Zhao, P.; Lu, L. Evaluation and Optimization of Radiomics Features Stability to Respiratory Motion in 18 F-FDG 3D PET Imaging. Med. Phys. 2021, 48, 5165–5178. [Google Scholar] [CrossRef] [PubMed]
  91. Crandall, J.P.; Fraum, T.J.; Lee, M.; Jiang, L.; Grigsby, P.; Wahl, R.L. Repeatability of 18 F-FDG PET Radiomic Features in Cervical Cancer. J Nucl Med 2021, 62, 707–715. [Google Scholar] [CrossRef] [PubMed]
  92. Ger, R.B.; Meier, J.G.; Pahlka, R.B.; Gay, S.; Mumme, R.; Fuller, C.D.; Li, H.; Howell, R.M.; Layman, R.R.; Stafford, R.J.; et al. Effects of Alterations in Positron Emission Tomography Imaging Parameters on Radiomics Features. PLoS ONE 2019, 14, e0221877. [Google Scholar] [CrossRef] [PubMed]
  93. Li, Y.; Ammari, S.; Balleyguier, C.; Lassau, N.; Chouzenoux, E. Impact of Preprocessing and Harmonization Methods on the Removal of Scanner Effects in Brain MRI Radiomic Features. Cancers 2021, 13, 3000. [Google Scholar] [CrossRef] [PubMed]
Figure 1. PRISMA-P flow chart: articles selection process.
Figure 1. PRISMA-P flow chart: articles selection process.
Preprints 110195 g001
Figure 2. Risk of bias (left) and applicability concerns (right).
Figure 2. Risk of bias (left) and applicability concerns (right).
Preprints 110195 g002
Table 1. The distribution of found records across databases.
Table 1. The distribution of found records across databases.
Database (n° of record) n° of total records
(with duplicates)
n° of total records
(without duplicates)
Medline Embase Cochrane Scopus
208 286 11 41 546 459
Table 2. The number of articles included and stratified by voxel resampling types (N.A. regards the case of missing this information).
Table 2. The number of articles included and stratified by voxel resampling types (N.A. regards the case of missing this information).
Modality Acquisition parameters reporting Voxel resampling
Isotropic Multiple isotropic Non-isotropic N.A. None
CT 10 (83.3%) 2 (16.7%) 3 (25.0%) 2 (16.7%) 5 (41.7%) 0 (0.0%)
Ref. [54,55] [53,56,57] [39,52] [58,59,60,61,62]
CBCT 1 (100%) 1 (100%) [63] 0 (0%) 0 (0%) 0 (0%) 0 (0%)
Ref. [63]
MRI 21 (80.8%) 9 (34.6%) 2 (7.7%) 3 (11.5%) 5 (19.2%) 7 (26.9%)
Ref. [55,64,65,66,67,68,69,70,71] [72,73] [74,75,76] [77,78,79,80,81] [82,83,84,85,86,87,88]
PET/CT 5 (100%) 2 (40%) 2 (40%) 0 (0%) 0 (0%) 1 (20%)
Ref. [89,90] [28,91] [92]
Table 3. Normalization strategies grouped by modality. Combination stands for absolute and relative strategies used for comparison.
Table 3. Normalization strategies grouped by modality. Combination stands for absolute and relative strategies used for comparison.
Modality Absolute Relative Combination None
CT 0 (0%) 1 (8.3%) 1 (8.3%) 10 (83.3%)
Ref. [55] [57] [39,52,53,54,56,58,59,60,61,62]
CBCT 0 (0%) 1 (100%) 0 (0%) 0 (0%)
Ref. [63]
MRI 0 (0%) 14 (53.8%) 4 (15.4%) 8 (30.8%)
Ref. [55,67,70,73,75,76,78,79,80,81,82,83,86,87] [64,65,66,84] [68,69,71,72,74,77,85,88]
PET/CT 0 (0%) 1 (20%) 1 (20%) 3 (60%)
Ref. [89] [91] [28,90,92]
Table 4. Discretization strategies grouped by modality.
Table 4. Discretization strategies grouped by modality.
BN BW BN + BW None
CT 2 (16.7%) 6 (50%) 1 (8.3%) 3 (25%)
Ref. [39,58] [52,54,55,56,57,60] [59] [53,61,62]
CBCT 1 (100%) 0 (0%) 0 (0%) 0 (0%)
Ref. [63]
MRI 10 (38.5%) 9 (36.6%) 5 (19.2%) 2 (7.7%)
Ref. [64,68,70,74,80,81,83,84,85,86] [55,65,71,72,75,76,82,87,88] [66,67,73,77,78] [69,79]
PET/CT 1 (20%) 0 (0%) 4 (80%) 0 (0%)
Ref. [91] [28,89,90,92]
Table 5. The study aims per modality.
Table 5. The study aims per modality.
Best performance Repeatability Reproducibility Repeatability+reproducibility
CT 3 (25%) 2 (16.7%) 5 (41.6%) 2 (16.7%)
Ref. [54,55,58] [53,61] [39,56,57,59,60] [52,62]
CBCT 0.0% 0.0% 0.0% 1 (100%)
Ref. [63]
MRI 8 (30.8%) 7 (26.9%) 6 (23.1%) 5 (19.2%)
Ref. [55,69,71,74,76,77,79,81] [65,72,75,83,85,86,87] [64,66,67,73,78,80] [68,70,82,84,88]
PET/CT 2 (40%) 3 (60%) 0 (0%) 0 (0%)
Ref. [89,90] [28,91,92]
Table 6. Investigated “macro” anatomic districts divided per modality.
Table 6. Investigated “macro” anatomic districts divided per modality.
Abdomen Brain Thorax Pelvis N.A.
CT 1 (8.4%) 0 3 (25%) 4 (33.3%) 4 (33.3%)
Ref. [54] [53,60,61] [56,57,58,59] [39,52,55,62]
CBCT 0 0 0.0% 1 (100%)
Ref. [63]
MRI 2 (7.7%) 10 (38.5%) 2 (7.7%) 9 (34.6%) 4 (15.3%)
Ref. [75,77] [66,68,73,74,79,80,81,83,85,93] [71,72] [65,67,69,74,76,78,84,87,88] [55,70,82,86]
PET/CT 0 0 3 (60%) 2 (40%) 0
Ref. [28,90,92] [89,91]
Table 7. Comparison metrics per imaging modality.
Table 7. Comparison metrics per imaging modality.
ICC CCC AUC Other
CT 3 3 2 4
Ref. [56,57,59] [52,53,61] [55,58] [39,54,60,62]
CBCT 0 1 0 0
Ref. [63]
MRI 15 7 5 9
Ref. [65,66,67,70,72,73,78,81,82,83,84,85,87,88,93] [66,75,82,84,85,86,88] [55,69,76,79,81] [67,68,70,71,74,77,80,83,84]
PET/CT 2 1 1 1
Ref. [28,92] [91] [89] [90]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated