1. Introduction
Breast cancer is the most diagnosed cancer among women worldwide. The Lancet’s Breast Cancer Commission [
1] predicts that by 2040, the global incidence of new cases of breast cancer will be more than 3 million per year. In the UK alone, there are 56,800 new cases of breast cancer diagnosed each year, with 11,500 deaths annually [
2]. The mortality rate has started to decrease recently [
3], which can be attributed to progress in early diagnostics [
4,
5]. Any delay in cancer detection can be fatal for a patient. All cancers currently require a histopathological examination, but such a procedure also expects the exclusion of benign lesions. Across most services, biopsies with a benign diagnosis are eight times more likely than those with a malignant one. Regardless, all require the same input level, and currently, there is no effective triage system to ‘fast-track’ potential malignant cases.
In an average NHS Trust, there are about 60,000 breast biopsies per year, the majority of which will be benign. This places an enormous burden on Pathology Departments, where only 3% of laboratories are currently fully staffed [
6], contributing to significant delays in turnaround time (TAT). Thus, there is a real need to adopt a different approach to tissue diagnostics that can effectively triage samples at an early stage, allowing those with potentially life-threatening conditions to be fast-tracked and, ideally, offering early reassurance to those without significant diseases.
To address this, we have investigated the feasibility of using the structural biomarkers obtained from X-ray diffraction to rapidly distinguish between cancer and benign breast tissues. Previously, cancer-induced structural changes were examined in two distinct ranges of scattering angles. Small-angle X-ray scattering (SAXS) and wide-angle X-ray scattering (WAXS) are defined to cover the (overlapping) momentum transfer values of 0.1 <
q < 5 nm
-1 and 3 <
q < 40 nm
-1, respectively. SAXS probes, for example, modifications to collagen fibril repeat distances [
7,
8,
9,
10], alterations to the amorphous scattering profile [
9,
10,
11,
12], and disruption to triglyceride molecular packing [
13,
14]. For the latter, a diffraction maximum at
q = 1.5 nm
−1 is characteristic of healthy tissue, and it has been shown [
14] that this peak is absent in benign tissues. In contrast, WAXS addresses modifications to lipid and aqueous components [
15,
16,
17,
18,
19,
20,
21]. Specifically, it has been shown [
13,
16] that in cancerous tissues, the intensity of a maximum peak at approximately
q = 14 nm
−1 is reduced, while the intensity maximum at approximately
q = 20 nm
-1 increases. The first feature is attributed to inter-fatty-acid molecular distances, while the second one is related to the oxygen-oxygen distance in the tetrahedral structure of water. The diminution of the 14 nm
−1 peak is caused by cancer-altered lipid metabolism, rendering it a potential structural biomarker for cancer.
In the current study, we revisit and extend the measurements of the Breast Cancer Now Biobank (BCNB) [
22] samples previously reported in [
23] to specifically address the benign/cancer classification and determine the optimal preprocessing and classification methods. We utilize three distinct data representations, one of which is based on the analysis of one-dimensional (1D) momentum transfer profiles obtained after azimuthal integration. This representation was employed in [
23]. Another approach was introduced in [
24] for XRD-based cancer detection in dogs’ claws and implements the Fourier coefficients of this 1D profile. It was demonstrated in [
25] for the same dataset of dogs’ claws that 2D Fourier analysis of the XRD images, without azimuthal integration, provides better metrics, which is our third data representation method. We employ various data preprocessing steps, both standard and custom-developed, including Principal Component Analysis (PCA), and several classes of machine learning algorithms. Logistic Regression, Random Forest, Support Vector Machine, K-Nearest Neighbor, Naive Bayes Classifier, Light Gradient-Boosting Machine, and XGBoost were used to determine the performance metrics. The first five classifiers are sourced from the scikit-learn library [
26,
27], while the last two are derived from [
28] and the XGBoost library [
29,
30], respectively. The code is organized into pipelines, which describe different combinations of preprocessing, processing, and classification steps.
Our analysis shows excellent cancer/benign discrimination, with many pipelines exhibiting a balanced accuracy exceeding 0.9 for patients. In this, the obtained classification metrics are better for the WAXS images. A comparison of the data representation methods demonstrates that the 2D Fourier coefficients have an advantage, whereas, for 1D representation, the conventional approach is preferable to the 1D Fourier transform.
2. Materials and Methods
2.1. Experimental Design
2.1.1. Breast Tissue Specimens
To test whether XRD could distinguish between cancer and benign tissue samples, we accessed ex-vivo biopsy samples from the Breast Cancer Now Biobank (BCNB). Ethical approval for the project was obtained locally via Keele University (NS-210096) and for the collection and use of specimens via the BCNB (NRES Approval Number 23/EE/0229). These were fresh-frozen (FF) samples, approximately 10 × 2 × 2 mm in size, taken from patients who consented to the BCNB. All cases underwent full histopathological diagnosis on H&E sections and were categorized into invasive cancer or benign lesions. The cancer cohort mainly consisted of invasive ductal carcinoma and invasive lobular carcinoma of different grades. The benign cohort includes various conditions, excluding fibroadenomas and macromastia. Each tissue specimen was placed into a bespoke aluminum sample holder, 2 mm thick, utilizing a SPEXTM 6 µm thick mylar window film to seal and secure the tissue within a 5 mm aperture.
2.1.2. X-Ray Diffraction (XRD) Measurements
XRD measurements were conducted using a bespoke X-ray diffractometer engineered and built by EosDx, Inc. (Menlo Park, CA, USA), a US-based company developing X-ray scattering for medical diagnostics. The radiation produced by the copper-based Incoatec Microfocus Source (Geesthacht, Germany) was collected using two multilayer curved mirror optics, resulting in a low-divergence, monochromatic beam with a wavelength of
λ = 0.154 nm. The two-dimensional detector employed to record the X-ray scatter was a MiniPix SN1442 Si 500 µm detector (ADVACAM, Prague, Czech Republic) with a 256 × 256-pixel array and a 55 × 55 µm pixel size. Silver behenate powder (Thermoscientific
® 045494.06, Heysham, Lancashire, UK) was scanned (utilizing the bespoke aluminum holders) to allow for accurate sample-to-detector distance calibration. The experimental data were stored as a 256 by 256 matrix of integers representing the photon counts. All the experiments were performed at room temperature (19 °C) under atmospheric pressure. To assess specimen heterogeneity, data were collected from several individual spots on each sample, from 4 to 9, depending on the size of the sample, which results in several diffractograms per specimen. The measurements were performed at two specific sample-to-detector distances to examine WAXS (2 cm) and SAXS (16 cm) specifically. Each diffractogram was collected for 1 min for SAXS and 30 sec for WAXS, with a count time of 0.1 sec. Examples of the XRD images are shown in
Figure 1a,
b).
2.2. Data Analysis
2.2.1. Image Preprocessing
In total, 3047 WAXS and 3055 SAXS images for 47 benign and 164 cancer patients were collected and examined in this work. The raw XRD data files contained the intensities at the 256 × 256 pixel arrays. In the first pre-processing step, any faulty detector pixels (e.g., dead pixels) were removed.
The maximum difference in sample-to-detector distance, which influences calibration, was within 2.6%. For the 1D models, which include azimuthal integration, automatic calibration was performed using the open-source library PyFAI. For 2D data representation, this error was considered negligible, and no size calibration was performed.
Additional pre-processing steps for the 2D models include normalizing the images outside the primary beam region. Furthermore, to exclude the contributions of random features, the areas outside of the main patterns, which are similar for all images, were nullified. The pre-processed images are presented in
Supplemental Materials, Figure S1a,b).
For 1D models, the azimuthal integration was performed using PyFAI to obtain an average radial profile starting from the beam position. It was subsequently represented in terms of the momentum transfer
q = (4π sin
θ)/
λ, where 2
θ is the scattering angle, tan 2
θ is calculated as the ratio of the distances from the pixel to the beam center and from the sample to the detector, and
λ is the X-ray wavelength. To eliminate artifacts, the full
q-range was limited to retain 3 ÷ 20 nm
-1 for WAXS and 0.3 ÷ 3.7 nm
-1 for SAXS. The obtained intensities were normalized. The representative curves are shown in
Figure 2.
2.2.2. Fourier Coefficients Representation
Both 1D and 2D Fourier transformations were implemented to describe initial XRD data. The 1D Fourier coefficients were calculated for azimuthally integrated curves in
Figure 2 using either the Discrete Fourier Transformation implemented in the SciPy and NumPy libraries (
1DF) or the custom procedure described in [
23] (
1DFC). The slope of the curve was removed for improved Fourier series convergence (
SR).
The 2D Fourier coefficients (
2DF) were calculated for the preprocessed XRD data (
Figure S1a,b) using the two-dimensional Discrete Fourier Transformation functions provided by the SciPy and NumPy libraries. This implemented preprocessing enabled analysis of the same area in 2D images, thereby enhancing the influence of cancer on the XRD pattern and reducing the impact of optical alignment. We tested different combinations of Fourier coefficient components: real parts (
Re), imaginary parts (
Im), amplitudes (
Am), and phases (
Ph).
To eliminate artifacts and accelerate processing, for some pipelines, we employed Low-Pass Fourier Filtration (LPF), which removed high-frequency coefficients. Specifically, for discriminant analysis, 30 coefficients were selected for each curve in the 1D case, and a 20th-order cutoff was implemented in the 2D case to retain only 1,257 out of the total 65,536 coefficients.
2.2.3. Measurements-to-Patients Transition
We have examined two samples from most patients, with a few providing one, three, or four samples, and each sample was measured multiple times. The model was optimized using all measurements in the training dataset. Then, the class probability of each testing measurement was evaluated using the optimized model, which provided the decision threshold to achieve maximum balanced accuracy. Cancer and benign diagnoses were assigned for each measurement when the probability exceeded or fell below the threshold, respectively. The final diagnosis of a patient was established by averaging all predicted class probabilities for that patient and comparing them with the optimal decision threshold (
Figure 3).
2.2.4. Data Analysis Procedure and Machine Learning Methods
We used Visual Studio and Jupyter Notebook as the primary coding platforms. The majority of preprocessing methods were sourced from the scikit-learn library [
27], from which most classifiers were also adopted, except for gradient-boosting algorithms [
28,
29,
30]. Here, we provide only a brief description of the computation method, as the overall classification algorithm is the same as described in [
25], where all details can be found. All preprocessing steps are optional and were combined differently in pipelines. Abbreviations given here in italics will be used in the Results section to describe the pipelines. Optionally, the standardization (
STD) of Fourier coefficients can be applied before classification. As a dimensionality reduction technique, principal component analysis (PCA) was used with three different numbers of principal components: 3 (
PCA_3), 50 (
PCA_50), and 100 (
PCA_100). Two methods for removing the primary beam were implemented in the code. The beam was removed from the original images before the Fourier transformations (
BR) and, alternatively, in reciprocal space (
BRF) (see [
25] for details).
The code was organized into pipelines, which contained different combinations of preprocessing, processing, and classification steps. It should be emphasized that the values of the optimal decision threshold vary broadly between the pipelines. Some pipelines with too many features used in modeling require the Stochastic Gradient Descent (
SGD) method [
27] to ensure reasonable computation time. The list of classifiers used in this work included Logistic Regression (
LR), Gaussian Naive Bayes (
GNB), K-nearest Neighbors Classifier (
KNN), linear Support Vector Classifier (
SVC), Random Forest Classifier (
RF), XGBoost (
XGB), and LightGBM (
LGBM). The pros and cons of each classifier were previously discussed in [
25].
In total, we produced 574 pipelines, both for WAXS and SAXS, divided into three main types: 1D models with Fourier transformation (
1DF) and without it (
1D), as well as 2D models with Fourier transformation (
2DF). A complete list of abbreviations for utilized methods is provided in
Supplemental Table S1.
3. Results
The measured XRD patterns were randomly separated into training and testing datasets, with 20 cancerous (1-20) and 12 benign (21-32) patients selected for testing. After both the training and testing datasets were preprocessed using the procedure described above, the training dataset was used to optimize the model. The testing dataset was then classified based on the optimized estimator.
We used sensitivity (Sen_M), specificity (Spec_M), and balanced accuracy (BA_M) as performance metrics to assess the measurements. Sensitivity and specificity are the proportions of cancerous and benign samples, respectively, that were correctly identified, and balanced accuracy is the average of these two metrics. We also determined the receiver operating characteristics (ROC) curve and used the area under the ROC curve (AUC_M) as a metric. After the transition to patients described above, all performance metrics (Sen_P, Spec_P, BA_P, and AUC_P) were also calculated for the patients. All the procedures were performed separately for the WAXS and SAXS measurements.
The pipelines, i.e., combinations of preprocessing steps and classifiers, were ranked in terms of BA_P and AUC_P (for equal BA_P values). The five highest-ranked sets are presented in
Table 1 and
Table 2 for WAXS and SAXS, respectively. The total rankings for WAXS (W) and SAXS (S) are provided in the first column. The second column describes the set of preprocessing steps and the classifier using the nomenclature introduced in
Section 2. Columns 3-6 show the metrics for the measurements, and columns 7-10 display the metrics for patients.
It is evident that many pipelines provided excellent metrics, demonstrating clear separation of the diffraction patterns belonging to cancerous and benign tissues. For WAXS, the most discriminating results are for the two-dimensional analysis. However, the metrics for the 1D approach are also reasonably good. The best one-dimensional pipelines based on the 1D Fourier coefficients are ranked 84W – 92W, with identical BA_P = 0.91 and AUC_P = 0.9 – 0.93 for RF, XGB, KNN, and LGBM classifiers. The conventional 1D analysis is ranked higher, with 43W – 48W, yielding BA_P = 0.93 and AUC_P = 0.9 – 0.93 for the RF, XGB, KNN, SVC, and LGBM classifiers. The LR classifier produces an even higher value, with AUC_P = 0.97. In general, BA_P for 99 out of 574 pipelines exceeds 0.9. For SAXS, the metrics for 2D and conventional 1D are similar, while the best 1D Fourier is ranked 25S, with BA_P = 0.88 and AUC_P = 0.86 for the LR classifier. The results for SAXS are worse than for WAXS, with only 12 pipelines providing BA_P better than 0.9.
In the analysis of WAXS, the metrics for all classifiers are similar, except for GNB, which showed impressive results only when combined with PCA. It is worth noting that this is the only classifier to perform better with PCA, while all the others showed no improvement. For SAXS, LR and SVC perform better than others, while KNN performs worse, with metrics sometimes improved by PCA.
To determine the causes of our <1 metrics, we examined the results of the best pipelines for all patients from the testing group. The resulting performances are displayed in
Figure 4 for WAXS (
a and
b) and SAXS (
c and
d). In this, we also demonstrate the indirect performances of the best WAXS (SAXS) pipelines of all three types on SAXS (WAXS) datasets. They are shown in black, while the best direct ones are in green, blue, and red, depending on the pipeline type.
It is evident from this figure that pipelines with excellent performance on WAXS are not always reliable for the SAXS dataset, and vice versa. A good illustration is
Figure 4a, which shows many false negative results (black squares) for the indirect 1DF pipeline.
This analysis revealed two outstanding patients, Patient 2, with a cancer diagnosis, and Patient 22, with benign conditions. For Patient 2, all pipelines yielded false negatives in WAXS, as well as the 2D pipelines in SAXS. For Patient 22, all pipelines, except the direct 2D pipeline, result in a false positive. New histopathological analyses were performed for these patients. For Patient 22, a fibroadenoma diagnosis was obtained, which explained the false-positive results, as fibroadenoma patients were not included in the benign training dataset. For Patient 2, the new tissue assessment described the tissue as only 5% invasive ductal carcinoma.
To gain a better understanding of Patient 2, we examined all the measurements of the corresponding samples, along with the class probabilities and optimal decision thresholds, as shown in
Figure 5, for the best pipelines of all types.
The two samples belonging to Patient 2 exhibit different behaviors seen in the 1D pipelines, while for the 2D pipelines, they are similar. The first sample, measured at 9 points, was closer to the benign response, while the second sample was rather cancerous. However, even in the first sample, the class probabilities for different measurements vary from 0 to 1 in the conventional 1D classification. This agrees with a new tissue assessment reporting that this patient had an initial or even pre-cancerous stage; the tumor microenvironment is not entirely restructured by the cancer cells, and the diffraction patterns strongly depend on the point of measurement. Such non-uniformity of the X-ray scattering is especially seen in the SAXS measurements.
4. Discussion
In this study, we measured X-ray diffraction patterns from 211 patients with benign and malignant histological diagnoses. The measurements were performed in two separate ranges of the scattering momentum transfer, varying the sample-to-detector distance of the diffractometer. We compared various data analytics methods with different sets of pre-processing steps and machine-learning classifiers (pipelines). We achieved excellent benign/cancer classification, with more than a hundred pipelines having balanced accuracy exceeding 0.9. These results are significantly better than previously obtained in [
23] for part of the dataset. However, it is worth noting that for the present paper, we excluded the “healthy” samples derived from cosmetic procedures and benign samples with confirmed diagnoses of fibroadenoma and macromastia.
We compared three different data representations and demonstrated that the custom-developed approach, based on the 2D Fourier coefficients [
25], yields better metrics for wide-angle scattering. For small angles, the results obtained by this approach are similar to those obtained by the conventional method based on azimuthal integration. At the same time, using the Fourier transformation for the examination of the 1D curves [
24] appears not beneficial, as the metrics are worse. Generally, the classification performs better for WAXS measurements than for SAXS.
We also compared various machine-learning classifiers and demonstrated that the results are comparable, with minor exceptions. One such exception is the performance of the Logistic Regression. In [
25], where the X-ray diffraction of the dogs’ claws was examined, this classifier was worse than any of the others. Here, it provides improved metrics, especially in the SAXS range. A possible explanation is that cancer and benign clusters are well-defined, in contrast to the dogs’ claws, and the straightforward separation using Logistic Regression works well.
We also investigated the reasons why our analysis was non-ideal and identified two outliers in the test dataset, for which almost all pipelines produced either a false negative (for the cancer patient) or a false positive (for the benign patient). A new histopathological tissue assessment revealed that the benign patient had a fibroadenoma, which was an exclusion criterion for the training dataset; therefore, these data do not belong to either of the established clusters. For the second anomalous patient data, a similar new tissue assessment showed that this is only 5% invasive ductal carcinoma. A more scrutinized examination of the two samples belonging to this patient revealed the non-uniformity of results between the samples and even within different positions on the same sample. This is expected, as the cancer-induced structural changes have not yet extended to the macroscopic scale in such patients.
The revealed heterogeneity informs us regarding one of the future improvements of our method. In the present work, we employed simple averaging of the results from different measurements and samples, as shown in
Figure 3. However, for non-conclusive cases, a more advanced weighted approach could be used. Another improvement can be the inclusion of healthy samples and patients with fibroadenoma and macromastia. In this case, the binary classification would be replaced with a multiclass classification, as several clusters are expected. However, we should emphasize that the present work is only the first step, and further research is necessary with greatly enhanced statistics.
5. Conclusions
Our results clearly showed that cancer-induced structural biomarkers can be successfully revealed by X-ray diffraction of human breast tissues. Our prototype measurement system features a variable sample-to-detector distance, enabling it to accommodate a range of momentum transfer values from the SAXS to the WAXS regions. Such measurements probe different independent molecular components of tissues and, therefore, complement each other.
In the present work, we developed and compared various data analytics approaches, which include both conventional 1D data representation based on the azimuthal integration of the diffraction patterns and 1D and 2D Fourier transformations. We obtained excellent performance metrics, especially with 2D Fourier coefficients in the WAXS range. Our measurements and data analysis are fast and inexpensive, making X-ray diffraction a valuable tool for pathological laboratories.
Supplementary Materials
The following supporting information can be downloaded at the website of this paper posted on Preprints.org. Table S1: List of abbreviations; Figure S1: The pre-processed images: (a) WAXS and (b) SAXS.
Author Contributions
Conceptualization, L.J., L.M. and P.L.; methodology, A.Al., O.A., A.Aj., C.G., K.R. and P.L.; software, A.Al., S.S., O.A., S.D., V.K., B.B., and S.M.; validation, A.Al., K.R., L.J., L.M. and P.L.; formal analysis, A.Al. and L.M.; investigation, A.Al., S.S., O.A., S.D., V.K., B.B., S.M., A.Aj., L.A. and C.G.; resources, C.G., L.J. and P.L.; data curation, A.Al., S.S., O.A., S.D., V.K., B.B., and A.Aj.; writing—original draft preparation, A.Al., L.J. and L.M.; writing—review and editing, A.Al., S.S., V.K., C.G., K.R., L.J. and L.M.; visualization, A.Al. and L.M.; supervision, C.G., K.R., L.M. and P.L.; project administration, C.G., L.J. and P.L. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Tissue was collected under NRES Approval number 23/EE/0229, approved November 21, 2023, with university ethics approval granted for this research (NS-210096).
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study by the Breast Cancer Now Biobank.
Data Availability Statement
The files with the XRD patterns for the training and test datasets, as well as the complete tables with sets of metrics obtained from different approaches, are available at
https://zenodo.org/records/15129858. The codes are available upon request.
Acknowledgments
The authors wish to acknowledge the role of the Breast Cancer Now Tissue Bank in collecting and making available the samples used in the generation of this publication, and the patients who donated to the Bank. This work acknowledges the support of the National Institute for Health and Care Research Barts Biomedical Research Centre (NIHR203330); a delivery partnership of Barts Health NHS Trust, Queen Mary University of London, St George’s University Hospitals NHS Foundation Trust, and St George’s University of London.
Conflicts of Interest
P.L. is a shareholder of Matur UK, Ltd and EosDx, Inc. A.Al., S.S., O.A., S.D., V.K., B.B., and S.M. are consultants for Matur UK, Ltd. K.R., L.J. and L.M. are consultants for EosDx, Inc.
References
- Coles, C.E.; Earl, H.; Anderson, B.O.; Barrios, C.H.; Bienz, M.; Bliss, J.M.; Cameron, D.A.; Cardoso, F.; Cui, W.; Francis, P.A; et al. The Lancet Breast Cancer Commission. Lancet 2024, 403, 1895–1950. [Google Scholar] [CrossRef]
- https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/breast-cancer, accessed March 18, 2025.
- Siegel, R.L.; Giaquinto, A.N.; Jemal, A. Cancer statistics, 2024. CA Cancer J Clin. 2024, 74, 12–49. [Google Scholar] [CrossRef] [PubMed]
- Migowski, A. Early detection of breast cancer and the interpretation of results of survival studies. Cienc. Saude Coletiva 2015, 20, 1309. [Google Scholar] [CrossRef] [PubMed]
- Taylor, C.; McGale, P.; Probert, J.; Broggio, J.; Charman, J.; Darby, S.C.; Kerr, A.J.; Whelan, T.; Cutter, D.J.; Mannu, G.; Dodwell, D. Breast cancer mortality in 500 000 women with early invasive breast cancer diagnosed in England, 1993-2015: population based observational cohort study. BMJ 2023, 381, e074684. [Google Scholar] [CrossRef] [PubMed]
- https://www.rcpath.org/discover-pathology/public-affairs/the-pathology-workforce.html, accessed March 18, 2025.
- Lewis, R.A.; Rogers, K.D.; Hall, C.J.; Towns-Andrews, E.; Slawson, S.; Evans, A.; Pinder, S.E.; Ellis, I.O.; Boggis, C.R.M.; Hufton, A.P.; Dance, D.R. Breast cancer diagnosis using scattered X-rays. J. Synchrotron Rad. 2000, 7, 348–352. [Google Scholar] [CrossRef]
- Fernandez, M.; Keyrilainen, J.; Serimaa, R.; Torkkeli, M.; Karjalainen-Lindsberg, M.-L.; Tenhunen, M.; Thomlinson, W.; Urban, V.; Suortti, P. Small-angle x-ray scattering studies of human breast tissue samples. Phys. Med. Biol. 2002, 47, 577–592. [Google Scholar] [CrossRef]
- Sidhu, S.; Siu, K.K.W.; Falzon, G.; Nazaretian, S.; Hart, S.A.; Fox, J.G.; Susil, B.J.; Lewis, R.A. X-ray scattering for classifying tissue types associated with breast disease. Med. Phys. 2008, 35, 4660–4670. [Google Scholar] [CrossRef]
- Conceição, A.L.; Antoniassi, M.; Poletti, M.E. Analysis of breast cancer by small angle X-ray scattering. Analyst 2009, 134, 1077–1082. [Google Scholar] [CrossRef]
- Sidhu, S.; Siu, K.K.W.; Falzon, G.; Hart, S.A.; Fox, J.G.; Lewis, R.A. Mapping structural changes in breast tissue disease using x-ray scattering. Med. Phys. 2009, 36, 3211–3217. [Google Scholar] [CrossRef]
- Conceiçao, A.L.C.; Antoniassi, M.; Geraldelli, W.; Poletti, M.E. Mapping transitions between healthy and pathological lesions in human breast tissues by diffraction enhanced imaging computed tomography (DEI-CT) and small angle x-ray scattering (SAXS). Radiat. Phys. Chem. 2014, 95, 313–316. [Google Scholar] [CrossRef]
- Conceicao, A.L.C.; Meehan, K.; Antoniassi, M.; Piacenti-Silva, M.; Poletti, M.E. The influence of hydration on the architectural rearrangement of normal and neoplastic human breast tissues. Heliyon 2019, 5, e01219. [Google Scholar] [CrossRef] [PubMed]
- Mohd Sobri, S.N.; Abdul Sani, S.F.; Sabtu, S.N.; Looi, L.M.; Chiew, S.F.; Pathmanathan, D.; Chio-Srichan, S.; Bradley, D.A. Structural Studies of Epithelial Mesenchymal Transition Breast Tissues. Sci. Rep. 2020, 10, 1997. [Google Scholar] [CrossRef] [PubMed]
- Kidane, G.; Speller, R.D.; Royle, G.J.; Hanby, A.M. X-ray scatter signatures for normal and neoplastic breast tissues. Phys. Med. Biol. 1999, 44, 1791. [Google Scholar] [CrossRef] [PubMed]
- Poletti, M.E.; Gonçalves, O.D.; Mazzaro, I. X-ray scattering from human breast tissues and breast-equivalent materials. Phys. Med. Biol. 2002, 47, 47–63. [Google Scholar] [CrossRef]
- Oliveira, O.R.; Conceiçao, A.L.C.; Cunha, D.M.; Poletti, M.E.; Pela, C.A. Identification of neoplasias of breast tissues using a powder diffractometer. J. Radiat. Res. 2008, 49, 527–532. [Google Scholar] [CrossRef]
- Conceicao, A.L.C.; Antoniassi, M.; Poletti, M.E. Assessment of the differential linear coherent scattering coefficient of biological samples. Nucl. Instr. Meth. Phys. Res. A 2010, 619, 67–70. [Google Scholar] [CrossRef]
- Griffiths, J.A.; Royle, G.J.; Hanby, A.M.; Horrocks, J.A.; Bohndiek, S.E.; Speller, R.D. Correlation of energy dispersive diffraction signatures and microCT of small breast tissue samples with pathological analysis. Phys. Med. Biol. 2007, 52, 6151–6164. [Google Scholar] [CrossRef]
- Cunha, D.M.; Oliveira, O.R.; Pérez, C.A.; Poletti, M.E. X-ray scattering profiles of some normal and malignant human breast tissues. X-ray Spectrometry 2006, 35, 370–374. [Google Scholar] [CrossRef]
- Conceicao, A.L.C.; Antoniassi, M.; Cunha, D.M.; Ribeiro-Silva, A.; Poletti, M.E. Multivariate analysis of the scattering profiles of healthy and pathological human breast tissues. Nucl. Instr. Meth. Phys. Res. A 2011, 652, 870–873. [Google Scholar] [CrossRef]
- https://breastcancernow.org/our-research/information-for-researchers/apply-to-our-biobank/, accessed , 2025. 18 March.
- Denisov, S.; Blinchevsky, B.; Friedman, J.; Gerbelli, B.; Ajeer, A.; Adams, L.; Greenwood, C.; Rogers, K.; Mourokh, L.; Lazarev, P. Vitacrystallography: Structural Biomarkers of Breast Cancer Obtained by X-ray Scattering. Cancers 2024, 16, 2499. [Google Scholar] [CrossRef]
- Alekseev, A.; Yuk, D.; Lazarev, A.; Labelle, D.; Mourokh, L.; Lazarev, P. Canine Cancer Diagnostics by X-ray Diffraction of Claws. Cancers 2024, 16, 2422. [Google Scholar] [CrossRef] [PubMed]
- Alekseev, A.; Avdieiev, O.; Murokh, S.; Yuk, D.; Lazarev, A.; Labelle, D.; Mourokh, L.; Lazarev, P. Fourier Transformation-Based Analysis of X-Ray Diffraction Pattern of Keratin for Cancer Detection. Crystals 2025, 15, 57. [Google Scholar] [CrossRef]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; Vanderplas, J.; Passos, A.; Cournapeau, D.; Brucher, M.; Perrot, M.; Duchesnay, E. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- https://scikit-learn.org/stable/, accessed March 18, 2025.
- https://lightgbm.readthedocs.io/en/stable/, accessed March 18, 2025.
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD ’16. New York, NY, 2016, 785–794., USA: Association for Computing Machinery.
- https://xgboost.readthedocs.io/en/stable/, accessed March 18, 2025.
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).