AI in Hepatology: Revolutionizing the Diagnosis and Management of Liver Disease

Sheza Malik; Rishi Das; Thanita Thongtan; Kathryn Thompson; Nader Dbouk

doi:10.20944/preprints202411.2084.v1

Submitted:

26 November 2024

Posted:

27 November 2024

You are already at the latest version

Abstract

The integration of artificial intelligence (AI) into hepatology is revolutionizing the diagnosis and management of liver diseases amidst a rising global burden of conditions like metabolic-associated steatotic liver disease (MASLD). AI, particularly machine learning (ML) and deep learning (DL), harnesses vast datasets and complex algorithms to enhance clinical decision-making and patient outcomes. AI's applications in hepatology span a variety of conditions, including autoimmune hepatitis, primary biliary cholangitis, primary sclerosing cholangitis, MASLD, hepatitis B, and hepatocellular carcinoma. It enables early detection, predicts disease progression, such as fibrosis and cirrhosis, and supports more precise treatment strategies. Despite its transformative potential, challenges remain, including data integration, algorithm transparency, and computational demands. This review examines the current state of AI in hepatology, exploring its applications, limitations, and the opportunities it presents to enhance liver health and care delivery.

Keywords:

Artificial Intelligence

;

Machine Learning

;

Deep Learning

;

Cirrhosis

;

Hepatocellular carcinoma

Subject:

Medicine and Pharmacology - Clinical Medicine

1. Introduction

As artificial intelligence (AI) continues to evolve, its integration into various medical subspecialties has grown significantly, offering innovative approaches to diagnosis, treatment, and disease management. Hepatology is no exception. With the rising global burden of liver conditions such as metabolic-associated steatotic liver disease (MASLD), the need for efficient and accurate tools for early detection and management has never been more critical. AI technologies have emerged as powerful tools, leveraging vast datasets to improve clinical decision-making and patient outcomes. Machine learning (ML), a subfield of AI, leverages diverse algorithms to enhance predictive capabilities and decision-making. ML techniques share a common goal: systematically organizing inputs into features, labels, clusters, or neural networks to improve prediction accuracy and refine outcomes across successive iterations. ML models span a spectrum of human involvement, from supervised learning, where labeled data guides training, to unsupervised learning, which identifies patterns in unlabeled data. Reinforcement learning, another subset, involves algorithms learning optimal actions through trial and error. Deep learning (DL), a specialized branch of ML, mimics human brain processing by employing multi-layered neural networks with minimal human supervision but requires significant computational power [1]. These approaches collectively drive advancements in AI applications, transforming complex data into actionable insights in medicine and beyond.

A review of the current literature highlights AI’s applications in diagnosing and managing a range of hepatic conditions, including autoimmune hepatitis (AIH), primary biliary cholangitis (PBC), primary sclerosing cholangitis (PSC), MASLD, and hepatitis B (HBV). Beyond diagnosis, AI shows promise in predicting disease progression, such as advanced fibrosis and cirrhosis, and plays a growing role in hepatocellular carcinoma (HCC) diagnosis and management. By offering insights into how AI is transforming hepatology, this review aims to provide a comprehensive overview of its applications, limitations, and future potential in advancing liver health.

2. Applications of AI in the Diagnosis and Management of Cirrhosis and Its Complications:

AI has demonstrated significant potential in improving the accuracy of advanced fibrosis and cirrhosis detection through advanced imaging techniques, ML algorithms, and biomarker analysis. Furthermore, AI is being applied to predict complications such as hepatic encephalopathy, variceal bleeding, and ascites, enabling more personalized and timely interventions. This section explores the current applications of AI in diagnosing cirrhosis, managing its complications, and its potential to revolutionize care in this critical area of hepatology.

2.1. Detecting Advanced Fibrosis and Cirrhosis

Chang et al. evaluated the accuracy of various ML models, including logistic regression, random forest, and artificial neural networks (ANN), for predicting histological stages of fibrosis in 1,370 patients with biopsy-proven MASLD [2]. These models, which incorporated seventeen demographic and clinical features, demonstrated superior predictive accuracy compared to traditional non-invasive fibrosis assessments such as Fibroscan, FIB-4, NFS, and FAST. Fan et al. investigated ML algorithms that included liver stiffness measurements (LSM) alongside aMAP scores (a composite index incorporating age, gender, albumin, bilirubin, and platelet count) for diagnosing fibrosis and cirrhosis in 946 biopsy-confirmed MASLD patients from China and the United States. The study reported diagnostic accuracy (96.8% for advanced fibrosis and 91.2% for cirrhosis), with high specificity (92.6% and 98.3%) and sensitivity (82.4% and 88.9%) [3]. In another large, multinational study, Anushiravani et al. utilized the random forest algorithm to develop the FIB-6, a scoring system which combines readily available laboratory parameters to reliably rule out cirrhosis in patients with hepatitis C (HCV). The FIB-6 index was subsequently validated in a large study including 2,472 patients from Egypt, Iran, KSA, Greece, Turkey, and Oman who had biopsy-proven MASLD and was again noted to be superior to other non-invasive indices in ruling out advanced fibrosis and cirrhosis with a sensitivity of 97% and negative predictive value of 94.7% [4]. Similarly, Wang et al. constructed an ML model using the Light Gradient Boosting Machine (Light GBM) algorithm to predict cirrhosis in patients with chronic HBV, leveraging platelet count and bile acid levels. This model demonstrated superior predictive accuracy compared to conventional non-invasive measures, underscoring the significance of bile acids as biomarkers of liver dysfunction [5]. In addition to the study by Wang et al., several other studies have explored the use of the LightGBM algorithm to identify cirrhosis. For instance, Orhanbulucu et al. employed ensemble learning techniques, including LightGBM, to predict chronic liver disease (CLD) [6]. In this study, LightGBM achieved the highest accuracy (98.8%) among various boosting algorithms, including Random Forest and AdaBoost. Kuzhippallil et al., used LightGBM to predict CLD alongside other algorithms like XGBoost and stacking. After feature selection and outlier elimination, LightGBM demonstrated competitive accuracy, reinforcing its utility in disease prediction models [7]. LightGBM’s success in these studies highlights its strengths: memory efficiency, fast processing, and suitability for large-scale data analysis, making it an optimal choice for identifying cirrhosis in patients with CLD. In another study, Pu et al. explored a comprehensive ML algorithm incorporating 55 clinical and laboratory features to predict fibrosis in 1,023 patients with chronic HBV that had biopsy-proven fibrosis. This study’s extensive use of diverse data points offers insight into the benefits of large-scale feature integration, which may enhance the model’s clinical applicability in Hepatology [8]. Chen et al. used EmpowerStats to develop an ML model that incorporated general clinical data, complete blood counts (CBC), 24-hour urinary copper levels, and serum ceruloplasmin, to accurately predict the presence of cirrhosis in a cohort of 346 patients with Wilson disease [9].

Various ML models that incorporate imaging have also been developed to predict cirrhosis. Mazumder et al. used a cohort of 1,590 computed tomography (CT) scans to train an automated liver segmentation model using 3D-U-Net and Google’s DeepLabv3+. Their approach also integrated routine lab values and demographic data, achieving an area under the curve (AUC) of 0.85 in a validation cohort of 352 patients, including 72 liver transplant recipients [10]. Similarly, Nowak et al. utilized deep transfer learning with a ResNet50 convolutional neural network (CNN) pre-trained on the ImageNet archive to accurately detect cirrhosis in a sample of 713 patients, outperforming a board-certified radiologist [11]. Luetkens et al. utilized DL to distinguish alcohol- from non-alcohol- related cirrhosis using magnetic resonance imaging (MRI). In a cohort of 465 patients—221 with alcohol-related cirrhosis and 244 with non-alcohol-related cirrhosis—T2-weighted images were analyzed using ResNet50 and DenseNet121 CNN models. ResNet50 achieved the highest performance, with an AUC of 0.82 and an accuracy of 0.75, demonstrating the potential of DL models to differentiate alcohol-related cirrhosis from other causes [12]. Wei et al. used Pyradiomics software on MRI images from 280 patients with CLD to extract radiomics features, combining MRI sequences to predict fibrosis staging and inflammation grading [13]. Destrempes et al. developed a model using ultrasound and shear wave elastography data from 82 patients with CLD, achieving more accurate identification of steatosis, inflammation, and fibrosis than shear wave elastography alone [14]. Similarly, Meng et al. used ultrasound and shear wave elastography data from 618 patients with diabetes and MASLD, identifying 25 fibrosis-related radiomics features and highlighting five as most predictive using a support vector machine model [15].

Other ML models incorporate a broad range of clinical, imaging, and laboratory data to predict cirrhosis. For example, Ahn et al. developed a DL-based model using electrocardiogram (ECG) signals to identify patients with cirrhosis and generated an AI-Cirrhosis-ECG (ACE) score which correlated with disease severity. The model was validated and tested in a large sample of patients with cirrhosis and had a sensitivity of 84.9% and specificity of 83.2%, highlighting the potential of AI-enabled ECG analysis to assist in cirrhosis prediction [16]. Xiao et al. used DL to analyze images from slit lamps and retinal fundal examinations in 1252 patients with hepatobiliary disorders including cirrhosis and created models using these images that were highly effective in detecting cirrhosis with an AUC of 0.90 for slit lamp and 0.83 for retinal fundal imaging [17]. In another interesting study, Dou et al. developed a support vector machine algorithm using urine fluorescence spectroscopy to detect cirrhosis, showing a sensitivity of 73.85% and specificity of 89.34% [18]. These studies collectively illustrate the effectiveness of ML in combining a broad array of clinical data points with advanced algorithmic frameworks to enhance diagnostic accuracy for cirrhosis and advanced fibrosis.

2.2. Clinically Significant Portal Hypertension

Early detection of clinically significant portal hypertension (CSPH) is essential since early interventions can prevent decompensation. Hepatic venous pressure gradient (HVPG) measurement is considered the gold standard for the diagnosis of portal hypertension. However, this is an invasive procedure that is not readily available outside of tertiary care centers. Identifying non-invasive tests and models that can accurately detect CSPH is therefore paramount. Bosch et al. used trichrome-stained liver biopsy samples obtained from patients with cirrhosis due to MASLD to develop a ML model that accurately identified patients with CSPH with AUCs of 0.85 and 0.76 in the training and test phases [19]. In another study, Yu et al. used an auto-machine-learning CT radiomics HVPG quantitative model (aHVPG) that was effective in assessing portal hypertension severity and outperformed several other non-invasive measures, including elastography [20]. Noureddin et al. obtained liver biopsies from patients with cirrhosis due to MASLD and used ML to create a score based on septa, nodules, and fibrosis (SNOF) which predicted CSPH with high accuracy [21], while Reinis et al. used ML models based on readily available lab parameters (bilirubin, platelet count, and international normalized ratio) to predict CSPH in patients with compensated advanced CLD with an AUC of 0.775 [22].

2.3. Esophageal Varices

ML models have proven valuable in managing esophageal varices. Liu et al. demonstrated the effectiveness of extreme gradient boosting (XGBoost) in predicting variceal rebleeding, while Agarwal et al. used ML to create a model for predicting bleeding risk in compensated advanced CLD, offering high accuracy and a non-invasive alternative to endoscopic evaluations [23,24]. Bayani et al. applied ensemble learning techniques that combined clinical data and lab parameters, outperforming traditional methods like the platelet-to-spleen ratio with an AUC of 0.87 [25]. By integrating elastography and lab data, this approach provides a viable, non-invasive method for identifying patients at risk of variceal bleeding, potentially reducing reliance on invasive diagnostics.

Recent advances in ML for variceal management also include DL-based imaging techniques. Gao et al. developed an imaging-based ML model using endoscopic images and structured data, outperforming traditional clinical risk scores in predicting variceal bleeding in patients with cirrhosis [26]. Similarly, Jin et al. introduced an AI-driven non-invasive method for measuring esophageal varix diameter via endoscopy, enabling precise assessment of variceal size and bleeding risk without direct visualization [27]. Yan et al. developed a novel radiomics model for diagnosing high-risk esophageal varices in patients with cirrhosis using portal venous-phase enhanced CT images. The model demonstrated high accuracy, outperforming Baveno VI criteria [28].

CNNs have also been used to predict variceal bleeding risk. Hou et al. created an ANN model to predict the 1-year risk of esophagogastric variceal bleeding in patients with cirrhosis using data from 12 independent risk factors. This model demonstrated superior predictive accuracy compared to traditional indices [29]. In another large study, Chen et al. evaluated ENDOANGEL, a deep convolutional neural network, for diagnosing gastroesophageal varices (EVs and GVs) and predicting rupture risk. Trained on 14,718 endoscopic images, ENDOANGEL outperformed endoscopists in detecting EVs and GVs, identifying red color signs, and recognizing risk factors for bleeding with high accuracy [30].

These studies collectively demonstrate the potential of AI to enhance the accuracy and efficiency of variceal management by leveraging non-invasive methods that minimize patient burden, optimize clinical workflows, and refine risk-based surveillance strategies.

2.4. Ascites:

ML models can help guide management of ascites. For example, Wurstle et al. developed a model incorporating 34 readily available clinical data points to identify patients with infected ascites. This model, demonstrated high negative predictive values for infected ascites across various pre-test probabilities, potentially providing a reliable non-invasive alternative to paracentesis [31]. Another study by Hatami et al., utilized ML models like k-nearest neighbors (KNN), support vector machines, and neural networks to predict ascites grades based on routine clinical data. The KNN model demonstrated the highest accuracy at 94%, underscoring ML’s effectiveness in non-invasively predicting ascites severity [32]. These ML-based approaches can reduce the need for invasive procedures and thus minimize unnecessary risks and costs, and allow more personalized care for cirrhotic patients with ascites.

2.5. Hepatic Encephalopathy

ML models can also predict outcomes in patients with cirrhosis and hepatic encephalopathy (HE). In a retrospective study utilizing the MIMIC-IV database, Yang et al. developed and validated ML models to predict early mortality in HE patients. This study found that the artificial neural network NNET model provided the highest predictive accuracy, achieving an AUC of 0.837. The NNET model also demonstrated superior calibration and discrimination compared to traditional clinical scores such as the MELD and MELD-Na [33]. These findings suggest that NNET can identify patients with HE who are at high risk for mortality and who may benefit from more aggressive treatment and possibly early referral for a liver transplant; however, external prospective validation is needed to confirm the model’s utility across diverse patient populations. In another study involving patients with cirrhosis complicated by HE, Zhang et al. employed weighted ML algorithms to accurately predict mortality risk. The weighted random forest model achieved high sensitivity and specificity, with an AUC of 0.82, and effectively distinguished between high-risk and uncomplicated HE cases further underscoring the potential of ML to support clinical decision-making in complex liver disease cases [34].

3. Applications of AI in the Diagnosis and Management of HCC

HCC represents a significant global health challenge due to its high mortality rate and complex management requirements. HCC is the most common form of primary liver cancer and the third leading cause of cancer-related death worldwide [35]. The high mortality rate associated with HCC is largely due to late-stage diagnosis and the limited effectiveness of conventional treatments, with 5-year survival of only 15% for unresectable disease [36]. Traditional predictive and risk stratification models for HCC, such as PAGE-B, CAMD, REAL-B, HCC-RESCUE, AASL-HCC, and modified REACH-B scores, rely on demographic data, laboratory findings, and elastography to generate risk profiles ranging from low to high using multivariate regression modeling [37,38,39]. Current guidelines for HCC are informed by its natural history, lesion detection sensitivity, and cost-effectiveness of screening intervals. However, advancements in big data processing and the refinement of modern algorithms offer exciting possibilities for improving risk stratification, diagnostic accuracy, and tailored therapeutic approaches. In the following sections we will outline efforts to implement ML methods to identify risks of developing HCC among patients with CLD; diagnose HCC with imaging, pathology and genomic markers; and predict treatment response.

3.1. HCC Risk Stratification

ML models have shown effectiveness in predicting the risk of HCC in patients with cirrhosis. Singal et al. developed both a random forest ML model and a regression model using a cohort of 442 patients with cirrhosis who were enrolled in an HCC surveillance program and found that the ML model outperformed the regression model in identifying patients at high risk for developing HCC [40]. Similarly, Kim et al. used gradient machine boosting algorithms in a cohort of 6051 Korean patients with HBV to create an ML model that was compared to other predictive models in Korean and Caucasian validation cohorts. The ML model was found to be superior in predicting HCC risk in both validation cohorts outperforming various scoring indices including PAGE-B, modified PAGE-B, REACH-B and CU-HCC [41]. In a large retrospective cohort of 4242 patients with HBV and HCV, Sato et al. evaluated multiple ML methods for the prediction of HCC. The authors found that gradient boosting produced the highest accuracy (87.34%; AUC = 0.940) followed by random forest (86.08%; AUC = 0.923) for the prediction of HCC compared to logistic regression (79.74%; AUC = 0.866). Traditional decision tree-based models such as random forest outperformed neural networks in this study [42]. In a similar study from Hong Kong, Wong et al. compared multiple ML models for predicting HCC risk among 124,006 patients with HBV and HCV. Ridge regression (AUROC) 0.844, sensitivity = 0.90, specificity = 0.90 with dual cutoffs) and random forest (AUROC = 0.837, sensitivity = 0.90, specificity = 0.910 with dual cut-offs) produced the highest model accuracy in the validation cohort [43]. More recently, Lin et al. used a sample of 5155 Korean patients with multiple etiologies of CLD to develop an ML- based model named SMART-HCC incorporating elastography measurements and validated this model in Korean and Caucasian cohorts. SMART-HCC outperformed multiple other traditional predictive models in both cohorts. The authors also successfully used this model to stratify patients into low, intermediate, and high-risk groups with significantly higher incidence of HCC in the high-risk group (p<0.001) [44]. Sarkar el. compared multiple ML models to predict HCC risk in a cohort of 2247 patients with MASLD and demonstrated that tree algorithms such as gradient boost, decision tree, and random forest produced the highest accuracy in predicting HCC compared to a probabilistic neural network [45]. Audureau et al. used several ML models to determine HCC risk in a cohort of 836 patients with chronic HCV based on SVR status. The random survival forest model performed best, suggesting that ML can enhance HCC risk assessment and support more cost-effective, personalized surveillance programs [46]. Phan et al. evaluated various DL models in a large cohort of patients with viral hepatitis and found that the CNN model outperformed other models in predicting HCC among patients with chronic hepatitis with an accuracy of 0.980 (AUC = 0.886) [47]. Nam et al. also developed a CNN model that outperformed six older generation models in predicting HCC risk in a sample of 424 patients with cirrhosis due to HBV [48].

3.2. HCC Diagnosis

CNNs are highly effective for analyzing radiology and pathology images, as they can create feature layers and pool layers to emphasize predictive characteristics. Several studies have investigated CNN applications for diagnosing HCC.

Chaiteeraakij et al. conducted a retrospective analysis of ultrasound data from 5,444 patients using a CNN, achieving an HCC detection rate of 82.3% (95% CI: 77.1–87.5) [49]. Mokrane et al. analyzed data from 178 patients with cirrhosis across 27 institutions from 2010 to 2015, using 1,160 imaging features extracted from multiphase CT to classify indeterminate liver nodules. They identified a radiological signature reflecting changes between arterial and venous phases, yielding an AUC of 0.66 to 0.70 (sensitivity = 0.70, specificity = 0.54) for HCC prediction [50]. In another study, Wang et al. developed two CNN models using CT data from 7,512 patients, achieving an AUROC of 0.883 (95% CI: 0.855–0.911) on the validation set [51]. Zhang et al. applied a three-dimensional CNN to MRI data from 267 patients to predict microvascular invasion, with a combined model yielding an AUC of 0.81, sensitivity of 69%, and specificity of 79% in the training set [52]. CNNs have also been applied in limited studies to assess tumor progression on CT following locoregional therapy [53]. Kim et al. used CNN on contrast-enhanced MRI from 549 patients, achieving 87% sensitivity and 93% specificity for HCC detection [54]. Additionally, a meta-analysis of 44 studies by Salehi et al. found that AI algorithms across various imaging modalities had a pooled sensitivity of 85% (95% CI: 78–89) and specificity of 84% (95% CI: 72–91) [55]. Chen et al. used CNNs to analyze pathology slides from 377 patients in the Genomics Data Commons Database, training a neural network (Inception V3) to classify whole slide images (WSI) cropped to smaller tiles at 20x magnification. The model performed exceptionally well in distinguishing tumor from normal tissue (AUC = 0.961) and showed near-perfect agreement with an experienced pathologist. Additionally, the model achieved good accuracy in differentiating tumor grades [56]. In another study, Wang et al. trained a neural network using WSI from 304 tumors, also developing the model to perform single-cell segmentation based on 246 image features, which characterized individual nuclei and their relation to infiltrating lymphocytes. This enabled the identification of three distinct histological subtypes and, by integrating survival data, the prediction of prognosis for each subtype. This study emphasizes the potential of DL not only in diagnosing HCC but also in distinguishing tumor subtypes based on genomic and tumor microenvironment characteristics, supporting more personalized treatment approaches [57]. Neural networks have also been successfully applied to differentiate HCC from other liver tumors as demonstrated by Jang et al. who evaluated three neural networks on 20x tile images, achieving an AUC of 0.995 in distinguishing HCC from cholangiocarcinoma and metastatic colon cancer [58].

3.3. Predicting HCC Treatment Response

Numerous ML models have been created to forecast patient survival, tumor recurrence, microvascular invasion, and treatment response in HCC. A notable approach is radiomics, which analyzes regions of interest from imaging data retrospectively collected from HCC patient cohorts [59]. Many of these models have undergone validation and demonstrated superior performance compared to traditional prognostic tools [60]. For instance, Sheen et al. developed a radiomics normogram using CT-derived Rad score, T stage, alpha-fetoprotein (AFP) level and bi-lobar distribution that accurately identified tumors refractory to trans-arterial chemoembolization (TACE) with AUC ranging from 0.91 to 0.95 [61]. Ji et al. developed validated predictive models that incorporated ML to analyze contrast-enhanced CT images, along with variables such as the ALBI grade, AFP level, cirrhosis, and tumor margin characteristics. Their ML models demonstrated effective prediction of HCC recurrence, achieving concordance index (C-index) values ranging from 0.633 to 0.699 across various datasets and outperforming traditional models and staging systems [62]. Additionally, Qiao et al. designed a prognostic ANN model to predict early HCC outcomes after partial hepatectomy in a cohort of 543 patients. This model demonstrated superior predictive accuracy, achieving an AUC of 0.855 in the training cohort and was better than established staging systems such as TNM, BCLC, and IHPBA [63].

DL models can identify conventional and novel histopathological patterns associated with HCC recurrence after surgical resection or transplantation. By integrating these patterns, DL models generate reliable predictions that improve prognostic accuracy and aid clinical decision-making [64,65,66,67]. For example, Saillard et al. utilized two DL algorithms based on digitalized WSI to predict patient survival post-surgery. The first model, SCHMOWDER, incorporated an attention mechanism focused on tumoral areas annotated by a pathologist, while the second model, CHOWDER, operated independently of human expertise. In a discovery set of 194 patients, SCHMOWDER achieved a C-index of 0.78, and CHOWDER achieved a C-index of 0.75, both outperforming traditional survival prediction methods. Validation in an independent dataset from The Cancer Genome Atlas (TCGA) confirmed the superior discriminatory power of these models [68]. Chaudhary et al. investigated DL based multi-omics approaches to predict survival outcomes in a cohort of 360 patients with HCC. The model incorporated data encompassing RNA sequencing, microRNA sequencing, and methylation profiles sourced from TCGA and demonstrated effective prognostic capabilities, achieving a C-index of 0.68 and identifying two distinct patient subgroups with significant survival disparities (p = 7.13e-6). Validation across five external datasets confirmed the model’s strong predictive performance, with C-indices ranging from 0.67 to 0.82 [69].

AI also has the potential to tailor treatment plans based on individual patient characteristics [70]. ML can predict waitlist dropout rates among liver transplant candidates with HCC [71], tumor recurrence in post-liver transplant patients [72], as well as graft survival [73]. Such predictive capabilities can allow for the appropriate prioritization of patients with HCC on the transplant waitlist. Furthermore, AI algorithms can analyze genetic and molecular data to identify potential therapeutic targets and predict responses to specific treatments. For example, AI-driven analyses of genomic data can identify mutations associated with HCC, thereby guiding targeted therapies [74]. Large Language Models can also process extensive literature, electronic health records, and clinical guidelines, to provide evidence-based treatment recommendations tailored to individual patient profiles. In a recent study, Ge et al. utilized the Versa generative pretrained transformer (GPT-4) application programming interface to extract data from unstructured text in imaging reports related to HCC. This study compared the performance of Versa GPT-4 versus manual chart reviews for extracting eight specific data elements, including maximum LI-RADS score, size of HCC lesions, and the presence of macrovascular invasion or metastases from 1,101 imaging reports of 753 patients with HCC. The results demonstrated that Versa GPT-4 achieved an overall accuracy of 0.934, with specific accuracies ranging from 0.886 to 0.989, depending on the element being assessed [75]. While the model performed well in classification tasks, it encountered challenges with more complex data extraction.

4. Applications of AI in Autoimmune Liver Disease

AI has also been applied in the diagnosis and management of various liver disorders including AIH, PBC and PSC.

Ercan et al. created a DL tool AI(H) using the Aiforia platform that was able to analyze liver biopsies and accurately identify features of AIH [76]. Guadalupi et al. analyzed salivary samples from patients with AIH, PBC, and healthy controls using a gel-based proteomic approach and machine learning to distinguish AIH from PBC. Mass spectroscopy identified protein variations, with random forest and multidimensional scaling pinpointing a distinct set of proteins. These proteins were mostly linked to immune function and liver fibrosis. This AI-assisted approach offers a promising, non-invasive method to understand the pathophysiology of AIH and PBC [77]. Singh et al. extracted MRI data from 169 patients with large-duct PSC and trained a decision tree model to identify patients that were at high risk for decompensation within one year of the initial MRI [78]. Additionally, two studies by Cristoferi et al. and Vuppalanchi et al., used MRCP+ software to produce three-dimensional models of the biliary tree to calculate biliary tree metrics and predict disease progression of PSC [79,80]. Other studies have employed ML techniques to identify biomarkers associated with more severe disease and progression to cirrhosis amongst patients with PSC. Snir et al. used ML to analyze blood samples from 45 patients with PSC and 30 controls, identifying 2,870 proteins linked to PSC severity and progression to cirrhosis. An elastic net regression model highlighted 16 proteins strongly correlated with the enhanced liver fibrosis (ELF) score, which mapped to 13 enriched cell pathways, including those involving CCL24, a protein associated with PSC pathogenesis and cirrhosis risk [81]. Mousa et al. profiled plasma bile acids in 508 PSC patients and 302 controls using liquid chromatography-tandem mass spectrometry and used gradient boosting to create a multivariate model that identified high taurine-conjugated bile acids as predictive of hepatic decompensation. The resulting PSC-BAP score, derived from six bile acid variables, shows promise for predicting decompensation risk and assessing anti-cholestatic drug efficacy. These findings highlight potential biomarkers and therapeutic targets for PSC [82].

5. Challenges and Future Directions:

Despite notable advancements, the integration of AI into the routine clinical management of chronic liver disease presents substantial challenges, necessitating strategic planning and careful consideration. Several critical issues must be addressed to enable the effective application of AI in hepatology, including the following:

5.1. Data Limitations and Overfitting Risks

One of the primary challenges in developing AI models for hepatology is the risk of overfitting [83]. Overfitting occurs when a model learns specific patterns from the training data without capturing underlying, generalizable features, leading to strong performance during model development but poor external validity in varied clinical contexts. Deep learning models trained on small datasets are particularly susceptible to capturing “noise” or spurious correlations rather than meaningful clinical signals, thereby reducing their predictive reliability [84]. Effective mitigation strategies include employing cross-validation, conducting rigorous external validation on independent datasets, and utilizing interpretable models when data size is limited [85].

5.2. Biases and Generalizability Issues

AI models in hepatology often encounter biases resulting from imbalanced or non-representative training data [86]. For instance, many studies rely on datasets derived from specific geographic regions, ethnic groups, or healthcare environments, which may not reflect the global patient population affected by liver diseases. This issue is particularly evident in models developed using predominantly Western or Asian cohorts, which may struggle to generalize to patient populations with distinct disease etiologies, such as MASLD or ALD in Western countries versus viral hepatitis in South Asia [87]. Addressing these challenges requires incorporating diverse patient demographics and leveraging bias-aware machine-learning techniques to detect and correct skewed data distributions [88].

5.3. Challenges in Model Interpretability and Adoption

A significant obstacle to the clinical adoption of AI is the “black box” nature of complex models, particularly deep learning algorithms [89]. These models often lack transparency, making it difficult for clinicians to trust their diagnostic or prognostic outputs. Explainable AI (XAI) methodologies, such as Shapley Additive Explanations (SHAP) [90] and Local Interpretable Model-Agnostic Explanations (LIME⁾ [91]^, are increasingly being employed to demystify the decision-making processes of AI models. Moving forward, developing models that balance high accuracy with clear, interpretable insights aligned with clinical reasoning frameworks will be essential for facilitating clinician acceptance and integration into practice.

5.4. Regulatory and Ethical Considerations

The evolving regulatory landscape of healthcare poses challenges for the approval and deployment of AI-based tools in hepatology. While regulatory agencies such as the FDA and EMA have issued guidelines for AI-driven medical devices, the lack of standardized protocols for evaluating model safety, efficacy, and equity remains a significant hurdle. Ethical concerns, including data privacy, patient consent, and equitable access, must also be addressed. Transparent and comprehensive policy frameworks, coupled with standardized evaluation metrics and robust validation processes, are necessary to ensure AI applications meet clinical and ethical standards before widespread adoption.

5.5. Data Integration and Interoperability Issues

Effective AI integration in hepatology requires the analysis of diverse data types, including imaging, genomics, clinical notes, and laboratory results [92]. However, inconsistencies in electronic health record (EHR) systems across institutions exacerbate this challenge, hindering the creation of comprehensive datasets necessary for robust AI model training [93]. Advances in multimodal learning frameworks, which enable the simultaneous analysis of multiple data sources, hold promise for addressing these challenges but require further refinement to improve data integration and overall model performance.

5.6. Need for Prospective, Multi-Center Validation

Most AI models in hepatology are developed and validated on retrospective datasets, which limits their applicability to real-world clinical settings. Prospective, multi-center studies are crucial to evaluate model performance across diverse patient populations and healthcare environments, enhancing their generalizability and reliability [94].

5.7. Continuous Learning and Adaptive AI Systems

To maintain clinical relevance, AI models must adapt to new data and evolving clinical practices. Continuous learning frameworks allow models to update as new data becomes available, thereby mitigating the risk of performance degradation over time [95]. However, implementing such systems requires robust monitoring mechanisms to detect shifts in data distribution—referred to as model drift—and ensure sustained accuracy and reliability in predictions.

5.8. Economic and Resource Considerations

The development, implementation, and maintenance of AI systems in healthcare entails substantial costs. Beyond initial investments in technology and infrastructure, ongoing expenses related to data storage, model updates, clinician training, and system integration must be considered. Conducting cost-effectiveness analyses will be vital to determine whether AI tools deliver sufficient clinical benefits to justify their financial impact, particularly in resource-limited settings [96].

5.9. Patient and Public Engagement

Engaging patients and the public in the development and deployment of AI tools can enhance transparency and trust in these technologies. Education initiatives that inform patients about AI’s capabilities and limitations, along with feedback mechanisms for data usage, can foster more ethical and patient-centered applications [97].

6. Conclusions

Addressing these challenges will require a multidisciplinary approach involving collaboration between clinicians, AI researchers, regulatory bodies, and patient advocacy groups. By adopting transparent, inclusive, and evidence-based strategies, the field of AI in hepatology can advance toward the development of effective, equitable, and trustworthy clinical applications.

Author Contributions

Conceptualization, ND.; investigation, S.M., R.D., T.T., K.T., N.D.; resources, S.M., R.D., T.T., K.T., N.D; writing—original draft preparation, S.M., R.D., T.T., K.T., N.D.; writing—review and editing, S.M., R.D., T.T., K.T., N.D. supervision, N.D. All authors have read and agreed to the published version of the manuscript. All authors contributed substantially to the work reported.

Funding

This research received no external funding

Acknowledgments

The authors would like to acknowledge Mr. John Nemeth, Clinical Informationist, EUHM Library for assisting with literature search.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Esteva A, Robicquet A, et al. A guide to deep learning in healthcare. Nat Med. 2019 Jan;25(1):24-29. Epub 2019 Jan 7. [CrossRef] [PubMed]
Chang D, Truong E, et al. Machine learning models are superior to noninvasive tests in identifying clinically significant stages of NAFLD and NAFLD-related cirrhosis. Hepatology. 2023 Feb 1;77(2):546-557. Epub 2022 Aug 9. [CrossRef] [PubMed]
Fan R, Yu N, et al. Machine-learning model comprising five clinical indices and liver stiffness measurement can accurately identify MASLD-related liver fibrosis. Liver Int. 2024 Mar;44(3):749-759. Epub 2023 Dec 22. [CrossRef] [PubMed]
Anushiravani A, Alswat K, et al. Multicenter validation of FIB-6 as a novel machine learning non-invasive score to rule out liver cirrhosis in biopsy-proven MAFLD. Eur J Gastroenterol Hepatol. 2023 Nov 1;35(11):1284-1288. Epub 2023 Sep 5. [CrossRef] [PubMed]
Wang Z, Zhang A, et al. Clinical prediction of HBV-associated cirrhosis using machine learning based on platelet and bile acids. Clin Chim Acta. 2023 Nov 1;551:117589. Epub 2023 Oct 10. [CrossRef] [PubMed]
Orhanbulucu F, Acer I et al. Predicting liver disease using decision tree ensemble methods. J Inst Sci Technol. 2022, 38, 261–267. [Google Scholar]
Kuzhippallil MA, Joseph C, et al. Comparative analysis of machine learning techniques for Indian liver disease patients. In: 6th International Conference on Advanced Computing and Communication Systems (ICACCS); 2020: Coimbatore, India.
Pu X, Deng D, et al. High-dimensional hepatopath data analysis by machine learning for predicting HBV-related fibrosis. Sci Rep. 2021 Mar 3;11(1):5081. Erratum in: Sci Rep. 2021 Aug 12;11(1):16730. [CrossRef] [PubMed]
Chen K, Wan Y, et al. Liver cirrhosis prediction for patients with Wilson disease based on machine learning: a case-control study from southwest China. Eur J Gastroenterol Hepatol. 2022 Oct 1;34(10):1067-1073. Epub 2022 Jul 25. [CrossRef] [PubMed]
Mazumder NR, Enchakalody B, et al. Using Artificial Intelligence to Predict Cirrhosis From Computed Tomography Scans. Clin Transl Gastroenterol. 2023;14(10):e00616. Published 2023 Oct 1. [CrossRef]
Nowak S, Mesropyan N, et al. Detection of liver cirrhosis in standard T2-weighted MRI using deep transfer learning. Eur Radiol. 2021;31(11):8807-8815. [CrossRef]
Luetkens JA, Nowak S, et al. Deep learning supports the differentiation of alcoholic and other-than-alcoholic cirrhosis based on MRI. Sci Rep. 2022;12(1):8297. Published 2022 May 18. [CrossRef]
Wei H, Shao Z, et al. Value of multimodal MRI radiomics and machine learning in predicting staging liver fibrosis and grading inflammatory activity. Br J Radiol. 2023;96(1141):20220512. [CrossRef]
Destrempes F, Gesnik M, et al. Quantitative ultrasound, elastography, and machine learning for assessment of steatosis, inflammation, and fibrosis in chronic liver disease. PLoS One. 2022;17(1):e0262291. [CrossRef]
Meng F, Wu Q, et al. Application of Interpretable Machine Learning Models Based on Ultrasonic Radiomics for Predicting the Risk of Fibrosis Progression in Diabetic Patients with Nonalcoholic Fatty Liver Disease. Diabetes Metab Syndr Obes. 2023;16:3901-3913. Published 2023 Dec 2. [CrossRef]
Ahn JC, Attia ZI, et al. Development of the AI-Cirrhosis-ECG Score: An Electrocardiogram-Based Deep Learning Model in Cirrhosis. Am J Gastroenterol. 2022;117(3):424-432. [CrossRef]
Xiao W, Huang X, et al. Screening and identifying hepatobiliary diseases through deep learning using ocular images: a prospective, multicentre study. Lancet Digit Health. 2021;3(2):e88-e97. [CrossRef]
Dou J, Dawuti W, Zheng X, et al. Urine fluorescence spectroscopy combined with machine learning for screening of hepatocellular carcinoma and liver cirrhosis. Photodiagnosis Photodyn Ther. 2022;40:103102. [CrossRef]
Bosch J, Chung C, et al. A Machine Learning Approach to Liver Histological Evaluation Predicts Clinically Significant Portal Hypertension in NASH Cirrhosis. Hepatology. 2021;74(6):3146-3160. [CrossRef]
Yu Q, Huang Y, et al. An imaging-based artificial intelligence model for non-invasive grading of hepatic venous pressure gradient in cirrhotic portal hypertension. Cell Rep Med. 2022;3(3):100563. Published 2022 Mar 15. [CrossRef]
Noureddin M, Goodman Z, et al. Machine learning liver histology scores correlate with portal hypertension assessments in nonalcoholic steatohepatitis cirrhosis. Aliment Pharmacol Ther. 2023;57(4):409-417. [CrossRef]
Reiniš J, Petrenko O, et al. Assessment of portal hypertension severity using machine learning models in patients with compensated cirrhosis. J Hepatol. 2023;78(2):390-400. [CrossRef]
Liu L, Nie Y, et al. A Practical Model for Predicting Esophageal Variceal Rebleeding in Patients with Hepatitis B-Associated Cirrhosis. Int J Clin Pract. 2023 Aug 3;2023:9701841. [CrossRef] [PubMed]
Agarwal S, Sharma S, et al. Development of a machine learning model to predict bleed in esophageal varices in compensated advanced chronic liver disease: A proof of concept. J Gastroenterol Hepatol. 2021 Oct;36(10):2935-2942. Epub 2021 Jun 13. [CrossRef] [PubMed]
Bayani A, Hosseini A, et al. Identifying predictors of varices grading in patients with cirrhosis using ensemble learning. Clin Chem Lab Med. 2022 Jul 19;60(12):1938-1945. [CrossRef]
Gao Y, Yu Q, et al. An imaging-based machine learning model outperforms clinical risk scores for prognosis of cirrhotic variceal bleeding. Eur Radiol. 2023 Dec;33(12):8965-8973. Epub 2023 Jul 15. [CrossRef] [PubMed]
Jin J, Dong B, et al. A Noninvasive Technology Using Artificial Intelligence to Measure the Diameter of Esophageal Varices Under Endoscopy. Surg Laparosc Endosc Percutan Tech. 2023 Jun 1;33(3):282-285. [CrossRef] [PubMed]
Yan, Y, Li, Y, et al. A novel machine learning-based radiomic model for diagnosing high bleeding risk esophageal varices in cirrhotic patients. Hepatol Int 16, 423–432 (2022). [CrossRef]
Hou, Y, Yu, H, et al. Machine learning-based model for predicting the esophagogastric variceal bleeding risk in liver cirrhosis patients. Diagn Pathol 18, 29 (2023). [CrossRef]
Chen M, Wang J, et al. Automated and real-time validation of gastroesophageal varices under esophagogastroduodenoscopy using a deep convolutional neural network: a multicenter retrospective study (with video). Gastrointest. Endosc. 2021; 93: 422–432.e3. Epub 2020 Jun 26. [CrossRef]
Würstle S, Hapfelmeier A, et al. A Novel Machine Learning-Based Point-Score Model as a Non-Invasive Decision-Making Tool for Identifying Infected Ascites in Patients with Hydropic Decompensated Liver Cirrhosis: A Retrospective Multicentre Study. Antibiotics. 2022; 11(11):1610. [CrossRef]
Hatami B, Asadi F, et al. “Machine learning-based system for prediction of ascites grades in patients with liver cirrhosis using laboratory and clinical data: design and implementation study” Clinical Chemistry and Laboratory Medicine (CCLM), vol. 60, no. 12, 2022, pp. 1946-1954. [CrossRef]
Yang H, Li X, et al. Using machine learning methods to predict hepatic encephalopathy in cirrhotic patients with unbalanced data. Comput Methods Programs Biomed. 2021;211:106420. [CrossRef]
Zhang Z, Wang J, et al. Using machine learning methods to predict 28-day mortality in patients with hepatic encephalopathy. BMC Gastroenterol. 2023;23(1):111. Published 2023 Apr 6. [CrossRef]
Baecker A, Liu, X, et al. (2018). Worldwide incidence of hepatocellular carcinoma cases attributable to major risk factors. European Journal of Cancer Prevention, 27(3). [CrossRef]
Finn RS, Qin, S, et al. (2020). Atezolizumab plus Bevacizumab in Unresectable Hepatocellular Carcinoma. New England Journal of Medicine, 382(20). [CrossRef]
Kim BK, Ahn, SH (2023). Prediction model of hepatitis B virus-related hepatocellular carcinoma in patients receiving antiviral therapy. In Journal of the Formosan Medical Association (Vol. 122, Issue 12). [CrossRef]
Yu JH, Cho, S, et al. (2022). The best predictive model for hepatocellular carcinoma in patients with chronic hepatitis B infection. In Clinical and Molecular Hepatology (Vol. 28, Issue 3). [CrossRef]
Lin CL, Wu, S, et al. (2023). Predicting Hepatocellular Carcinoma Risk in Chronic Hepatitis B Patients Receiving Finite Periods of Antiviral Therapy. Cancers, 15(13). [CrossRef]
Singal AG, Mukherjee A, et al. (2013). Machine learning algorithms outperform conventional regression models in predicting development of hepatocellular carcinoma. American Journal of Gastroenterology, 108(11). [CrossRef]
Kim HY, Lampertico P, et al. (2022). An artificial intelligence model to predict hepatocellular carcinoma risk in Korean and Caucasian patients with chronic hepatitis B. Journal of Hepatology, 76(2). [CrossRef]
Sato M, Morimoto K, et al. (2019). Machine-learning Approach for the Development of a Novel Predictive Model for the Diagnosis of Hepatocellular Carcinoma. Scientific Reports, 9(1). [CrossRef]
Wong GL, Hui V, et al. (2022). Novel machine learning models outperform risk scores in predicting hepatocellular carcinoma in patients with chronic viral hepatitis. JHEP Reports, 4(3). [CrossRef]
Lin H, Li G, et al. (2024). A Liver Stiffness–Based Etiology-Independent Machine Learning Algorithm to Predict Hepatocellular Carcinoma. Clinical Gastroenterology and Hepatology, 22(3). [CrossRef]
Sarkar S, Alurwar A, et al. (2024). A Machine Learning Model to Predict Risk for Hepatocellular Carcinoma in Patients With Metabolic Dysfunction-Associated Steatotic Liver Disease. Gastro Hep Advances, 3(4). [CrossRef]
Audureau E, Carrat F, et al. (2020). Personalized surveillance for hepatocellular carcinoma in cirrhosis – using machine learning adapted to HCV status. Journal of Hepatology, 73(6). [CrossRef]
Phan D, Chan CL, et al. (2020). Liver cancer prediction in a viral hepatitis cohort: A deep learning approach. International Journal of Cancer, 147(10). [CrossRef]
Nam JY, Sinn DH, et al. (2020). Deep learning model for prediction of hepatocellular carcinoma in patients with HBV-related cirrhosis on antiviral therapy. JHEP Reports, 2(6). [CrossRef]
Chaiteerakij R, Ariyaskul D, et al. Artificial intelligence for ultrasonographic detection and diagnosis of hepatocellular carcinoma and cholangiocarcinoma. Sci Rep. 2024, 14, 20617. [Google Scholar] [CrossRef] [PubMed]
Mokrane FZ, Lu L, et al. Radiomics machine-learning signature for diagnosis of hepatocellular carcinoma in cirrhotic patients with indeterminate liver nodules. Eur Radiol. 2020 Jan;30(1):558-570. [CrossRef]
Wang H, Jiang Y, et al. (2020). Single-cell spatial analysis of tumor and immune microenvironment on whole-slide image reveals hepatocellular carcinoma subtypes. Cancers, 12(12). [CrossRef]
Zhang R, Xu L, et al. A nomogram based on bi-regional radiomics features from multimodal magnetic resonance imaging for preoperative prediction of microvascular invasion in hepatocellular carcinoma. Quant Imaging Med Surg. 2019, 9, 1503–1515. [Google Scholar] [CrossRef] [PubMed]
Lim S, Shin Y, et al. Arterial enhancing local tumor progression detection on CT images using convolutional neural network after hepatocellular carcinoma ablation: a preliminary study. Sci Rep. 2022 Feb 2;12(1):1754. [CrossRef]
Kim J, Min JH, et al. Detection of Hepatocellular Carcinoma in Contrast-Enhanced Magnetic Resonance Imaging Using Deep Learning Classifier: A Multi-Center Retrospective Study. Sci Rep. 2020 Jun 11;10(1):9458. [CrossRef]
Salehi MA, Harandi H, et al. Diagnostic Performance of Artificial Intelligence in Detection of Hepatocellular Carcinoma: A Meta-analysis. J Imaging Inform Med. 2024 Aug;37(4):1297-1311. [CrossRef]
Chen M, Zhang B, et al. (2020). Classification and mutation prediction based on histopathology H&E images in liver cancer using deep learning. Npj Precision Oncology, 4(1). [CrossRef]
Wang H, Jiang Y, et al. (2020). Single-cell spatial analysis of tumor and immune microenvironment on whole-slide image reveals hepatocellular carcinoma subtypes. Cancers, 12(12). [CrossRef]
Jang HJ, Go JH, et al. (2023). Deep Learning for the Pathologic Diagnosis of Hepatocellular Carcinoma, Cholangiocarcinoma, and Metastatic Colorectal Cancer. Cancers, 15(22). [CrossRef]
Lévi-Strauss T, Tortorici B, et al. Radiomics, a Promising New Discipline: Example of Hepatocellular Carcinoma. Diagnostics (Basel). 2023 Mar 30;13(7):1303. [CrossRef] [PubMed]
Lai Q, Spoletini G, et al. Prognostic role of artificial intelligence among patients with hepatocellular cancer: A systematic review. World J Gastroenterol. 2020 Nov 14;26(42):6679-6688. [CrossRef]
Sheen H, Kim JS, et al. A radiomics nomogram for predicting transcatheter arterial chemoembolization refractoriness of hepatocellular carcinoma without extrahepatic metastasis or macrovascular invasion. Abdom Radiol (NY). 2021, 46, 2839–2849. [Google Scholar] [CrossRef] [PubMed]
Ji GW, Zhu FP, et al. Machine-learning analysis of contrast-enhanced CT radiomics predicts recurrence of hepatocellular carcinoma after resection: A multi-institutional study. EBioMedicine. 2019 Dec;50:156-165. [CrossRef]
Qiao G, Li J, et al. Artificial neural networking model for the prediction of post-hepatectomy survival of patients with early hepatocellular carcinoma. J Gastroenterol Hepatol. 2014 Dec;29(12):2014-20. [CrossRef]
Song D, Wang Y, et al. Using deep learning to predict microvascular invasion in hepatocellular carcinoma based on dynamic contrast-enhanced MRI combined with clinical parameters. J Cancer Res Clin Oncol. 2021, 147, 3757–3767. [Google Scholar] [CrossRef] [PubMed]
Zhang Y, Lv X, et al. Deep Learning With 3D Convolutional Neural Network for Noninvasive Prediction of Microvascular Invasion in Hepatocellular Carcinoma. J Magn Reson Imaging. 2021 Jul;54(1):134-143. [CrossRef]
Jiang YQ, Cao SE, et al. Preoperative identification of microvascular invasion in hepatocellular carcinoma by XGBoost and deep learning. J Cancer Res Clin Oncol. 2021 Mar;147(3):821-833. [CrossRef]
Calderaro J, Kather JN. Artificial intelligence-based pathology for gastrointestinal and hepatobiliary cancers. Gut. 2021 Jun;70(6):1183-1193. [CrossRef]
Saillard C, Schmauch B, et al. Predicting Survival After Hepatocellular Carcinoma Resection Using Deep Learning on Histological Slides. Hepatology. 2020 Dec;72(6):2000-2013. [CrossRef]
Chaudhary K, Poirion OB, et al. Deep Learning-Based Multi-Omics Integration Robustly Predicts Survival in Liver Cancer. Clin Cancer Res. 2018 Mar 15;24(6):1248-1259. [CrossRef]
Choi GH, Yun J, et al. Development of machine learning-based clinical decision support system for hepatocellular carcinoma. Sci Rep. 2020 Sep 9;10(1):14855. [CrossRef]
Kwong A, Hameed B, et al. Machine learning to predict waitlist dropout among liver transplant candidates with hepatocellular carcinoma. Cancer Med. 2022 Mar;11(6):1535-1541. [CrossRef]
Nam JY, Lee JH, et al. Novel Model to Predict HCC Recurrence after Liver Transplantation Obtained Using Deep Learning: A Multicenter Study. Cancers (Basel). 2020 Sep 29;12(10):2791. [CrossRef]
Briceño J, Cruz-Ramírez M, et al. Use of artificial intelligence as an innovative donor-recipient matching model for liver transplantation: results from a multicenter Spanish study. J Hepatol. 2014 Nov;61(5):1020-8. [CrossRef]
Elmas A, Lujambio A, et al. Proteomic Analyses Identify Therapeutic Targets in Hepatocellular Carcinoma. Front Oncol. 2022 Mar 30;12:814120. [CrossRef]
Ge J, Li M, et al. (2024). A Comparison of a Large Language Model vs Manual Chart Review for the Extraction of Data Elements From the Electronic Health Record. Gastroenterology, 166(4), 707-709. [CrossRef]
Ercan C, Kordy K, et al. A deep-learning-based model for assessment of autoimmune hepatitis from histology: AI(H). Virchows Arch. Published online June 15, 2024. [CrossRef]
Guadalupi G, Contini C, et al. Combined Salivary Proteome Profiling and Machine Learning Analysis Provides Insight into Molecular Signature for Autoimmune Liver Diseases Classification. Int J Mol Sci. 2023;24(15):12207. [CrossRef]
Singh Y, Jons WA, et al. Algebraic topology-based machine learning using MRI predicts outcomes in primary sclerosing cholangitis. Eur Radiol Exp. 2022;6(1):58. [CrossRef]
Cristoferi L, Porta M, et al. A quantitative MRCP-derived score for medium-term outcome prediction in primary sclerosing cholangitis. Dig Liver Dis. 2023;55(3):373-380. [CrossRef]
Vuppalanchi R, Are V, et al. A composite score using quantitative magnetic resonance cholangiopancreatography predicts clinical outcomes in primary sclerosing cholangitis. JHEP Rep. 2023;5(10):100834. [CrossRef]
Snir T, Greenman R, et al. Machine Learning Identifies Key Proteins in Primary Sclerosing Cholangitis Progression and Links High CCL24 to Cirrhosis. Int J Mol Sci. 2024;25(11):6042. Published 2024 May 30. [CrossRef]
Mousa OY, Juran BD, et al. Bile Acid Profiles in Primary Sclerosing Cholangitis and Their Ability to Predict Hepatic Decompensation. Hepatology. 2021;74(1):281-295. [CrossRef]
Charilaou P, Battat R. Machine learning models and over-fitting considerations. World J Gastroenterol. 2022;28(5):605-607. [CrossRef]
Cawley GC, Talbot NLC. On over-fitting in model selection and subsequent selection bias in performance evaluation. J Mach Learn Res. 2010, 11, 2079–2107. [Google Scholar]
Little MA, Varoquaux G, et al. Using and understanding cross-validation strategies. Perspectives on Saeb et al. Gigascience. 2017;6(5):1-6. [CrossRef]
Siddique S, Haque MA, et al. Survey on Machine Learning Biases and Mitigation Techniques. Digital. 2024; 4(1):1-68. [CrossRef]
Celi LA, Cellini J, et al. Sources of bias in artificial intelligence that perpetuate healthcare disparities-A global review. PLOS Digit Health. 2022;1(3):e0000022. Published 2022 Mar 31. [CrossRef]
Chen RJ, Wang JJ, et al. Algorithmic fairness in artificial intelligence for medicine and healthcare. Nat Biomed Eng. 2023;7(6):719-742. [CrossRef]
Marey A, Arjmand P, et al. Explainability, transparency and black box challenges of AI in radiology: impact on patient care in cardiovascular radiology. Egypt J Radiol Nucl Med 55, 183 (2024). [CrossRef]
Louhichi M, Nesmaoui R, et al. Shapley Values for Explaining the Black Box Nature of Machine Learning Model Clustering. Procedia Computer Science. 2023;220:806-811. [CrossRef]
Ribeiro MT, Singh S, et al. Why should I trust you?: Explaining the predictions of any classifier; Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; San Francisco, CA, USA. 13–17 August 2016; pp. 1135–1144.
Panayides AS, Amini A, et al. AI in Medical Imaging Informatics: Current Challenges and Future Directions. IEEE J Biomed Health Inform. 2020;24(7):1837-1857. [CrossRef]
Holmes JH, Beinlich J, et al. Why Is the Electronic Health Record So Challenging for Research and Clinical Care?. Methods Inf Med. 2021;60(1-02):32-48. [CrossRef]
Kalapala R, Rughwani H, et al. Artificial Intelligence in Hepatology- Ready for the Primetime. J Clin Exp Hepatol. 2023;13(1):149-161. [CrossRef]
Hurtado J, Salvati D, et al. Continual learning for predictive maintenance: Overview and challenges. Intelligent Systems with Applications. 2023;19:200251. [CrossRef]
Khanna NN, Maindarkar MA, et al. Economics of Artificial Intelligence in Healthcare: Diagnosis vs. Treatment. Healthcare (Basel). 2022;10(12):2493. Published 2022 Dec 9. [CrossRef]
Maleki Varnosfaderani S, Forouzanfar M. The Role of AI in Hospitals and Clinics: Transforming Healthcare in the 21st Century. Bioengineering (Basel). 2024;11(4):337. Published 2024 Mar 29. [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.