Crossed references.

D. M. Fiore,R. Weinstein,K. Boyer,E. Linn

DOI: https://doi.org/10.1097/00005721-198807000-00003

Abstract:

What problem does this paper attempt to address?

Machine learning improves the prediction of significant fibrosis in Asian patients with metabolic dysfunction‐associated steatotic liver disease – The Gut and Obesity in Asia (GO‐ASIA) Study

Nipun Verma,Ajay Duseja,Manu Mehta,Arka De,Huapeng Lin,Vincent Wai‐Sun Wong,Grace Lai‐Hung Wong,Ruveena Bhavani Rajaram,Wah‐Kheong Chan,Sanjiv Mahadeva,Ming‐Hua Zheng,Wen‐Yue Liu,Sombat Treeprasertsuk,Thaninee Prasoppokakorn,Satoru Kakizaki,Yosuke Seki,Kazunori Kasama,Phunchai Charatcharoenwitthaya,Phalath Sathirawich,Anand Kulkarni,Hery Djagat Purnomo,Lubna Kamani,Yeong Yeh Lee,Mung Seong Wong,Eunice X. X. Tan,Dan Yock Young

DOI: https://doi.org/10.1111/apt.17891

IF: 9.524

2024-02-03

Alimentary Pharmacology & Therapeutics

Abstract:As compared to current standard FIB‐4, machine learning outperforms detection of significant fibrosis in patients with biopsy proven metabolic dysfunction‐associated steatotic liver disease by 10 times higher odds of fibrosis detection, 28% reduction of unnecessary referrals and 78% prevention of missed referrals. Summary Background The precise estimation of cases with significant fibrosis (SF) is an unmet goal in non‐alcoholic fatty liver disease (NAFLD/MASLD). Aims We evaluated the performance of machine learning (ML) and non‐patented scores for ruling out SF among NAFLD/MASLD patients. Methods Twenty‐one ML models were trained (N = 1153), tested (N = 283), and validated (N = 220) on clinical and biochemical parameters of histologically‐proven NAFLD/MASLD patients (N = 1656) collected across 14 centres in 8 Asian countries. Their performance for detecting histological‐SF (≥F2fibrosis) were evaluated with APRI, FIB4, NFS, BARD, and SAFE (NPV/F1‐score as model‐selection criteria). Results Patients aged 47 years (median), 54.6% males, 73.7% with metabolic syndrome, and 32.9% with histological‐SF were included in the study. Patients with SFvs.no‐SF had higher age, aminotransferases, fasting plasma glucose, metabolic syndrome, uncontrolled diabetes, and NAFLD activity score (p 140) was next best in ruling out SF (NPV of 0.757, 0.724 and 0.827 in overall, test and validation set). Conclusions ML with clinical, anthropometric data and simple blood investigations perform better than FIB‐4 for ruling out SF in biopsy‐proven Asian NAFLD/MASLD patients.

pharmacology & pharmacy,gastroenterology & hepatology
Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China

Han Ma,Cheng-fu Xu,Zhe Shen,Chao-hui Yu,You-ming Li

DOI: https://doi.org/10.1155/2018/4304376

2018-01-01

BioMed Research International

Abstract:Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. Results. Among the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion. Novel machine learning techniques may have screening and predictive value for NAFLD.
Machine Learning to Predict Progression of Non‐alcoholic Fatty Liver to Non‐alcoholic Steatohepatitis or Fibrosis

Sina Ghandian,Rahul Thapa,Anurag Garikipati,Gina Barnes,Abigail Green-Saxena,Jacob Calvert,Qingqing Mao,Ritankar Das

DOI: https://doi.org/10.1002/jgh3.12716

2022-01-01

JGH Open

Abstract:Abstract Background Non‐alcoholic fatty liver (NAFL) can progress to the severe subtype non‐alcoholic steatohepatitis (NASH) and/or fibrosis, which are associated with increased morbidity, mortality, and healthcare costs. Current machine learning studies detect NASH; however, this study is unique in predicting the progression of NAFL patients to NASH or fibrosis. Aim To utilize clinical information from NAFL‐diagnosed patients to predict the likelihood of progression to NASH or fibrosis. Methods Data were collected from electronic health records of patients receiving a first‐time NAFL diagnosis. A gradient boosted machine learning algorithm (XGBoost) as well as logistic regression (LR) and multi‐layer perceptron (MLP) models were developed. A five‐fold cross‐validation grid search was utilized for hyperparameter optimization of variables, including maximum tree depth, learning rate, and number of estimators. Predictions of patients likely to progress to NASH or fibrosis within 4 years of initial NAFL diagnosis were made using demographic features, vital signs, and laboratory measurements. Results The XGBoost algorithm achieved area under the receiver operating characteristic (AUROC) values of 0.79 for prediction of progression to NASH and 0.87 for fibrosis on both hold‐out and external validation test sets. The XGBoost algorithm outperformed the LR and MLP models for both NASH and fibrosis prediction on all metrics. Conclusion It is possible to accurately identify newly diagnosed NAFL patients at high risk of progression to NASH or fibrosis. Early identification of these patients may allow for increased clinical monitoring, more aggressive preventative measures to slow the progression of NAFL and fibrosis, and efficient clinical trial enrollment.
Prediction of Fatty Liver Disease in a Chinese Population Using Machine-Learning Algorithms

Shuwei Weng,Die Hu,Jin Chen,Yanyi Yang,Daoquan Peng

DOI: https://doi.org/10.3390/diagnostics13061168

IF: 3.6

2023-03-19

Diagnostics

Abstract:Background: Fatty liver disease (FLD) is an important risk factor for liver cancer and cardiovascular disease and can lead to significant social and economic burden. However, there is currently no nationwide epidemiological survey for FLD in China, making early FLD screening crucial for the Chinese population. Unfortunately, liver biopsy and abdominal ultrasound, the preferred methods for FLD diagnosis, are not practical for primary medical institutions. Therefore, the aim of this study was to develop machine learning (ML) models for screening individuals at high risk of FLD, and to provide a new perspective on early FLD diagnosis. Methods: This study included a total of 30,574 individuals between the ages of 18 and 70 who completed abdominal ultrasound and the related clinical examinations. Among them, 3474 individuals were diagnosed with FLD by abdominal ultrasound. We used 11 indicators to build eight classification models to predict FLD. The model prediction ability was evaluated by the area under the curve, sensitivity, specificity, positive predictive value, negative predictive value, and kappa value. Feature importance analysis was assessed by Shapley value or root mean square error loss after permutations. Results: Among the eight ML models, the prediction accuracy of the extreme gradient boosting (XGBoost) model was highest at 89.77%. By feature importance analysis, we found that the body mass index, triglyceride, and alanine aminotransferase play important roles in FLD prediction. Conclusion: XGBoost improves the efficiency and cost of large-scale FLD screening.

medicine, general & internal
Application of Interpretable Machine Learning Models Based on Ultrasonic Radiomics for Predicting the Risk of Fibrosis Progression in Diabetic Patients with Nonalcoholic Fatty Liver Disease

Fei Meng,Qin Wu,Wei Zhang,Shirong Hou

DOI: https://doi.org/10.2147/DMSO.S439127

2023-12-02

Abstract:Introduction: Patients with nonalcoholic fatty liver disease (NAFLD) and type 2 diabetes mellitus (T2DM) face a significant risk of hepatic fibrosis. Liver stiffness measurement (LSM) is commonly used to exclude advanced fibrosis, but its effectiveness in predicting fibrosis progression, especially in initially fibrosis-free patients, remains under-investigated. Although radiomics and machine learning (ML) models show promise in interpreting intricate data and predicting clinical outcomes, their application in assessing the fibrosis progression risk has not been fully explored. This study aimed to address this gap by developing and validating ML-based models to identify patients at risk of fibrosis progression using clinical data and multimodal radiomics features, thereby enhancing NAFLD and T2DM management. Methods: The study involved a retrospective analysis of 618 diabetic patients with NAFLD. These patients were divided into training and external validation cohorts. Based on LSM values, patients were classified into "Low-risk" and "Fibrosis-risk" groups. Radiomics features from multimodal ultrasound imaging were extracted, standardized, and utilized to develop various ML models. The models were internally validated based on these radiomics or clinical data, and the optimal model's feature importance was analyzed using the Shapley Additive Explanations (SHAP) approach, followed by external validation. Results: Of the 618 patients, 18.1% demonstrated an LSM≥6.5kPa, indicating a higher risk of hepatic fibrosis. The study identified 25 significant fibrosis-related radiomics features, with the support vector machine (SVM) model demonstrating superior performance in both internal and external validations. The SHAP analysis identified five key determinants of fibrosis risk, which included three radiomics features from shear wave elastography (SWE) and two from grayscale imaging. Conclusion: This study demonstrates the utility of an SVM model based on radiomics features derived from SWE and grayscale imaging for predicting fibrosis progression in diabetic patients with NAFLD, thereby enabling timely and effective therapeutic interventions.
Machine-Learning Algorithm for Predicting Fatty Liver Disease in a Taiwanese Population

Yang-Yuan Chen,Chun-Yu Lin,Hsu-Heng Yen,Pei-Yuan Su,Ya-Huei Zeng,Siou-Ping Huang,I-Ling Liu

DOI: https://doi.org/10.3390/jpm12071026

IF: 3.5083

2022-06-24

Journal of Personalized Medicine

Abstract:The rising incidence of fatty liver disease (FLD) poses a health challenge, and is expected to be the leading global cause of liver-related morbidity and mortality in the near future. Early case identification is crucial for disease intervention. A retrospective cross-sectional study was performed on 31,930 Taiwanese subjects (25,544 training and 6386 testing sets) who had received health check-ups and abdominal ultrasounds in Changhua Christian Hospital from January 2009 to January 2019. Clinical and laboratory factors were included for analysis by different machine-learning algorithms. In addition, the performance of the machine-learning algorithms was compared with that of the fatty liver index (FLI). Totally, 6658/25,544 (26.1%) and 1647/6386 (25.8%) subjects had moderate-to-severe liver disease in the training and testing sets, respectively. Five machine-learning models were examined and demonstrated exemplary performance in predicting FLD. Among these models, the xgBoost model revealed the highest area under the receiver operating characteristic (AUROC) (0.882), accuracy (0.833), F1 score (0.829), sensitivity (0.833), and specificity (0.683) compared with those of neural network, logistic regression, random forest, and support vector machine-learning models. The xgBoost, neural network, and logistic regression models had a significantly higher AUROC than that of FLI. Body mass index was the most important feature to predict FLD according to the feature ranking scores. The xgBoost model had the best overall prediction ability for diagnosing FLD in our study. Machine-learning algorithms provide considerable benefits for screening candidates with FLD.

medicine, general & internal,health care sciences & services
Machine learning-based mortality prediction models for non-alcoholic fatty liver disease in the general United States population

Jiarui Zheng,Zilong Wang,Bo Feng

DOI: https://doi.org/10.1101/2024.07.10.24310253

2024-07-16

Abstract:Background & Aims : Nowadays, the global prevalence of non-alcoholic fatty liver disease (NAFLD) has reached about 25%, which is the most common chronic liver disease worldwide, and the mortality risk of NAFLD patients is higher. Our research created five machine learning (ML) models for predicting overall mortality in ultrasound-proven NAFLD patients and compared their performance with conventional non-invasive scoring systems, aiming to find a generalizable and valuable model for early mortality prediction in NAFLD patients. Methods: National Health and Nutrition Examination Survey (NHANES)-III from 1988 to 1994 and NHANES-III related mortality data from 2019 were used. 70% of subjects were separated into the training set (N = 2262) for development, while 30% were in the testing set (N= 971) for validation. The outcome was all-cause death at the end of follow-up. Twenty-nine related variables were trained as predictor features for five ML–based models: Logistic regression (LR), K-nearest neighbors (KNN), Gradient-boosted decision tree (XGBoost), Random forest (RF) and Decision tree. Five typical evaluation indexes including area under the curve (AUC), F1 score, accuracy, sensitivity and specificity were used to measure the prediction performance. Results: 3233 patients with NAFLD in total were eligible for the inclusion criteria, with 1231 death during the average 25.3 years follow up time. AUC of the LR model in predicting the mortality of NAFLD was 0.888 (95% confidence interval [CI] 0.867-0.909), the accuracy was 0.808, the sensitivity was 0.819, the specificity was 0.802, and the F1 score was 0.765, which showed the best performance compared with other models (AUC were: RF, 0.876 [95%CI 0.852-0.897]; XGBoost, 0.875 [95%CI 0.853-0.898]; Decision tree, 0.793 [95%CI 0.766-0.819] and KNN, 0.787 [95%CI 0.759-0.816]) and conventional clinical scores (AUC were: Fibrosis-4 Score (FIB-4), 0.793 [95%CI 0.777-0.809]; NAFLD fibrosis score (NFS), 0.770 [95%CI 0.753-0.787] and aspartate aminotransferase-to-platelet ratio index (APRI), 0.522 [95%CI 0.502-0.543]). Conclusions: ML–based models, especially LR model, had better discrimination performance in predicting all-cause mortality in patients with NAFLD compared to the conventional non-invasive scores, and an interpretable model like Decision tree, which only used three predictors: age, systolic pressure and glycated hemoglobin, is simple to use in clinical practice.

Infectious Diseases (except HIV/AIDS)
Ultrasound‐Based Machine Learning Approach for Detection of Nonalcoholic Fatty Liver Disease

Aylin Tahmasebi,Shuo Wang,Corinne E. Wessner,Trang Vu,Ji‐Bin Liu,Flemming Forsberg,Jesse Civan,Flavius F. Guglielmo,John R. Eisenbrey

DOI: https://doi.org/10.1002/jum.16194

2023-02-22

Journal of Ultrasound in Medicine

Abstract:Objectives Current diagnosis of nonalcoholic fatty liver disease (NAFLD) relies on biopsy or MR‐based fat quantification. This prospective study explored the use of ultrasound with artificial intelligence for the detection of NAFLD. Methods One hundred and twenty subjects with clinical suspicion of NAFLD and 10 healthy volunteers consented to participate in this institutional review board‐approved study. Subjects were categorized as NAFLD and non‐NAFLD according to MR proton density fat fraction (PDFF) findings. Ultrasound images from 10 different locations in the right and left hepatic lobes were collected following a standard protocol. MRI‐based liver fat quantification was used as the reference standard with >6.4% indicative of NAFLD. A supervised machine learning model was developed for assessment of NAFLD. To validate model performance, a balanced testing dataset of 24 subjects was used. Sensitivity, specificity, positive predictive value, negative predictive value, and overall accuracy with 95% confidence interval were calculated. Results A total of 1119 images from 106 participants was used for model development. The internal evaluation achieved an average precision of 0.941, recall of 88.2%, and precision of 89.0%. In the testing set AutoML achieved a sensitivity of 72.2% (63.1%–80.1%), specificity of 94.6% (88.7%–98.0%), positive predictive value (PPV) of 93.1% (86.0%–96.7%), negative predictive value of 77.3% (71.6%–82.1%), and accuracy of 83.4% (77.9%–88.0%). The average agreement for an individual subject was 92%. Conclusions An ultrasound‐based machine learning model for identification of NAFLD showed high specificity and PPV in this prospective trial. This approach may in the future be used as an inexpensive and noninvasive screening tool for identifying NAFLD in high‐risk patients.

radiology, nuclear medicine & medical imaging,acoustics
Development and validation of machine learning models for nonalcoholic fatty liver disease

Hong-Ye Peng,Shao-Jie Duan,Liang Pan,Mi-Yuan Wang,Jia-Liang Chen,Yi-Chong Wang,Shu-Kun Yao

DOI: https://doi.org/10.1016/j.hbpd.2023.03.009

2023-03-27

Abstract:Background Nonalcoholic fatty liver disease (NAFLD) had become the most prevalent liver disease worldwide. Early diagnosis could effectively reduce NAFLD-related morbidity and mortality. This study aimed to combine the risk factors to develop and validate a novel model for predicting NAFLD. Methods We enrolled 578 participants completing abdominal ultrasound into the training set. The least absolute shrinkage and selection operator (LASSO) regression combined with random forest (RF) was conducted to screen significant predictors for NAFLD risk. Five machine learning models including logistic regression (LR), RF, extreme gradient boosting (XGBoost), gradient boosting machine (GBM), and support vector machine (SVM) were developed. To further improve model performance, we conducted hyperparameter tuning with train function in Python package 'sklearn'. We included 131 participants completing magnetic resonance imaging into the testing set for external validation. Results There were 329 participants with NAFLD and 249 without in the training set, while 96 with NAFLD and 35 without were in the testing set. Visceral adiposity index, abdominal circumference, body mass index, alanine aminotransferase (ALT), ALT/AST (aspartate aminotransferase), age, high-density lipoprotein cholesterol (HDL-C) and elevated triglyceride (TG) were important predictors for NAFLD risk. The area under curve (AUC) of LR, RF, XGBoost, GBM, SVM were 0.915 [95% confidence interval (CI): 0.886-0.937], 0.907 (95% CI: 0.856-0.938), 0.928 (95% CI: 0.873-0.944), 0.924 (95% CI: 0.875-0.939), and 0.900 (95% CI: 0.883-0.913), respectively. XGBoost model presented the best predictive performance, and its AUC was enhanced to 0.938 (95% CI: 0.870-0.950) with further parameter tuning. Conclusions This study developed and validated five novel machine learning models for NAFLD prediction, among which XGBoost presented the best performance and was considered a reliable reference for early identification of high-risk patients with NAFLD in clinical practice.

gastroenterology & hepatology
Comparison of Machine Learning Models and the Fatty Liver Index in Predicting Lean Fatty Liver

Pei-Yuan Su,Yang-Yuan Chen,Chun-Yu Lin,Wei-Wen Su,Siou-Ping Huang,Hsu-Heng Yen

DOI: https://doi.org/10.3390/diagnostics13081407

IF: 3.6

2023-04-14

Diagnostics

Abstract:The reported prevalence of non-alcoholic fatty liver disease in studies of lean individuals ranges from 7.6% to 19.3%. The aim of the study was to develop machine-learning models for the prediction of fatty liver disease in lean individuals. The present retrospective study included 12,191 lean subjects with a body mass index < 23 kg/m2 who had undergone a health checkup from January 2009 to January 2019. Participants were divided into a training (70%, 8533 subjects) and a testing group (30%, 3568 subjects). A total of 27 clinical features were analyzed, except for medical history and history of alcohol or tobacco consumption. Among the 12,191 lean individuals included in the present study, 741 (6.1%) had fatty liver. The machine learning model comprising a two-class neural network using 10 features had the highest area under the receiver operating characteristic curve (AUROC) value (0.885) among all other algorithms. When applied to the testing group, we found the two-class neural network exhibited a slightly higher AUROC value for predicting fatty liver (0.868, 0.841–0.894) compared to the fatty liver index (FLI; 0.852, 0.824–0.81). In conclusion, the two-class neural network had greater predictive value for fatty liver than the FLI in lean individuals.

medicine, general & internal
Ceftibuten-containing agar plate for detecting group B streptococci with reduced penicillin susceptibility (PRGBS).

Chitose Kamiya,K. Kimura,Yo Doyama,Akira Miyazaki,M. Morimoto,Hirotsugu Banno,N. Nagano,Wanchun Jin,J. Wachino,Keiko Yamada,Y. Arakawa

DOI: https://doi.org/10.1016/j.diagmicrobio.2015.04.010

IF: 2.983

2015-08-01

Diagnostic Microbiology and Infectious Disease

Abstract:
Machine‐learning model comprising five clinical indices and liver stiffness measurement can accurately identify MASLD‐related liver fibrosis

Rong Fan,Ning Yu,Guanlin Li,Tamoore Arshad,Wen‐Yue Liu,Grace Lai‐Hung Wong,Xieer Liang,Yongpeng Chen,Xiao‐Zhi Jin,Howard Ho‐Wai Leung,Jinjun Chen,Xiao‐Dong Wang,Terry Cheuk‐Fung Yip,Arun J. Sanyal,Jian Sun,Vincent Wai‐Sun Wong,Ming‐Hua Zheng,Jinlin Hou

DOI: https://doi.org/10.1111/liv.15818

IF: 8.754

2023-12-23

Liver International

Abstract:Background & Aims aMAP score, as a hepatocellular carcinoma risk score, is proven to be associated with the degree of chronic hepatitis B‐related liver fibrosis. We aimed to evaluate the ability of aMAP score for metabolic dysfunction‐associated steatotic liver disease (MASLD; formerly NAFLD)‐related fibrosis diagnosis and establish a machine‐learning (ML) model to improve the diagnostic performance. Methods A total of 946 biopsy‐proved MASLD patients from China and the United States were included in the analysis. The aMAP score, demographic/clinical indices and liver stiffness measurement (LSM) were included in seven ML algorithms to build fibrosis diagnostic models in the training set (N = 703). The performance of ML models was evaluated in the external validation set (N = 125). Results The AUROCs of aMAP versus fibrosis‐4 index (FIB‐4) and aspartate aminotransferase‐platelet ratio (APRI) in cirrhosis and advanced fibrosis were (0.850 vs. 0.857 [P = 0.734], 0.735 [P = 0.001]) and (0.759 vs. 0.795 [P = 0.027], 0.709 [P = 0.049]). When using dual cut‐off values, aMAP had a smaller uncertainty area and higher accuracy (26.9%, 86.6%) than FIB‐4 (37.3%, 85.0%) and APRI (59.0%, 77.3%) in cirrhosis diagnosis. The seven ML models performed satisfactorily in most cases. In the validation set, the ML model comprising LSM and 5 indices (including age, sex, platelets, albumin and total bilirubin used in aMAP calculator), built by logistic regression algorithm (called LSM‐plus model), exhibited excellent performance. In cirrhosis and advanced fibrosis detection, the LSM‐plus model had higher accuracy (96.8%, 91.2%) than LSM alone (86.4%, 67.2%) and Agile score (76.0%, 83.2%), respectively. Additionally, the LSM‐plus model also displayed high specificity (cirrhosis: 98.3%; advanced fibrosis: 92.6%) with satisfactory AUROC (0.932, 0.875, respectively) and sensitivity (88.9%, 82.4%, respectively). Conclusions The aMAP score is capable of diagnosing MASLD‐related fibrosis. The LSM‐plus model could accurately identify MASLD‐related cirrhosis and advanced fibrosis.

gastroenterology & hepatology
An explainable machine learning model for prediction of high-risk nonalcoholic steatohepatitis

Basile Njei,Eri Osta,Nelvis Njei,Yazan A. Al-Ajlouni,Joseph K. Lim

DOI: https://doi.org/10.1038/s41598-024-59183-4

IF: 4.6

2024-04-15

Scientific Reports

Abstract:Early identification of high-risk metabolic dysfunction-associated steatohepatitis (MASH) can offer patients access to novel therapeutic options and potentially decrease the risk of progression to cirrhosis. This study aimed to develop an explainable machine learning model for high-risk MASH prediction and compare its performance with well-established biomarkers. Data were derived from the National Health and Nutrition Examination Surveys (NHANES) 2017-March 2020, which included a total of 5281 adults with valid elastography measurements. We used a FAST score ≥ 0.35, calculated using liver stiffness measurement and controlled attenuation parameter values and aspartate aminotransferase levels, to identify individuals with high-risk MASH. We developed an ensemble-based machine learning XGBoost model to detect high-risk MASH and explored the model's interpretability using an explainable artificial intelligence SHAP method. The prevalence of high-risk MASH was 6.9%. Our XGBoost model achieved a high level of sensitivity (0.82), specificity (0.91), accuracy (0.90), and AUC (0.95) for identifying high-risk MASH. Our model demonstrated a superior ability to predict high-risk MASH vs. FIB-4, APRI, BARD, and MASLD fibrosis scores (AUC of 0.95 vs. 0.50, 0.50, 0.49 and 0.50, respectively). To explain the high performance of our model, we found that the top 5 predictors of high-risk MASH were ALT, GGT, platelet count, waist circumference, and age. We used an explainable ML approach to develop a clinically applicable model that outperforms commonly used clinical risk indices and could increase the identification of high-risk MASH patients in resource-limited settings.

multidisciplinary sciences
Age-related changes in [3H]GBR 12935 binding site density in the prefrontal cortex of controls and schizophrenics

A. Hitri,M. Casanova,J. Kleinman,D. Weinberger,R. Wyatt

DOI: https://doi.org/10.1016/0006-3223(94)00202-E

IF: 12.81

1995-02-01

Biological Psychiatry

Abstract:
Comparison and development of advanced machine learning tools to predict nonalcoholic fatty liver disease: An extended study

Yuan-Xing Liu,Xi Liu,Chao Cen,Xin Li,Ji-Min Liu,Zhao-Yan Ming,Song-Feng Yu,Xiao-Feng Tang,Lin Zhou,Jun Yu,Ke-Jie Huang,Shu-Sen Zheng

DOI: https://doi.org/10.1016/j.hbpd.2021.08.004

2021-10-01

Abstract:BACKGROUND: Nonalcoholic fatty liver disease (NAFLD) is a public health challenge and significant cause of morbidity and mortality worldwide. Early identification is crucial for disease intervention. We recently proposed a nomogram-based NAFLD prediction model from a large population cohort. We aimed to explore machine learning tools in predicting NAFLD.METHODS: A retrospective cross-sectional study was performed on 15 315 Chinese subjects (10 373 training and 4942 testing sets). Selected clinical and biochemical factors were evaluated by different types of machine learning algorithms to develop and validate seven predictive models. Nine evaluation indicators including area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), accuracy, positive predictive value, sensitivity, F1 score, Matthews correlation coefficient (MCC), specificity and negative prognostic value were applied to compare the performance among the models. The selected clinical and biochemical factors were ranked according to the importance in prediction ability.RESULTS: Totally 4018/10 373 (38.74%) and 1860/4942 (37.64%) subjects had ultrasound-proven NAFLD in the training and testing sets, respectively. Seven machine learning based models were developed and demonstrated good performance in predicting NAFLD. Among these models, the XGBoost model revealed the highest AUROC (0.873), AUPRC (0.810), accuracy (0.795), positive predictive value (0.806), F1 score (0.695), MCC (0.557), specificity (0.909), demonstrating the best prediction ability among the built models. Body mass index was the most valuable indicator to predict NAFLD according to the feature ranking scores.CONCLUSIONS: The XGBoost model has the best overall prediction ability for diagnosing NAFLD. The novel machine learning tools provide considerable beneficial potential in NAFLD screening.

gastroenterology & hepatology
High-Throughput, Machine Learning–Based Quantification of Steatosis, Inflammation, Ballooning, and Fibrosis in Biopsies From Patients With Nonalcoholic Fatty Liver Disease

Roberta Forlano,Benjamin H. Mullish,Nikolaos Giannakeas,James B. Maurice,Napat Angkathunyakul,Josephine Lloyd,Alexandros T. Tzallas,Markos Tsipouras,Michael Yee,Mark R. Thursz,Robert D. Goldin,Pinelopi Manousou

DOI: https://doi.org/10.1016/j.cgh.2019.12.025

IF: 13.576

2020-08-01

Clinical Gastroenterology and Hepatology

Abstract:Background & AimsLiver biopsy is the reference standard for staging and grading non-alcoholic fatty liver disease (NAFLD), but histologic scoring systems are semi-quantitative with marked inter- and intra-observer variation. We used machine learning to develop fully automated software for quantification of steatosis, inflammation, ballooning, and fibrosis in biopsies from patients with NAFLD and validated the technology in a separate group of patients.MethodsWe collected data from 246 consecutive patients with biopsy-proven NAFLD and followed in London, the United Kingdom, from January 2010 through December 2016. Biopsies from the first 100 patients were used to derive the algorithm and biopsies from the following 146 were used to validate it. Biopsies were independently scored by pathologists using the nonalcoholic steatohepatitis clinical research network criteria and digitalized. Areas of steatosis, inflammation, ballooning, and fibrosis were annotated on biopsies by 2 hepatobiliary histopathologists to facilitate machine learning. Images of biopsies from the derivation and validation sets were then analyzed by the algorithm to compute percentages of fat, inflammation, ballooning, and fibrosis, as well as collagen proportionate area (CPA), and compared with findings from pathologists' manual annotations and conventional scoring systems.ResultsIn the derivation group, results from manual annotation and the software had an inter-class correlation coefficient (ICC) of 0.97 for steatosis (95%CI, 0.95–0.99; P<.001); ICC, 0.96 for inflammation (95%CI, 0.9–0.98; P<.001); ICC, 0.94 for ballooning (95%CI, 0.87–0.98; P<.001); and ICC, 0.92 for fibrosis (95%CI, 0.88–0.96; P=.001). Percentages of fat, inflammation, ballooning, and CPA from the derivation group were confirmed in the validation cohort. The software identified histological features of NAFLD with levels of inter- and intra-observer agreement ranging from 0.95 to 0.99; this value was higher than that of semi-quantitative scoring systems which ranged 0.58 to 0.88. In a subgroup of paired liver biopsies, quantitative analysis was more sensitive in detecting differences compared to the nonalcoholic steatohepatitis Clinical Research Network scoring system.ConclusionsWe used machine learning to develop software to rapidly and objectively analyse liver biopsies for histologic features of NAFLD. The results from the software correlate with those from histopathologists, with high levels of inter- and intra-observer agreement. Findings were validated in a separate group of patients. This tool might be used for objective assessment of response to therapy for NAFLD in practice and clinical trials.

gastroenterology & hepatology
Advancing non-alcoholic fatty liver disease prediction: a comprehensive machine learning approach integrating SHAP interpretability and multi-cohort validation

Bo Yang,Huaguan Lu,Yinghui Ran

DOI: https://doi.org/10.3389/fendo.2024.1450317

IF: 6.055

2024-10-09

Frontiers in Endocrinology

Abstract:Introduction: Non-alcoholic fatty liver disease (NAFLD) represents a major global health challenge, often undiagnosed because of suboptimal screening tools. Advances in machine learning (ML) offer potential improvements in predictive diagnostics, leveraging complex clinical datasets. Methods: We utilized a comprehensive dataset from the Dryad database for model development and training and performed external validation using data from the National Health and Nutrition Examination Survey (NHANES) 2017–2020 cycles. Seven distinct ML models were developed and rigorously evaluated. Additionally, we employed the SHapley Additive exPlanations (SHAP) method to enhance the interpretability of the models, allowing for a detailed understanding of how each variable contributes to predictive outcomes. Results: A total of 14,913 participants were eligible for this study. Among the seven constructed models, the light gradient boosting machine achieved the highest performance, with an area under the receiver operating characteristic curve of 0.90 in the internal validation set and 0.81 in the external NHANES validation cohort. In detailed performance metrics, it maintained an accuracy of 87%, a sensitivity of 92.9%, and an F1 score of 0.92. Key predictive variables identified included alanine aminotransferase, gammaglutamyl transpeptidase, triglyceride glucose–waist circumference, metabolic score for insulin resistance, and HbA1c, which are strongly associated with metabolic dysfunctions integral to NAFLD progression. Conclusions: The integration of ML with SHAP interpretability provides a robust predictive tool for NAFLD, enhancing the early identification and potential management of the disease. The model's high accuracy and generalizability across diverse populations highlight its clinical utility, though future enhancements should include longitudinal data and lifestyle factors to refine risk assessments further.

endocrinology & metabolism
Machine learning approaches for early detection of non-alcoholic steatohepatitis based on clinical and blood parameters

Amir Reza Naderi Yaghouti,Hamed Zamanian,Ahmad Shalbaf

DOI: https://doi.org/10.1038/s41598-024-51741-0

IF: 4.6

2024-01-30

Scientific Reports

Abstract:This study aims to develop a machine learning approach leveraging clinical data and blood parameters to predict non-alcoholic steatohepatitis (NASH) based on the NAFLD Activity Score (NAS). Using a dataset of 181 patients, we performed preprocessing including normalization and categorical encoding. To identify predictive features, we applied sequential forward selection (SFS), chi-square, analysis of variance (ANOVA), and mutual information (MI). The selected features were used to train machine learning classifiers including SVM, random forest, AdaBoost, LightGBM, and XGBoost. Hyperparameter tuning was done for each classifier using randomized search. Model evaluation was performed using leave-one-out cross-validation over 100 repetitions. Among the classifiers, random forest, combined with SFS feature selection and 10 features, obtained the best performance: Accuracy: 81.32% ± 6.43%, Sensitivity: 86.04% ± 6.21%, Specificity: 70.49% ± 8.12% Precision: 81.59% ± 6.23%, and F1-score: 83.75% ± 6.23% percent. Our findings highlight the promise of machine learning in enhancing early diagnosis of NASH and provide a compelling alternative to conventional diagnostic techniques. Consequently, this study highlights the promise of machine learning techniques in enhancing early and non-invasive diagnosis of NASH based on readily available clinical and blood data. Our findings provide the basis for developing scalable approaches that can improve screening and monitoring of NASH progression.

multidisciplinary sciences
A machine learning-based model analysis for serum markers of liver fibrosis in chronic hepatitis B patients

Congjie Zhang,Zhenyu Shu,Shanshan Chen,Jiaxuan Peng,Yueyue Zhao,Xuan Dai,Jie Li,Xuehan Zou,Jianhua Hu,Haijun Huang

DOI: https://doi.org/10.1038/s41598-024-63095-8

IF: 4.6

2024-05-29

Scientific Reports

Abstract:Early assessment and accurate staging of liver fibrosis may be of great help for clinical diagnosis and treatment in patients with chronic hepatitis B (CHB). We aimed to identify serum markers and construct a machine learning (ML) model to reliably predict the stage of fibrosis in CHB patients. The clinical data of 618 CHB patients between February 2017 and September 2021 from Zhejiang Provincial People's Hospital were retrospectively analyzed, and these data as a training cohort to build the model. Six ML models were constructed based on logistic regression, support vector machine, Bayes, K-nearest neighbor, decision tree (DT) and random forest by using the maximum relevance minimum redundancy (mRMR) and gradient boosting decision tree (GBDT) dimensionality reduction selected features on the training cohort. Then, the resampling method was used to select the optimal ML model. In addition, a total of 571 patients from another hospital were used as an external validation cohort to verify the performance of the model. The DT model constructed based on five serological biomarkers included HBV-DNA, platelet, thrombin time, international normalized ratio and albumin, with the area under curve (AUC) values of the DT model for assessment of liver fibrosis stages (F0-1, F2, F3 and F4) in the training cohort were 0.898, 0.891, 0.907 and 0.944, respectively. The AUC values of the DT model for assessment of liver fibrosis stages (F0-1, F2, F3 and F4) in the external validation cohort were 0.906, 0.876, 0.931 and 0.933, respectively. The simulated risk classification based on the cutoff value showed that the classification performance of the DT model in distinguishing hepatic fibrosis stages can be accurately matched with pathological diagnosis results. ML model of five serum markers allows for accurate diagnosis of hepatic fibrosis stages, and beneficial for the clinical monitoring and treatment of CHB patients.

multidisciplinary sciences
[Computer-Aided Assessment of Liver Fibrosis Progression in Patients with Chronic Hepatitis B: an Exploratory Research].

T,Z Yao,H Ding,Z T Xu,M R Yang,J H Yu,W P Wang

DOI: https://doi.org/10.3760/cma.j.issn.0376-2491.2019.07.003

2019-01-01

Abstract:Objective: To establish automatic liver fibrosis classification models by using traditional machine learning and deep learning methods and preliminaryly evaluate the efficiency. Methods: Gray scale ultrasound images and corresponding elastic images of 354 patients, 247 males and 107 females, mean age (54±12) years undergoing partial hepatectomy in Zhongshan Hospital of Fudan University from November 2014 to January 2016 were enrolled in this study. By using traditional machine learning and deep learning methods, an automatic classification model of liver fibrosis stages (S0 to S4) were established through feature extraction and classification of ultrasound image data sets and the accuracy in different classification categories of each model were calculated, by using liver biopsy as the reference standard. Results: Pathological examination showed 73 cases in pathological stage S0, 40 cases in S1, 49 cases in S2, 41 cases in S3, and 151 cases in S4. The traditional machine classification model based on support vector machine (SVM) classifier and sparse representation classifier and the deep learning classification model based on LeNet-5 neural network, their accuracy rates in the two categories (S0/S1/S2 and S3/S4) were 89.8%, 91.8% and 90.7% respectively; the accuracy rates in the three categories (S0/S1 and S2/S3 and S4) were 75.3%, 79.4% and 82.8% respectively; the accuracy in the three categories (S0 and S1/S2/S3 and S4) were 79.3%, 82.7% and 87.2% respectively. Conclusions: Computer-aided assessment of liver fibrosis progression in patients with chronic hepatitis B has a high accuracy, and can achieve a more detailed classification. This method is expected to be applied in the non-invasive evaluation of liver fibrosis in patients with hepatitis B in clinical work in the future.

Crossed references.

Machine learning improves the prediction of significant fibrosis in Asian patients with metabolic dysfunction‐associated steatotic liver disease – The Gut and Obesity in Asia (GO‐ASIA) Study

Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China

Machine Learning to Predict Progression of Non‐alcoholic Fatty Liver to Non‐alcoholic Steatohepatitis or Fibrosis

Prediction of Fatty Liver Disease in a Chinese Population Using Machine-Learning Algorithms

Application of Interpretable Machine Learning Models Based on Ultrasonic Radiomics for Predicting the Risk of Fibrosis Progression in Diabetic Patients with Nonalcoholic Fatty Liver Disease

Machine-Learning Algorithm for Predicting Fatty Liver Disease in a Taiwanese Population

Machine learning-based mortality prediction models for non-alcoholic fatty liver disease in the general United States population

Ultrasound‐Based Machine Learning Approach for Detection of Nonalcoholic Fatty Liver Disease

Development and validation of machine learning models for nonalcoholic fatty liver disease

Comparison of Machine Learning Models and the Fatty Liver Index in Predicting Lean Fatty Liver

Ceftibuten-containing agar plate for detecting group B streptococci with reduced penicillin susceptibility (PRGBS).

Machine‐learning model comprising five clinical indices and liver stiffness measurement can accurately identify MASLD‐related liver fibrosis

An explainable machine learning model for prediction of high-risk nonalcoholic steatohepatitis

Age-related changes in [3H]GBR 12935 binding site density in the prefrontal cortex of controls and schizophrenics

Comparison and development of advanced machine learning tools to predict nonalcoholic fatty liver disease: An extended study

High-Throughput, Machine Learning–Based Quantification of Steatosis, Inflammation, Ballooning, and Fibrosis in Biopsies From Patients With Nonalcoholic Fatty Liver Disease

Advancing non-alcoholic fatty liver disease prediction: a comprehensive machine learning approach integrating SHAP interpretability and multi-cohort validation

Machine learning approaches for early detection of non-alcoholic steatohepatitis based on clinical and blood parameters

A machine learning-based model analysis for serum markers of liver fibrosis in chronic hepatitis B patients

[Computer-Aided Assessment of Liver Fibrosis Progression in Patients with Chronic Hepatitis B: an Exploratory Research].