Comments on validation of conventional non-invasive fibrosis scoring systems in patients with metabolic associated fatty liver disease

2022-12-03 06:31:06JianGuoHongLunJieYanXianLiShengYuYaoPengSuHaiChaoLiZiNiuDingDongXuWangzhaoRuDongTaoLi

World Journal of Gastroenterology 2022年6期

Jian-Guo Hong,Lun-Jie Yan, Xian Li, Sheng-Yu Yao, Peng Su, Hai-Chao Li, Zi-Niu Ding, Dong-Xu Wang,zhao-Ru Dong,Tao Li

Abstract To evaluate and predict liver fibrosis in patients with nonalcoholic fatty liver disease (NAFLD), several non-invasive scoring systems were built and widely used in the progress of diagnosis and treatment, which showed great diagnostic efficiency, such as aspartate aminotransferase to platelet ratio index, fibrosis-4 index, body mass index, aspartate aminotransferase to alanine aminotransferase ratio, diabetes score and NAFLD fibrosis score. Since the new concept of metabolic associated fatty liver disease (MAFLD) was proposed, the clinical application value of the non-invasive scoring systems mentioned above has not been assessed in MAFLD. The evaluation of the diagnostic performance of these non-invasive scoring systems will provide references for clinicians in the diagnosis of MAFLD.

Key Words: Metabolic associated fatty liver disease; Prediction model; Calibration;Normal distribution; Nonalcoholic fatty liver disease

TO THE EDITOR

We read the recent article entitled “Validation of Conventional Non-invasive Fibrosis Scoring Systems in Patients with Metabolic Associated Fatty Liver Disease” published by Wuet al[1 ] with great interest. In the article, the authors designed and performed a retrospective study to evaluate the diagnostic performance of four non-invasive scoring systems, including aspartate aminotransferase to platelet ratio index (APRI),fibrosis-4 index (FIB-4 ), body mass index (BMI), aspartate aminotransferase to alanine aminotransferase (ALT) ratio, diabetes score (BARD score) and nonalcoholic fatty liver disease fibrosis score (NFS), in patients with metabolic associated fatty liver disease(MAFLD). We would like to share our opinions and criticisms about this valuable work.

The specificity, sensitivity, positive predictive value (PPV), negative predictive value (NPV) and discrimination of the scoring systems mentioned above were evaluated in the prediction of advanced fibrosis in patients with MAFLD in the study by Wuet al[1 ]. Clinical characteristics, laboratory variables and non-invasive scores were compared between patients with advanced fibrosis and those with mild fibrosis or without fibrosis. The results showed that the FIB-4 (P ＜ 0 .001 ), NFS (P ＜ 0 .001 ),APRI (P= 0 .003 ) and BARD (P ＜ 0 .001 ) scores were all significantly higher in patients with advanced fibrosis. Unfortunately, only univariate analysis was performed in this study. In our opinion, multivariate analysis should be performed to recognize the independent variables in the prediction of advanced fibrosis, as in the study by Nielsenet al[2 ].

The authors evaluated the diagnostic efficiency of the prediction scores using the following statistical indices: Sensitivity, specificity, accuracy, PPV, NPV and the area under the receiver operating characteristic curve (AUROC). All the above indices are important statistical variables in the development and validation of prediction models.In our opinion, it would be better if the authors had evaluated the calibration of the prediction scores in the study. In fact, calibration of the prediction model is a critical statistical index in the evaluation of diagnostic efficiency[3 ]. Calibration of the prediction scores can be performed using the Hosmer-Lemeshow goodness-of-fit test and calibration curves, the latter of which can be easily plotted using R software. We advise the authors to evaluate the calibration of prediction models in future studies.

The calculations of PPV and NPV are very important in the development and validation of prediction models. Unlike sensitivity and specificity, PPV and NPV cannot be compared directly among different samples, except for samples with the same prevalence rate of the disease. This is because both PPV and NPV can be affected by the prevalence rate of disease[4 ,5 ]. The authors compared the PPV and NPV in table 4 and stated that “PPV and NPV was better in the HBV-MAFLD group” in the article.In our opinion, the comparisons of PPV and NPV between the HBV-MAFLD group and the pure MAFLD group will be valuable only when advanced fibrosis accounts for the same proportion of the two groups.

Although the authors stated that the continuous variables were expressed in the format of mean ± SD or median value with interquartile range (IQR) and the differences were calculated using Student’sttest in the case of normally distributed data or the Mann-Whitney test in the remaining cases, there were no continuous variables expressed as the median (IQR) in the article. Normally, the distribution of continuous variables should be tested; then, the continuous variables in normal distribution will be expressed as the mean ± SD, and the non-normally distributed continuous variables will be expressed as the median (IQR). If all the continuous variables are expressed as the mean ± SD but the authors do not indicate that all the continuous variables fit a normal distribution, the readers will doubt whether the normal distribution tests were performed in the study. After all, it rarely happens that all the laboratory variables fit a normal distribution in one study. In most studies, the laboratory variables and scores, including ALT, AST, APRI and other variables, do not fit a normal distribution and should be expressed as the median (interquartile range)[2 ,6 -8 ]. For any parameter in biomedical research, a true normal distribution is rare.The European Medicines Agency has issued the general guidance that data should be checked for normality of distribution and should be analyzed and presented based on the results of normal distribution tests. It is also possible that some continuous variables did not fit a normal distribution, but these variables were accidentally expressed as the mean ± SD in the study conducted by Wuet al[1 ]. Of course, this situation indeed does not affect the accuracy of the study. We advise that the authors indicate whether the continuous variables fit a normal distribution and if the normal distribution tests have been performed in future studies.

The authors compared the diagnostic ability of the NFS, FIB-4 , APRI and BARD score for a late stage of fibrosis in MAFLD. The results demonstrated that the APRI and BARD scores performed poorly, but the FIB-4 and NFS showed a promising prospect in clinical use. The new thresholds of the FIB-4 and NFS proposed in this study were 1 .05 and -2 .1 , respectively. The two thresholds proposed by the authors were determined based on their specific study sample. The diagnostic efficiency of the thresholds in the prediction of advanced fibrosis should be further evaluated in an external validation cohort and/or in a prospective validation cohort. Additionally, if possible, the authors can try to develop models based on multiple variables, including the FIB-4 and/or NFS, to predict advanced fibrosis in patients with MAFLD. He et al[9 ]proposed that a diagnostic model containing valuable parameters extracted from more examination tools might provide more satisfactory results[9 ]. Compared to using a single variable, we believe that prediction models based on multiple variables,including clinical characteristics, radiology examinations and laboratory examinations,would exhibit higher sensitivity, higher specificity, higher accuracy, higher PPV,higher NPV, better discrimination and better calibration in the prediction of advanced fibrosis in patients with MAFLD. Because there is now evidence from a prospective cohort that common genetic variants can capture additional prognostic insights not conveyed by validated clinical/biochemical parameters[10 ], we encourage the integration of genetics (perhaps epigenetics) with clinical fibrosis scores, as it may refine individual risk and improve risk stratification and prediction of severe liver disease.

In general, we are very interested in the study by Wuet al[1 ]. The authors demonstrated the prediction values of APRI, FIB-4 , NFS and BARD in a large sample of histology-proven MAFLD. As MAFLD is a new entity, this study will provide important references for clinicians in the prediction of advanced fibrosis in MAFLD patients. The study performed by Wuet al[1 ] could also provide important references for other studies of non-invasive scores and prediction models.

World Journal of Gastroenterology2022年6期

World Journal of Gastroenterology的其它文章: COVID-19 , liver dysfunction and pathophysiology: A conceptual discussion; Gallbladder Burkitt’s lymphoma mimicking gallbladder cancer: A case report; Validation of the PAGE-B score to predict hepatocellular carcinoma risk in caucasian chronic hepatitis B patients on treatment; Atrophic gastritis and gastric cancer tissue miRNome analysis reveals hsa-miR-129 -1 and hsa-miR-196 a as potential early diagnostic biomarkers; Effects of viremia and CD4 recovery on gut “microbiome-immunity” axis in treatment-naïve HIV-1 -infected patients undergoing antiretroviral therapy; Recent advances in the diagnostic evaluation of pancreatic cystic lesions