- 1Department of Pathology, IRCCS Humanitas Research Hospital, Milan, Italy
- 2Department of Biomedical Sciences, Humanitas University, Milan, Italy
- 3Department of Physics, Università di Milano-Bicocca, Milan, Italy
- 4SC Medicina Indirizzo Metabolico, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico, Milan, Italy
- 5Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy
- 6Division of Pathology, Foundation IRCCS Ca’ Granda Ospedale Maggiore Policlinico, Milan, Italy
- 7Division of Internal Medicine and Hepatology, Department of Gastroenterology, IRCCS Humanitas Research Hospital, Rozzano, Italy
- 8Division of Immunology, Transplantation and Infectious Diseases IRCCS San Raffaele Scientific Institute, Vita-Salute San Raffaele University, Milan, Italy
Background and Aims: The risk of disease progression in MASH increases proportionally to the pathological stage of fibrosis. This latter is evaluated through a semi-quantitative process, which has limited sensitivity in reflecting changes in disease or response to treatment. This study aims to test the clinical impact of Artificial Intelligence (AI) in characterizing liver fibrosis in MASH patients.
Methods: The study included 60 patients with clinical pathological diagnosis of MASH. Among these, 17 received a medical treatment and underwent a post-treatment biopsy. For each biopsy (n = 77) a Sirius Red digital slide (SR-WSI) was obtained. AI extracts >30 features from SR-WSI, including estimated collagen area (ECA) and entropy of collagen (EnC).
Results: AI highlighted that different histopathological stages are associated with progressive and significant increase of ECA (F2: 2.6% ± 0.4; F3: 5.7% ± 0.4; F4: 10.9% ± 0.8; p: 0.0001) and EnC (F2: 0.96 ± 0.05; F3: 1.24 ± 0.06; F4: 1.80 ± 0.11, p: 0.0001); disclosed the heterogeneity of fibrosis among pathological homogenous cases; revealed post treatment fibrosis modification in 76% of the cases (vs 56% detected by histopathology).
Conclusion: AI characterizes the fibrosis process by its true, continuous, and non-categorical nature, thus allowing for better identification of the response to anti-MASH treatment.
1 Introduction
Metabolic Dysfunction-Associated Steatotic Liver Disease (MASLD) is the most common cause of chronic liver disease worldwide, with a global prevalence of 25% in the adult population, 60% in patients with type 2 diabetes and 80% in obese individuals (1, 2). Only a small proportion of MASLD cases progress to cirrhosis and severe complications, but biomarkers predicting patients at risk are still lacking. Conversely, in patients where steatosis is complicated by inflammation and cellular damage (i.e., MASH - Metabolic Dysfunction-Associate Steatohepatitis), the possibility of severe liver diseases increases proportionally with the amount of fibrosis, as estimated by the pathological stage (3, 4). In keeping with this, the histopathological reduction of fibrosis is one of the main endpoints for clinical trials on MASH patients (5–7). However, the histopathological assessment of fibrosis stage in chronic liver disease, including MASH (8, 9), is a descriptive and semi-quantitative process with moderate reproducibility even among expert liver pathologists (10, 11). Moreover, the limited categories used for pathological staging scarcely illustrate a disease driven by a continuum of injury, resulting in limited sensitivity to reflect changes in disease severity over time (11).
Approaches based on whole slide digital images (WSI) and Artificial Intelligence (AI) have been applied successfully to the evaluation of morphological features of MASH, including fibrosis (12–18). Notably, two methods have shown clinically meaningful results. Collagen Proportionate Area (CPA), a quantitative assessment of collagen area expressed as the ratio of collagen-stained pixels over full-biopsy-pixels, demonstrated its superiority over semi-quantitative staging in predicting clinical decompensation (19, 20). Second Harmonic Generation (SHG) analysis, which focuses on the possibility to visualize collagen allowing its quantification on a continuous scale and evaluation of spatial relationship with the surrounding cells (14, 21), demonstrated its superiority over pathological staging in detecting treatment-associated reduction of fibrosis (15). These new methods, allowing to characterize minimal difference in the quantitative features of the fibrous tissue, pave the way to precision medicine also in hepatology (11).
The aim of this study is to test the clinical impact of an AI-based approach in characterizing liver fibrosis features in a series of MASH patients.
2 Materials and methods
2.1 Study cohort
We searched the files of the Departments of Hepatology and Pathology of IRCCS, Humanitas Clinical Institute (Rozzano, Italy), for all cases with a matched clinical and pathological diagnosis of MASH. Specifically, we selected only cases fulfilling the following criteria: (1) clinical diagnosis of MASLD or MetALD (MASLD predominant), (2) histopatological diagnosis of steato-hepatitis (8, 9); (3) biopsy tissue of >15 mm length, showing >10 portal spaces.
2.2 Adjudication of ground truth
For each biopsy, we obtained a recut and stained all of them in a single batch for Sirius Red (SR). All the SR slides were then digitized using Ultra Fast Scanner (Philips, Netherlands). Imaging was performed with an Olympus 40x air objective (NA = 0.75, Plan Apo) with a pixel-size of 0.25 𝜇m.
SR digital slides (SR-WSI) were then evaluated using the Philips viewer independently by four pathologists (two senior, expert in liver pathology; two young, without specific experience in liver pathology). Each of them evaluated the pathological stage according to the CNR NASH semi-quantitative system (8, 9). Discordant diagnoses were reviewed in a consensus session.
Inter-observer variability was evaluated using Fleiss’s kappa value, κ (22, 23). Kappa values indicate slight agreement when the value ranges between 0.01 and 0.20, fair agreement (0.21–0.40), moderate agreement (0.41–0.60), substantial agreement (0.61–0.80), and almost perfect agreement when the values are >0.81. This process allowed to quantify inter-observer concordance and to establish a conclusive microscopic-stage for each case (ground truth).
2.3 AI-based analysis
Once labeled with a conclusive histopathological stage, each SR-WSI underwent an AI-based analysis to carefully measure several fibrosis features (see Supplementary material for details). Dependent t-test (called the paired-samples t-test) was then used in order to assess differences for each fibrosis feature. Statistical significance has been calculated for all paired categories and considered significant when p-value less than 0.05. This approach allowed to explore differences existing in the fibrosis features in cases labeled with the same histopathological stage.
2.4 Assessment of treatment effect: histopathology versus AI
We then focused our attention on a subset of cases from the study series, specifically those who received a liver biopsy before and after medical therapy for MASH within a clinical trial. In particular, we compared the impact of AI versus histopathology in assessing changes in fibrosis, specifically the increase or reduction of fibrosis, to evaluate the treatment effect in paired biopsy.
2.5 Validation
To validate the clinical findings emerging from the study cohort, particularly the impact of AI in assessing an increase or reduction of fibrosis, we collected an adequate external series. We searched the files of the Departments of Hepatology and Pathology at IRCCS, Ca′ Granda, Ospedale Maggiore Policlinico (Milano, Italy) for cases fulfilling the following criteria: (1) clinical diagnosis MASLD or metALD (MASLD predominant); (2) histopatological diagnosis of steatohepatitis (8, 9); (3) biopsy tissue of >15 mm length, showing >10 portal spaces; (4) liver biopsy obtained before and after medical treatment within a clinical trial for MASH. Once the cases were selected and recuts obtained, all samples were stained in a single batch for Sirius red (SR) at the original Institution. All the SR slides were then digitized using the same scanner of the study cohort. Once generated, SR-WSIs were evaluated by the same AI solution used in the study cohort. In the validation cohort, the original pathological stage, based on CNR NASH semi-quantitative system, evaluated by an expert liver pathologist, was considered as truth.
2.6 Assessment of clinical outcomes
We assessed clinical outcomes in terms of weight loss. A clinical outcome was defined as a weight reduction of more than 10% between the baseline biopsy and the follow-up biopsy, based on its known association with improvements in liver histology in terms of liver fibrosis in MASH patients (24). This criterion was applied to both the study and validation cohorts. Written informed consent was obtained from the individuals for the publication of any potentially identifiable images or data included in this article.
3 Results
The study cohort consisted of 52 patients (Table 1). Among them, 9 underwent a post-treatment biopsy. Therefore, the total number of cases was 61. The results of the pathologists’ agreement in staging these cases are shown in Figure 1. The agreement was fair for F1a, F1b and F1c (respectively κ: 0.19, 0.15 and 0.17), moderate for F2 (κ: 0.43) and F3 (κ: 0.58) and substantial for F4 (κ: 0.73). The overall agreement was only moderate (κ: 0.44 Figure 1A, rising to κ: 0.55 when grouping all F1 classes into a single category, Figure 1B). After the adjudication, the series was represented by 9 F1 (a,b,c: 4,1,4), 19 F2, 28 F3 and 5 F4. These diagnoses were then considered as the ground truth.
Figure 1. Agreement between pathologists in the evaluation of NASH stage. In panel (A), the subgroups of F1, namely F1a, F1b, F1c, have been considered separately, while in panel (B) they have been grouped. The overall Fleiss’s Kappa agreement is 0.44 for panel A and 0.55 for panel B. The agreement for each category is reported over the column. The hypothesis that the agreement is caused by random chance can be rejected (*** p ≤ 0.001).
3.1 Homogeneous histopathological stages are finely dissected by AI-based analysis
The AI-based analysis finely dissected the histopathological category of “fibrosis” into several parameters (n = 33, see Supplementary Table S1 for a detailed list). Significant differences along the spectrum F1-F4 were observed for 14 parameters (see Supplementary Table S2 for details). Among these latter, we focused our interest on Estimated Collagen Area (ECA) and Entropy of Collagen (EnC), since both have an easy reference to a morphological counterpart. Figure 2 illustrates and explains the transition from histopathological stage to ECA and EnC.
Figure 2. Correlation between histopathological NASH stage and features of fibrosis generated by AI. The figure illustrates the process of transition from the original image (SR-WSI) to AI-features and the comparison of the histopathological NASH stage (F1 to F4) to a heatmap generated by the AI for ECA and EnC. After preprocessing (see Supplementary Figure S1), the original SR-WSI is analyzed to segment collagen. The quantification process involves the extraction of both intensity and textural features at pixel level and within Regions of Interest (ROIs). Estimated Collagen Area (ECA) is computed as a fraction of collagen pixels (Sirius Red positive) over the total number of pixels representing the tissue section. Entropy of Collagen (EnC) is a textural parameter that encodes for the randomness of SR optical density values with respect to its neighborhood in terms of intensity distribution. Low entropy values correspond to a uniform and homogeneous image.
3.1.1 Estimated collagen area compared to microscopic stages
The mean values of ECA were progressively higher and significantly different in F2, F3 and F4 categories (Figure 3A; Supplementary Table S3). Interestingly, ECA of individual cases within the same histopathological stage showed considerable heterogeneous values; some values overlapped with those of cases in adjacent stages. This heterogeneity characterized in particular F3 cases, where ECA ranged between 2.8 and 10% (mean 5.7%, standard deviation ±0.4%, IQR 2.3). The heterogeneity of ECA in two exemplificative cases is shown in Figure 4.
Figure 3. AI features (ECA and EnC) of single cases according to histopathological NASH stage. The figure shows for each biopsy, grouped according to the histopathological stage, the results of AI evaluation. Panel (A) illustrates the Estimated Collagen Area (ECA), panel (B) describes the evaluation of Entropy of Collagen (EnC). Significant differences between subgroups are assessed by one-way ANOVA corrected by a Tukey post hoc test. (** p ≤ 0.01; *** p ≤ 0.001; **** p ≤ 0.0001). Error bars represent the standard error of mean.
Figure 4. ECA heterogeneity in cases with homogenous histopathological NASH stage. (A,B) SR-WSI of two cases labeled as F2 at histopathological level by all pathologists (perfect agreement). (C,D) Overlay between the original WSI and the Estimated Collagen Area (ECA) heatmap for each case. AI shows that in the first case (heatmap C) the percentage of tissue area covered by collagen (ECA) is 1.67% while in the second case (heatmap D) this percentage is to 3.85%. (E,F) SR-WSI of two cases labeled as F3 at histopathological level, after the adjudication process. (G,H) AI computes, that the area covered by collagen is lower in the first case (heatmap G, ECA = 3.83%) as compared to the second (heatmap H, ECA = 9.95%).
3.1.2 Entropy of collagen compared to microscopic stages
EnC also showed a statistically significant increase from F2 to F4 (Figure 3B; Supplementary Table S4), with considerable heterogeneity of values within homogenous pathological categories. In particular, F3 cases exhibited EnC levels ranging between 0.86 and 2.04 (mean: 1.24; standard deviation: ±0.06; IQR: 0.4), while F4 cases ranged between 1.56 and 2.12 (mean: 1.8; standard deviation ±0.1; IQR: 0.5). Supplementary Figure S2 illustrates the heterogeneity of EnC in two exemplificative cases.
3.2 AI-based analysis highlights treatment efficacy
In the study cohort, 9 patients underwent a liver biopsy before and after medical treatment for MASH (see Supplementary Table S5a for clinical data). We compared the ability of histopathology (stage) and AI (ECA and EnC) to recognize changes due to therapy. The results are shown in Table 2. Briefly, 56% of cases showed a stage change according to pathologist-based evaluation, compared to 100% depicted by AI. According to AI analysis, all post treatment biopsies differed from their paired pretreatment counterparts in terms of ECA and EnC. Among cases without changes at the microscopic level, 75% were characterized by a homogenous decrease of ECA and EnC, while the remaining 25% had homogeneous increase in both AI values. Among cases that showed a stage change at the microscopic level, 80% had congruous modifications in both ECA and EnC; in the remaining 20%, ECA and EnC were congruous but opposite to the histopathological finding. At clinical level two patients (n° 2 and 6) achieved a weight loss greater than 10% between the baseline and post-treatment biopsy; in both cases, AI demonstrated a congruous reduction in ECA and EnC while histopathology recognize a change of stage (reduction) only in one case.
In the validation cohort, there were 8 patients who received liver biopsy before and after medical treatment for MASH (see Supplementary Table S5b for clinical data). Changes in pathological stage, ECA and EnC observed in these patients are reported in Table 3. Similar to the study cohort, histopathological evaluation disclosed stage change after treatment in 50% of cases as compared to 100% detected by AI. Among cases without changes at microscopic level, 50% were characterized by a clear and homogenous decrease of both ECA and EnC. Among cases with a proven change of stage at histopathology, 50% had congruous modifications of both ECA and EnC. In the validation cohort, no patient achieved the >10% weight loss threshold.
Table 3. Histopathological stage, ECA and EnC in pre- and post- treatment biopsy (validation cohort).
The heatmap shown in Figure 5 offers an overview of the impact of AI, compared to histopathology, in the assessment of treatment efficacy, using the post-treatment biopsy as a reference. Briefly, histopathology proved 29% responders (decreasing stage), 24% non-responders (increasing stage), but was not conclusive in 47% of cases (no change in stage). In contrast, AI identified 53% of cases as responders (congruous decrease of both EC and EnC), 23% non-responders (congruous increase of both EC and EnC) and 24% as not conclusive (discordant ECA and EnC). Figure 6 illustrates the impact of AI on the evaluation of fibrosis modification due to treatment.
Figure 5. Impact of AI, as compared to histopathology, in the assessment of treatment efficacy taking as reference the post treatment biopsy. The figure compares the results of histopathology and AI in assessing the response to treatment for MASH. Results shown refers to post treatment biopsy (PTB). Histopathology: green = reduction of stage in PTB (responder at histopathology); yellow = no change of stage in PTB (not conclusive at histology); red = increase of stage in PTB (not responder at histopathology). AI (ECA and EnC): green = reduction of value in PTB; red = increase of value in PTB; cases with congrous decrease of ECA and EnC are responders, with congrous decrease not responders, and discordant case as not conclusive, according to AI analysis.
Figure 6. Evaluation of treatment efficacy on fibrosis modification. The figure illustrates the microscopic (SR-WSI, left) and AI (segmentation, ECA and EnC, right) features in pre- and post- treatment biopsy in three patients (A, B, C). In patient A (case 6, Table 2), the pre treatment biopsy (A1) was diagnosed as stage F3, AI reveled a ECA of 9.8% and EnC of 2.04; post treatment biopsy (A2) was still diagnosed as F3 at histopathological level; however AI revealed that EC decreased to 4.8% and EnC to 1.42. In patient B (case 4, Table 2), the pre treatment biopsy (B1) was diagnosed as F3, with ECA of 9.6% and EnC of 1.48; after treatment (B2) the histopathological stage did not change (F3), but AI disclosed a reduction for both ECA (8.0%) and EnC (1.42). In Patient C (case 1, Table 2), the histopathological evaluation disclosed a stage reduction from F4 seen in pre-treatment (C1) to F3 seen in post treatment biopsy (C2); AI features were consistent with this reduction (ECA from 8.6 to 6.3%; EnC from 1.56 to 1.38).
4 Discussion
Histopathological characterization of fibrosis is the only parameter that correlates with the evolution of steatohepatitis (5–7). A reduction in fibrosis is one of the goals of the treatments for steatohepatitis, as well as one of the parameters used by the FDA for drug approval (25). Nonetheless, the evaluation of fibrosis modification under the microscope has some limitations. First, the assessment is a highly subjective process: numerous studies have demonstrated good but not exceptional agreement even among expert hepato-pathologists (26). Then, the system in use classifies into defined categories, similar to impermeable compartments, a process, the appearance, and modulation of fibrosis, which in nature appears as continuous (27). Finally, liver fibrosis remodeling is a slow event that might not be depicted properly by the few categories of current staging systems. To overcome these limitations, some studies started using AI to quantify fibrosis in a more objective way and as a continuous parameter, or to identify specific parameters that might not be defined by histopathology, for example the entropy of collagen fibers (13, 16, 28, 29).
The results of the present study, conducted on MASH biopsy confirm that AI-based evaluation characterizes at the deepest level, compared to classic histopathological evaluation, the fibrotic process occurring in patients with MASH. In particular, we showed that cases consistently labeled as belonging to the same histopathological stage might show significant differences in collagen amount. Thus, cases classified as F3 might present values identical to a F2- or to a F4- cases. This infinitesimal evaluation of the collagen amount likely represents the result of the multi-resolution approach we adopted in the study. Indeed, the AI we designed combined features related to texture and intensity obtained at two clearly separated spatial scales: single pixel and ROI (32x32pixel). Interestingly, Taylor-Weiner et al. (16), using a similar approach, a deep-learning method able to measure the amount and distribution of fibrosis across the whole biopsy at different resolutions, proved a significant heterogeneity across patients with the same stage. A significant heterogeneity across patients with the same stage was also proved by Wang et al. (29) and Forlano et al. (13) using Collagen Proportionate Area (CPA). Interestingly enough the values of CPA reported by these two studies for specific MASH stages (F2: 3.3 and 2.1%; F3: 6.8 and 5.5%; F4: 12.4 and 11.1%) overlap those of ECA in our study (F2: 2.6%; F3: 5.7% and F4: 10.9%).
The present study also confirmed that AI allows a more accurate identification of minimal changes conditioned by the therapy in MASH patients. In fact, while histopathology identified stage changes in 53% of cases, the congruous combination of two AI features representing the amount and entropy of collagen highlighted the presence of changes in 76% of the cases. These findings are strictly in keeping with the few similar data available in the literature. Naoumov et al. (15) proved a change in stage in 97% of cases using digital quantification on SHG as compared to 40% detected by histopathology. In specific, stage regression was observed in 69% of F2 and F3 cases treated with the optimal dose of drug. Taylor-Wayner et al. (16) calculated 27 and 40% of responders using, respectively, a rigorous or less-stringent thresholds as compared to none detected by histopathology. In our series, AI proved a stage regression in 53% of cases; histopatology in 29%; weight loss greater than 10%, a clinical feature that has been associated with improvements in liver fibrosis in MASH (24), in 12%. We also showed that the reduction in absolute terms of collagen was frequently accompanied by a reduction of entropy of collagen (EnC). This latter is a textural parameter that translates the randomness of Sirius Red optical density values with respect to its neighborhood in terms of the intensity distribution. Low entropy values correspond to more uniform and homogeneous image thus to more grouped collagen fibers. Interestingly enough, our findings overlap the description of regressing fibrosis reported by Sanyal et al. (7) characterized by the narrowing or even disappearance of the fibrotic bridge; as well as that described by Naoumov et al. (15) showing that after treatment septa became thin and compact, with sharp borders. Once inserted into this framework, our results, although limited to a small series of post treatment biopsy, do not appear random but rather representative of the important role played by AI in highlighting changes caused by medical therapy.
Finally, we showed that some post treatment liver biopsy totally unchanged under the microscope, have only minimal changes also in the parameters extracted by AI. This result mirrors those of a SHG study showing that fibrosis regression starts in a piecemeal manner (15). Interestingly enough, a recent paper, based on the spatial information returned by SHG proved, in a murine model of steatohepatitis, that this minute fibrosis regression is induced by a reduction in food amount (30).
In conclusion, our data confirm that AI allows a better characterization of the fibrosis process by its true, continuous, and non-categorical nature. Of clinical importance is also the possibility, guaranteed by AI algorithms, of identifying among the cases that the pathologist would have judged as unchanged, those in which there was actually a response to treatment. Finally, it does not appear irrelevant to observe that there is a certain percentage of cases where the objective parameters generated by AI highlight an initial process of modification of the fibrosis. Taken together, the intrinsic possibilities of AI, pave the way for renewed staging system, where the information generated by pathological description (site of fibrosis; formation of septa) would benefit of an integration with data generated by AI.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by Humanitas University review board. The studies were conducted in accordance with the local legislation and institutional requirements. The human samples used in this study were acquired from previously diagnosed human biopsies. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individuals for the publication of any potentially identifiable images or data included in this article.
Author contributions
RA: Methodology, Resources, Supervision, Writing – original draft, Writing – review & editing, Conceptualization. DP: Conceptualization, Visualization, Methodology, Software, Writing – original draft, Writing – review & editing. CC: Writing – review & editing. VB: Writing – review & editing. BD: Writing – review & editing. GC: Writing – review & editing. RL: Writing – review & editing. AF: Writing – review & editing. MM: Resources, Writing – review & editing. IA: Writing – review & editing. MR: Writing – review & editing. LTe: Writing – review & editing. DI: Writing – review & editing. AA: Writing – review & editing. NP: Writing – review & editing, Visualization. LS: Conceptualization, Visualization, Supervision, Resources, Formal analysis, Investigation, Methodology, Software, Validation, Writing – original draft, Writing – review & editing. LTo: Conceptualization, Investigation, Methodology, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. The research leading to these results has received funding from AIRC under IG 2020 – ID. 25087 project by P.I. Di Tommaso Luca and by Academic Funding UNIMIB 2023 (Laura Sironi, 2023-ATEQC-0075). This work was supported by “Ricerca Corrente” funding from the Italian Ministry of Health to IRCCS Humanitas Research Hospital for the cost of the publication.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2024.1480866/full#supplementary-material
Abbreviations
MASLD, Metabolic Dysfunction-Associated Steatotic Liver Disease; MASH, Metabolic Dysfunction-Associate Steatohepatitis; WSI, Whole slide digital images; AI, Artificial intelligence; CPA, Collagen proportionate area; SHG, Second Harmonic Generation; SR, Sirius Red; SR-WSI, Sirius red-digital slides; ECA, Estimated collagen area; EnC, Entropy of collagen.
References
1. Rinella, ME, Neuschwander-Tetri, BA, Siddiqui, MS, Abdelmalek, MF, Caldwell, S, Barb, D, et al. AASLD practice guidance on the clinical assessment and management of nonalcoholic fatty liver disease. Hepatology. (2023) 77:1797–835. doi: 10.1097/HEP.0000000000000323
2. Younossi, Z, Anstee, QM, Marietti, M, Hardy, T, Henry, L, Eslam, M, et al. Global burden of NAFLD and NASH: trends, predictions, risk factors and prevention. Nat Rev Gastroenterol Hepatol. (2018) 15:11–20. doi: 10.1038/nrgastro.2017.109
3. Angulo, P, Kleiner, DE, Dam-Larsen, S, Adams, LA, Bjornsson, ES, Charatcharoenwitthaya, P, et al. Liver fibrosis, but no other histologic features, is associated with long-term outcomes of patients with nonalcoholic fatty liver disease. Gastroenterology. (2015) 149:389–397.e10. doi: 10.1053/j.gastro.2015.04.043
4. Hagström, H, Nasr, P, Ekstedt, M, Hammar, U, Stål, P, Hultcrantz, R, et al. Fibrosis stage but not NASH predicts mortality and time to development of severe liver disease in biopsy-proven NAFLD. J Hepatol. (2017) 67:1265–73. doi: 10.1016/j.jhep.2017.07.027
5. Friedman, SL, Neuschwander-Tetri, BA, Rinella, M, and Sanyal, AJ. Mechanisms of NAFLD development and therapeutic strategies. Nat Med. (2018) 24:908–22. doi: 10.1038/s41591-018-0104-9
6. Sanyal, AJ, van Natta, ML, Clark, J, Neuschwander-Tetri, BA, Diehl, A, Dasarathy, S, et al. Prospective study of outcomes in adults with nonalcoholic fatty liver disease. N Engl J Med. (2021) 385:1559–69. doi: 10.1056/NEJMoa2029349
7. Sanyal, AJ, Foucquier, J, Younossi, ZM, Harrison, SA, Newsome, PN, Chan, W-K, et al. Enhanced diagnosis of advanced fibrosis and cirrhosis in individuals with NAFLD using FibroScan-based agile scores. J Hepatol. (2023) 78:247–59. doi: 10.1016/j.jhep.2022.10.034
8. Brunt, EM, Janney, CG, di Bisceglie, AM, Neuschwander-Tetri, BA, and Bacon, BR. Nonalcoholic Steatohepatitis: a proposal for grading and staging the histological lesions. Am J Gastroenterol. (1999) 94:2467–74. doi: 10.1111/j.1572-0241.1999.01377.x
9. Kleiner, DE, Brunt, EM, van Natta, M, Behling, C, Contos, MJ, Cummings, OW, et al. Design and validation of a histological scoring system for nonalcoholic fatty liver disease. Hepatology. (2005) 41:1313–21. doi: 10.1002/hep.20701
10. Altamirano-Barrera, A, Barranco-Fragoso, B, and Méndez-Sánchez, N. Management strategies for liver fibrosis. Ann Hepatol. (2017) 16:48–56. doi: 10.5604/16652681.1226814
11. Friedman, SL, and Pinzani, M. Hepatic fibrosis 2022: unmet needs and a blueprint for the future. Hepatology. (2022) 75:473–88. doi: 10.1002/hep.32285
12. Dinani, AM, Kowdley, KV, and Noureddin, M. Application of artificial intelligence for diagnosis and risk stratification in NAFLD and NASH: the state of the art. Hepatology. (2021) 74:2233–40. doi: 10.1002/hep.31869
13. Forlano, R, Mullish, BH, Giannakeas, N, Maurice, JB, Angkathunyakul, N, Lloyd, J, et al. High-throughput, machine learning–based quantification of steatosis, inflammation, ballooning, and fibrosis in biopsies from patients with nonalcoholic fatty liver disease. Clin Gastroenterol Hepatol. (2020) 18:2081–2090.e9. doi: 10.1016/j.cgh.2019.12.025
14. Liu, F, Goh, GB-B, Tiniakos, D, Wee, A, Leow, W-Q, Zhao, J-M, et al. qFIBS: an automated technique for quantitative evaluation of fibrosis, inflammation, ballooning, and steatosis in patients with nonalcoholic Steatohepatitis. Hepatology. (2020) 71:1953–66. doi: 10.1002/hep.30986
15. Naoumov, NV, Brees, D, Loeffler, J, Chng, E, Ren, Y, Lopez, P, et al. Digital pathology with artificial intelligence analyses provides greater insights into treatment-induced fibrosis regression in NASH. J Hepatol. (2022) 77:1399–409. doi: 10.1016/j.jhep.2022.06.018
16. Taylor-Weiner, A, Pokkalla, H, Han, L, Jia, C, Huss, R, Chung, C, et al. A machine learning approach enables quantitative measurement of liver histology and disease monitoring in NASH. Hepatology. (2021) 74:133–47. doi: 10.1002/hep.31750
17. Vanderbeck, S, Bockhorst, J, Komorowski, R, Kleiner, DE, and Gawrieh, S. Automatic classification of white regions in liver biopsies by supervised machine learning. Hum Pathol. (2014) 45:785–92. doi: 10.1016/j.humpath.2013.11.011
18. Vanderbeck, S, Bockhorst, J, Kleiner, D, Komorowski, R, Chalasani, N, and Gawrieh, S. Automatic quantification of lobular inflammation and hepatocyte ballooning in nonalcoholic fatty liver disease liver biopsies. Hum Pathol. (2015) 46:767–75. doi: 10.1016/j.humpath.2015.01.019
19. Buzzetti, E, Hall, A, Ekstedt, M, Manuguerra, R, Guerrero Misas, M, Covelli, C, et al. Collagen proportionate area is an independent predictor of long-term outcome in patients with non-alcoholic fatty liver disease. Aliment Pharmacol Ther. (2019) 49:1214–22. doi: 10.1111/apt.15219
20. Tsochatzis, E, Bruno, S, Isgro, G, Hall, A, Theocharidou, E, Manousou, P, et al. Collagen proportionate area is superior to other histological methods for sub-classifying cirrhosis and determining prognosis. J Hepatol. (2014) 60:948–54. doi: 10.1016/j.jhep.2013.12.023
21. Wang, Y, Wong, GL-H, He, F-P, Sun, J, Chan, AW-H, Yang, J, et al. Quantifying and monitoring fibrosis in non-alcoholic fatty liver disease using dual-photon microscopy. Gut. (2020) 69:1116–26. doi: 10.1136/gutjnl-2019-318841
22. Fleiss, JL. Measuring nominal scale agreement among many raters. Psychol Bull. (1971) 76:378–82. doi: 10.1037/h0031619
23. Fleiss, JL, Levin, B, and Paik, MC. The measurement of interrater agreement In: JL Fleiss, editor. Statistical methods for rates and proportions. Hoboken, NJ: Wiley (2003)
24. Vilar-Gomez, E, Martinez-Perez, Y, Calzadilla-Bertot, L, Torres-Gonzalez, A, Gra-Oramas, B, Gonzalez-Fabian, L, et al. Weight loss through lifestyle modification significantly reduces features of nonalcoholic Steatohepatitis. Gastroenterology. (2015) 149:367–378.e5. doi: 10.1053/j.gastro.2015.04.005
25. United States Food and Drug Administration. Noncirrhotic nonalcoholic Steatohepatitis with liver fibrosis: developing drugs for treatment guidance for industry. United States Food and Drug Administration (2018). Available at: https://www.fda.gov/media/119044/download.
26. Davison, BA, Harrison, SA, Cotter, G, Alkhouri, N, Sanyal, A, Edwards, C, et al. Suboptimal reliability of liver biopsy evaluation has implications for randomized clinical trials. J Hepatol. (2020) 73:1322–32. doi: 10.1016/j.jhep.2020.06.025
27. Schwabe, RF, Tabas, I, and Pajvani, UB. Mechanisms of fibrosis development in nonalcoholic Steatohepatitis. Gastroenterology. (2020) 158:1913–28. doi: 10.1053/j.gastro.2019.11.311
28. Gawrieh, S, Knoedler, DM, Saeian, K, Wallace, JR, and Komorowski, RA. Effects of interventions on intra- and interobserver agreement on interpretation of nonalcoholic fatty liver disease histology. Ann Diagn Pathol. (2011) 15:19–24. doi: 10.1016/j.anndiagpath.2010.08.001
29. Wang, Z, Jeffrey, GP, Huang, Y, de Boer, B, Garas, G, Wallace, M, et al. Liver fibrosis quantified by image morphometry predicts clinical outcomes in patients with non-alcoholic fatty liver disease. Hepatol Int. (2023) 17:1162–9. doi: 10.1007/s12072-023-10564-3
Keywords: liver, MASH, fibrosis, treatment, artificial intelligence
Citation: Akpinar R, Panzeri D, De Carlo C, Belsito V, Durante B, Chirico G, Lombardi R, Fracanzani AL, Maggioni M, Arcari I, Roncalli M, Terracciano LM, Inverso D, Aghemo A, Pugliese N, Sironi L and Di Tommaso L (2024) Role of artificial intelligence in staging and assessing of treatment response in MASH patients. Front. Med. 11:1480866. doi: 10.3389/fmed.2024.1480866
Edited by:
Gianluca Marucci, IRCCS Carlo Besta Neurological Institute Foundation, ItalyReviewed by:
Nafis Shafizadeh, Kaiser Permanente, United StatesMaria Guido, University of Padua, Italy
Copyright © 2024 Akpinar, Panzeri, De Carlo, Belsito, Durante, Chirico, Lombardi, Fracanzani, Maggioni, Arcari, Roncalli, Terracciano, Inverso, Aghemo, Pugliese, Sironi and Di Tommaso. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Luca Di Tommaso, bHVjYS5kaV90b21tYXNvQGh1bmltZWQuZXU=; Laura Sironi, bGF1cmEuc2lyb25pQHVuaW1pYi5pdA==; Nicola Pugliese, bmljb2xhLnB1Z2xpZXNlQGh1bWFuaXRhcy5pdA==
†These authors have contributed equally to this work and share first authorship
‡These authors have contributed equally to this work and share senior authorship