Literature DB >> 24497882

Inter-observer and Intra-observer Agreement in Pathological Evaluation of Non-alcoholic Fatty Liver Disease Suspected Liver Biopsies.

Omid Pournik1, Seyed Moayed Alavian2, Leila Ghalichi3, Bahram Seifizarei4, Leila Mehrnoush5, Azam Aslani6, Soghra Anjarani7, Saeid Eslami8.   

Abstract

BACKGROUND: Histopathologic assessment of liver tissue is an essential step in management and follow-up of non-alcoholic fatty liver disease (NAFLD) while inter- and intra-observer variations limit the accuracy of these assessments.
OBJECTIVES: The aim of this study was to assess the inter- and intra-observer reproducibility of histopathologic assessment of liver biopsies based on NAFLD activity score (NAS) scoring system.
MATERIALS AND METHODS: The anonymous liver biopsy samples of 100 consecutive NAFLD suspected adults were randomly assigned to four pathologists. Then, the samples were randomly reassigned to the pathologists for the second time in a way that each sample would be evaluated by two different pathologists. Biopsies were revisited by their first evaluator after two months. The results were reported based on NAS scoring system.
RESULTS: Inter-observer agreement of the pathology scores based on NAS scoring system was acceptable for steatosis, lobular inflammation, and fibrosis, but not for hepatocyte ballooning. The intra-observer agreement was acceptable in all scales, with lowest intra-class correlation observed for lobular inflammation.
CONCLUSIONS: NAS scoring system has good overall inter- and intra-observer agreement, but more attention should be given to defining the hepatocyte ballooning and lobular inflammation, and training the pathologists to improve the accuracy of pathology reports.

Entities:  

Keywords:  Fatty Liver; Observer Variation; Pathology

Year:  2014        PMID: 24497882      PMCID: PMC3909644          DOI: 10.5812/hepatmon.15167

Source DB:  PubMed          Journal:  Hepat Mon        ISSN: 1735-143X            Impact factor:   0.660


1. Background

Considering the increasing prevalence of non-alcoholic fatty liver disease (NAFLD) world-wide, it is essential to have methods and procedures for accurate diagnosis of the cases, as well as to identify patients with non-alcoholic steatohepatitis (NASH) (1). The diagnosis of NASH and its distinction from non-alcoholic fatty liver (NAFL) affects the prognosis and treatment plan as the former group have a higher risk of acquiring cirrhosis and hepatocellular carcinoma (2). Histopathologic assessment of liver tissue is an essential step in the management and follow-up of chronic liver conditions (3, 4). It is generally agreed that steatosis, hepatocellular ballooning, and lobular inflammation are the histopathological characteristics of NASH, while fibrosis is not essential in the diagnosis (2). Although liver biopsy is the gold standard of staging and evaluating the progress of the disease, it is a risky procedure and there are a few limitations that affect the clinical acceptance of the process (1). Invasiveness is one of these factors which affects both patient and clinician tendency to the procedure (5), as well as patients compliance for repeated biopsies which might be necessary during the long-term management of the cases (6). Sampling error is another issue which affects the acceptance of liver biopsy and it mainly results from sampling variability and observer variation. Sampling variability reflects the uneven distribution of histologic lesions in the liver tissue and may result in misdiagnosis and staging inaccuracy (5, 7). Inter- and intra-observer variation also limits the accuracy of the histopathologic evaluations and thus, affect the clinical judgment of the physicians (5, 6). This issue has been widely discussed in chronic liver disease especially viral hepatitis during the past 20 years, (8-10) and less frequently in biopsies obtained from NAFL-suspected cases. NAFLD comprises a wide morphological spectrum which makes the pathologic evaluation and distinction difficult. NAFLD is histologically further categorized into NAFL and NASH. The diagnosis is made based on the degree of steatosis, hepatocellular ballooning, and lobular inflammation (2). Histopathologic grading and staging of liver biopsies can be different between the pathologists (10, 11), and general pathologists and expert hepatopathologists can perform significantly different in assessing NASH suspected liver biopsies (12). Hepatocellular ballooning is highly prone to intra- as well as inter-observer variation (2), while the agreement is higher in fibrosis (13). Interventions such as image review by the pathologists and use of scoring sheet with written diagnostic criteria for different NAFLD phenotypes have been applied to improve the agreement (14). Also, various scoring systems have been proposed to improve the agreement of the pathologists (12). NAFLD activity score (NAS) is a histologic scoring system, widely accepted for evaluating NASH suspected liver biopsies (15). NAS ranges between 0 and 8, and NAS scores of 0 to 2 are not considered as NASH; Scores of 3 to 4 are considered indeterminate and scores of 5 to 8 recommend NASH diagnosis. The primary studies in NASH Clinical Research Network (16) have shown acceptable inter- and intra-rater agreement between the pathologists, but there have been few studies to assess the validity of NAS scoring systems outside NASH Clinical Research Network (16, 17). Some researchers believe that NAS is a valuable tool in clinical trials, while its generalizability and diagnostic accuracy should be studied (18). Although histopathologic evaluation of liver biopsies using NAS is becoming a routine practice in clinic, its accuracy is not usually considered during clinical decision making in diagnosis and follow-up. Also, the degree of accuracy is not clear in settings other than NASH Clinical Research Network. In this study, we intended to assess both inter-observer and intra-observer reproducibility of NAS scoring system in a group of Iranian pathologists.

2. Objectives

The aim of this study was to help the clinicians better judge the results of the liver biopsies both on diagnosis and follow-up of NAFLD patients.

3. Materials and Methods

In this cross-sectional study, 100 consecutive liver samples of adult cases suspected to NAFLD whom were biopsied in 6 different hospitals in Tehran, Iran, between 2010 and 2012 were included. All of the cases were diagnosed for NAFLD based on clinical evaluation and evidence of steatosis in ultrasonography after ruling out the other etiologies of fatty liver like excessive alcohol consumption and other chronic liver disease. The cases were Iranian and older than 18 years old. Ultrasound-assisted percutaneous liver biopsy was performed using Tru-Cut biopsy needles. The biopsy samples were sectioned in 3 different levels and stained by hematoxylin and eosin, trichrome, and reticulin methods. For the purpose of the study, these anonymous biopsy section samples were randomly assigned to four pathologists (3 general pathologists and one hepatopathologist) who had agreed to evaluate the samples. The pathologists were not aware of patient identity or open label pathology report, and reported the samples based on NAS scoring system (15). The samples were excluded from the study if the observers reported inadequate quality for the biopsy or staining. When all samples were evaluated, they were randomly reassigned to the pathologists for the second time in a way that each sample would be evaluated by two different pathologists. The pathologists were blinded to the result of the prior evaluation and the identity of the first evaluator. The agreement between the 2 raters was evaluated by intra-class correlation (ICC). From 100 biopsies, 91 cases were revisited by their first evaluator after 2 months and reported based on NAS scoring system. The agreement between the 2 evaluations was evaluated by ICC to assess the intra-observer agreement. The study was approved by the ethical committee of Middle East Center of Liver Disease (MELD). The results were analyzed using (SPSS version 16, Chicago, IL) Mean and standard deviation were used to describe the data. ICC was applied to assess the correlation between the pathologists' evaluations in different scales.

4. Results

One hundred liver biopsies of adult cases were evaluated. Four cases were excluded due to technical problems in staining. The mean age of participants was 41.5 (SD: 9.74) years and the range was 18 to 58 years. Sixty four cases were male. The results of the NAS total score based on the first pathologists' assessment is presented in Table 1.
Table 1.

Results of Total NAFLD Activity Score, Scores Based on the First Pathologists' Evaluation [a]

NAFLD Activity ScoreSamples, No.
0–235
3–4 30
5-8 31

aAccording to NAS system, if NAS score was 0-2 the diagnosis was NAFL; if NAS score was 3-4 the diagnosis was indeterminate; if NAS score was 5-8 the diagnosis was NASH.

aAccording to NAS system, if NAS score was 0-2 the diagnosis was NAFL; if NAS score was 3-4 the diagnosis was indeterminate; if NAS score was 5-8 the diagnosis was NASH. The ICC of the total NAS scores and steatosis, lobular infiltration, and ballooning scores are presented in Table 2. The ICC for total NAS score, steatosis, and fibrosis was more than 0.5. For lobular inflammation, the ICC was low but significant. The Inter-observer ICC for hepatocyte ballooning was not acceptable.
Table 2.

Inter-Observer Agreement of the Pathology Scores Based on NAS Scoring

ICC [a]95% confidence IntervalP value
Steatosis0.6540.523-0.755< 0.001
Lobular inflammation 0.2880.095-0.4610.002
Hepatocyte Ballooning 0.012-0.187-0.2110.452
Total score 0.6230.484-0.731< 0.001
Fibrosis 0.5040.338-0.639< 0.001

aAbbreviations: ICC, intra-class correlation.

aAbbreviations: ICC, intra-class correlation. The intra-observer agreement of the pathologists in different scales is demonstrated in Table 3. All scales showed significant level of ICC. Lowest intra-observer ICC was observed for lobular inflammation and the highest was observed for steatosis. There was no significant difference in the intra-observer agreement of the pathologists.
Table 3.

The Intra-Observer Agreement of the Pathology Scores Based on NAS Scoring

ICC [a]95% confidence IntervalP value
steatosis0.7540.648-0.831< 0.001
Lobular inflammation 0.4200.234-0.577< 0.001
Hepatocyte Ballooning 0.5630.403-0.690< 0.001
Total score 0.6860.558-0.782< 0.001
Fibrosis 0.7440.632-0.836< 0.001

a Abbreviation: ICC, intra-class correlation.

a Abbreviation: ICC, intra-class correlation.

5. Discussion

In our study, highest agreement was observed in steatosis scale both in inter-observer and intra-observer assessments. The agreement for hepatocyte ballooning and lobular inflammation was lower compared to other scales. Research has shown that ballooning and lobular inflammation are important pathologic features that enable discrimination of NASH (19). The study of Kleiner et al. (15) showed that the agreement on ballooning feature was acceptable although the agreement was low in inter-rater agreement of pediatric cases (kappa = 0.22). Fukusato et al. who evaluated inter-rater agreement of experienced hepatopathologists observed slight or poor agreement in steatosis, ballooning, intralobular necroinflammatory changes, and portal inflammation (20). Some experts believe that hepatocellular ballooning is “an ill-defined form of liver cell injury associated with cell swelling and rounding of the cytoplasm, the detection of which is prone to intra- as well as inter-observer variation”(2). We observed moderate inter-rater agreement and good intra-rater agreement in fibrosis score, while Kleiner et al. (15) showed highest agreement in this feature compared to other scales, although fibrosis is not essential in the diagnosis of NASH (2). Previous studies have shown lower agreement in pediatric cases compared to adults (11, 15). Our cases were at least 18 years and no comparison could be provided. Variability in the distribution of hepatic lesions within the liver reduces the accuracy of liver biopsy as the gold standard (21). Considering the mentioned weak points, some researchers suggest considering liver biopsy as the best rather than gold standard accurate staging and grading of chronic liver conditions (22). Experts have suggested a few methods for reducing the risk of misclassification. Interpretation of biopsies by experienced liver pathologists is one of these methods (22). Others have proposed that evaluation of hepatocellular ballooning might be more accurately performed by immunostaining methods (2). Providing clinical and laboratory findings of the patients could also be helpful in correct evaluation of the patients by the pathologists (23). Despite the importance of histopathologic findings in the management of liver diseases and their increasing prevalence, most of the pathologists have not received enough formal training and have little experience with liver biopsy (23). Some researchers have shown that in evaluation of liver samples from chronic viral hepatitis, the level of experience of the pathologist in terms of specialization, duration, and location of practice has a stronger effect on the agreement compared to the characteristics of the specimen (10). Many non-invasive methods have been proposed for evaluation of NAFLD–suspected cases. Fibroscan® is one of these methods which has gained popularity in the clinic. Accuracy of Fibroscan® is variable especially in health conditions such as steatosis, metabolic syndrome, high body mass index, hepatic hemangioma, and heart failure (24). Another limitation of Fibroscan® is based on the fact that the results are displayed as a single score and the clinicians do not have the chance to evaluate different sub-scales which are present in routine histopathologic reports. Although many experts believe that histological assessment of liver in NAFLD cases “is far from being both accurate and precise” (1), it is still the best available and acceptable method for evaluation and follow up of NAFLD cases. The downsides of the procedure are to be considered with attention and ancillary methods should be applied to improve the accuracy and reduce the risks. Development and improvement of scoring systems and complementary educational programs for pathologists are among possible solutions for improving the accuracy. Future technical developments may lead to new and improved methods and higher precision in laboratory and clinical evaluation of NAFLD patients. Finally, NAS scoring system has good overall inter-observer and intra-observer agreement, but more attention should be paid both in defining the hepatocyte ballooning and lobular inflammation and training of the pathologists to improve the accuracy of the pathology reports.
  24 in total

1.  Second opinion pathology in liver biopsy interpretation.

Authors:  P A Bejarano; A Koehler; K E Sherman
Journal:  Am J Gastroenterol       Date:  2001-11       Impact factor: 10.864

2.  Sources of variability in histological scoring of chronic viral hepatitis.

Authors:  Marie-Christine Rousselet; Sophie Michalak; Florence Dupré; Anne Croué; Pierre Bedossa; Jean-Paul Saint-André; Paul Calès
Journal:  Hepatology       Date:  2005-02       Impact factor: 17.425

Review 3.  Assessment of fibrosis in chronic liver diseases.

Authors:  Kun Zhou; Lun Gen Lu
Journal:  J Dig Dis       Date:  2009-02       Impact factor: 2.325

4.  Liver biopsy: the best, not the gold standard.

Authors:  Pierre Bedossa; Fabrice Carrat
Journal:  J Hepatol       Date:  2008-11-06       Impact factor: 25.083

5.  Correlation of vitamin E, uric acid, and diet composition with histologic features of pediatric NAFLD.

Authors:  Miriam B Vos; Ryan Colvin; Patricia Belt; Jean P Molleston; Karen F Murray; Philip Rosenthal; Jeffrey B Schwimmer; James Tonascia; Aynur Unalp; Joel E Lavine
Journal:  J Pediatr Gastroenterol Nutr       Date:  2012-01       Impact factor: 2.839

Review 6.  [Liver biopsy: Which role in patient management?].

Authors:  Jean-Yves Scoazec
Journal:  Ann Pathol       Date:  2010-11-26       Impact factor: 0.407

7.  Validation of the non-alcoholic fatty liver disease activity score.

Authors:  M Hjelkrem; C Stauch; J Shaw; S A Harrison
Journal:  Aliment Pharmacol Ther       Date:  2011-05-18       Impact factor: 8.171

8.  Design and validation of a histological scoring system for nonalcoholic fatty liver disease.

Authors:  David E Kleiner; Elizabeth M Brunt; Mark Van Natta; Cynthia Behling; Melissa J Contos; Oscar W Cummings; Linda D Ferrell; Yao-Chang Liu; Michael S Torbenson; Aynur Unalp-Arida; Matthew Yeh; Arthur J McCullough; Arun J Sanyal
Journal:  Hepatology       Date:  2005-06       Impact factor: 17.425

9.  A comparison of hepatopathologists' and community pathologists' review of liver biopsy specimens from patients with hepatitis C.

Authors:  Maire Robert; Andre N Sofair; Ann Thomas; Beth Bell; Stephanie Bialek; Christopher Corless; Grace Van Ness; Sharon Huie-White; Nicole Stabach; Atif Zaman
Journal:  Clin Gastroenterol Hepatol       Date:  2008-12-13       Impact factor: 11.382

10.  Liver Hemangioma Might Lead to overestimation of Liver Fibrosis by Fibroscan; A Missed Issue in Two Cases.

Authors:  Seyed Hossein Aalaei-Andabili; Leila Mehrnoush; Shima Salimi; Mustafa Shafiei; Seyed Moayed Alavian
Journal:  Hepat Mon       Date:  2012-06-30       Impact factor: 0.660

View more
  17 in total

1.  Prediction of nonalcoholic fatty liver disease (NAFLD) activity score (NAS) with multiparametric hepatic magnetic resonance imaging and elastography.

Authors:  Ziying Yin; Matthew C Murphy; Jiahui Li; Kevin J Glaser; Amy S Mauer; Taofic Mounajjed; Terry M Therneau; Heshan Liu; Harmeet Malhi; Armando Manduca; Richard L Ehman; Meng Yin
Journal:  Eur Radiol       Date:  2019-03-18       Impact factor: 5.315

2.  Feasibility and agreement of stiffness measurements using gradient-echo and spin-echo MR elastography sequences in unselected patients undergoing liver MRI.

Authors:  Guilherme Moura Cunha; Kevin J Glaser; Anke Bergman; Rodrigo P Luz; Eduardo H de Figueiredo; Flavia Paiva Proença Lobo Lopes
Journal:  Br J Radiol       Date:  2018-05-17       Impact factor: 3.039

3.  Inter-reader agreement of magnetic resonance imaging proton density fat fraction and its longitudinal change in a clinical trial of adults with nonalcoholic steatohepatitis.

Authors:  Jonathan C Hooker; Gavin Hamilton; Charlie C Park; Steven Liao; Tanya Wolfson; Soudabeh Fazeli Dehkordy; Cheng William Hong; Adrija Mamidipalli; Anthony Gamst; Rohit Loomba; Claude B Sirlin
Journal:  Abdom Radiol (NY)       Date:  2019-02

Review 4.  Histological assessment based on liver biopsy: the value and challenges in NASH drug development.

Authors:  Xiao-Fei Tong; Qian-Yi Wang; Xin-Yan Zhao; Ya-Meng Sun; Xiao-Ning Wu; Li-Ling Yang; Zheng-Zhao Lu; Xiao-Juan Ou; Ji-Dong Jia; Hong You
Journal:  Acta Pharmacol Sin       Date:  2022-02-14       Impact factor: 7.169

5.  Magnetic resonance elastography SE-EPI vs GRE sequences at 3T in a pediatric population with liver disease.

Authors:  Juan S Calle-Toro; Suraj D Serai; Erum A Hartung; David J Goldberg; Bradley D Bolster; Kassa Darge; Sudha A Anupindi
Journal:  Abdom Radiol (NY)       Date:  2019-03

6.  A diagnostic model for cirrhosis in patients with non-alcoholic fatty liver disease: an artificial neural network approach.

Authors:  Omid Pournik; Sara Dorri; Hedieh Zabolinezhad; Seyyed Moayed Alavian; Saeid Eslami
Journal:  Med J Islam Repub Iran       Date:  2014-10-21

Review 7.  Nonalcoholic Fatty liver disease/non-alcoholic steatohepatitis in childhood: endocrine-metabolic "mal-programming".

Authors:  Sara Manti; Claudio Romano; Valeria Chirico; Martina Filippelli; Caterina Cuppari; Italia Loddo; Carmelo Salpietro; Teresa Arrigo
Journal:  Hepat Mon       Date:  2014-05-01       Impact factor: 0.660

Review 8.  The influence of dietary fat on liver fat accumulation.

Authors:  Charlotte J Green; Leanne Hodson
Journal:  Nutrients       Date:  2014-11-10       Impact factor: 5.717

9.  Nonalcoholic fatty liver disease: different classifications concordance and relationship between degrees of morphological features and spectrum of the disease.

Authors:  Juliana Maya Monteiro; Geysa Maya Monteiro; Adriana Caroli-Bottino; Vera Lucia Pannain
Journal:  Anal Cell Pathol (Amst)       Date:  2014-12-02       Impact factor: 2.916

10.  Inter-observer Variability in Histomorphological Evaluation of Non-neoplastic Liver Biopsy Tissue and Impact of Clinical Information on Final Diagnosis in Shahid Beheshti University of Medical Sciences Affiliated Hospitals.

Authors:  Zeinab Kishanifarahani; Mahsa Ahadi; Behrang Kazeminejad; Tahmineh Mollasharifi; Malihe Saber Afsharian; Amir Sadeghi; Farahnaz Bidari; Elena Jamali; Niki Hasanzadeh; Abolfazl Movafagh; Arash Dehghan; Arsham Moradi; Afshin Moradi
Journal:  Iran J Pathol       Date:  2019-08-01
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.