S J Park 1 , S H Park 1 , Y J Choi 2 , D W Kim 3 , E J Son 1 , H S Lee 1 , J H Yoon 4 , E-K Kim 1 , H J Moon 1 , J Y Kwak 1 . Show Affiliations »
Abstract
PURPOSE: To evaluate the interobserver variability for US assessments of thyroid nodules and analyze the diagnostic performances of US assessments in thyroid nodules according to nodule size. MATERIALS AND METHODS: This was an IRB-approved retrospective study with waiver of informed consent. A total of 400 surgically-confirmed thyroid nodules were included. Nodules were divided into 4 groups by size; group 1 (nodule size < 5 mm), group 2 (5 mm ≤ nodule size < 10 mm), group 3 (10 mm ≤ nodule size < 20 mm), and group 4 (nodule size ≥ 20 mm). Three experienced (7 - 10 years) radiologists retrospectively reviewed the US images. Agreement of each US descriptor and final US assessment, and diagnostic performances were calculated in each group and compared. RESULTS: Composition represented substantial or good agreement (k = 0.719 - 0.89). Margin showed the lowest agreement (k = 0.322 - 0.365). Individual kappa values for final assessment according to nodule size were as follows: group 1 (k = 0.674), group 2 (k = 0.596), group 3 (k = 0.674), and group 4 (k = 0.673). Specificity, PPV, and accuracy were significantly different among the groups with different size (p value < 0.05) and lowest in group 1. NPV, specificity, PPV and accuracy except PPV of observer 3 increased with nodule size (p < 0.05). CONCLUSION: Interobserver agreements were relatively good (k = 0.637) in final US assessment regardless of nodule size in experienced radiologists. High false-positive rate was observed in US assessment in nodules less than 5 mm in maximum diameter. © Georg Thieme Verlag KG Stuttgart · New York.
PURPOSE: To evaluate the interobserver variability for US assessments of thyroid nodules and analyze the diagnostic performances of US assessments in thyroid nodules according to nodule size. MATERIALS AND METHODS: This was an IRB-approved retrospective study with waiver of informed consent. A total of 400 surgically-confirmed thyroid nodules were included. Nodules were divided into 4 groups by size; group 1 (nodule size < 5 mm), group 2 (5 mm ≤ nodule size < 10 mm), group 3 (10 mm ≤ nodule size < 20 mm), and group 4 (nodule size ≥ 20 mm). Three experienced (7 - 10 years) radiologists retrospectively reviewed the US images. Agreement of each US descriptor and final US assessment, and diagnostic performances were calculated in each group and compared. RESULTS: Composition represented substantial or good agreement (k = 0.719 - 0.89). Margin showed the lowest agreement (k = 0.322 - 0.365). Individual kappa values for final assessment according to nodule size were as follows: group 1 (k = 0.674), group 2 (k = 0.596), group 3 (k = 0.674), and group 4 (k = 0.673). Specificity, PPV, and accuracy were significantly different among the groups with different size (p value < 0.05) and lowest in group 1. NPV, specificity, PPV and accuracy except PPV of observer 3 increased with nodule size (p < 0.05). CONCLUSION: Interobserver agreements were relatively good (k = 0.637) in final US assessment regardless of nodule size in experienced radiologists. High false-positive rate was observed in US assessment in nodules less than 5 mm in maximum diameter. © Georg Thieme Verlag KG Stuttgart · New York.
Entities: Chemical
Mesh: See more »
Year: 2012
PMID: 23108925 DOI: 10.1055/s-0032-1325404
Source DB: PubMed Journal: Ultraschall Med ISSN: 0172-4614 Impact factor: 6.548