Literature DB >> 35242444

A Study of Thyroid Fine Needle Aspiration of Follicular Adenoma in the "Atypia of Undetermined Significance" Bethesda Category Using Digital Image Analysis.

Keluo Yao1, Xin Jing2, Jerome Cheng2, Ulysses G J Balis2, Liron Pantanowitz2, Madelyn Lew2.   

Abstract

BACKGROUND: Originally designed for computerized image analysis, ThinPrep is underutilized in that role outside gynecological cytology. It can be used to address the inter/intra-observer variability in the evaluation of thyroid fine needle aspiration (TFNA) biopsy and help pathologists to gain additional insight into thyroid cytomorphology.
METHODS: We designed and validated a feature engineering and supervised machine learning-based digital image analysis method using ImageJ and Python scikit-learn . The method was trained and validated from 400 low power (100x) and 400 high power (400x) images generated from 40 TFNA cases. RESULT: The area under the curve (AUC) for receiver operating characteristics (ROC) is 0.75 (0.74-0.82) for model based from low-power images and 0.74 (0.69-0.79) for the model based from high-power images. Cytomorphologic features were synthesized using feature engineering and when performed in isolation, they achieved AUC of 0.71 (0.64-0.77) for chromatin, 0.70 (0.64-0.73) for cellularity, 0.65 (0.60-0.69) for cytoarchitecture, 0.57 (0.51-0.61) for nuclear size, and 0.63 (0.57-0.68) for nuclear shape.
CONCLUSION: Our study proves that ThinPrep is an excellent preparation method for digital image analysis of thyroid cytomorphology. It can be used to quantitatively harvest morphologic information for diagnostic purpose.
© 2022 The Authors.

Entities:  

Keywords:  Bethesda; Digital image analysis; Fine needle aspiration; ThinPrep; Thyroid

Year:  2022        PMID: 35242444      PMCID: PMC8864759          DOI: 10.1016/j.jpi.2022.100004

Source DB:  PubMed          Journal:  J Pathol Inform


List of Abbreviations

Introduction

As one of the more accessible organs for fine needle aspiration (FNA) biopsy, thyroid nodules are frequently evaluated for cytologic diagnosis to determine surgical versus conservative management. While a subset of thyroid FNA (T-FNA) contains clear cytomorphologic features of neoplastic lesions that can be definitively and reliably diagnosed amongst cytopathologists, up to 21% of cases within some institutions can display cellular and architectural atypia insufficient for definitive diagnosis, leaving a significant element of uncertainty of appropriate management for clinicians to pursue., Many indeterminate results due to architectural atypia identified within T-FNAs are reported by pathologists as “atypia of undetermined significance/follicular lesion of undetermined significance (AUS/FLUS)” (Bethesda category III) in the Bethesda System for Reporting Thyroid Cytopathology TBSRTC. While TBSRTC recommends molecular assays for both categories to guide management, many clinicians are seeking lower cost options to enhance the diagnostic accuracy of the existing cytological material, particularly in the indeterminate diagnostic categories. In our current study, we evaluated an alternative pathway to an objective, reproducible diagnosis by utilizing an existing cytologic preparation technique optimized for digital pathology and machine learning algorithms., The use of this technology can provide a substitute pathway to resolve indeterminate diagnostic categories through digital evaluation and classification of cytomorphologic features (follicular group architecture, smear cellularity, amount of colloid, and cytologic atypia) associated with follicular neoplasms. To our best knowledge to date, ThinPrep ® is underutilized in this regard but is widely used by many institutions for the evaluation of thyroid aspirate material. ThinPrep ® is conveniently primed for digital image analysis (DIA), as it is created to reduce the variability of stains and was originally developed for the ThinPrep Imaging System. In this study, we aim to evaluate the feasibility of applying DIA on T-FNA material prepared by the ThinPrep ® procedure and use it to gain more insight to improve the diagnostic accuracy of thyroid aspiration cytology.

Method

Case Collection and Image Capture

To reduce the complexity of the study, we decided to focus on the morphologic difference between surgically verified benign thyroid vs. thyroid with follicular adenoma, as the extent and degree of morphologic criteria are more subjective rendering less reproducible diagnoses in comparison to other thyroid lesions with cytologic (nuclear) atypia such as papillary thyroid lesions. From our laboratory information system (LIS), we performed a structured query language (SQL) search for all surgical resection cases diagnosed as follicular adenoma or thyroid with nodular hyperplasia. Cross referencing the prior T-FNAs , we identified 20 T-FNAs diagnosed as AUS/FLUS, with subsequent diagnoses of follicular adenoma on surgical resections and 20 T-FNAs with subsequent diagnoses of benign thyroid nodules on surgical resections. Digital images of 10 mid-power (100x) and 10 high-power (400x) fields on the ThinPrep material were obtained using a DP71 camera (3500 Corporate Parkway, Center Valley, PA 18034, Olympus, USA) on an Olympus BX51 microscope with CellSens Entry v1.12 (Olympus, USA). The mid-power fields were randomly taken to evaluate overall specimen cellularity while the high-power fields captured follicular cells. All images associated with each case were grouped together and further reviewed by a board-certified cytopathologist (ML) to evaluate for adequate cellularity and to render a diagnosis within the Bethesda classification system. Unsatisfactory cases with insufficient cellularity were removed from the study. In total, we curated 800 images through the above process.

Image Analysis

To maximize the use of the images, a custom image analysis algorithm was developed based on cytomorphology feature engineering and supervised machine learning.

Cytomorphology Feature Engineering

We used ImageJ v1.51p (NIH, USA) to develop cytomorphology feature engineering. The process consists of image segmentation followed by feature extraction (Fig. 1). For image segmentation, we started with preprocessing of the images by substracting the background, followed by red‑green‑blue color channel separation. We only extracted the green channels and created masks for all nuclei using an automatic threshold method. The feature extraction processed focused on the nuclei which were treated as individual “particles” with low-level features. The low-level features are selectively grouped together based on the authors’ cytomorphology knowledge to form medium and high level features (Table 1). For example, a medium level feature, nuclear size, or simply size, is composed of mean and standard deviation of nuclear area, which are low level features. Cytology, a high-level feature, is composed of three medium level features, chromatin, shape, and size. For the high-power images, the “particles” were filtered by some low level features such as size and circularity to remove background noise. These low level features were also used to distinguish or “gate” individual nuclei from closely grouped clusters to detect crowding of follicular cells. The “cellularity” high level features were extracted only from the mid-power images. Altogether, we have a total of 86 low level nuclear features used to construct three medium level feature models (chromatin, shape, and size), three high level feature models (cellularity, architecture, and cytology), and two models based on magnification (low and high power).
Figure 1

The segmentation and feature extraction of each image (A) starts with background subtraction (B), followed by conversion to 8-bit grayscale image (green channel only) through color deconvolution (C), automatic threshold segmentation, and finally a mask (red) for the nuclear features (D). The extracted nuclei features are further “gated” (high-power only) using size and circularity to separate out individual nuclei from closely grouped clusters.

Table 1

Details of the predictive models and features performance.

Magnification1High2Medium3Low4Gate5Mean/StdDv6Feature importance7FA value8B9 value9T-test score10
Mid-powerCellularityN/ACountINoneMean0.143268.57164.310.000
Mid-powerCellularityN/ATotal AreaIINoneMean0.10441191.5224885.810.001
Mid-powerN/AN/AAreaIIINoneMean0.060153.44129.370.122
Mid-powerN/AN/ACircIVNoneMean0.0830.900.900.832
Mid-powerN/AN/AMaxFeretVNoneMean0.06712.8412.270.248
Mid-powerN/AN/AIntDenVINoneMean0.05625581.1321781.700.175
Mid-powerN/AN/AKurtVIINoneMean0.066-1.03-1.020.586
Mid-powerN/AN/AMeanVIIINoneMean0.061174.81177.030.263
Mid-powerN/AN/AMedianIXNoneMean0.057176.21178.170.329
Mid-powerN/AN/AMinFeretXNoneMean0.0678.598.250.258
Mid-powerN/AN/AModeXINoneMean0.061177.56179.190.465
Mid-powerN/AN/APerimXIINoneMean0.06235.1233.140.180
Mid-powerN/AN/ASkewXIIINoneMean0.055-0.28-0.260.050
Mid-powerN/AN/ASolidityXIVNoneMean0.0580.890.890.519
High-powerArchitectureN/AARXVClusterStdDv0.0100.380.300.017
High-powerArchitectureN/AARClusterMean0.0091.741.460.001
High-powerArchitectureN/AAreaClusterMean0.0145711.592252.090.000
High-powerArchitectureN/AAreaClusterStdDv0.0094731.231443.710.000
High-powerArchitectureN/ACircClusterStdDv0.0140.110.070.000
High-powerArchitectureN/ACircClusterMean0.0090.320.270.003
High-powerArchitectureN/AMaxFeretClusterMean0.01698.8567.440.000
High-powerArchitectureN/AMaxFeretClusterStdDv0.00940.9118.640.000
High-powerArchitectureN/AMinFeretClusterMean0.01160.0240.960.000
High-powerArchitectureN/AMinFeretClusterStdDv0.00925.4511.620.000
High-powerArchitectureN/APerimClusterMean0.015432.76264.520.000
High-powerArchitectureN/APerimClusterStdDv0.010274.11108.590.000
High-powerArchitectureN/ARoundClusterMean0.0120.520.440.001
High-powerArchitectureN/ARoundClusterStdDv0.0080.110.090.004
High-powerArchitectureN/ASolidityClusterMean0.0150.690.570.000
High-powerArchitectureN/ASolidityClusterStdDv0.0120.060.040.000
High-powerCytologyChromatinIntDenSingleStdDv0.02017007.1812785.690.000
High-powerCytologyChromatinIntDenClusterMean0.016478875.76169419.420.000
High-powerCytologyChromatinIntDenSingleMean0.01640865.4134437.490.000
High-powerCytologyChromatinIntDenClusterStdDv0.010395319.99139400.120.004
High-powerCytologyChromatinKurtSingleStdDv0.0130.470.430.121
High-powerCytologyChromatinKurtClusterMean0.013-0.48-0.380.046
High-powerCytologyChromatinKurtSingleMean0.012-0.51-0.460.079
High-powerCytologyChromatinKurtClusterStdDv0.0080.380.300.126
High-powerCytologyChromatinMaxXVISingleStdDv0.0284.552.960.000
High-powerCytologyChromatinMaxSingleMean0.020127.32115.620.001
High-powerCytologyChromatinMaxClusterMean0.015132.7099.760.000
High-powerCytologyChromatinMaxClusterStdDv0.00811.497.990.010
High-powerCytologyChromatinMeanClusterMean0.01876.3456.040.000
High-powerCytologyChromatinMeanClusterStdDv0.0166.673.760.000
High-powerCytologyChromatinMeanSingleStdDv0.01411.759.780.000
High-powerCytologyChromatinMeanSingleMean0.01282.0073.230.001
High-powerCytologyChromatinMedianXVIIClusterStdDv0.0198.294.690.000
High-powerCytologyChromatinMedianClusterMean0.01575.0854.080.000
High-powerCytologyChromatinMedianSingleStdDv0.01514.3311.990.000
High-powerCytologyChromatinMedianSingleMean0.01378.8669.880.001
High-powerCytologyChromatinMinXVIIISingleStdDv0.01514.1411.140.001
High-powerCytologyChromatinMinSingleMean0.01443.5940.290.108
High-powerCytologyChromatinMinClusterMean0.01228.3922.310.001
High-powerCytologyChromatinMinClusterStdDv0.0097.884.860.000
High-powerCytologyChromatinModeXIXClusterMean0.01970.1447.870.000
High-powerCytologyChromatinModeClusterStdDv0.01814.819.010.000
High-powerCytologyChromatinModeSingleStdDv0.01420.4916.810.000
High-powerCytologyChromatinModeSingleMean0.01271.1062.520.001
High-powerCytologyChromatinRawIntDenSingleMean0.01840865.4134437.490.000
High-powerCytologyChromatinRawIntDenSingleStdDv0.01617007.1812785.690.000
High-powerCytologyChromatinRawIntDenClusterMean0.015478875.76169419.420.000
High-powerCytologyChromatinRawIntDenClusterStdDv0.009395319.99139400.120.004
High-powerCytologyChromatinSkewSingleMean0.0210.340.470.000
High-powerCytologyChromatinSkewClusterMean0.0150.140.250.006
High-powerCytologyChromatinSkewSingleStdDv0.0100.390.370.089
High-powerCytologyChromatinSkewClusterStdDv0.0100.260.170.000
High-powerCytologyChromatinStdDevSingleMean0.02320.0718.200.002
High-powerCytologyChromatinStdDevSingleStdDv0.0124.583.490.000
High-powerCytologyChromatinStdDevClusterMean0.01121.9016.840.000
High-powerCytologyChromatinStdDevClusterStdDv0.0112.881.910.000
High-powerCytologyShapeARSingleMean0.0171.311.340.038
High-powerCytologyShapeARSingleStdDv0.0120.270.280.163
High-powerCytologyShapeCircSingleMean0.0220.730.760.004
High-powerCytologyShapeCircSingleStdDv0.0120.100.090.405
High-powerCytologyShapeMaxFeretSingleMean0.01629.1329.010.804
High-powerCytologyShapeMaxFeretSingleStdDv0.0126.205.770.064
High-powerCytologyShapeMinFeretSingleMean0.01622.3321.770.127
High-powerCytologyShapeMinFeretSingleStdDv0.0134.283.760.001
High-powerCytologyShapePerimSingleMean0.01887.9085.510.108
High-powerCytologyShapePerimSingleStdDv0.01119.2317.450.010
High-powerCytologyShapeRoundSingleMean0.0190.770.760.230
High-powerCytologyShapeRoundSingleStdDv0.0090.130.130.995
High-powerCytologyShapeSoliditySingleMean0.0190.890.900.265
High-powerCytologyShapeSoliditySingleStdDv0.0110.030.030.371
High-powerCytologySizeAreaSingleMean0.016482.09461.700.100
High-powerCytologySizeAreaSingleStdDv0.011183.26157.580.000

1. The magnification models are consisted of mid-power (100x) and high-power (400x) models. 2. High = high level features; 3. Medium = medium level features; 4. Low = low level features. 5. The “gate” filters follicular cell nuclei into single vs overlapping clusters based on: single nuclei have areas (III) between 100 and 1200 pixels and circ (IV) between 0.5 and 1.0; overlapping clusters have areas (III) between 1200 to infinite pixels and circularity between 0.0 and 1.0. 6. Values collected as mean vs standard deviation. 7. Feature importance dictates contribution (in percentage) of each feature to the predictive accuracy of the model. 8. Average value collected from follicular adenoma images. 9. Average value collected from benign (B9) thyroid images. 10. Student’s P values of each feature based on comparing values from follicular adenoma vs benign thyroid.

Count (I) - Number of separated nuclei, including both single and clustered nuclei; Total area (II) - Total area of the image occupied by nuclei in pixels; each pixel corresponds to an area of 0.064 μM2 for the high-power model and 0.016 μM2 for the mid-power model; Area (III) - Area of region of individual nuclei in square pixels; Circ (IV) - 4 π (Area/Perimeter2); 1.0 is a perfect circle; the value approaches 0 as the shape elongates; MaxFeret (V) - Feret's diameter: Maximum caliper; conversion factor 0.08 μM for high-power model and 0.04 μM for mid-power model; IntDen (VI) - Integrated density: area times mean gray value; Kurt (VII) - Kurtosis: The fourth-order moment about the mean; Mean (VIII) - Average gray value of the pixels in each nucleus/cluster of nuclei; The values range from 0 to 255; Medium (IX) - The median gray value of the pixels in the entire image; MinFeret (X) - Minmum Feret's diameter: minimum caliper; conversion factor 0.08 μM for high-power model and 0.04 μM for mid-power model; Mode (XI) - Most frequently occurring gray value of the pixels in each nucleus/cluster of nuclei; Perim (XII) - The length of the outside boundary of each nucleus/cluster of nuclei; multiple the value by 0.08 to get a measurement in μM for the high-power model and 0.04 for the mid-power model; Skew (XIII) - The third-order moment about the mean; Solidity (XIV) - Area/Convex Area; AR (XV) - Aspect ratio: Major axis/Minor axis; Max (XVI) - Maximum gray values of the pixels in each nucleus/cluster of nuclei; value range from 0 to 255; Median (XVII) - The median value of the pixels in the entire image; values range from 0 to 255; Min (XVIII) - Minimum gray values of the pixels in each nucleus/cluster of nuclei; values range from 0 to 255; Mode (XIX) - Most frequently occurring gray value of the pixels in each nucleus/cluster of nuclei; values range from 0 to 255.

The segmentation and feature extraction of each image (A) starts with background subtraction (B), followed by conversion to 8-bit grayscale image (green channel only) through color deconvolution (C), automatic threshold segmentation, and finally a mask (red) for the nuclear features (D). The extracted nuclei features are further “gated” (high-power only) using size and circularity to separate out individual nuclei from closely grouped clusters. Details of the predictive models and features performance. 1. The magnification models are consisted of mid-power (100x) and high-power (400x) models. 2. High = high level features; 3. Medium = medium level features; 4. Low = low level features. 5. The “gate” filters follicular cell nuclei into single vs overlapping clusters based on: single nuclei have areas (III) between 100 and 1200 pixels and circ (IV) between 0.5 and 1.0; overlapping clusters have areas (III) between 1200 to infinite pixels and circularity between 0.0 and 1.0. 6. Values collected as mean vs standard deviation. 7. Feature importance dictates contribution (in percentage) of each feature to the predictive accuracy of the model. 8. Average value collected from follicular adenoma images. 9. Average value collected from benign (B9) thyroid images. 10. Student’s P values of each feature based on comparing values from follicular adenoma vs benign thyroid. Count (I) - Number of separated nuclei, including both single and clustered nuclei; Total area (II) - Total area of the image occupied by nuclei in pixels; each pixel corresponds to an area of 0.064 μM2 for the high-power model and 0.016 μM2 for the mid-power model; Area (III) - Area of region of individual nuclei in square pixels; Circ (IV) - 4 π (Area/Perimeter2); 1.0 is a perfect circle; the value approaches 0 as the shape elongates; MaxFeret (V) - Feret's diameter: Maximum caliper; conversion factor 0.08 μM for high-power model and 0.04 μM for mid-power model; IntDen (VI) - Integrated density: area times mean gray value; Kurt (VII) - Kurtosis: The fourth-order moment about the mean; Mean (VIII) - Average gray value of the pixels in each nucleus/cluster of nuclei; The values range from 0 to 255; Medium (IX) - The median gray value of the pixels in the entire image; MinFeret (X) - Minmum Feret's diameter: minimum caliper; conversion factor 0.08 μM for high-power model and 0.04 μM for mid-power model; Mode (XI) - Most frequently occurring gray value of the pixels in each nucleus/cluster of nuclei; Perim (XII) - The length of the outside boundary of each nucleus/cluster of nuclei; multiple the value by 0.08 to get a measurement in μM for the high-power model and 0.04 for the mid-power model; Skew (XIII) - The third-order moment about the mean; Solidity (XIV) - Area/Convex Area; AR (XV) - Aspect ratio: Major axis/Minor axis; Max (XVI) - Maximum gray values of the pixels in each nucleus/cluster of nuclei; value range from 0 to 255; Median (XVII) - The median value of the pixels in the entire image; values range from 0 to 255; Min (XVIII) - Minimum gray values of the pixels in each nucleus/cluster of nuclei; values range from 0 to 255; Mode (XIX) - Most frequently occurring gray value of the pixels in each nucleus/cluster of nuclei; values range from 0 to 255.

Supervised Machine Learning

Supervised machine learning methods aim to automatically create algorithms based on known paired input (features) and expected output (e.g., ground truth) data. Training data are used to optimize the weights and parameters of the algorithms while the validation data were used to validate the generalizability and performance of the trained algorithm. Utilizing the above rules, the various combination of features based on the models were utilized as the input and surgical report (Follicular adenoma vs benign thyroid) of the T-FNA were the expected output. Follicular adenoma was considered as a positive result. Using Python sklearn library, we used gradient boost classifier (GBC) and extra tree classifier (ETC) as our supervised machine learning methods. The training and validation data were randomly split 1:1 from the collected data using a data splitting algorithm. The process was also repeated three times to further ensure generalization and to prevent overfitting. We also used extra tree classifier to evaluate the importance of low level features using all available data.

Result

The measure of a predictive test performance calls for measurement in accuracy, the closeness of the measurements to a specific value; precision, also known as positive-predictive value; recall, sensitivity. Since all features were used between the high and low power models, their performances are the direct measurement of the DIA algorithm design. Using validation data only, the mid-power model achieved an average accuracy of 0.71 (0.70–0.74), precision 0.72 (0.69–0.74), and recall 0.71 (0.64–0.75); the high-power model achieved an average accuracy of 0.67 (0.63–0.72), precision of 0.67 (0.62–0.74), and recall 0.69 (0.60–0.74). By direct comparison, the cytopathologist who reviewed all the images achieved an accuracy of 0.625, precision 0.57, and recall 0.95. Receiver operating characteristics (ROC) is also used to evaluate the diagnostic ability of a test as its discrimination threshold is changed. Using validation data only, the AUCs are 0.75 (0.74–0.82) for mid-power magnification model and 0.74 (0.69–0.79) for high-power magnification model. For high level features models, AUCs are 0.70 (0.64–0.73) for cellularity, 0.65 (0.60–0.69) for architecture, and 0.74 (0.69–0.80) for cytology. The AUC for ROC for medium level features are 0.57 (0.51–0.61) for nuclear size, 0.63 (0.57–0.68) for nuclear shape, and 0.71 (0.64–0.77) for nuclear chromatin (Fig. 2). Table 1 gives additional details on the breakdown of prediction accuracy contribution and statistical analyses of all features.
Figure 2

The predictive performance evaluation of mid-power (A) and high-power (B) models, high level features (A-1, B-1, B-2), and medium level features (B-3, B-4, B-5) using receiver operating characteristics (ROC) and quantified by area under the curve (AUC); ETC = extra tree classifier; GBC = gradient boost classifier.

The predictive performance evaluation of mid-power (A) and high-power (B) models, high level features (A-1, B-1, B-2), and medium level features (B-3, B-4, B-5) using receiver operating characteristics (ROC) and quantified by area under the curve (AUC); ETC = extra tree classifier; GBC = gradient boost classifier. Since the high- and medium-power magnification models have reasonable performance base on the validation results, the high and medium level features models can be considered as statistical hypothesis tests to evaluate the importance of each group of features and their contribution to the accuracy of the models. Based on this method, all three high level features, cellularity, architecture, and cytology appear to contribute significantly. For the medium level features, nuclear chromatin appears to be the strongest contributor while nuclear shape is a distant second. The nuclei size, on the other hand, appears to be non-contributory , a finding collaborated by statistical analysis (P=0.10) of the size variation between T-FNA from follicular adenoma and benign thyroid(Table 1). The minimal presence of colloid material in ThinPrep combined with technical limitations prevented the incorporation of these morphologic features into our models.

Discussion

The current evaluation of T-FNA relies on manual visual evaluation by cytopathologists. It is known that while the human visual system is excellent at recognizing patterns, it performs poorly on quantitative tasks and is susceptible to optical illusions. Most suspicious or malignant (Bethesda category IV to VI) T-FNA cases show higher rates of diagnostic reproducibility among cytopathologists as they present with more pronounced architectural and cytologic features. In these cases, there is little need for repeated T-FNA or ancillary molecular tests for further characterization as the evidence for surgical management is well established. However, for Bethesda category III, the degree of cytologic and architectural atypia may be subtle and variable, which explains the high degree of inter-observer variability. Furthermore, the Bethesda criteria for this diagnostic category, whether architectural or cytologic atypia, are not defined in quantifiable methods and therefore are fundamentally subjective. Our study shows that while looking at the exact same set of images, board-certified cytopathologists may err on the side of caution and sacrifice overall accuracy. While molecular testing provides an alternative to repeat T-FNA , it comes at a cost of additional needle passes and the assay itself. Additionally, their exact predictive performance for entities like follicular adenoma remains controversial based on existing published data. Our results show that routine T-FNA augmented by using ThinPrep material can produce predictions with the pre-existing diagnostic material with increased overall accuracy by quantitatively evaluating morphologic features. Therefore, concurrent evaluation of preliminarily indeterminate T-FNA with DIA may present as a more cost-effective method for evaluating thyroid nodules without additional biopies or molecular studies. Additionally, as a liquid-based cytology preparation that uses standardized instruments to produce monolayers of well-stained and well-preserved cells, ThinPrep may be further explored for further non-gynecologic image analysis applications. Our DIA design also examines the morphologic difference between T-FNA from follicular adenoma and benign thyroid. The performances of the high level feature models show cellularity, architecture, and cytology appear to contribute to the accuracy of the models (Table 1 and Fig. 2). ThinPrep material from follicular adenoma has a higher degree of cellularity, greater follicular cell crowding, and quantifiable nuclear difference than benign thyroid (Table 1 and Fig. 2). Further characterization of the nuclear morphology profiles using the medium level features shows nuclear chromatin appears to be the strongest contributor to accuracy while the nuclear shape is a distant second. The nuclear size was not a discriminating feature (with AUC close to 0.5) and this finding is further supported by the student T-test (P > 0.05) for nuclear size (Table 1). Characterization of the nuclear chromatin profile and shape difference beyond the listed performance and statistical metrics is suboptimal due to limitations of sample size and technical limitations (Table 1). However, the above findings reaffirm that cellularity, chromatin texture, and architectural features are diagnostically important in ThinPrep-based T-FNA for follicular adenoma. To the best of our knowledge, this is the first attempt to apply DIA to simultaneously build predictive models to better separate indeterminate thyroid diagnostic categories (Bethesda III) and to investigate T-FNA cytomorphology in ThinPrep material. While T-FNA cytomorphology is well studied on manually made smears, the decades of utilization in computer image analysis assisted diagnosis for gynecologic cytology (e. g., ThinPrep Imaging System) and the recent advances in digital image analysis merit a second look for expanded applications for liquid-based preparation such as ThinPrep. Limitations of our current study include the low number of cases in the dataset and comparing DIA against a single cytopathologist. The scope of the DIA algorithm is currently limited to T-FNA of follicular adenoma or benign thyroid nodule with ThinPrep material. A whole slide imaging method was not used due to limited development time. We do believe that mid- and high-power models can sufficiently capture the vast majority of the morphologic features and thus this study can serve as proof-of-concept and pave ways for more advanced future studies to build DIA-based decision-support tools for T-FNA .
AUCArea under the curve
AUSAtypia of undetermined significance
DIADigital image analysis
ETCExtra tree classifier
FLUSFollicular lesion of undetermined significance
FNAFine needle aspiration
GBCGradient boost classifier
ROCReceiver operating characteristics
SQLStructured query language
TFNAThyroid fine needle aspiration
  12 in total

Review 1.  Can we change the way we screen?: the ThinPrep Imaging System.

Authors:  Andrea E Dawson
Journal:  Cancer       Date:  2004-12-25       Impact factor: 6.860

Review 2.  The atypical thyroid fine-needle aspiration: past, present, and future.

Authors:  Massimo Bongiovanni; Jeffrey F Krane; Edmund S Cibas; William C Faquin
Journal:  Cancer Cytopathol       Date:  2011-07-12       Impact factor: 5.284

Review 3.  The Gold Standard Paradox in Digital Image Analysis: Manual Versus Automated Scoring as Ground Truth.

Authors:  Famke Aeffner; Kristin Wilson; Nathan T Martin; Joshua C Black; Cris L Luengo Hendriks; Brad Bolon; Daniel G Rudmann; Roberto Gianani; Sally R Koegler; Joseph Krueger; G Dave Young
Journal:  Arch Pathol Lab Med       Date:  2017-05-30       Impact factor: 5.534

4.  Application of the Bethesda Classification for Thyroid Fine-Needle Aspiration: Institutional Experience and Meta-analysis.

Authors:  Elliot A Krauss; Megan Mahon; Jean M Fede; Lanjing Zhang
Journal:  Arch Pathol Lab Med       Date:  2016-10       Impact factor: 5.534

Review 5.  Update on Molecular Testing for Cytologically Indeterminate Thyroid Nodules.

Authors:  Michiya Nishino; Marina Nikiforova
Journal:  Arch Pathol Lab Med       Date:  2018-01-16       Impact factor: 5.534

6.  Performance differences between conventional smears and liquid-based preparations of thyroid fine-needle aspiration samples: analysis of 47,076 responses in the College of American Pathologists Interlaboratory Comparison Program in Non-Gynecologic Cytology.

Authors:  Andrew H Fischer; Amy C Clayton; Joel S Bentz; Patricia G Wasserman; Michael R Henry; Rhona J Souers; Ann T Moriarty
Journal:  Arch Pathol Lab Med       Date:  2013-01       Impact factor: 5.534

7.  Assisted primary screening using the automated ThinPrep Imaging System.

Authors:  Charles V Biscotti; Andrea E Dawson; Bruce Dziura; Luis Galup; Teresa Darragh; Amir Rahemtulla; Lisa Wills-Frank
Journal:  Am J Clin Pathol       Date:  2005-02       Impact factor: 2.493

Review 8.  The thyroid gland: recurring problems in histologic and cytologic evaluation.

Authors:  William C Faquin
Journal:  Arch Pathol Lab Med       Date:  2008-04       Impact factor: 5.534

Review 9.  Comparative analysis of diagnostic performance, feasibility and cost of different test-methods for thyroid nodules with indeterminate cytology.

Authors:  Salvatore Sciacchitano; Luca Lavra; Alessandra Ulivieri; Fiorenza Magi; Gian Paolo De Francesco; Carlo Bellotti; Leila B Salehi; Maria Trovato; Carlo Drago; Armando Bartolazzi
Journal:  Oncotarget       Date:  2017-07-25

Review 10.  Thyroid nodule update on diagnosis and management.

Authors:  Shrikant Tamhane; Hossein Gharib
Journal:  Clin Diabetes Endocrinol       Date:  2016-10-03
View more
  1 in total

1.  Successful integration of thyroid cytopathology and surgical pathology education in an E-module format.

Authors:  Joseph M Rohr; Maheswari Mukherjee; Amber Donnelly; Sarah Sprinkle; Ernesto Martinez Duarte; Ana Yuil Valdes
Journal:  J Pathol Inform       Date:  2022-07-05
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.