| Literature DB >> 27798083 |
Gila Lithwick-Yanai1, Nir Dromi1, Alexander Shtabsky2,3, Sara Morgenstern3,4, Yulia Strenov3,4, Meora Feinmesser3,4, Vladimir Kravtsov3,5, Marino E Leon6, Marián Hajdúch7, Syed Z Ali8, Christopher J VandenBussche8, Xinmin Zhang9,10, Leonor Leider-Trejo2,3, Asia Zubkov2, Sergey Vorobyov11, Michal Kushnir1, Yaron Goren1,12, Sarit Tabak1, Etti Kadosh1, Hila Benjamin13, Temima Schnitzer-Perlman1, Hagai Marmor1, Maria Motin1, Danit Lebanony1, Sharon Kredo-Russo1, Heather Mitchell13, Melissa Noller13, Alexis Smith13, Olivia Dattner13, Karin Ashkenazi13, Mats Sanden13, Kenneth A Berlin13, Dganit Bar1, Eti Meiri1.
Abstract
AIMS: The distinction between benign and malignant thyroid nodules has important therapeutic implications. Our objective was to develop an assay that could classify indeterminate thyroid nodules as benign or suspicious, using routinely prepared fine needle aspirate (FNA) cytology smears.Entities:
Keywords: DIAGNOSTICS; LABORATORY TESTS; MOLECULAR ONCOLOGY; THYROID; THYROID CANCER
Mesh:
Substances:
Year: 2016 PMID: 27798083 PMCID: PMC5484037 DOI: 10.1136/jclinpath-2016-204089
Source DB: PubMed Journal: J Clin Pathol ISSN: 0021-9746 Impact factor: 3.411
Figure 1Assay development. The study was composed of three stages: (1) a discovery phase, (2) training, (3) validation. Discovery studies: (I) An initial set of 96 miRNAs was selected based on their differential expression in benign and malignant samples (53 formalin-fixed paraffin-embedded (FFPE), 73 cell block and 84 fine needle aspirate (FNA) samples) as seen on custom microarray and next generation sequencing (NGS) experiments. (II) This set of miRNAs was then evaluated on FNA smears (n=82) using qRT-PCR and 24 miRNAs were selected for the assay training and validation stages. Training: the final assay classifier was developed and cross validated on an FNA training set (n=375). Validation: the test was validated on a blinded set of 189 indeterminate samples from Bethesda classes III, IV and V for which at least one out of two independent pathologists agreed with the original pathologist regarding the final diagnosis (benign or malignant) of the sample. The results of the test on a subset of validation set samples for which all three pathologists agreed (n=150) were also assessed.
Tumour samples used in the study
| Training* | Validation | |
|---|---|---|
| Cohort | ||
| #Samples | 375 | 201 |
| #Patients | 357 | 201 |
| % Malignant | 49 | 30 |
| Age (median) | 54 | 53 |
| % Females | 73 | 80 |
| Cytology | ||
| #Giemsa | 212 | 90 |
| #Diff-Quik | 95 | 21 |
| #Papanicolaou | 62 | 90 |
| #Bethesda II | 27 | 0 |
| #Bethesda III | 80 | 29 |
| #Bethesda IV | 142 | 131 |
| #Bethesda V | 77 | 41 |
| #Bethesda VI | 49 | 0 |
*Patient age was missing for 64 training samples and patient gender was missing for 10 training samples. Three training samples were created by mixing more than one slide (with different stains), two were unstained, and for one the stain was unknown.
MicroRNAs profiled in the assay
| MicroRNA* | Sequence† | Forward primer sequence‡ |
|---|---|---|
| hsa-miR-31-5p | AGGCAAGATGCTGGCATAGCT | AGGCAAGATGCTGGCATAGCT |
| hsa-miR-5701 | TTATTGTCACGTTCTGATT | AGTCATTTGGCTTATTGTCACGTTCTGATT |
| hsa-miR-424-3p | CAAAACGTGAGGCGCTGCTAT | CAAAACGTGAGGCGCTGCTAT |
| MID-50971 | ATACTCTGGTTTCTTTTC | CAGTCATTTGGCATACTCTGGTTTCTTTTC |
| MID-20094 | TAAGCCAGTTTCTGTCTGATA | CATTTGGCTAAGCCAGTTTCTGTCTGATA |
| MID-50976 | CTGTCTGAGCGCCGCTC | CCTGTCTGAGCGCCGCTC |
| hsa-miR-3074-5p | GTTCCTGCTGAACTGAGCCAG | CGTTCCTGCTGAACTGAGCCAG |
| hsa-miR-222-3p | AGCTACATCTGGCTACTGGGT | GCAGCTACATCTGGCTACTGGGT |
| MID-50969 | ATGACAGATTGACATGGACAATT | TGGCATGACAGATTGACATGGACAATT |
| hsa-miR-146b-5p | TGAGAACTGAATTCCATAGGCT | TGGCTGAGAACTGAATTCCATAGGCT |
| hsa-miR-346 | TGTCTGCCCGCATGCCTGCCTCT | TGTCTGCCCGCATGCCTGCCTCT |
| MID-16582 | AGTGAAGCATTGGACTGTA | TTGGCAGTGAAGCATTGGACTGTA |
| hsa-miR-342-3p | TCTCACACAGAAATCGCACCCGT | CAGTCATTTGGGTCTCACACAGAAATCG |
| hsa-miR-181c-5p | AACATTCAACCTGTCGGTGAGT | CAGTCATTTGGCAACATTCAACCTGTCG |
| hsa-miR-125b-5p | TCCCTGAGACCCTAACTTGTGA | CAGTCATTTGGGTCCCTGAGACCCTAAC |
| hsa-miR-375 | TTTGTTCGTTCGGCTCGCGTGA | CAGTCATTTGGGTTTGTTCGTTCGGCTC |
| hsa-miR-486-5p | TCCTGTACTGAGCTGCCCCGAG | CAGTCATTTGGCTCCTGTACTGAGCTGC |
| hsa-miR-551b-3p | GCGACCCATACTTGGTTTCAG | CAGTCATTTGGCGCGACCCATACTTGGT |
| hsa-miR-23a-3p | ATCACATTGCCAGGGATTTCC | CAGTCATTTGGCATCACATTGCCAGGGA |
| hsa-miR-574-3p | CACGCTCATGCACACACCCACA | CAGTCATTTGGCCACGCTCATGCACACA |
| hsa-miR-152-3p | TCAGTGCATGACAGAACTTGG | CAGTCATTTGGCTCAGTGCATGACAGAA |
| hsa-miR-200c-3p | TAATACTGCCGGGTAATGATGGA | CAGTCATTTGGGTAATACTGCCGGGTAA |
| hsa-miR-138-5p | AGCTGGTGTTGTGAATCAGGCCG | CAGTCATTTGGCAGCTGGTGTTGTGAAT |
| hsa-miR-345-5p | GCTGACTCCTAGTCCAGGGCTC | CAGTCATTTGGCGCTGACTCCTAGTCCA |
*microRNA names that begin with ‘hsa’ are in miRBase, those that begin with ‘MID’ were sequenced and/or predicted at Rosetta Genomics.
†microRNA sequence is from miRBase28 V.20.
‡Reverse primer sequence: GCGAGCACAGAATTAATACGAC.
Performance of the assay
| Indeterminate (Bethesda III, IV,V)* | Indeterminate (Bethesda III,IV)* | Bethesda II and Bethesda VI* | ||
|---|---|---|---|---|
| Training† | #Malignant | 115 | 59 | 40 |
| #Benign | 147 | 137 | 26 | |
| Sensitivity | 86 [78–92] | 78 [65–88] | 96 [85–100] | |
| Specificity | 75 [67–81] | 76 [68–83] | 82 [62–94] | |
| Validation‡, entire set | #Malignant | 61 | 31 | 0 |
| #Benign | 128 | 119 | 0 | |
| Sensitivity | 85 [74–93] | 74 [55–88] | NA | |
| Specificity | 72 [63–79] | 74 [65–82] | NA | |
| NPV | 91 [84–96] | 92 [84–96] | NA | |
| PPV | 59 [48–69] | 43 [29–57] | NA | |
| Validation‡, agreement set | #Malignant | 40 | 14 | 0 |
| #Benign | 110 | 102 | 0 | |
| Sensitivity | 98 [87–100] | 100 [77–100] | NA | |
| Specificity | 78 [69–85] | 80 [71–88] | NA | |
| NPV | 99 [94–100] | 100 [96–100] | NA | |
| PPV | 62 [49–74] | 41 [25–59] | NA | |
| Evaluation study on Bethesda II/VI samples§ | #Malignant | NA | NA | 9 |
| #Benign | NA | NA | 32 | |
| Sensitivity | NA | NA | 89 [52–100] | |
| Specificity | NA | NA | 63 [44–79] |
*95% CIs are in square brackets.
†For training, estimates are based on the mean of 10 10-fold cross-validation runs. Samples with very low expression in any of the classification steps, as well as medullary samples, are not included.
‡Samples that failed quality control are not included.
§Additional blinded study.
NA, not applicable.
Performance of the assay for different histological types
| Training* | Validation, entire set | Validation, agreement set | ||||
|---|---|---|---|---|---|---|
| Histological type | #Samples† | % Correct‡ | #Samples† | % Correct‡ | #Samples†§ | % Correct‡ |
| Medullary | 5 | 100 [48–100] | 3 | 100 [29–100] | 1 (33.3%) | 100 [3–100] |
| PTC classic | 48 | 94 [83–100] | 17 | 88 [64–99] | 15 (88.2%) | 100 [78–100] |
| FVPTC | 40 | 81 [65–92] | 37 | 84 [68–94] | 23 (62.2%) | 96 [78–100] |
| FC | 16 | 56 [30–80] | 3 | 67 [9–99] | 1 (33.3%) | 100 [3–100] |
| PDC | 5 | 100 [48–100] | 1 | 100 [3–100] | 0 (0%) | NA |
| Papillary, other | 6 | 100 [54–100] | 0 | NA | 0 | NA |
| FA | 90 | 76 [66–84] | 95 | 76 [66–84] | 82 (86.3%) | 82 [72–89] |
| Nodular hyperplasia | 48 | 75 [60–86] | 28 | 64 [44–81] | 23 (82.1%) | 74 [52–90] |
| CLT | 9 | 82 [44–99] | 5 | 40 [5–85] | 5 (100.0%) | 40 [5–85] |
| Total | 267 | 189 | 150 | |||
*Only indeterminate training samples are listed in the table. Estimates for the training performance are based on the mean of 10 10-fold cross-validation runs. Samples with very low expression in any of the classification steps, as well as medullary samples, are not included.
†Number of samples includes only those that passed quality control steps.
‡95% CIs are in square brackets.
§Numbers in parentheses signify the percentage of test samples in the agreement set.
CLT, chronic lymphocytic thyroiditis; FA, follicular adenoma; FC, follicular carcinoma; FVPTC, follicular variant of papillary carcinoma; NA, not applicable; PDC, poorly differentiated carcinoma.
The malignant validation set samples misclassified as benign
| Bethesda | Stain | Extracted amount (ng) | Gender | Histological type | Histological subtype | In agreement set? |
|---|---|---|---|---|---|---|
| V | Giemsa | 294 | Female | Papillary carcinoma | Follicular variant, non-encapsulated | Yes |
| IV | Giemsa | 4716 | Female | Papillary carcinoma | Classic variant | No |
| IV | PAP | 138 | Male | Papillary carcinoma | Follicular variant, encapsulated | No |
| III | PAP | 115 | Female | Papillary carcinoma | Follicular variant, encapsulated | No |
| IV | PAP | 103 | Female | Papillary carcinoma | Follicular variant, encapsulated | No |
| IV | Giemsa | 51 | Female | Papillary carcinoma | Follicular variant, encapsulated | No |
| IV | PAP | 1242 | Female | Papillary carcinoma | Follicular variant, encapsulated | No |
| IV | Giemsa | 249 | Female | Follicular carcinoma | Minimal capsular invasion | No |
| IV | Giemsa | 451 | Male | Papillary carcinoma | Classic variant | No |
PAP, Papanicolaou.
Figure 2Negative predictive value (NPV) and positive predictive value (PPV) for varying prevalence values. NPV and PPV were calculated, based on the observed sensitivity and specificity in the blinded validation set, for varying prevalence values. Dashed lines: the entire validation set (sensitivity: 85.2%, specificity: 71.9%), solid lines: the agreement subset (sensitivity: 97.5%, specificity: 78.2%). Red line: calculated NPV. Blue line: calculated PPV.
The malignant histological types in the validation set
| Agreement set | Not in agreement set | |||
|---|---|---|---|---|
| Total | #Misclassified* | Total | #Misclassified* | |
| Medullary† | 1 | 0 | 2 | 0 |
| Papillary classic | 15 | 0 | 2 | 2 |
| FVPTC, encapsulated | 12 | 0 | 14 | 5 |
| FVPTC, non-encapsulated | 10 | 1 | 0 | 0 |
| FC | 1 | 0 | 2 | 1 |
| PDC | 0 | 0 | 1 | 0 |
| Total‡ | 39 | 1 | 21 | 8 |
*Misclassified as benign.
†The two independent pathologists did not have information regarding immunostaining.
‡One FVPTC sample (in the agreement set and correctly classified) is not included in the table, since there was no information available regarding the encapsulation status.
FC, follicular carcinoma; FVPTC, follicular variant of papillary carcinoma; PDC, poorly differentiated carcinoma.
Figure 3Medullary carcinoma linear discriminant analysis (LDA) step. An LDA classifier based on the expression of hsa-miR-375 is used to differentiate medullary carcinoma samples. All the training medullary carcinoma stained smears and two of the three medullary smears in the test set demonstrate overexpression of hsa-miR-375. Yellow diamonds: malignant non-medullary training samples. Blue squares: benign training samples. Green circles: medullary carcinoma training samples. Red stars: medullary carcinoma validation samples.