Literature DB >> 27798083

Multicentre validation of a microRNA-based assay for diagnosing indeterminate thyroid nodules utilising fine needle aspirate smears.

Gila Lithwick-Yanai¹, Nir Dromi¹, Alexander Shtabsky^2,3, Sara Morgenstern^3,4, Yulia Strenov^3,4, Meora Feinmesser^3,4, Vladimir Kravtsov^3,5, Marino E Leon⁶, Marián Hajdúch⁷, Syed Z Ali⁸, Christopher J VandenBussche⁸, Xinmin Zhang^9,10, Leonor Leider-Trejo^2,3, Asia Zubkov², Sergey Vorobyov¹¹, Michal Kushnir¹, Yaron Goren^1,12, Sarit Tabak¹, Etti Kadosh¹, Hila Benjamin¹³, Temima Schnitzer-Perlman¹, Hagai Marmor¹, Maria Motin¹, Danit Lebanony¹, Sharon Kredo-Russo¹, Heather Mitchell¹³, Melissa Noller¹³, Alexis Smith¹³, Olivia Dattner¹³, Karin Ashkenazi¹³, Mats Sanden¹³, Kenneth A Berlin¹³, Dganit Bar¹, Eti Meiri¹.

Abstract

AIMS: The distinction between benign and malignant thyroid nodules has important therapeutic implications. Our objective was to develop an assay that could classify indeterminate thyroid nodules as benign or suspicious, using routinely prepared fine needle aspirate (FNA) cytology smears.
METHODS: A training set of 375 FNA smears was used to develop the microRNA-based assay, which was validated using a blinded, multicentre, retrospective cohort of 201 smears. Final diagnosis of the validation samples was determined based on corresponding surgical specimens, reviewed by the contributing institute pathologist and two independent pathologists. Validation samples were from adult patients (≥18 years) with nodule size >0.5 cm, and a final diagnosis confirmed by at least one of the two blinded, independent pathologists. The developed assay, RosettaGX Reveal, differentiates benign from malignant thyroid nodules, using quantitative RT-PCR.
RESULTS: Test performance on the 189 samples that passed quality control: negative predictive value: 91% (95% CI 84% to 96%); sensitivity: 85% (CI 74% to 93%); specificity: 72% (CI 63% to 79%). Performance for cases in which all three reviewing pathologists were in agreement regarding the final diagnosis (n=150): negative predictive value: 99% (CI 94% to 100%); sensitivity: 98% (CI 87% to 100%); specificity: 78% (CI 69% to 85%).
CONCLUSIONS: A novel assay utilising microRNA expression in cytology smears was developed. The assay distinguishes benign from malignant thyroid nodules using a single FNA stained smear, and does not require fresh tissue or special collection and shipment conditions. This assay offers a valuable tool for the preoperative classification of thyroid samples with indeterminate cytology. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

Entities: CellLine Chemical Disease Gene Species

Keywords: DIAGNOSTICS; LABORATORY TESTS; MOLECULAR ONCOLOGY; THYROID; THYROID CANCER

Mesh：

Substances：
MicroRNAs

Year: 2016 PMID： 27798083 PMCID： PMC5484037 DOI： 10.1136/jclinpath-2016-204089

Source DB: PubMed Journal: J Clin Pathol ISSN： 0021-9746 Impact factor: 3.411

Introduction

Thyroid cancer has been increasing worldwide over the past few decades and is the most rapidly increasing cancer in the US.1 More than 64 000 new cases are expected to be diagnosed in the US in 2016, with 1980 associated deaths.2 Thyroid cancer usually presents as a palpable thyroid nodule identified on physical exam or incidentally when imaging studies are performed. Fine needle aspiration (FNA) is currently the recommended method for sampling thyroid tissue in order to diagnose thyroid nodules. FNA cytology results in a definitive benign or malignant diagnosis in the majority of cases. However, depending on the institution, approximately 10–40% of FNAs are not conclusively diagnosed by cytology and are categorised as indeterminate.3 4 In the Bethesda System for Reporting Thyroid Cytopathology, indeterminate categories include: atypia of undetermined significance/follicular lesion of undetermined significance (AUS/FLUS; Bethesda category III); follicular neoplasm or suspicious for a follicular neoplasm (FN/SFN; Bethesda category IV); and suspicious for malignancy (SM; Bethesda category V). Most patients with cytologically indeterminate nodules are referred for a diagnostic lobectomy or complete thyroidectomy, however as many as 70% of these nodules prove to be benign on final surgical pathology.3 4 To overcome this limitation of FNA cytology, several molecular tests have been developed, offering a refined diagnosis for cytologically indeterminate thyroid nodules and leading to a reduction in unnecessary surgeries.5–9 MicroRNAs (miRNAs) comprise a class of short (∼21–23 nucleotides), non-coding endogenous RNAs that regulate gene expression by directing their target mRNAs for degradation or translational repression.10–12 miRNA expression profiling has identified signatures associated with cancer diagnosis, prognosis and response to treatment.10 13–16 In addition, miRNA expression profiles have been shown to differentiate histological types17 18 and are currently used in several commercially available tests.9 19 20 Numerous studies have described the role of miRNAs in the pathogenesis of thyroid cancer.21–24 miRNAs are extremely stable and remain intact in tissues, whether fresh, frozen or formalin-fixed paraffin-embedded (FFPE).25 This property of miRNAs has been exploited for the development of several commercially available miRNA-based molecular tests.19 20 It has also allowed us, as described here, to develop a miRNA-based diagnostic test, RosettaGX Reveal. Unlike other commercially available molecular tests, this test does not require fresh FNA tissue or special collection and shipment conditions, and can be performed on a single, routinely prepared FNA smear, stained with Papanicolaou stain or Romanowsky-type stains (Diff-Quik and Giemsa). We describe here the discovery, development and blinded validation of the above-referenced miRNA-based diagnostic test. The test measures a set of miRNAs by qRT-PCR to classify a nodule as benign or ‘suspicious for malignancy by miRNA profiling’. The test also measures a miRNA specific to medullary carcinoma. The negative predictive value (NPV) in indeterminate nodules where all three reviewing pathologists were in agreement regarding final diagnosis is 99%; it is 91% for the entire validation set.

Methods

Patients and samples

The study was composed of three stages: (1) discovery, (2) training and (3) validation (figure 1). Under Institutional Review Board (IRB)-approved protocols, archived, preoperative stained FNA smear samples were gathered from several sources, as detailed in the online supplementary materials and methods. In the discovery and training sets, we sought to enrich for the various histological types and subtypes and therefore collected non-consecutive samples of Bethesda categories II–VI. Samples for the independent, retrospectively collected, validation set were received with a corresponding H&E-stained slide, obtained from the excised nodule, along with its associated histological diagnosis (reference diagnosis). The validation set consisted of indeterminate samples from five sources, which were blinded both to the lab technicians and to the investigators performing the analyses. To approximate the true distribution in the population, the samples in the validation set were consecutive (ie, each institute gave all the indeterminate smears that had a matching resection sample, gathered within a defined period of time). A detailed description of the training and validation samples is presented in table 1.

Figure 1

Assay development. The study was composed of three stages: (1) a discovery phase, (2) training, (3) validation. Discovery studies: (I) An initial set of 96 miRNAs was selected based on their differential expression in benign and malignant samples (53 formalin-fixed paraffin-embedded (FFPE), 73 cell block and 84 fine needle aspirate (FNA) samples) as seen on custom microarray and next generation sequencing (NGS) experiments. (II) This set of miRNAs was then evaluated on FNA smears (n=82) using qRT-PCR and 24 miRNAs were selected for the assay training and validation stages. Training: the final assay classifier was developed and cross validated on an FNA training set (n=375). Validation: the test was validated on a blinded set of 189 indeterminate samples from Bethesda classes III, IV and V for which at least one out of two independent pathologists agreed with the original pathologist regarding the final diagnosis (benign or malignant) of the sample. The results of the test on a subset of validation set samples for which all three pathologists agreed (n=150) were also assessed.

Table 1

Tumour samples used in the study

	Training*	Validation
Cohort
#Samples	375	201
#Patients	357	201
% Malignant	49	30
Age (median)	54	53
% Females	73	80
Cytology
#Giemsa	212	90
#Diff-Quik	95	21
#Papanicolaou	62	90
#Bethesda II	27	0
#Bethesda III	80	29
#Bethesda IV	142	131
#Bethesda V	77	41
#Bethesda VI	49	0

*Patient age was missing for 64 training samples and patient gender was missing for 10 training samples. Three training samples were created by mixing more than one slide (with different stains), two were unstained, and for one the stain was unknown.

Tumour samples used in the study *Patient age was missing for 64 training samples and patient gender was missing for 10 training samples. Three training samples were created by mixing more than one slide (with different stains), two were unstained, and for one the stain was unknown. Assay development. The study was composed of three stages: (1) a discovery phase, (2) training, (3) validation. Discovery studies: (I) An initial set of 96 miRNAs was selected based on their differential expression in benign and malignant samples (53 formalin-fixed paraffin-embedded (FFPE), 73 cell block and 84 fine needle aspirate (FNA) samples) as seen on custom microarray and next generation sequencing (NGS) experiments. (II) This set of miRNAs was then evaluated on FNA smears (n=82) using qRT-PCR and 24 miRNAs were selected for the assay training and validation stages. Training: the final assay classifier was developed and cross validated on an FNA training set (n=375). Validation: the test was validated on a blinded set of 189 indeterminate samples from Bethesda classes III, IV and V for which at least one out of two independent pathologists agreed with the original pathologist regarding the final diagnosis (benign or malignant) of the sample. The results of the test on a subset of validation set samples for which all three pathologists agreed (n=150) were also assessed. In a separate evaluation study, 41 Bethesda II and Bethesda VI samples and 48 FNA cell block samples were tested with the final assay classifier.

Cytopathological assessment

All cytological slides were categorised according to the Bethesda system4 by the contributing institute. Since some samples date back to before the establishment of the Bethesda system, all samples were assigned a Bethesda category by the cytopathologist of the medical centre of origin (‘the original pathologist’), based on the entire set of cytological slides. The cytological samples were stained with either Papanicolaou stain or Romanowsky-type stains (Diff-Quik and Giemsa).

Histological diagnosis and inclusion criteria

For all FNA samples, the reference diagnosis was based on the pathological assessment of the H&E stained excised tumour. Samples were included in the training and validation cohorts if the patient was at least 18 years old and if the nodule size was greater than 0.5 cm. For the samples in the discovery and training sets, the original pathologist's review was the sole review and determined the final reference diagnosis. Samples in the validation set were also reviewed by two additional independent pathologists (ASh and LL-T). If at least one of the two independent pathologists agreed with the original pathologist’s diagnosis regarding whether the resection sample was benign or malignant, then that sample was included (36 samples did not meet this criterion, and were therefore not included). All cases in which the reference diagnosis was medullary carcinoma were included, since only the original pathologist had information regarding calcitonin immunostaining. The histological type that was used for analyses was the one assigned by the original pathologist. Data regarding the new diagnosis of ‘non-invasive follicular thyroid neoplasm with papillary-like nuclear features’ (NIFTP) were not collected, since this diagnosis was suggested after the study was concluded.26 The test was run after receiving the pathological reviews and defining the validation set, and thus, pathologists were blinded to the test results. Two smears were excluded from the validation set following unblinding, since it was discovered that there were two other smears from the same two samples. The duplicates had identical test results.

Classifier

The classifier combines several linear discriminant analysis (LDA) steps and a K-nearest neighbour (KNN) classifier step to differentiate between benign samples and samples that are ‘suspicious for malignancy by miRNA profiling’. Samples classified by one of the LDA steps are marked as being positive for expression of the medullary marker. Several quality control (QC) steps accompany the test.27 Further details regarding the assay protocol and classifier can be found in the online supplementary materials and methods.

Results

We developed an assay which classifies indeterminate thyroid smears as benign or ‘suspicious for malignancy by miRNA profiling’. In addition, the assay tests for the presence of a medullary carcinoma marker (hsa-miR-375). There were three phases in the development of the assay (figure 1): a discovery phase in which the set of miRNA biomarkers was selected; a training phase in which the final classifier was determined; and a validation study, in which the diagnostic protocol was tested in the CLIA-approved US laboratory, on a blinded independent validation cohort (table 1). The validation study was preceded by an inter-laboratory validation study and other analytical validation studies.27 In addition, there was an evaluation study on Bethesda II/VI samples and cell blocks.

Discovery studies

To select the set of miRNAs for classification, several screening stages were performed (figure 1). In the first stage, 53 FFPE samples of resected tumours, 73 cell blocks of FNAs and a set of 156 stained FNA smears, corresponding to 84 unique samples, were profiled on Agilent custom-designed miRNA microarrays containing over 2000 miRNA probes. In addition, a subset of the follicular FFPE samples were profiled using next generation sequencing (data not shown). Next, a subset of 96 miRNAs that showed differential expression in benign and malignant tumours was selected. The selected miRNA set also included biomarkers described in the literature, biomarkers of epithelial cells and markers of various blood components discovered based on the profiling of smears that contained only blood.27 These miRNAs were measured using qRT-PCR analysing 95 stained FNA smears, corresponding to 82 unique samples (71 of which were previously profiled on microarrays). Based on these experiments, a final set of 24 miRNAs was selected (table 2).

Table 2

MicroRNAs profiled in the assay

MicroRNA*	Sequence†	Forward primer sequence‡
hsa-miR-31-5p	AGGCAAGATGCTGGCATAGCT	AGGCAAGATGCTGGCATAGCT
hsa-miR-5701	TTATTGTCACGTTCTGATT	AGTCATTTGGCTTATTGTCACGTTCTGATT
hsa-miR-424-3p	CAAAACGTGAGGCGCTGCTAT	CAAAACGTGAGGCGCTGCTAT
MID-50971	ATACTCTGGTTTCTTTTC	CAGTCATTTGGCATACTCTGGTTTCTTTTC
MID-20094	TAAGCCAGTTTCTGTCTGATA	CATTTGGCTAAGCCAGTTTCTGTCTGATA
MID-50976	CTGTCTGAGCGCCGCTC	CCTGTCTGAGCGCCGCTC
hsa-miR-3074-5p	GTTCCTGCTGAACTGAGCCAG	CGTTCCTGCTGAACTGAGCCAG
hsa-miR-222-3p	AGCTACATCTGGCTACTGGGT	GCAGCTACATCTGGCTACTGGGT
MID-50969	ATGACAGATTGACATGGACAATT	TGGCATGACAGATTGACATGGACAATT
hsa-miR-146b-5p	TGAGAACTGAATTCCATAGGCT	TGGCTGAGAACTGAATTCCATAGGCT
hsa-miR-346	TGTCTGCCCGCATGCCTGCCTCT	TGTCTGCCCGCATGCCTGCCTCT
MID-16582	AGTGAAGCATTGGACTGTA	TTGGCAGTGAAGCATTGGACTGTA
hsa-miR-342-3p	TCTCACACAGAAATCGCACCCGT	CAGTCATTTGGGTCTCACACAGAAATCG
hsa-miR-181c-5p	AACATTCAACCTGTCGGTGAGT	CAGTCATTTGGCAACATTCAACCTGTCG
hsa-miR-125b-5p	TCCCTGAGACCCTAACTTGTGA	CAGTCATTTGGGTCCCTGAGACCCTAAC
hsa-miR-375	TTTGTTCGTTCGGCTCGCGTGA	CAGTCATTTGGGTTTGTTCGTTCGGCTC
hsa-miR-486-5p	TCCTGTACTGAGCTGCCCCGAG	CAGTCATTTGGCTCCTGTACTGAGCTGC
hsa-miR-551b-3p	GCGACCCATACTTGGTTTCAG	CAGTCATTTGGCGCGACCCATACTTGGT
hsa-miR-23a-3p	ATCACATTGCCAGGGATTTCC	CAGTCATTTGGCATCACATTGCCAGGGA
hsa-miR-574-3p	CACGCTCATGCACACACCCACA	CAGTCATTTGGCCACGCTCATGCACACA
hsa-miR-152-3p	TCAGTGCATGACAGAACTTGG	CAGTCATTTGGCTCAGTGCATGACAGAA
hsa-miR-200c-3p	TAATACTGCCGGGTAATGATGGA	CAGTCATTTGGGTAATACTGCCGGGTAA
hsa-miR-138-5p	AGCTGGTGTTGTGAATCAGGCCG	CAGTCATTTGGCAGCTGGTGTTGTGAAT
hsa-miR-345-5p	GCTGACTCCTAGTCCAGGGCTC	CAGTCATTTGGCGCTGACTCCTAGTCCA

*microRNA names that begin with ‘hsa’ are in miRBase, those that begin with ‘MID’ were sequenced and/or predicted at Rosetta Genomics.

†microRNA sequence is from miRBase28 V.20.

‡Reverse primer sequence: GCGAGCACAGAATTAATACGAC.

MicroRNAs profiled in the assay *microRNA names that begin with ‘hsa’ are in miRBase, those that begin with ‘MID’ were sequenced and/or predicted at Rosetta Genomics. †microRNA sequence is from miRBase28 V.20. ‡Reverse primer sequence: GCGAGCACAGAATTAATACGAC.

Training set and classifier

To establish the final sample reference set and classifier, the 24 miRNAs were quantified in 375 samples, according to the final assay protocol, in two laboratories (252 FNA smear samples profiled in the Rosetta Genomics Israel laboratory and 123 samples profiled in the Rosetta Genomics US laboratory in Philadelphia, Pennsylvania, USA). The type of cytological stain used did not affect the classification performance.27 The classification method used for this miRNA-based assay, named RosettaGX Reveal, combines several LDA steps along with a KNN-based classifier. The performance of the training set is summarised in table 3. Based on the results from this training set (as estimated using cross validation), the sensitivity of the classifier on indeterminate samples (Bethesda categories III, IV and V) was estimated to be 86%, and the specificity was estimated to be 75%.

Table 3

Performance of the assay

		Indeterminate (Bethesda III, IV,V)*	Indeterminate (Bethesda III,IV)*	Bethesda II and Bethesda VI*
Training†	#Malignant	115	59	40
	#Benign	147	137	26
	Sensitivity	86 [78–92]	78 [65–88]	96 [85–100]
	Specificity	75 [67–81]	76 [68–83]	82 [62–94]
Validation‡, entire set	#Malignant	61	31	0
	#Benign	128	119	0
	Sensitivity	85 [74–93]	74 [55–88]	NA
	Specificity	72 [63–79]	74 [65–82]	NA
	NPV	91 [84–96]	92 [84–96]	NA
	PPV	59 [48–69]	43 [29–57]	NA
Validation‡, agreement set	#Malignant	40	14	0
	#Benign	110	102	0
	Sensitivity	98 [87–100]	100 [77–100]	NA
	Specificity	78 [69–85]	80 [71–88]	NA
	NPV	99 [94–100]	100 [96–100]	NA
	PPV	62 [49–74]	41 [25–59]	NA
Evaluation study on Bethesda II/VI samples§	#Malignant	NA	NA	9
	#Benign	NA	NA	32
	Sensitivity	NA	NA	89 [52–100]
	Specificity	NA	NA	63 [44–79]

*95% CIs are in square brackets.

†For training, estimates are based on the mean of 10 10-fold cross-validation runs. Samples with very low expression in any of the classification steps, as well as medullary samples, are not included.

‡Samples that failed quality control are not included.

§Additional blinded study.

NA, not applicable.

Performance of the assay *95% CIs are in square brackets. †For training, estimates are based on the mean of 10 10-fold cross-validation runs. Samples with very low expression in any of the classification steps, as well as medullary samples, are not included. ‡Samples that failed quality control are not included. §Additional blinded study. NA, not applicable.

Validation set

An independent set of 201 consecutive, indeterminate FNA samples (table 1) from five sources was classified blindly, in the US CLIA-approved laboratory, by the assay. This set of 201 samples included only samples for which at least one of the two independent pathologists agreed with the original pathologist on the final diagnosis (benign or malignant) of the excised H&E stained nodule. Only 12 of the 201 samples (6%) failed during processing or QC steps, with the most common reason being low miRNA expression. All of these 12 samples were histologically benign based on the resections. Of the remaining 189 samples, 101 (53.4%) were classified as benign. The performance of the validation set was found to be very similar to the performance estimates of the training set, as can be seen in tables 3 and 4 (NPV: 91%, sensitivity: 85%, specificity: 72%; and positive predictive value (PPV): 59%). When focusing on nodules of size ≥1 cm (n=166), the sensitivity was 84% and the specificity was 72%. The sensitivity and specificity of the subset of Bethesda III and IV samples are both 74%, with an NPV of 92% and a PPV of 43% (table 3). The accuracy of oncocytic follicular adenoma (FA) samples was slightly lower than that of non-oncocytic FA samples, however this difference was not statistically significant (see online supplementary results).

Table 4

Performance of the assay for different histological types

	Training*		Validation, entire set		Validation, agreement set
Histological type	#Samples†	% Correct‡	#Samples†	% Correct‡	#Samples†§	% Correct‡
Medullary	5	100 [48–100]	3	100 [29–100]	1 (33.3%)	100 [3–100]
PTC classic	48	94 [83–100]	17	88 [64–99]	15 (88.2%)	100 [78–100]
FVPTC	40	81 [65–92]	37	84 [68–94]	23 (62.2%)	96 [78–100]
FC	16	56 [30–80]	3	67 [9–99]	1 (33.3%)	100 [3–100]
PDC	5	100 [48–100]	1	100 [3–100]	0 (0%)	NA
Papillary, other	6	100 [54–100]	0	NA	0	NA
FA	90	76 [66–84]	95	76 [66–84]	82 (86.3%)	82 [72–89]
Nodular hyperplasia	48	75 [60–86]	28	64 [44–81]	23 (82.1%)	74 [52–90]
CLT	9	82 [44–99]	5	40 [5–85]	5 (100.0%)	40 [5–85]
Total	267		189		150

*Only indeterminate training samples are listed in the table. Estimates for the training performance are based on the mean of 10 10-fold cross-validation runs. Samples with very low expression in any of the classification steps, as well as medullary samples, are not included.

†Number of samples includes only those that passed quality control steps.

‡95% CIs are in square brackets.

§Numbers in parentheses signify the percentage of test samples in the agreement set.

CLT, chronic lymphocytic thyroiditis; FA, follicular adenoma; FC, follicular carcinoma; FVPTC, follicular variant of papillary carcinoma; NA, not applicable; PDC, poorly differentiated carcinoma.

Performance of the assay for different histological types *Only indeterminate training samples are listed in the table. Estimates for the training performance are based on the mean of 10 10-fold cross-validation runs. Samples with very low expression in any of the classification steps, as well as medullary samples, are not included. †Number of samples includes only those that passed quality control steps. ‡95% CIs are in square brackets. §Numbers in parentheses signify the percentage of test samples in the agreement set. CLT, chronic lymphocytic thyroiditis; FA, follicular adenoma; FC, follicular carcinoma; FVPTC, follicular variant of papillary carcinoma; NA, not applicable; PDC, poorly differentiated carcinoma. The nine malignant samples misclassified as benign (table 5) included samples from all three indeterminate Bethesda categories; included both Giemsa and Papanicolaou stained samples; and came from three different sources. The follicular carcinoma (FC) sample misclassified as benign by the assay was described as having minimal capsular invasion, according to the original pathologist, as were the other two FC samples that were correctly classified by the assay. The samples from patients with chronic lymphocytic thyroiditis (CLT) showed a lower correct classification rate (ie, relatively more were misclassified as ‘suspicious for malignancy by miRNA profiling’), relative to the training performance and to the other benign samples (table 4). However, this difference may be due to the small number of CLT samples in the validation set.

Table 5

The malignant validation set samples misclassified as benign

Bethesda	Stain	Extracted amount (ng)	Gender	Histological type	Histological subtype	In agreement set?
V	Giemsa	294	Female	Papillary carcinoma	Follicular variant, non-encapsulated	Yes
IV	Giemsa	4716	Female	Papillary carcinoma	Classic variant	No
IV	PAP	138	Male	Papillary carcinoma	Follicular variant, encapsulated	No
III	PAP	115	Female	Papillary carcinoma	Follicular variant, encapsulated	No
IV	PAP	103	Female	Papillary carcinoma	Follicular variant, encapsulated	No
IV	Giemsa	51	Female	Papillary carcinoma	Follicular variant, encapsulated	No
IV	PAP	1242	Female	Papillary carcinoma	Follicular variant, encapsulated	No
IV	Giemsa	249	Female	Follicular carcinoma	Minimal capsular invasion	No
IV	Giemsa	451	Male	Papillary carcinoma	Classic variant	No

PAP, Papanicolaou.

The malignant validation set samples misclassified as benign PAP, Papanicolaou.

Validation agreement set

To test the assay on a set of samples with a higher degree of certainty in the final diagnosis, a subset of the validation samples (‘agreement set’) was compiled post hoc. This set was composed of 160 samples (80% of the validation set) for which all three pathologists were in agreement on the final diagnosis of benign or malignant; 150 of these samples passed QC steps. This set demonstrated very high performance (table 3). The NPV of the agreement set was 99% (only one malignant sample was misclassified as benign), with a sensitivity of 98%, a specificity of 78% and a PPV of 62%. The NPV and PPV for both the entire set and the agreement set are plotted in figure 2.

Figure 2

Negative predictive value (NPV) and positive predictive value (PPV) for varying prevalence values. NPV and PPV were calculated, based on the observed sensitivity and specificity in the blinded validation set, for varying prevalence values. Dashed lines: the entire validation set (sensitivity: 85.2%, specificity: 71.9%), solid lines: the agreement subset (sensitivity: 97.5%, specificity: 78.2%). Red line: calculated NPV. Blue line: calculated PPV. As expected, the samples in the agreement set (table 4) had a much higher correct classification rate compared with the remainder of the validation set samples (ie, where only one of the independent pathologists agreed with the diagnosis made by the original pathologist): 125/150 (83%) samples in the agreement set were correctly classified, whereas 19/39 (49%) of the remaining samples were correctly classified (p=6.14e-06, χ2 test).

Concordance between pathologists

The assay performance is influenced by the accuracy of the diagnosis. Therefore, we examined the level of agreement between the pathologists for the different histological types (table 6). There was a large number of encapsulated follicular variant of papillary carcinomas (FVPTCs) in the entire validation set that were not in the agreement set. This higher proportion of encapsulated FVPTCs in the subset of samples for which only one of the two independent pathologists agreed with the original pathologist, was statistically significant compared with the proportion of non-encapsulated FVPTCs (p=0.0029, Fisher’s exact test).

Table 6

The malignant histological types in the validation set

	Agreement set		Not in agreement set
	Total	#Misclassified*	Total	#Misclassified*
Medullary†	1	0	2	0
Papillary classic	15	0	2	2
FVPTC, encapsulated	12	0	14	5
FVPTC, non-encapsulated	10	1	0	0
FC	1	0	2	1
PDC	0	0	1	0
Total‡	39	1	21	8

*Misclassified as benign.

†The two independent pathologists did not have information regarding immunostaining.

‡One FVPTC sample (in the agreement set and correctly classified) is not included in the table, since there was no information available regarding the encapsulation status.

FC, follicular carcinoma; FVPTC, follicular variant of papillary carcinoma; PDC, poorly differentiated carcinoma.

The malignant histological types in the validation set *Misclassified as benign. †The two independent pathologists did not have information regarding immunostaining. ‡One FVPTC sample (in the agreement set and correctly classified) is not included in the table, since there was no information available regarding the encapsulation status. FC, follicular carcinoma; FVPTC, follicular variant of papillary carcinoma; PDC, poorly differentiated carcinoma.

Medullary carcinoma

Medullary carcinoma is a rare form of thyroid cancer which often demonstrates overexpression of hsa-miR-375.29 To identify medullary carcinoma, the assay tests for the upregulation of hsa-miR-375 in one of the LDA steps (figure 3). Elevated expression of this medullary marker is provided as part of the assay results. In the training set, there were 14 medullary samples, including five indeterminate medullary samples, and all of these presented high expression of hsa-miR-375. In the validation set, there were three medullary carcinoma samples. All were correctly classified as suspicious. However, one (assigned Bethesda V) did not demonstrate overexpression of hsa-miR-375 and was therefore not denoted as medullary carcinoma (this sample was confirmed to be medullary carcinoma, with positive calcitonin immunostaining).

Figure 3

Medullary carcinoma linear discriminant analysis (LDA) step. An LDA classifier based on the expression of hsa-miR-375 is used to differentiate medullary carcinoma samples. All the training medullary carcinoma stained smears and two of the three medullary smears in the test set demonstrate overexpression of hsa-miR-375. Yellow diamonds: malignant non-medullary training samples. Blue squares: benign training samples. Green circles: medullary carcinoma training samples. Red stars: medullary carcinoma validation samples.

Evaluation study on FNA cell blocks and Bethesda II/VI samples

The assay was also tested on cell blocks, and in benign (Bethesda II) and malignant (Bethesda VI) smears. The sensitivity and specificity of the cell block indeterminate samples were 72% and 79%. The sensitivity of the malignant Bethesda VI smears was 89% and the specificity of the benign Bethesda II samples was 63% (table 3). More details can be seen in the online supplementary results.

Discussion

We present here a first-of-its-kind assay by which miRNA material is successfully extracted from routinely stained FNA cytology smears and classified as ‘suspicious for malignancy by miRNA profiling’ or ‘benign’. In contrast to currently available tests,6 8 9 30 the test presented here does not require an additional FNA biopsy and can be performed on the same specimen as that initially used to categorise the sample as indeterminate. In addition, this test does not require specially designated preservation material, or unique shipment conditions. Instead, a single routinely prepared cytological slide, stained with Papanicolaou stain or Romanowsky type stains (Diff-Quik and Giemsa), can be used. The test does not require a large amount of cytological material, and the failure rate is quite low if there is minimal adequate cellularity, with 94% of the samples in the validation set being successfully processed. The assay's performance was evaluated based on a validation set composed of blinded, indeterminate, consecutive samples gathered from five sources in the USA, Europe and Israel. Since the test is run on cytology slides routinely prepared for examination, and does not require any special preservation conditions, it was possible to perform the study on a retrospective cohort. The development of a molecular test requires a reliable gold-standard reference diagnosis with which to compare the test results. This leads to two inherent biases in the tested set of samples. The first bias is that only samples with a corresponding surgically excised histopathological specimen were gathered. The second bias is that only samples for which the reference resection-based diagnosis was confirmed by an independent pathologist were included. Since there is a high level of disagreement between pathologists regarding the diagnosis of such specimens,31–34 relying on a single pathologist may lead to the inclusion of samples with an inaccurate final diagnosis, which could lead to an incorrect estimation of the performance of the assay. However, we cannot rule out the possibility that the exclusion of these samples alters the true sample distribution and, as a result, affects the performance estimates. The majority of malignant samples that were not included in the agreement subset were encapsulated FVPTCs. This is in accordance with previous reports that there is a relatively high level of inter-observer variability between pathologists with regard to FVPTC diagnoses,31 33 in particular for non-invasive, encapsulated FVPTCs versus FA.35 There is current evidence that encapsulated FVPTC is a neoplasm of relatively low malignant potential, particularly if there is no capsular or vascular invasion.36 Additional evidence supporting a reclassification is suggested by the findings in their molecular profile.37 It has been suggested that cases of encapsulated FVPTC that cannot be unequivocally diagnosed as benign or malignant should be reclassified as ‘follicular tumour of uncertain malignant potential’ by some authors35 38 or, as proposed by the Endocrine Pathology Society, as NIFTP.26 We are actively collecting data on this new diagnosis for future studies of the classifier. It has also been suggested that papillary thyroid cancer should be reclassified according to its molecular profile.37 Our study offers additional evidence supporting the need for reclassification of encapsulated FVPTC. The expression levels of several documented thyroid malignant markers are measured in our assay. For example, the miRNAs used in the assay include hsa-miR-146b-5p and hsa-miR-222-3p, which have both been found to be upregulated in papillary thyroid cancer and involved in tumour progression and aggressiveness.39–41 In contrast, hsa-miR-152-3p and hsa-miR-138-5p have been shown to be downregulated in papillary thyroid cancer.42 miRNAs, including several of those used in the assay, have been previously found to differentiate malignant and benign thyroid FNA samples,9 43–46 even in FNA smears.47–49 In conclusion, we presented a new diagnostic assay and evaluated its performance on a blinded set of 189 samples from several sources. Additional cohorts, both academic and non-academic, could help to further validate the performance of the assay. The test described in this paper is a novel, multicentre, clinically evaluated, commercially available assay that can accurately differentiate between malignant and benign thyroid nodules using routinely prepared FNA-stained smears. 10–40% of thyroid fine needle aspirates (FNAs) are not conclusively diagnosed by cytology and are categorised as indeterminate. The RosettaGX Reveal assay, which was blindly validated, differentiates benign from malignant thyroid nodules in indeterminate smears. The smear used for the assay can be a routinely prepared smear, which was used to make the indeterminate diagnosis, and does not require a repeat FNA. In contrast with currently available tests, the assay does not require fresh tissue or special collection and shipment conditions.

49 in total

1. MicroRNA expression profile helps to distinguish benign nodules from papillary thyroid carcinomas starting from cells of fine-needle aspiration.

Authors: Patrizia Agretti; Eleonora Ferrarini; Teresa Rago; Antonio Candelieri; Giuseppina De Marco; Antonio Dimida; Filippo Niccolai; Angelo Molinaro; Giancarlo Di Coscio; Aldo Pinchera; Paolo Vitti; Massimo Tonacchera
Journal: Eur J Endocrinol Date: 2012-06-22 Impact factor: 6.664

2. The role of microRNA genes in papillary thyroid carcinoma.

Authors: Huiling He; Krystian Jazdzewski; Wei Li; Sandya Liyanarachchi; Rebecca Nagy; Stefano Volinia; George A Calin; Chang-Gong Liu; Kaarle Franssila; Saul Suster; Richard T Kloos; Carlo M Croce; Albert de la Chapelle
Journal: Proc Natl Acad Sci U S A Date: 2005-12-19 Impact factor: 11.205

3. Tumor microRNA-29a expression and the risk of recurrence in stage II colon cancer.

Authors: Alina Weissmann-Brenner; Michal Kushnir; Gila Lithwick Yanai; Ranit Aharonov; Hadas Gibori; Ofer Purim; Yulia Kundel; Sara Morgenstern; Marissa Halperin; Yaron Niv; Baruch Brenner
Journal: Int J Oncol Date: 2012-03-16 Impact factor: 5.650

4. A second-generation microRNA-based assay for diagnosing tumor tissue origin.

Authors: Eti Meiri; Wolf C Mueller; Shai Rosenwald; Merav Zepeniuk; Elizabeth Klinke; Tina Bocker Edmonston; Margot Werner; Ulrike Lass; Iris Barshack; Meora Feinmesser; Monica Huszar; Franz Fogt; Karin Ashkenazi; Mats Sanden; Eran Goren; Nir Dromi; Orit Zion; Ilanit Burnstein; Ayelet Chajut; Yael Spector; Ranit Aharonov
Journal: Oncologist Date: 2012-05-22

5. MicroRNA signature in thyroid fine needle aspiration cytology applied to "atypia of undetermined significance" cases.

Authors: Rulong Shen; Sandya Liyanarachchi; Wei Li; Paul E Wakely; Motoyasu Saji; Jie Huang; Rebecca Nagy; Tisha Farrell; Matthew D Ringel; Albert de la Chapelle; Richard T Kloos; Huiling He
Journal: Thyroid Date: 2011-12-02 Impact factor: 6.568

6. Integrated genomic characterization of papillary thyroid carcinoma.

Authors:
Journal: Cell Date: 2014-10-23 Impact factor: 41.582

7. Observer variation in the diagnosis of follicular variant of papillary thyroid carcinoma.

Authors: Ricardo V Lloyd; Lori A Erickson; Mary B Casey; King Y Lam; Christine M Lohse; Sylvia L Asa; John K C Chan; Ronald A DeLellis; H Ruben Harach; Kennichi Kakudo; Virginia A LiVolsi; Juan Rosai; Thomas J Sebo; Manuel Sobrinho-Simoes; Bruce M Wenig; Marick E Lae
Journal: Am J Surg Pathol Date: 2004-10 Impact factor: 6.394

8. Highly accurate diagnosis of cancer in thyroid nodules with follicular neoplasm/suspicious for a follicular neoplasm cytology by ThyroSeq v2 next-generation sequencing assay.

Authors: Yuri E Nikiforov; Sally E Carty; Simon I Chiosea; Christopher Coyne; Umamaheswar Duvvuri; Robert L Ferris; William E Gooding; Steven P Hodak; Shane O LeBeau; N Paul Ohori; Raja R Seethala; Mitchell E Tublin; Linwah Yip; Marina N Nikiforova
Journal: Cancer Date: 2014-09-10 Impact factor: 6.860

9. A mammalian microRNA expression atlas based on small RNA library sequencing.

Authors: Pablo Landgraf; Mirabela Rusu; Robert Sheridan; Alain Sewer; Nicola Iovino; Alexei Aravin; Sébastien Pfeffer; Amanda Rice; Alice O Kamphorst; Markus Landthaler; Carolina Lin; Nicholas D Socci; Leandro Hermida; Valerio Fulci; Sabina Chiaretti; Robin Foà; Julia Schliwka; Uta Fuchs; Astrid Novosel; Roman-Ulrich Müller; Bernhard Schermer; Ute Bissels; Jason Inman; Quang Phan; Minchen Chien; David B Weir; Ruchi Choksi; Gabriella De Vita; Daniela Frezzetti; Hans-Ingo Trompeter; Veit Hornung; Grace Teng; Gunther Hartmann; Miklos Palkovits; Roberto Di Lauro; Peter Wernet; Giuseppe Macino; Charles E Rogler; James W Nagle; Jingyue Ju; F Nina Papavasiliou; Thomas Benzing; Peter Lichter; Wayne Tam; Michael J Brownstein; Andreas Bosio; Arndt Borkhardt; James J Russo; Chris Sander; Mihaela Zavolan; Thomas Tuschl
Journal: Cell Date: 2007-06-29 Impact factor: 41.582

10. Molecular pathways associated with aggressiveness of papillary thyroid cancer.

Authors: Salvatore Benvenga; Christian A Koch
Journal: Curr Genomics Date: 2014-06 Impact factor: 2.236

30 in total

Review 1. Current controversies and future directions in the diagnosis and management of differentiated thyroid cancers.

Authors: Timothy M Ullmann; Katherine D Gray; Maureen D Moore; Rasa Zarnegar; Thomas J Fahey
Journal: Gland Surg Date: 2018-10

Review 2. Molecular markers in well-differentiated thyroid cancer.

Authors: Anil K D'Cruz; Richa Vaish; Abhishek Vaidya; Iain J Nixon; Michelle D Williams; Vincent Vander Poorten; Fernando López; Peter Angelos; Ashok R Shaha; Avi Khafif; Alena Skalova; Alessandra Rinaldo; Jennifer L Hunt; Alfio Ferlito
Journal: Eur Arch Otorhinolaryngol Date: 2018-04-06 Impact factor: 2.503

Review 3. Current methodologies for molecular screening of thyroid nodules.

Authors: Elisabetta Macerola; Fulvio Basolo
Journal: Gland Surg Date: 2018-08

4. Selective use of Molecular Testing Based on Sonographic Features of Cytologically Indeterminate Thyroid Nodules: A Decision Analysis.

Authors: Kyle A Zanocco; Max M Wang; Michael W Yeh; Masha J Livhits
Journal: World J Surg Date: 2020-02 Impact factor: 3.352

5. Preoperative metabolic classification of thyroid nodules using mass spectrometry imaging of fine-needle aspiration biopsies.

Authors: Rachel J DeHoog; Jialing Zhang; Elizabeth Alore; John Q Lin; Wendong Yu; Spencer Woody; Christopher Almendariz; Monica Lin; Anton F Engelsman; Stan B Sidhu; Robert Tibshirani; James Suliburk; Livia S Eberlin
Journal: Proc Natl Acad Sci U S A Date: 2019-10-07 Impact factor: 11.205

6. The potential of three whole blood microRNAs to predict outcome and monitor treatment response in sarcoid-bearing equids.

Authors: E Hamza; J Cosandey; V Gerber; C Koch; L Unger
Journal: Vet Res Commun Date: 2022-04-28 Impact factor: 2.459

Review 7. Molecular Testing for Thyroid Nodules Including Its Interpretation and Use in Clinical Practice.

Authors: Snehal G Patel; Sally E Carty; Andrew J Lee
Journal: Ann Surg Oncol Date: 2021-07-18 Impact factor: 5.344

8. Performance of a dual-component molecular assay in cytologically indeterminate thyroid nodules.

Authors: Guido Fadda; Sebastiano Filetti; Marialuisa Sponziello; Chiara Brunelli; Antonella Verrienti; Giorgio Grani; Valeria Pecce; Luana Abballe; Valeria Ramundo; Giuseppe Damante; Diego Russo; Celestino Pio Lombardi; Cosimo Durante; Esther Diana Rossi; Patrizia Straccia
Journal: Endocrine Date: 2020-03-30 Impact factor: 3.633

Review 9. microRNA-based diagnostic and therapeutic applications in cancer medicine.

Authors: Lorenzo F Sempere; Asfar S Azmi; Anna Moore
Journal: Wiley Interdiscip Rev RNA Date: 2021-05-17 Impact factor: 9.957

10. Thyroseq v3, Afirma GSC, and microRNA Panels Versus Previous Molecular Tests in the Preoperative Diagnosis of Indeterminate Thyroid Nodules: A Systematic Review and Meta-Analysis.

Authors: Cristina Alina Silaghi; Vera Lozovanu; Carmen Emanuela Georgescu; Raluca Diana Georgescu; Sergiu Susman; Bogdana Adriana Năsui; Anca Dobrean; Horatiu Silaghi
Journal: Front Endocrinol (Lausanne) Date: 2021-05-13 Impact factor: 5.555