| Literature DB >> 32743554 |
Jina Nanayakkara1, Kathrin Tyryshkin1, Xiaojing Yang1, Justin J M Wong1, Kaitlin Vanderbeck1, Paula S Ginter2, Theresa Scognamiglio2, Yao-Tseng Chen2, Nicole Panarelli3, Nai-Kong Cheung4, Frederike Dijk5, Iddo Z Ben-Dov6, Michelle Kang Kim7, Simron Singh8, Pavel Morozov9, Klaas E A Max9, Thomas Tuschl9, Neil Renwick1.
Abstract
Neuroendocrine neoplasms (NENs) are clinically diverse and incompletely characterized cancers that are challenging to classify. MicroRNAs (miRNAs) are small regulatory RNAs that can be used to classify cancers. Recently, a morphology-based classification framework for evaluating NENs from different anatomical sites was proposed by experts, with the requirement of improved molecular data integration. Here, we compiled 378 miRNA expression profiles to examine NEN classification through comprehensive miRNA profiling and data mining. Following data preprocessing, our final study cohort included 221 NEN and 114 non-NEN samples, representing 15 NEN pathological types and 5 site-matched non-NEN control groups. Unsupervised hierarchical clustering of miRNA expression profiles clearly separated NENs from non-NENs. Comparative analyses showed that miR-375 and miR-7 expression is substantially higher in NEN cases than non-NEN controls. Correlation analyses showed that NENs from diverse anatomical sites have convergent miRNA expression programs, likely reflecting morphological and functional similarities. Using machine learning approaches, we identified 17 miRNAs to discriminate 15 NEN pathological types and subsequently constructed a multilayer classifier, correctly identifying 217 (98%) of 221 samples and overturning one histological diagnosis. Through our research, we have identified common and type-specific miRNA tissue markers and constructed an accurate miRNA-based classifier, advancing our understanding of NEN diversity.Entities:
Year: 2020 PMID: 32743554 PMCID: PMC7380486 DOI: 10.1093/narcan/zcaa009
Source DB: PubMed Journal: NAR Cancer ISSN: 2632-8674
Anatomical distribution and histopathological diagnoses of study samples
| NENs | Number of samples, | non-NENs | Number of samples, |
|---|---|---|---|
| Total | 221 | 114 | |
| Epithelial | |||
| Gastrointestinal | |||
| tract and pancreas | |||
| PanNET | 28 (13%) | PAAD | 10 (9%) |
| INET | 31 (14%) | ||
| AppNET | 15 (7%) | ||
| RNET | 7 (3%) | ||
| Lung | |||
| TC | 13 (6%) | LAC | 9 (8%)a |
| AC | 15 (7%) | LUNG | 15 (12%)a |
| SCLC | 11 (5%) | ||
| LCNEC | 13 (6%) | ||
| Parathyroid gland | |||
| PTA | 9 (4%) | PTG | 15 (13%) |
| Pituitary gland | |||
| PitNET | 10 (5%) | ||
| Skin | |||
| MCC | 17 (8%) | SK | 10 (9%) |
| Thyroid | |||
| MTC | 9 (4%) | TG | 10 (9%)b |
| TN | 45 (39%)b | ||
| Non-epithelial | |||
| Adrenal gland and | |||
| extra-adrenal sites | |||
| NB | 25 (11%) | ||
| PCC | 10 (5%) | ||
| PGL | 8 (4%) |
aFor lung NENs, neoplastic (LAC) and non-diseased (LUNG) tissue controls were available.
bFor MTC, neoplastic (TN) and non-diseased (TG) tissue controls were available.
Anatomical location and diagnostic histopathological information are presented for 221 NEN cases, comprising 15 pathological types from seven anatomical sites, and 114 site-matched non-NEN controls, comprising seven diagnostic entities from five anatomical sites. Sample abbreviations: AC, atypical carcinoid; AppNET, appendiceal NET; INET, ileal NET; LCNEC, large-cell NEC; MCC, Merkel cell carcinoma; MTC, medullary thyroid carcinoma; NB, neuroblastoma; PanNET, pancreatic NET; PCC, pheochromocytoma; PGL, paraganglioma; PitNET, pituitary adenoma; PTA, parathyroid adenoma; RNET, rectal NET; SCLC, small-cell lung carcinoma; TC, typical carcinoid. Non-NEN samples comprise lung (LUNG), lung adenocarcinoma (LAC), pancreatic adenocarcinoma (PAAD), parathyroid gland (PTG), skin (SK), thyroid gland (TG) and thyroid neoplasm (TN).
Figure 1.miR-375 and miR-7 expression in NEN and non-NEN samples. Normalized miR-375 and miR-7 expression was examined between 15 NEN pathological types and 7 site-matched non-NEN control groups. Site-matched NEN and non-NEN groups were designated by anatomical site in the color bar: pancreas (blue), lung (red), parathyroid (purple), skin (orange) and thyroid (green); NENs without a site-matched control were left blank. Both miR-375 and miR-7 were higher expressed in NEN cases than non-NEN controls. With the exception of PTA, miR-375 expression was higher in NEN pathological types than in site-matched non-NEN controls. With the exception of PTA, miR-7 was also higher in NEN pathological types compared to site-matched non-NEN controls. Abbreviation: log2 RF, log2 normalized relative frequency. Sample abbreviations are provided in Table 1 and Supplementary Table S1.
Figure 2.Unsupervised hierarchical clustering of study samples based on miRNA expression. Unsupervised hierarchical clustering using Euclidean distance and complete agglomeration clustering was performed using filtered (union of top 75% abundance) log2 normalized miRNA sequence reads for all NEN cases (n = 221) and non-NEN controls (n = 114). Anatomical groupings comprise the following pathological types described in Table 1 and Supplementary Table S1: thyroid (MTC, TG, TN), skin (MCC, SK), pituitary gland (PitNET), parathyroid gland (PTA, PTG), lung (AC, TC, SCLC, LCNEC, LAC, LUNG), GEP (AppNET, INET, PNET, RNET), and adrenal and extra-adrenal (PCC, PGL). With noted exceptions, NEN cases and non-NEN controls, and epithelial and non-epithelial samples, clustered distinctly and NEN pathological types preferentially clustered with each other than with site-matched non-NEN controls.
Figure 3.Multilayer miRNA-based classifier for predicting NEN pathological types. A multilayer classifier for predicting NEN pathological types was developed using supervised machine learning models. In the first layer, NEN miRNA profiles were classified as epithelial or non-epithelial based on miR-10b and miR-200a expression. In subsequent layers, epithelial and non-epithelial NENs were successively identified using the selected miRNAs as indicated. Sample abbreviations are provided in Table 1 and Supplementary Table S1.
Figure 4.Scatter plot assessment of miRNAs selected for classification. Epithelial and non-epithelial NENs are effectively discriminated based on miR-10b and miR-200a expression with one misclassification (A). Within epithelial NENs, PTA, PitNET, MCC and MTC were accurately discriminated from the remaining NENs based on miR-30a expression (B), miR-10a and miR-212-3p expression (C), miR-15b and miR-660 expression (D), and miR-335-5p, miR-29a and miR-222 expression (E); lung NENs and GEP NENs were discriminated based on miR-760, miR-1224-5p, miR-139, miR-205 and miR-9 expression (F, G). Within non-epithelial NENs, NB was accurately discriminated from PCC/PGL based on miR-93 expression (H), and PCC and PGL were separated based on miR-10b and miR-379 expression (I). Similar results were generated using relevant miRNA cluster data and are not presented. Arrows indicate misclassified samples. Abbreviation: log2 RF, log2 normalized relative frequency. Sample abbreviations are provided in Table 1 and Supplementary Table S1.
Figure 5.t-SNE for selected classificatory miRNAs. Sample grouping was visually assessed using miRNAs selected for multilayer classification and t-SNE analysis. With one notable exception, samples clustered as epithelial or non-epithelial NENs and tended to group by pathological type. The exception was a misdiagnosed PanNET later found to be a PGL on further testing. Sample abbreviations are provided in Table 1 and Supplementary Table S1.
Overall accuracy of multilayer classifier for discriminating NENs
| Established diagnosis | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| GEP NET | Lung NET | MTC | MCC | PitNET | PTA | PCC | PGL | NB | ||
| Multilayer classifier designation | GEP NET | 80 | 3 | |||||||
| Lung NET | 49 | |||||||||
| MTC | 9 | |||||||||
| MCC | 17 | |||||||||
| PitNET | 10 | |||||||||
| PTA | 9 | |||||||||
| PCC | 10 | |||||||||
| PGL | 1 | 8 | ||||||||
| NB | 25 | |||||||||
| Decision-level accuracy | 80/81 (99%) | 49/52 (94%) | 9/9 (100%) | 17/17 (100%) | 10/10 (100%) | 9/9 (100%) | 10/10 (100%) | 8/8 (100%) | 25/25 (100%) | |
| Overall accuracy | 217/221 (98%) | |||||||||
Using our multilayer classifier, NEN miRNA profiles were assigned to one of nine pathological subgroups or pathological types. Cases of GEP NENs (AppNET, INET, PNET, RNET) or lung NENs (TC, AC, SCLC, LCNEC) were not assigned to individual pathological types because we previously developed miRNA-based classifiers for these subgroups (18) (Wong et al., in preparation). By comparing classifier designations to established histopathological diagnoses, we determined our overall classifier accuracy to be 98%. Additional measures of classifier performance were also calculated: precision (0.98), recall (0.99) and Matthews correlation coefficient (0.98). Sample abbreviations are provided in Table 1 and Supplementary Table S1.