| Literature DB >> 27776138 |
Bing Zheng1,2, Jun Liu3,4,5, Jianlei Gu1,6, Jing Du7, Lin Wang7, Shengli Gu8, Juan Cheng8, Jun Yang4,5, Hui Lu1,6,9.
Abstract
BACKGROUND: A key challenge in thyroid carcinoma is preoperatively diagnosing malignant thyroid nodules. A novel diagnostic test that measures the expression of a 3-gene signature (DPP4, SCG5 and CA12) has demonstrated promise in thyroid carcinoma assessment. However, more reliable prediction methods combining clinical features with genomic signatures with high accuracy, good stability and low cost are needed. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2016 PMID: 27776138 PMCID: PMC5077123 DOI: 10.1371/journal.pone.0164570
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographic, clinical and ultrasound characteristics of 913 patients.
| Characteristics | Recorded variables |
|---|---|
| Demographics | Age at last birthday |
| Sex: male/female | |
| Clinical | Course of disease: from nodule detection to operation |
| Nodule increase | |
| Mobility of mass in physical examination: mobile/fixed/nonpalpable | |
| Texture of mass upon physical examination: soft/medium/hard | |
| Ultrasound | Nodule maximum size: <1.00 cm/1.00–1.99 cm/2.00–2.99 cm/≥3 cm |
| Cystic lesion | |
| Nodule number: single/germination/multiple | |
| Nodule boundary: clear/vague | |
| Nodule echoes: homogeneous/inhomogeneous | |
| Nodule echo type: hypoechoic/isoechoic/hyperechoic/mixed echogenicity | |
| Nodule morphology: regular/irregular | |
| Nodule peripheral vessels: abundant/scarce | |
| Lymphadenopathy number | |
| Lymph node morphology: regular/irregular | |
| Lymph node boundary: clear/vague | |
| Lymph node peripheral vessels: abundant/scarce | |
| Lymph node structure: damaged/undamaged | |
| Calcification: null/calcification /micro calcification |
Fig 1Workflow of this study.
(a) Flow diagram of feature selection and validation of clinical data. Cohort 1 comprised 771 samples that were randomly divided into the training (539 samples) and test (232 samples) sets. Two additional independent data sets, Cohorts 2 and 3, included 70 and 72 samples, respectively, from Renji and Xinhua Hospital and were also employed as test sets to validate the predictive accuracy of the classification based on clinical data. (b) Flow diagram for the comparison between the classifier models based on the three gene expression levels, the clinical information, and integrating the gene expression with clinical data.
Comparison of the characteristics of benign and malignant tumors in the training set following histologic classification of thyroid nodules.
| Variable | Abbreviation | Training set (N = 539) | ||
|---|---|---|---|---|
| Benign (n = 241)n (%) | Malignant (n = 298)n (%) | |||
| Age (years, median) | Age | 52 | 45.5 | < .001 |
| Sex | Sex | 0.008 | ||
| Male | 59 (24.5) | 89 (29.9) | ||
| Female | 182 (75.5) | 209 (70.1) | ||
| Course of disease (months, median) | Medical_his | 6 | 3 | < .001 |
| Nodule increase present | nodule_increase | 111 (46.1) | 92 (30.9) | < .001 |
| Mobility of mass in physical examination | ME_L_mob | < .001 | ||
| Nonpalpable | 25 (10.4) | 28 (9.4) | ||
| Fixed | 3 (1.2) | 35 (11.7) | ||
| Mobile | 213 (88.4) | 235 (78.9) | ||
| Texture of mass in physical examination | ME_L_char | < .001 | ||
| Nonpalpable | 24 (10.0) | 30 (10.1) | ||
| Soft | 137 (56.8) | 99 (33.2) | ||
| Medium | 68 (28.2) | 96 (32.2) | ||
| Hard | 12 (5.0) | 73 (24.5) | ||
| Max nodule diameter | US_N_max_D | < .001 | ||
| <1.00 cm | 26(10.8) | 64(21.5) | ||
| 1.00–1.99 cm | 89(36.9) | 132(44.3) | ||
| 2.00–2.99 cm | 68(28.2) | 70(23.5) | ||
| ≥ 3 cm | 58(24.1) | 32(10.7) | ||
| Taller-than-wide sign (≥1) | US_N_LW_ratio | 45(18.7) | 85(28.5) | 0.008 |
| Nodule maximum area (mm2, median) | US_N_max_area | 276 | 169.5 | 0.002 |
| Cystic lesion present | US_cystic_lesion | 60 (24.9) | 6 (2.0) | < .001 |
| Nodule number | US_N_num | NS | ||
| Single | 78 (32.4) | 102 (34.2) | ||
| Germination | 40 (16.6) | 64 (21.5) | ||
| Multiple | 123 (51.0) | 132 (44.3) | ||
| Nodule position | US_N_pos | NS | ||
| One side | 97(40.2) | 128(43.0) | ||
| Two sides | 144(59.8) | 170(57.0) | ||
| Nodule boundary clear | US_N_bou | 203 (84.2) | 167 (56.0) | < .001 |
| Nodule echoes homogenous | US_L_echoes | 86 (35.7) | 28 (9.4) | < .001 |
| Nodule echo type | US_echoe_type | < .001 | ||
| Hypoechoic | 90 (37.4) | 240 (80.5) | ||
| Isoechoic | 21 (8.7) | 14 (4.7) | ||
| Hyperechoic | 3 (1.2) | 3 (1.0) | ||
| Mixed echogenicity | 127 (52.7) | 41 (13.8) | ||
| Nodule morphology regular | US_L_mor | 199 (82.6) | 121(40.6) | < .001 |
| Nodule peripheral vessels abundant | US_L_vas | 62 (25.7) | 159 (53.4) | < .001 |
| Lymph node number | US_LN_num | < .001 | ||
| None | 211(87.6) | 217(72.9) | ||
| Single | 2(0.8) | 13(4.3) | ||
| Germination | 1(0.4) | 4(1.3) | ||
| Multiple | 27(11.2) | 64(21.5) | ||
| Lymph node morphology regular if present | US_LN_mor | 25/30 | 56/81 | < .001 |
| Lymph node boundary clear if present | US_LN_bou | 27/30 | 64/81 | < .001 |
| Lymph node peripheral vessels abundant if present | US_LN_vas | 7/30 | 13/81 | < .001 |
| Lymph node structure undamaged if present | US_LN_inner_str | 28/30 | 61/81 | < .001 |
| Calcification | US_cal | < .001 | ||
| No calcification | 149 (61.9) | 86 (28.9) | ||
| Calcification | 69 (28.6) | 102 (34.2) | ||
| Micro-calcification | (9.5) | 110(36.9) | ||
NS, not statistically significant (P>0.05)
*These variables are listed as the median. Student’s t test was used for comparison between two groups. Other parameters were compared by χ2-test or Fisher’s test.
Predictive performance of three independent data sets using the clinical information model.
| Test set | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | Accuracy (%) |
|---|---|---|---|---|---|
| Cohort 1-TE | 83.6 | 80.7 | 84.3 | 80.0 | 82.3 |
| Cohort 2 | 74.2 | 87.1 | 82.1 | 81.0 | 81.4 |
| Cohort 3 | 85.3 | 78.9 | 78.4 | 85.7 | 81.9 |
PPV: positive predictive value; NPV: negative predictive value.
Fig 2Histogram of the relative gene expression levels of DPP4, SCG5 and CA12 in malignant and benign thyroid nodules.
**P<0.01 by two-tailed t test between benign and malignant thyroid tumor types. *P<0.05 by two-tailed t test between benign and malignant thyroid tumor types.
Comparison of thyroid cancer predictive performance based on the gene expression, clinical data, or integrated model.
| Test set | Variables | Sensitivity(%) | Specificity(%) | Accuracy(%) |
|---|---|---|---|---|
| Cohort 3 | 3 gene | 85.3 | 57.9 | 70.8 |
| 7 Clinical inform | 79.4 | 73.7 | 76.4 | |
| Gene+Clinical | 88.2 | 68.4 | 77.8 | |
| Cohort 2 | 3 gene | 87.1 | 77.0 | 81.4 |
| 7 Clinical inform | 74.2 | 92.3 | 84.3 | |
| Gene+Clinical | 93.54 | 84.6 | 88.6 | |
| Cohort2+ Cohort 3 | 3 gene | 72.3 | 80.5 | 76.8 |
| 7 Clinical inform | 66.2 | 81.8 | 74.6 | |
| Gene+Clinical | 83.1 | 84.4 | 83.8 |
a.Cohort 2 was used as the training set, and Cohort 3 was employed as the test set.
b.Cohort 3 was used as the training set, and Cohort 2 was employed as the test set.
c.Cohorts 2 and 3 were combined as a data set and validated by 10-fold cross-validation.
d.Model was developed based on the expression of DPP4, SCG5, and CA12.
e.Model was developed based on 7 significant clinical features.
f.Model was developed based on 10 input variables, including the expression levels of 3 genes and 7 significant clinical features.