| Literature DB >> 34906185 |
Jian Zhou1,2, Tong Cheng3, Xing Li3, Jie Hu1, Encheng Li4, Ming Ding5, Rulong Shen6, John P Pineda3, Chun Li1, Shaohua Lu1, Hongyu Yu7, Jiayuan Sun8, Wenbin Huang9, Xiaonan Wang3, Han Si3, Panying Shi3, Jing Liu10, Meijia Chang1, Maosen Dou1, Meng Shi11, Xiaofeng Chen11, Rex C Yung12, Qi Wang4, Ning Zhou13, Chunxue Bai14,15.
Abstract
BACKGROUND: Early lung cancer detection remains a clinical challenge for standard diagnostic biopsies due to insufficient tumor morphological evidence. As epigenetic alterations precede morphological changes, expression alterations of certain imprinted genes could serve as actionable diagnostic biomarkers for malignant lung lesions.Entities:
Keywords: Cancer biomarkers; Epigenetics; Genomic imprinting; In situ hybridization; Lung cancer early diagnosis; Pulmonary nodules
Mesh:
Substances:
Year: 2021 PMID: 34906185 PMCID: PMC8672623 DOI: 10.1186/s13148-021-01203-5
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Fig. 1Workflow showing the study design and analysis steps of model building, testing and validation
Baseline characteristics of study population
| Model building set | Model testing set | Model validation set | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Normala | Benign | Malignant | Benign | Malignant | Benign | Malignant | ||||
| ( | ( | ( | ( | ( | ( | ( | ||||
| Age | < 0.001 | 0.700 | 0.001 | |||||||
| Median | 52 | 57 | 62 | 62 | 62 | 61 | 64 | |||
| IQR | 45–57 | 49–63 | 53–67 | 49–77 | 56–66 | 52–63 | 58–71 | |||
| Sex (%) | 0.467 | 0.640 | 0.692 | |||||||
| Male | 14 (66.7%) | 28 (54.9%) | 111 (63.8%) | 6 (66.7%) | 17 (81.0%) | 25 (65.8%) | 81 (69.2%) | |||
| Female | 7 (33.3%) | 23 (45.1%) | 63 (36.2%) | 3 (33.3%) | 4 (19.0%) | 13 (34.2%) | 36 (30.8%) | |||
| Sample type | ||||||||||
| Surgically resected tissue specimen | 21 | 51 | 174 | |||||||
| Small biopsy specimen | 9 | 21 | 38 | 36 | ||||||
| Cytology specimen | 81 | |||||||||
| Histologic characteristics no | ||||||||||
| Normal | 21 | |||||||||
| PC | 10 | |||||||||
| PSP | 13 | |||||||||
| TB | 10 | 2 | 7 | |||||||
| COP | 10 | 2 | ||||||||
| PIP | 3 | |||||||||
| Non-TB infections | 3 | 8 | ||||||||
| Inflammation | 3 | 4 | 19 | |||||||
| Granuloma | 2 | 1 | ||||||||
| Hamartoma | 1 | |||||||||
| AdC | 94 | 9 | 61 | |||||||
| SqCC | 76 | 6 | 28 | |||||||
| AdSqLC | 2 | 2 | ||||||||
| LCC | 1 | |||||||||
| SCLC | 5 | 23 | ||||||||
| Carcinoma of unknown primary | 1 | 1 | 3 | |||||||
| Nodule size | e | e | e | |||||||
| < 0.8 cm | 7 | 2 | 10 | 1 | ||||||
| ≥ 0.8–2.0 cm | 3 | 35 | 1 | 1 | 2 | 20 | ||||
| > 2.0–3.0 cm | 48 | 9 | 3 | 32 | ||||||
| > 3.0–5.0 cm | 1 | 52 | 4 | 4 | 42 | |||||
| > 5.0 cm | 28 | 6 | 18 | |||||||
| Unclear LDCT featuresb | 21 | 44 | 5 | 12 | 1 | |||||
| Not specifiedc | 3 | 4 | 1 | 1 | 7 | 3 | ||||
| Smoking statusd (%) | 0.001 | e | 0.006 | |||||||
| Current smoker | 3 (14.3%) | 3 (5.9%) | 56 (32.2%) | 1 (11.1%) | 6 (28.6%) | 7 (18.4%) | 48 (41.0%) | |||
| Former smoker | 1 (4.8%) | 1 (2.0%) | 11 (6.3%) | 0 (0.0%) | 3 (14.3%) | 2 (5.3%) | 11 (9.4%) | |||
| Non-smoker | 15 (71.4%) | 40 (78.4%) | 90 (51.7%) | 1 (11.1%) | 8 (38.1%) | 16 (42.1%) | 40 (34.2%) | |||
| Not specified | 2 (9.5%) | 7 (13.7%) | 17 (9.8%) | 7 (77.8%) | 4 (19.0%) | 13 (34.2%) | 18 (15.4%) | |||
PC, Pulmonary cryptococcosis. PSP, pulmonary sclerosing pneumocytoma. TB, pulmonary tuberculosis. COP, cryptogenic organizing pneumonia. PIP, pulmonary inflammatory pseudotumor. AdC, adenocarcinoma. SqCC, squamous cell carcinoma. AdSqLC, adenosquamous lung carcinoma. LCC, large cell carcinoma. SCLC, small cell lung cancer
aNormal tissue specimens were resected adjacent to the benign lesions
bNo typical nodule under LDCT
cNodule sizes not recorded by doctors
dCases classified as current or former smokers were combined into a single category prior to analysis to comply with the statistical test requirements
eNo analysis proceeded since data transformations applied failed to meet the statistical test requirements
Fig. 2Principle of QCIGISH technology and novel imprinted gene evaluation. A Illustration showing the QCIGISH principle and the respective equations used for calculating BAE, MAE and TE measurements. The QCIGISH method targets the non-coding intronic nascent RNAs to visualize the transcription loci of imprinted genes in the cell nuclei. Blue components in the image are cell nuclei stained using hematoxylin. The distinct red or brown dots represent the detected gene-expressing sites. The different allelic expressions of imprinted genes are quantified based on the transcription signals. Aberrant expressions for abnormal cells exhibit two or more dots, while normal cells contain no to a single dot. B ROC curves showing the significant differences in the AUC values determined for the BAE and MAE of HM13 as compared to the QCIGISH binary classification model during the imprinted gene selection study. *, significant differences between AUC values, p < 0.05
Fig. 3Photomicrographs showing the pathological assessment and confirmation of QCIGISH results. A Comparative analysis of QCIGISH and H&E staining applied on serially resected specimens from the same tissue block of an adenocarcinoma showing increasing imprinted gene SNRPN expression alterations in normal, paracancerous and cancer regions, respectively. The typical normal, paracancerous and cancer regions were all magnified for both QCIGISH and H&E staining. B Illustrated examples showing the visualized allelic expression of imprinted gene GNAS in resected tissue sections of benign lesion and lung cancer subtypes in the model building set. TB, pulmonary tuberculosis. PC, pulmonary cryptococcosis. PIP, pulmonary inflammatory pseudotumor. AdIS, adenocarcinoma in situ. AdC, adenocarcinoma. SqCC, squamous cell carcinoma. LCC, large cell carcinoma. Scale bar, 20 μm
Fig. 4QCIGISH diagnostic performance in benign lesions and lung cancers. A ROC chart showing the AUC performance of the QCIGISH diagnostic grading model in the model building, testing and validation sets. B Computed sensitivities and specificities of the final QCIGISH diagnostic grading model in the model building, testing and validation sets. C Heatmap analysis plot showing the elevated multiallelic expression patterns for the lung cancers as compared to the benign lesions for the cytology and small biopsy specimens from both model testing and validation sets. D Analysis showing high sensitivities of QCIGISH for the diagnosis of different lung cancer subtypes. E Analysis showing high specificities of QCIGISH for the diagnosis of different benign lung lesions. AdC, adenocarcinoma. SqCC, squamous cell carcinoma. AdSqLC, adenosquamous lung carcinoma. SCLC, small cell lung cancer. TB, pulmonary tuberculosis. COP, cryptogenic organizing pneumonia. NTBI, non-TB infections. Infl, inflammation. Gran, granuloma. Hama, hamartoma. Error bars on the bar charts showed the 95% CI
Fig. 5Comparison of the accuracies of QCIGISH and current small biopsy pathology for lung cancer diagnosis. A Analysis showing improved or comparable sensitivities of QCIGISH over small biopsy pathology for different stages of NSCLC. B Analysis showing improved or comparable sensitivity of QCIGISH over small biopsy pathology for different stages of SCLC. C Analysis showing improved or comparable sensitivities of QCIGISH over small biopsy pathology for pulmonary nodules and masses. D Analysis showing comparable specificities of QCIGISH with small biopsy pathology for pulmonary nodules and masses. E List of clinically available minimally invasive sampling procedures for lung lesions. F An illustrated example of clinical cases both positively classified as lung cancer by QCIGISH but were diagnosed as benign by small biopsy pathology. Surgical histopathology or clinical diagnosis with 2-year follow-up were used as golden standard. CIS, carcinoma in situ. N.E.D., no enough data. Error bars on the bar charts showed the 95% CI. Scale bar, 50 μm