Literature DB >> 33629637

Targeted next-generation sequencing for cancer-associated gene mutation and copy number detection in 206 patients with non-small-cell lung cancer.

Songbai Zheng1, Xiaodan Wang1, Ying Fu2,3,4, Beibei Li5,6, Jianhua Xu5, Haifang Wang7, Zhen Huang1, Hui Xu8, Yurong Qiu1, Yaozhou Shi2,3, Kui Li1,8.   

Abstract

The knowledge of genetic variation in Chinese patients with non-small-cell lung cancer (NSCLC) is still limited. We aimed to profile this genetic variation in 206 Chinese patients with NSCLC using next-generation sequencing. Tumor tissues or whole-blood samples were collected and subjected to whole-exome targeted next-generation sequencing, which included 565 tumor-associated genes, for somatic gene mutation screening and copy number variation (CNV) detection. Potential functions of most commonly mutated genes and genes with CNV were predicted by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses. Atotal of 18,749 mutations were identified using targeted next-generation sequencing, and 85.3% of them were missense mutations. Among the mutation, conversions between pyrimidine and purine were predominant, and C> T/G > A was the most common substitution type. High frequencies of mutations were noted in TP53 (47.6%), EGFR (41.7%), CREBBP (23.1%), KMT2C (16.9%), MUC2 (16.6%), DNMT3A (15.5%), LRP1B (15.5%), MUC4 (15.5%), CDC27 (15.2%), and KRAS (12.8%). EGFR and KRAS mutations were mutually exclusive. The tumor mutation load showed differences depending on gender and tumor type. CNV analysis showed that BCORL1 and ARAF have the highest copy number amplification, whereas KDM6A and RBM10 showed the highest copy number deletion. GO and KEGG analyses indicated that high-frequency mutations and CNV genes were concentrated in tumor-related PI3K-Akt, FoxO, and Ras signaling pathway. Cumulatively, we studied somatic gene mutations involved in NSCLC and predicted their clinical significance in Chinese population. These findings may provide clues for etiology and drug target of NSCLC.

Entities:  

Keywords:  cancer; lung; mutation; panel; sequencing

Year:  2021        PMID: 33629637      PMCID: PMC8291840          DOI: 10.1080/21655979.2021.1890382

Source DB:  PubMed          Journal:  Bioengineered        ISSN: 2165-5979            Impact factor:   3.269


Introduction

Lung cancer has become the leading deadly malignancy in China and globally, in both men and women [1]. According to 2015 statistics, there were approximately 730,000 new cases of lung cancer in China and more than 430,000 people died from this disease. Lung cancer is divided into non–small-cell lung carcinoma (NSCLC) and small-cell lung carcinoma (SCLC) [2], with NSCLC accounting for more than 85% of cases [3]. Moreover, NSCLC has a high mortality rate. Despite extensive research on different treatment options, patients diagnosed with NSCLC (all stages) have a mortality rate of more than 50% within 1 year and an overall 5-year survival rate of less than 18% [4]. These data suggest that there is still a need for new targeted therapeutic drug research of NSCLC, and analyses of the underlying mechanism of NSCLC from a genetic level may provide clues for finding new therapeutic targets. Next-generation sequencing (NGS) is an approach widely used for the characterization of genetic features. Using an NGS platform, whole-genome sequencing, whole-exome sequencing, whole-transcriptome sequencing, and targeted sequencing can be performed for multiple specific genomic regions. It is a high-throughput and economical method for detecting multiple genetic variations [5]. Many studies have used NGS to analyze genetic variation, tumor mutation burden, and microsatellite instability in solid tumors such as colorectal cancer, gastric cancer, and breast cancer [6,7]. Target sequencing is also used for the identification of variations in genes causing lung cancer. Based on these NGS data, several important genes related to lung cancer have been identified, for exampletumor protein P53 (TP53), phosphatase and tensin homolog (PTEN), epidermal growth factor receptor (EGFR), KRAS proto-oncogene, GTPase (KRAS), neurofibromin 1 (NF1), ATM serine/threonine kinase (ATM), phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha (PIK3CA), and fibroblast growth factor receptor 4 (FGFR4) [8-13]. However, the knowledge of genetic variation in NSCLC remains limited in the Chinese population. Existing studies have focused on a small range of genes. For example, Wen et al. performed NGS of 37 cancer-related genes and selected introns of eight genes [14]. Tsoulos et al. focused on a custom panel comprising 23 genes [13,15]. Therefore, a broader panel containing NSCLC-related genes of great significance for the diagnosis and precise treatment of NSCLC is still needed. Here, we established a panel to detect somatic mutations in 206 samples from Chinese patients. To include as many NSCLC-related genes as possible, the panel comprised 565 genes that were associated with sensitivity and side effects of commonly used chemotherapeutic drugs in clinic and cancer risk. Our study expected to provide an overview of the characteristics of tumor genetic variation in Chinese patients with NSCLC, and provide clues for the clinical diagnosis to enable individualized therapy and find new therapeutic targets of NSCLC.

Materials and methods

Patient and DNA isolation

Surgically resected tumor tissues or venous blood samples were collected from 206 NSCLC patients. Genomic DNA was isolated from tissues or blood using the QIAGEN DNeasy Blood & Tissue Kit (#69504, Qiagen, Germany). All patients gave written informed consent to participate in this study.

Whole-exome next-generation and targeted gene sequencing

DNA libraries for whole-exome NGS were prepared using NEBNext® Ultra™ DNA Library Prep Kit (NEB #E7645, NEB, USA) for Illumina, in accordance with the manufacturer’s instructions. Whole-exome capture was performed using TruSeq Exome Enrichment kit (Illumina # 20020183, USA). For targeted gene sequencing, a panel comprising 565 tumor-related genes was prepared. Targeted genes were enriched with the TruSeq Custom Enrichment kits (Illumina). Samples were sequencing using the HiSeq X TEN platform (Illumina).

Bioinformatic analysis

The adapter sequence in the raw data was removed by cutadapt, after which high-quality reads were aligned to the human reference genome (hg19) using BWA [16] with the default parameters. Somatic mutations were detected by MuTect [17] based on the alignment. Somatic SNVs with high confidence were called if the following criteria were met: (I) both tumor and normal samples should have coverage of ≥10× at the genomic position; and (II) the variants should be supported by at least 5% of the total reads in the tumor. Copy number variation (CNV) for each tumor sample was determined by ADTEx [18]. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses of mutated genes were performed using KOBAS [19]. Enriched terms were defined as those with FDR of <0.01.

Statistical analysis

The difference in Tumor mutation burden (TMB) between male and female and adenocarcinoma and squamous carcinoma were analyzed using Student’s t-test method. Correlation between TMB and age were analyzed using Pearson Correlation Coefficient method.

Results

Analyses of the underlying mechanism of NSCLC from a genetic level may provide clues for studying new therapeutic targets for drugs in NSCLC treatment; however, the knowledge of the genetic variation of NSCLC remains limited in Chinese population. Moreover, NGS is a widely used approach for the characterization of genetic characteristics. In this study, we established a panel containing 565 genes that were associated with sensitivity and side effects of commonly used chemotherapeutic drugs in clinic and cancer risk to detect somatic mutations in samples from 206 Chinese patients. A total of 18,749 mutations were identified using targeted NGS and 85.3% of them were missense mutations. Among the mutations, conversions between pyrimidine and purine were dominant, and C > T/G > A was the most common substitution type. High frequencies of mutations were noted in TP53 (47.6%), EGFR (41.7%), CREB binding protein (CREBBP) (23.1%), lysine methyltransferase 2 C (KMT2C) (16.9%), Mucin 2 (MUC2) (16.6%), DNA methyltransferase 3 alpha (DNMT3A) (15.5%), LDL receptor related protein 1B (LRP1B) (15.5%), Mucin 4 (MUC4) (15.5%), cell division cycle 27 (CDC27 (15.2%), and KRAS (12.8%). EGFR and KRAS mutations were mutually exclusive. The tumor mutation load showed BCL6 corepressor like 1 (BCORL1) and a-raf proto-oncogene (ARAF) have the highest copy number amplification, whereas lysine demethylase 6A (KDM6A) and RNA binding motif protein 10 (RBM10) showed the highest copy number deletion. GO and KEGG analyses indicated that high-frequency mutations and CNV genes were concentrated in the tumor-related PI3K-Akt, FoxO, and Ras signaling pathway.

Overview of somatic mutation in patients with NSCLC

To obtain an overview of somatic mutation in Chinese patients with NSCLC patients, we recruited 206 Chinese patients with NSCLC and performed targeted NGS. The mean age of the 206 enrolled patients with NSCLC was 65 years (range 54–86). Of these, 81 (39.3%) were male and 125 (60.7%) were female. Individual clinical information is listed in Table 1. To obtain the somatic mutation spectrum of the 206 patients, next-generation sequencing-based technology was used to capture 565 genes from tumor tissues and peripheral blood of patients with NSCLC. As shown in Figure 1(a), the coverage depth of the captured regions of most genes was at least 50×, with an average coverage depth of 914× (Table 1) (Figure 1(a)).
Table 1.

Clinical information of the 206 NSCLC patients

SampleIDGenderAgeClinical diagnosis
P1Female63Non-small cell lung cancer
P2Male65Adenocarcinoma
P3Male79Adenocarcinoma
P4Male54Adenocarcinoma
P5Female68Adenocarcinoma
P6Male71Squamous
P7Female63Squamous
P8Male72Squamous
P9Male63Squamous
P10Male54Squamous
P11Female74Non-small-cell lung cancer
P12Male59Non-small-cell lung cancer
P13Male69Non-small-cell lung cancer
P14Male44Non-small-cell lung cancer
P15Male68Adenocarcinoma
P16Female49Non-small-cell lung cancer
P17Male57Adenocarcinoma
P18Female61Non-small-cell lung cancer
P19Male56Squamous
P20Male65Adenocarcinoma
P21Male63Squamous
P22Female64Non-small-cell lung cancer
P23Female57Non-small-cell lung cancer
P24Female45Adenocarcinoma
P25Female51Adenocarcinoma
P26Female50Adenocarcinoma
P27Male82Non-small-cell lung cancer
P28Male56Non-small-cell lung cancer
P29Female64Adenocarcinoma
P30Male71Non-small-cell lung cancer
P31Male46Adenocarcinoma
P32Male52Squamous
P33Female48Non-small-cell lung cancer
P34Male61Non-small-cell lung cancer
P35Male35Squamous
P36Male69Small cell lung cancer
P37Female69Non-small-cell lung cancer
P38Male64Non-small-cell lung cancer
P39Male65Non-small-cell lung cancer
P40Female75Non-small-cell lung cancer
P41Female58Adenocarcinoma
P42Female38Adenocarcinoma
P43Female63Non-small-cell lung cancer
P44Male62Non-small-cell lung cancer
P45Male79Non-small-cell lung cancer
P46Male51Non-small-cell lung cancer
P47Female60Adenocarcinoma
P48Male62Non-small-cell lung cancer
P49Male68Non-small-cell lung cancer
P50Female69Non-small-cell lung cancer
P51Female53Non-small-cell lung cancer
P52Female57Adenocarcinoma
P53Female61Adenocarcinoma
P54Male58Non-small-cell lung cancer
P55Male54Neuroendocrine
P56Female77Non-small-cell lung cancer
P57Female35Non-small-cell lung cancer
P58Female70Adenocarcinoma
P59Male79Non-small-cell lung cancer
P60Male66Non-small-cell lung cancer
P61Male68Non-small-cell lung cancer
P62Male68Non-small-cell lung cancer
P63Male61Non-small-cell lung cancer
P64Male80Non-small-cell lung cancer
P65Male70Non-small-cell lung cancer
P66Female39Adenocarcinoma
P67Female50Adenocarcinoma
P68Male67Non-small-cell lung cancer
P69Male49Non-small-cell lung cancer
P70Male72Adenocarcinoma
P71Male54Non-small-cell lung cancer
P72Male52Adenocarcinoma
P73Female68Non-small-cell lung cancer
P74Female73Non-small-cell lung cancer
P75Male69Adenocarcinoma
P76Female71Adenocarcinoma
P77Female66Non-small-cell lung cancer
P78Male69Adenocarcinoma
P79Male62Squamous
P80Male54Non-small-cell lung cancer
P81Female47Non-small-cell lung cancer
P82Male76Non-small-cell lung cancer
P83Male86Non-small-cell lung cancer
P84Male73Non-small-cell lung cancer
P85Male72Non-small-cell lung cancer
P86Male43Adenocarcinoma
P87Female67Adenocarcinoma
P88Male55Non-small-cell lung cancer
P89Male77Small cell lung cancer
P90Male57Non-small-cell lung cancer
P91Female54Adenocarcinoma
P92Male65Neuroendocrine
P93Female72Adenocarcinoma
P94Male62Squamous
P95Female45Non-small-cell lung cancer
P96Female45Non-small-cell lung cancer
P97Female51Adenocarcinoma
P98Female65Non-small-cell lung cancer
P99Male61Adenocarcinoma
P100Male79Squamous
P101Female64Adenocarcinoma
P102Male75Non-small-cell lung cancer
P103Male67Adenocarcinoma
P104Male72Non-small-cell lung cancer
P105Male79Adenocarcinoma
P106Female51Non-small-cell lung cancer
P107Female78Non-small-cell lung cancer
P108Male58Non-small-cell lung cancer
P109Female69Adenocarcinoma
P110Male82Non-small-cell lung cancer
P111Male76Non-small-cell lung cancer
P112Male61Adenocarcinoma
P113Female64Adenocarcinoma
P114Female69Non-small-cell lung cancer
P115Male85Adenocarcinoma
P116Male56Non-small-cell lung cancer
P117Female62Non-small-cell lung cancer
P118Male62Squamous
P119Male56Squamous
P120Male68Squamous
P121Male63Adenocarcinoma
P122Male58Non-small-cell lung cancer
P123Male64Adenocarcinoma
P124Male68Non-small-cell lung cancer
P125Male59Adenocarcinoma
P126Male67Non-small-cell lung cancer
P127Female78Non-small-cell lung cancer
P128Female66Non-small-cell lung cancer
P129Female67Non-small-cell lung cancer
P130Female57Non-small-cell lung cancer
P131Female74Non-small-cell lung cancer
P132Male55Non-small-cell lung cancer
P133Male62Squamous
P134Male66Squamous
P135Female56Non-small-cell lung cancer
P136Male60Non-small-cell lung cancer
P137Male81Non-small-cell lung cancer
P138Male63Non-small-cell lung cancer
P139Female49Adenocarcinoma
P140Male56Non-small-cell lung cancer
P141Male74Non-small-cell lung cancer
P142Female49Non-small-cell lung cancer
P143Male65Non-small-cell lung cancer
P144Female52Adenocarcinoma
P145Male40Non-small-cell lung cancer
P146Male66Adenocarcinoma
P147Female65Small cell lung cancer
P148Female68Large cell lung cancer
P149Male41Adenocarcinoma
P150Male54Adenocarcinoma
P151Female53Non-small-cell lung cancer
P152Male76Non-small-cell lung cancer
P153Female49Non-small-cell lung cancer
P154Female71Adenocarcinoma
P155Male69Non-small-cell lung cancer
P156Male60Adenocarcinoma
P157Male52Non-small-cell lung cancer
P158Female68Non-small-cell lung cancer
P159Male62Adenocarcinoma
P160Male75Non-small-cell lung cancer
P161Male65Non-small-cell lung cancer
P162Male65Non-small-cell lung cancer
P163Male55Non-small-cell lung cancer
P164Male68Non-small-cell lung cancer
P165Male57Adenocarcinoma
P166Female48Neuroendocrine
P167Male73Adenocarcinoma
P168Male62Adenocarcinoma
P169Female70Adenocarcinoma
P170Female61Non-small-cell lung cancer
P171Male65Adenocarcinoma
P172Male75Non-small-cell lung cancer
P173Male53Non-small-cell lung cancer
P174Female53Non-small-cell lung cancer
P175Male75Adenocarcinoma
P176Male40Non-small-cell lung cancer
P177Male65Non-small-cell lung cancer
P178Female67Non-small-cell lung cancer
P179Male70Non-small-cell lung cancer
P180Male55Non-small-cell lung cancer
P181Female68Small cell lung cancer
P182Male56Adenocarcinoma
P183Male66Non-small-cell lung cancer
P184Female70Non-small-cell lung cancer
P185Male62Squamous
P186Female55Adenocarcinoma
P187Female71Adenocarcinoma
P188Female63Adenocarcinoma
P189Female69Non-small-cell lung cancer
P190Female51Adenocarcinoma
P191Female46Adenocarcinoma
P192Female74Non-small-cell lung cancer
P193Female61Non-small-cell lung cancer
P194Male47Non-small-cell lung cancer
P195Male68Squamous
P196Male49Non-small-cell lung cancer
P197Female83Non-small-cell lung cancer
P198Female66Adenocarcinoma
P199Male56Squamous
P200Male54Non-small-cell lung cancer
P201Male51Non-small-cell lung cancer
P202Female70Adenocarcinoma
P203Female62Non-small-cell lung cancer
P204Male66Adenocarcinoma
P205Male56Non-small-cell lung cancer
P206Male63Adenocarcinoma
Figure 1.

Overview of the mutation status of the 206 patients with NSCLC based on next-generation sequencing. (a) Coverage depth for gene regions. Distribution of gene mutation types (b) and single mutation types (c) in the 206 patients with NSCLC. (d) Schematic showing 30 genes with the highest mutation frequency

Clinical information of the 206 NSCLC patients Overview of the mutation status of the 206 patients with NSCLC based on next-generation sequencing. (a) Coverage depth for gene regions. Distribution of gene mutation types (b) and single mutation types (c) in the 206 patients with NSCLC. (d) Schematic showing 30 genes with the highest mutation frequency A total of 18,749 mutations were identified, and the dominant mutation type was missense mutation (85.3%) (Figure 1(b), Table 2). Single-mutation variation analysis revealed that the dominant base mutations predominantly involved purines (Figure 1(c)) and that C > T/G > A was the most common substitution type. Of the mutated genes, 79 had a mutation frequency of more than 5%. Among these, the top ten most frequently mutated genes were TP53 (47.6%), EGFR (41.7%), CREBBP (23.1%), KMT2C (16.9%), MUC2 (16.6%), DNMT3A (15.5%), LRP1B (15.5%), MUC4 (15.5%), CDC27 (15.2%), and KRAS (12.8%) (Figure 1(d)).
Table 2.

Mutation information of the 206 NSCLC patients

TypeNumberPercentage
Synonymous320.17%
Missense15,98585.26%
Nonsense9965.31%
Readthough380.20%
Splicing6533.48%
Frameshift deletion5853.12%
Frameshift insertion1991.06%
In-frameshift deletion2141.14%
In-frameshift insertion470.25%
Total18,749100.00%
Mutation information of the 206 NSCLC patients

TMB analysis in patients with NSCLC

TMB has been proved to be an immunotherapy biomarker in clinical oncology, including NSCLC. To explore the association between TMB and NSCLC in Chinese patients, we performed comparative analysis of the sexes and different tumor subtypes showed that TMB in females was lower than that in males (Figure 2(a)). The median TMB for men is 6.6 Mutations/Mb, and the median TMB for women is 3.7 Mutations/Mb. The median TMB for men is 1.78 times that for women (Figure 2(a)).
Figure 2.

TMB analysis of 206 patients with NSCLC. (a) Differences in TMB by sex. (b) Differences in TMB by tumor type. (c) Correlation between TMB and age. TMB, tumor mutation burden

TMB analysis of 206 patients with NSCLC. (a) Differences in TMB by sex. (b) Differences in TMB by tumor type. (c) Correlation between TMB and age. TMB, tumor mutation burden Significantly higher TMB was observed in squamous carcinoma than that in adenocarcinoma (Figure 2(b)). The median TMB of lung adenocarcinoma is 4.3 Mutations/Mb, and the median TMB of lung squamous is 11.1 Mutations/Mb, 2.58 times that of lung adenocarcinoma (Figure 2(b)). To investigate the association between TMB and age, we compared TMB (range, 0–52.2 Mutations/Mb; median, 5.3 Mutations/Mb) and patient age (range, 35–86 years; median, 63 years). Correlation analysis showed that the correlation between the two was not significant (correlation coefficient R = 0.160, P = 0.074) (Figure 2(c)).

Analysis of most commonly mutated genes in patients with NSCLC

Gene mutation has been proved to be closely associated with tumor development, and identification of the isoform of gene mutation might benefit therapy. We analyzed the ten most frequently mutated genes in tumor tissues of patients with NSCLC and found that all patients had at least one high-frequency mutation. Of the 206 cases, no KRAS mutation was observed in patients with EGFR mutations (Figure 3). MutationMapper analysis showed that, in addition to DNMT3A, the mutation sites of the other nine high-frequency mutation genes were R249S/M, L858R, Q1950P, R886 C, T1488I/P, S2589, S2704P, C115R, G12 CN/D. Out of these nine, mutant hotspots of TP53 (R249S/M), EGFR (L858R), and KRAS (G12 CN/D) were located P53 DNA-binding domain, Protein tyrosine kinase domain and Ras family domain respectively (Figure 4).
Figure 3.

The top 10 high-frequency mutation genes of 206 patients were visualized by OncoPrinter. Each gray box from left to right represents the mutation of a sample

Figure 4.

Diagram showing mutant sites and frequency of the top 10 genes harboring high-frequency mutations

The top 10 high-frequency mutation genes of 206 patients were visualized by OncoPrinter. Each gray box from left to right represents the mutation of a sample Diagram showing mutant sites and frequency of the top 10 genes harboring high-frequency mutations GO and KEGG enrichment analyses showed that the top 10 high-frequency mutant genes were mainly enriched in terms of organelle lumen, membrane-enclosed lumen, intracellular organelle lumen, cellular macromolecule metabolic process, aromatic compound biosynthetic process (Figure 5(a)), and pathways including microRNAs in cancer, pathway in cancer, Notch signaling pathway, and FoxO signaling pathway (Figure 5(b)).
Figure 5.

GO (a) and KEGG (b) enrichment analyses for genes with mutation frequency of >5%. GO, Gene Ontology, KEGG, Kyoto Encyclopedia of Genes and Genomes

GO (a) and KEGG (b) enrichment analyses for genes with mutation frequency of >5%. GO, Gene Ontology, KEGG, Kyoto Encyclopedia of Genes and Genomes

Analysis of copy number variations in patients with NSCLC

Because CNV may indicate dysregulated gene and protein expression that may ultimately affect development and progression of NSCLC, we further explored gene CNV in Chinese patients with NSCLC. CNV analysis showed that 110 genes had copy number amplification. Among these, BCORL1, ARAF, GATA binding protein 1 (GATA1), bruton tyrosine kinase (BTK), and P21 (RAC1) activated kinase 3 (PAK3) were the genes with the highest copy number amplification (Figure 6(a)). These genes are mainly concentrated in the terms of protein binding, positive regulation of macromolecule metabolic process, regulation of cellular process, positive regulation of metabolic process, and regulation of macromolecule metabolic process (Figure 6(b)). KEGG analysis revealed that, for the genes with the highest copy number amplification, their predicted functions were enriched in transcriptional dysregulation in cancer, pathway in cancer, PI3K-Akt signaling pathway, and Ras signaling pathway (Figure 6(c)).
Figure 6.

Of enrichment analysis of the 30 genes (a) and their GO (b) and KEGG enrichment analysis results (c) with the highest copy number increase in 206 samples. GO, Gene Ontology, KEGG, Kyoto Encyclopedia of Genes and Genomes

Of enrichment analysis of the 30 genes (a) and their GO (b) and KEGG enrichment analysis results (c) with the highest copy number increase in 206 samples. GO, Gene Ontology, KEGG, Kyoto Encyclopedia of Genes and Genomes A total of 54 genes had copy number deletion. The genes with the highest copy number deletions were KDM6A, RBM10, TATA-box binding protein associated factor 1 (TAF1), ARAF, and stromal antigen 2 (STAG2) (Figure 7(a)). They were predicted to be enriched in terms of cellular macromolecule metabolic process, macromolecule modification, regulation of cellular process, macromolecule metabolic process, and cellular protein modification process (Figure 7(b)). The most enriched pathways were pathway in cancer, PI3K-Akt signaling pathway, and cell cycle (Figure 7(c)).
Figure 7.

The top 30 genes with the highest copy number deletion in 206 samples (a) and their GO (b) and KEGG (c) enrichment analysis results. GO, Gene Ontology, KEGG, Kyoto Encyclopedia of Genes and Genomes

The top 30 genes with the highest copy number deletion in 206 samples (a) and their GO (b) and KEGG (c) enrichment analysis results. GO, Gene Ontology, KEGG, Kyoto Encyclopedia of Genes and Genomes

Discussion

The purpose of this study was to identify the mutational characteristics of 206 Chinese patients with NSCLC. We identified 18,749 mutations by using targeted NGS. Among these mutations, missense mutations were dominant. Base mutations were dominated by pyrimidine and purine conversions. The ten most frequently mutated genes were obtained. Notably, EGFR and KRAS mutations were mutually exclusive. There were differences in TMB between the sexes and pathological subtypes; however, TMB was not associated with age. Finally, 110 genes and 54 genes showed copy number amplification and copy number deletion, respectively. These genes were specifically enriched in the NSCLC-associated pathways. Based on the targeted NGS, we determined the most frequently mutated genes in Chinese patients with NSCLC. These genes were TP53, EGFR, CREBBP, KMT2C, MUC2, DNMT3A, LRP1B, MUC4, CDC27, and KRAS. Mutations in these genes have been reported previously in NSCLC [20]. Interestingly, the genes with the highest mutation frequency differed in their rankings compared with the findings of a study on the American population. In the study, they showed that the most frequently mutated gene in this report is KRAS, followed by EGFR [10]. However, our results are also consistent with the results in some reports. For example, a study in Lebanon showed that mutations of TP53 are common molecular changes, occurring in over 50% of tumors [21,22]. In an NSCLC study with a small sample size, TP53 was also found to be the most frequently mutated gene in the Chinese population [15]. These indicate that TP53 mutation might be one of the genes affected in Chinese patients with NSCLC. In addition, our results also support the idea reported in a previous study that the mutant hotspot area of TP53 is located in the common R249 area [23]. It has been accepted that TP53 is an important tumor suppressor and the most commonly mutated gene in most cancers. As a prognostic factor in NSCLC, the presence of TP53 mutation suggested an aggressive feature and poor clinical outcome [24]. Our results show that EGFR ranks second in terms of the mutation frequency, at a rate of 41.7%. Based on previous studies, the mutation rate of EGFR in Chinese patients with NSCLC is approximately 30%–50% [23,25]. The frequency of EGFR mutations that we obtained is also in this . It is worth mentioning that we found the hotspot mutation L858R of the EGFR gene, which is also considered to be a high-frequency mutation in Asia [26,27]. There is evidence that patients harboring common EGFR mutations exhibit approximately 10 months progression free survival time after EGFR tyrosine kinase inhibitor (TKI) therapy, whereas those with uncommon EGFR mutations exhibit less response to EGFR TKI [28-30]. Therefore, our findings indicate that most Chinese patients with NSCLC might benefit from EGFR TKI treatment. However, in those NSCLC harboring dual TP53/EGFR mutations, especially missense mutations, low response is frequently observed [31]. In addition to TP53 and EGFR, KRAS is also a commonly mutated gene in NSCLC. In some reports, it is described that the frequency of conversion of KRAS in the Chinese is approximately 8% [25,32]. Here, we report a mutation rate of the KRAS gene of 12.8% [33]. In contrast to the widely reported high-frequency mutated genes mentioned above, CREBBP (23.1%), KMT2C (16.9%), MUC2 (16.6%), DNMT3A (15.5%), LRP1B (15.5%), MUC4 (15.5%), and CDC27 (15.2%) are currently reported less in the Chinese population, although mutations in DNMT3A and KMT2C have been identified in some studies [20,33-35]. Our results suggest some aspects of the mutational characteristics of these genes in Chinese NSCLC, suggesting functions of these genes in the etiology and treatment of NSCLC. It is worth mentioning that we observed that patients with NSCLC having EGFR mutations have no KRAS mutations, and vice versa. This is consistent with the previous assertion that EGFR and KRAS mutations are mutually exclusive in NSCLC, although some cases of EGFR and KRAS mutations being present together in some Asian populations, including in China, have been reported [25,36]. The genome in NSCLC is unstable and exhibits a wide range of gene CNVs. Because CNV is closely related to the expression of mRNA and protein, copy number amplification or deletion may affect the expression of tumor-related genes and the patient’s sensitivity to treatment and survival [37]. Analysis of the variation of copy number is helpful for learning underlying mechanisms and functions of related genes in patients with NSCLC. Our results show that the genes with the most increased copy number were BCORL1, ARAF, and GATA1, while those with the greatest deletion of copy number were KDM6A, RBM10, TAF1, ARAF, and STAG2. Among these genes, evidence suggests that patients with high expression of BCORL1 have a shorter 3-year survival than patients with its low expression [38]. In addition, RBM10 functions to inhibiting the proliferation of non-adenocarcinoma cells [39]. We speculate that the increase in BCORL1 copy number and deletion of RBM10 copy number may suggest their roles in the pathogenesis of NSCLC. The results of GO and KEGG enrichment analyses of genes with frequent mutations and CNV suggest that the mutant genes are enriched in tumor-related terms and signaling pathways. These pathways include the PI3K-Akt signaling pathway, FoxO signaling pathway, and Ras signaling pathway. The correlation between activation of the Notch signaling pathway and poor prognosis of NSCLC has been confirmed [40,41]. PI3K-Akt is an important signaling pathway that regulates tumor formation, survival and metastasis [42,43]. One of its downstream factors is the FoxO signaling pathway. Akt promotes the phosphorylation of FoxO and inhibits the transcriptional function of FoxO, potentially resulting in the induction of apoptosis, which is involved in biological processes such as NSCLC radiosensitization and tumor growth inhibition [44-46]. Moreover, the Ras signaling pathway is a proto-cancer pathway. Multiple tumor-promoting factors and drugs have been found to modulate tumor progression through this pathway [47-49]. Based on KEGG analysis, we suggest that the high frequency of mutation genes and CNV genes are associated with these tumor-related pathways. Inhibitors targeting these pathways may thus have clinical significance. It is interesting to find that TMB was higher in men than in women. Since we were unable to correlate the current data such as TMB with the treatment outcomes of men and women, the clinical prognostic value of genetic mutations could not be derived. Subsequent research on the links between the mutant genes and the clinical data of this patient population will further enrich the clinical value of the mutant gene.

Conclusion

The most common gene mutations in Chinese patients with NSCLC are missense mutations, and TP53, EGFR, CREBBP, KMT2C, MUC2 genes are the most frequently mutated genes. Several genes exhibited copy number amplification and copy number deletion. There were differences in TMB between the sexes and pathological subtypes; however, TMB was not associated with age. Our findings indicate that the panel is a good method for tumor molecular characterization In addition, our results are expected to provide clues for interpreting the etiology of NSCLC and performing drug target screening for this condition.
  45 in total

Review 1.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

2.  Prognostic and predictive effects of TP53 co-mutation in patients with EGFR-mutated non-small cell lung cancer (NSCLC).

Authors:  Catherine Labbé; Michael Cabanero; Grzegorz J Korpanty; Pascale Tomasini; Mark K Doherty; Céline Mascaux; Kevin Jao; Bethany Pitcher; Rick Wang; Melania Pintilie; Natasha B Leighl; Ronald Feld; Geoffrey Liu; Penelope Ann Bradbury; Suzanne Kamel-Reid; Ming-Sound Tsao; Frances A Shepherd
Journal:  Lung Cancer       Date:  2017-06-24       Impact factor: 5.705

Review 3.  Notch signaling in lung cancer.

Authors:  Paola Galluzzo; Maurizio Bocchetta
Journal:  Expert Rev Anticancer Ther       Date:  2011-04       Impact factor: 4.512

4.  Precision medicine in age-specific non-small-cell-lung-cancer patients: Integrating biomolecular results into clinical practice-A new approach to improve personalized translational research.

Authors:  Tiziana Vavalà; Valentina Monica; Marco Lo Iacono; Teresa Mele; Simone Busso; Luisella Righi; Mauro Papotti; Giorgio Vittorio Scagliotti; Silvia Novello
Journal:  Lung Cancer       Date:  2016-05-31       Impact factor: 5.705

5.  Screening for epidermal growth factor receptor mutations in lung cancer.

Authors:  Rafael Rosell; Teresa Moran; Cristina Queralt; Rut Porta; Felipe Cardenal; Carlos Camps; Margarita Majem; Guillermo Lopez-Vivanco; Dolores Isla; Mariano Provencio; Amelia Insa; Bartomeu Massuti; Jose Luis Gonzalez-Larriba; Luis Paz-Ares; Isabel Bover; Rosario Garcia-Campelo; Miguel Angel Moreno; Silvia Catot; Christian Rolfo; Noemi Reguart; Ramon Palmero; José Miguel Sánchez; Roman Bastus; Clara Mayo; Jordi Bertran-Alamillo; Miguel Angel Molina; Jose Javier Sanchez; Miquel Taron
Journal:  N Engl J Med       Date:  2009-08-19       Impact factor: 91.245

6.  Somatic Mutation Spectrum of Non-Small-Cell Lung Cancer in African Americans: A Pooled Analysis.

Authors:  Luiz H Araujo; Philip E Lammers; Velmalia Matthews-Smith; Rosana Eisenberg; Adriana Gonzalez; Ann G Schwartz; Cynthia Timmers; Konstantin Shilo; Weiqiang Zhao; Thanemozhi G Natarajan; Jianying Zhang; Ayse Selen Yilmaz; Tom Liu; Kevin Coombes; David P Carbone
Journal:  J Thorac Oncol       Date:  2015-10       Impact factor: 15.609

Review 7.  Effect and biomarker of Nivolumab for non-small-cell lung cancer.

Authors:  Li Wang; Deze Zhao; Kang Qin; Faisal Ul Rehman; Xiaochun Zhang
Journal:  Biomed Pharmacother       Date:  2019-07-05       Impact factor: 6.529

8.  A prospective, molecular epidemiology study of EGFR mutations in Asian patients with advanced non-small-cell lung cancer of adenocarcinoma histology (PIONEER).

Authors:  Yuankai Shi; Joseph Siu-Kie Au; Sumitra Thongprasert; Sankar Srinivasan; Chun-Ming Tsai; Mai Trong Khoa; Karin Heeroma; Yohji Itoh; Gerardo Cornelio; Pan-Chyr Yang
Journal:  J Thorac Oncol       Date:  2014-02       Impact factor: 15.609

9.  Integrative analysis of genome-wide gene copy number changes and gene expression in non-small cell lung cancer.

Authors:  Verena Jabs; Karolina Edlund; Helena König; Marianna Grinberg; Katrin Madjar; Jörg Rahnenführer; Simon Ekman; Michael Bergkvist; Lars Holmberg; Katja Ickstadt; Johan Botling; Jan G Hengstler; Patrick Micke
Journal:  PLoS One       Date:  2017-11-07       Impact factor: 3.240

10.  Frequency of Somatic TP53 Mutations in Combination with Known Pathogenic Mutations in Colon Adenocarcinoma, Non-Small Cell Lung Carcinoma, and Gliomas as Identified by Next-Generation Sequencing.

Authors:  Zahra Shajani-Yi; Francine B de Abreu; Jason D Peterson; Gregory J Tsongalis
Journal:  Neoplasia       Date:  2018-02-16       Impact factor: 5.715

View more
  4 in total

1.  Identification of CDK2-Related Immune Forecast Model and ceRNA in Lung Adenocarcinoma, a Pan-Cancer Analysis.

Authors:  Ting-Ting Liu; Rui Li; Chen Huo; Jian-Ping Li; Jie Yao; Xiu-Li Ji; Yi-Qing Qu
Journal:  Front Cell Dev Biol       Date:  2021-07-30

2.  Syringin exerts anti-breast cancer effects through PI3K-AKT and EGFR-RAS-RAF pathways.

Authors:  Fei Wang; Chong Yuan; Bo Liu; Yan-Fang Yang; He-Zhen Wu
Journal:  J Transl Med       Date:  2022-07-06       Impact factor: 8.440

Review 3.  KRAS mutation: from undruggable to druggable in cancer.

Authors:  Lamei Huang; Zhixing Guo; Fang Wang; Liwu Fu
Journal:  Signal Transduct Target Ther       Date:  2021-11-15

4.  An advanced non-small cell lung cancer patient with EGFR and KRAS mutations, and PD-L1 positive, benefited from immunotherapy: a case report.

Authors:  Ronghua Yang; Dong Wang; Xia Li; Kaiping Mao; Jinglong Wang; Peng Li; Xiaoliang Shi; Shanshan Zhang; Yongjie Wang
Journal:  Ann Transl Med       Date:  2022-03
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.