Literature DB >> 32638549

Spectrum of gene mutations identified by targeted next-generation sequencing in Chinese leukemia patients.

Hongxia Yao1, Congming Wu1, Yueqing Chen2, Li Guo1, Wenting Chen1, Yanping Pan1, Xiangjun Fu1, Guyun Wang1, Yipeng Ding3.   

Abstract

BACKGROUND: Despite targeted sequencing have identified several mutations for leukemia, there is still a limit of mutation screening for Chinese leukemia. Here, we used targeted next-generation sequencing for testing the mutation patterns of Chinese leukemia patients.
METHODS: We performed targeted sequencing of 504 tumor-related genes in 109 leukemia samples to identify single-nucleotide variants (SNVs) and insertions and deletions (INDELs). Pathogenic variants were assessed based on the American College of Medical Genetics and Genomics (ACMG) guidelines. The functional impact of pathogenic genes was explored through gene ontology (GO), pathway analysis, and protein-protein interaction network in silico.
RESULTS: We identified a total of 4,655 SNVs and 614 INDELs in 419 genes, in which PDE4DIP, NOTCH2, FANCA, BCR, and ROS1 emerged as the highly mutated genes. Of note, we were the first to demonstrate an association of PDE4DIP mutation and leukemia. Based on ACMG guidelines, 39 pathogenic and likely pathogenic mutations in 27 genes were found. GO annotation showed that the biological process including gland development, leukocyte differentiation, respiratory system development, myeloid leukocyte differentiation, mesenchymal to epithelial transition, and so on were involved.
CONCLUSION: Our study provided a map of gene mutations in Chinese patients with leukemia and gave insights into the molecular pathogenesis of leukemia.
© 2020 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals LLC.

Entities:  

Keywords:  INDELs; Leukemia; SNVs; gene ontology; pathway analysis

Mesh:

Substances:

Year:  2020        PMID: 32638549      PMCID: PMC7507579          DOI: 10.1002/mgg3.1369

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


INTRODUCTION

Leukemia is malignant disorders of the blood and bone marrow (Musharraf, Siddiqui, Shamsi, Choudhary, & Rahman, 2016). In China, there were 75,300 new cases of leukemia and 53,400 death of leukemia in 2015 (Chen et al., 2016). The incidence and mortality of leukemia in males is higher than those in females, and myeloid leukemia has significantly higher levels of incidence and mortality than lymphoid leukemia (Liu, Zhao, Chen, & Chen, 2013). Acute myeloid leukemia (AML), acute lymphoblastic leukemia (ALL), acute promyelocytic leukemia (APL), chronic myeloid leukemia (CML), and chronic lymphocytic leukemia (CLL) are the common types of leukemia (Juliusson & Hough, 2016). Acute leukemia is composed of primary undifferentiated cells; while chronic leukemia, the malignant cells are more differentiated. Exposure to environmental radiation and solvents were reported as predisposing factors for leukemia (Schuz & Erdmann, 2016). However, the direct cause of leukemia has not been found. Recently, mutation profiling of genes provides prognostic prediction and treatment guidance for patients with leukemia (Itzykson et al., 2013; Shin et al., 2016). Reports on mutational patterns of Chinese leukemia patients are limited. The application of next‐generation sequencing (NGS) technique can better achieve testing for a larger group of mutational markers. NGS is a massively parallel high‐throughput DNA sequencing approach (Metzker, 2010). The major advantages of this approach are that simultaneously screen a large number of genes and samples using very low amount of nucleic acids, and have high sensitivity for mutation detection (Meldrum, Doyle, & Tothill, 2011). Types of NGS include whole‐genome sequencing, whole‐exome sequencing, whole‐transcriptome sequencing, and targeted regions sequencing (Ross & Cronin, 2011). Targeted regions sequencing for multiple specific genomic regions have been widely employed in many fields to identify genetic variants related to disease pathogenesis and prognosis (Mansouri et al., 2014). Despite targeted sequencing have identified several mutations for leukemia, there is still a limit of mutation screening for Chinese leukemia. Here, we performed targeted regions sequencing containing these 504 genes in 109 patients with leukemia among Chinese Han population to explore the genetic basis of Chinese leukemia patients.

MATERIALS AND METHODS

Study population

A total of 109 patients diagnosed with leukemia at Hainan General Hospital were enrolled. All of the patients were genetically unrelated ethnic Han Chinese. Patients with leukemia were diagnosed according to 2016 WHO classification criteria (Sabattini, Bacci, Sagramoso, & Pileri, 2010). The samples were confirmed by bone marrow microscopy, flow immunophenotyping, chromosome screening, and fusion gene detection. Our study was approved by the ethical review board of Hainan General Hospital, and complied with the Declaration of Helsinki. Informed written consent was obtained from all patients enrolled in this study.

DNA extraction and sequencing

Peripheral blood (5 ml) was collected from each subject into EDTA‐coated vacutainer tubes. Genomic DNA was isolated using a commercially available DNA extraction kit (GoldMag Co. Ltd.), and then quantified using a NanoDrop 2000 Spectrophotometer (NanoDrop Technologies). The sample prepared using a Truseq DNA Sample preparation Kit (Illumina) following the standard protocol. Agilent SureDesign website (https://earray.chem.agilent.com/‐suredesign/home.htm) was used to design capture oligos for 504 cancer‐related genes. Paired‐end libraries were prepared following the Illumina protocol. Hybridization reactions were performed on AB 2720 Thermal Cycler (Life Technologies Corporation). The hybridization mixture was captured using magnetic beads (Invitrogen) and Agilent Custom Sureselect Enrichment Kit according to the manufacturer's instructions. Sequencing (2 × 150 bp reads) was carried out on Illumina HiSeq2500 platform (Illumina). Sequencing reads were aligned to the human reference genome (UCSC Genome Browser hg19, http://genome.ucsc.edu/) using the Burrows‐Wheeler Aligner (Li & Durbin, 2010). Picard software (https://github.com/broadinstitute/picard) was employed to remove duplicate PCR reads and evaluate the quality of variants by attaining effective reads, effective base, and average coverage depth. Sequencing quality controls were as follows: (a) Q20 and Q30 more than 90% and 80%, respectively; (b) coverage of target regions more than 99%; and mapping rate no less than 95%. single‐nucleotide variant (SNV) calling was performed using GATK and Varscan programs (Koboldt et al., 2012; McKenna et al., 2010). Variants were annotated using ANNOVAR (http://annovar.openbioinformatics.org/en/latest/).

In silico analysis

Pathogenic or likely pathogenic SNVs was assessed by the 1000 Genomes Project, Sorting Tolerant From Intolerant (SIFT), PolyPhen, MutationTaster, and Combined Annotation Dependent Depletion (CADD) based on the American College of Medical Genetics and Genomics (ACMG) guidelines (Richards et al., 2015). Pathway enrichment analysis of gene ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) for the candidate pathogenic genes was carried out using R clusterprofiler software package (http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html). GO terminology was annotated on the following three aspects: cellular component (GO‐CC), molecular function (GO‐MF), and biological process (GO‐BP). Pathway enrichment based on KEGG pathway database (https://www.kegg.jp/kegg/) was applied for pathway annotation. GO terms and KEGG pathway with p < .05 were considered to be significant. Protein–protein interaction (PPI) network was predicted by STRING online software (https://string‐db.org/). GEPIA (http://gepia.cancer‐pku.cn/) database was used to evaluate the expression and prognostic of candidate genes in leukemia.

RESULTS

Clinical characteristics of patients with leukemia

A total of 109 patients with leukemia were identified. The median age of the studied cohort was 39.83 years (range: 11–77 years), included 58 (53.2%) males and 51 (46.8%) females (Table 1). Among these, 30 patients diagnosed as AML, 19 patients as ALL, 24 patients as APL, 28 patients as CML, and 8 patients as CLL.
Table 1

Characteristics of patients with leukemia

VariableTotal (n = 109)AML (n = 30)ALL (n = 19)APL (n = 24)CML (n = 28)CLL (n = 8)
Age, years39.8 (11–77)29.5 (12–74)36 (13–73)41 (16–62)42 (11–75)60.5 (51–77)
Gender
Male, n 58131011186
Female, n 5117913102
SNVs and INDELs
SNVs, n 4,6552,4351,8702,5702,7012,661
Indels, n 614359309388408393
Blast (%, IQR)16.00 (2.50–47.50)36.50 (26.25–59.00)47.50 (14.00–80.00)2.25 (1.50–2.88)1.25 (1.00–4.50)3.00 (2.13–6.75)
Lymphocytes population (%, IQR)16.00 (6.65–41.42)13.60 (8.18–18.43)23.90 (14.90–44.30)31.86 (14.28–43.65)6.90 (1.95–23.4)79.25 (64.00–86.18)
BCR‐ABL1 transcript positive (n)24

Abbreviations: ALL, acute lymphoblastic leukemia; AML, acute myeloid leukemia; APL, acute promyelocytic leukemia; CLL, chronic lymphocytic leukemia; CML, chronic myeloid leukemia; INDELs, insertions and deletions; IQR, interquartile range; SNV, single‐nucleotide variants.

Characteristics of patients with leukemia Abbreviations: ALL, acute lymphoblastic leukemia; AML, acute myeloid leukemia; APL, acute promyelocytic leukemia; CLL, chronic lymphocytic leukemia; CML, chronic myeloid leukemia; INDELs, insertions and deletions; IQR, interquartile range; SNV, single‐nucleotide variants.

Spectrum of gene variants

The targeted capture deep sequencing of 504 tumor‐related genes in 109 leukemia samples revealed 4,655 SNVs and 614 insertions and deletions (INDELs) in 419 genes (Table S1). There were 208 genes with greater than 10 mutations, 78 genes with greater than 20 mutations. Top 50 mutated genes across the leukemia patients were described in Figure 1. Among them, the most commonly mutated gene was PDE4DIP (128), followed by NOTCH2 (59), FANCA (55), BCR (53), ROS1 (51), NACA (47), KDM5A (44), CLTCL1 (43), AKAP9 (43), MYH11 (42), PCM1 (42), NOTCH1 (41), COL1A1 (40), and so on. According to GEPIA database, PDE4DIP was down‐regulated in AML compared with normal samples (p < .01), and high expression of PDE4DIP was associated with poor AML prognosis (HR = 2.3, logrank p = .0027, Figure S1). STRING database was used to predict the potential interacting protein with PDE4DIP, as Figure S2 and Table S2. The results showed that PDE4DIP protein might interact with PDE4D, PRKAR2A, and AKAP9, which can bind to cAMP or protein kinase A (PKA), suggesting that PDE4DIP may participate in cAMP/PKA signaling pathway.
Figure 1

Landscape of mutations in 109 leukemia patients (top 50 mutated genes). The number of SNVs and INDELs for each gene is shown (left). The number of SNVs and INDELs for each patient is shown (top). Each vertical column represented an individual. Shaded bars indicate the number of mutations in genes. INDELs, insertions and deletions; SNVs, single‐nucleotide variants

Landscape of mutations in 109 leukemia patients (top 50 mutated genes). The number of SNVs and INDELs for each gene is shown (left). The number of SNVs and INDELs for each patient is shown (top). Each vertical column represented an individual. Shaded bars indicate the number of mutations in genes. INDELs, insertions and deletions; SNVs, single‐nucleotide variants We analyzed single‐nucleotide changes of these detected SNVs and found that transitions as C/G>T/A and A/T>G/C were more prevalent than transversions including C/G>G/C, C/G>A/T, A/T>C/G, and A>T/T>A in all leukemia patients, as shown in Figure 2a. In addition, we also analyzed the regions of these SNVs. Among these variants (Figure 2b), exonic variants (54.55%) were the most frequent, followed in order by intronic variants (34.92%), splicing variants (6.14%), and UTR variants (4.39%). Additionally, we detected the genetic effects of these variants in exonic region, with missense variants, synonymous variants, stopgain/loss variants, and unknown variants.
Figure 2

Patterns of somatic SNVs by targeted next‐generation sequencing. (a) The percentages of distinct transitions and transversions of SNVs. (b) Proportions of SNVs types according to their regions in the gene. SNVs, single‐nucleotide variants

Patterns of somatic SNVs by targeted next‐generation sequencing. (a) The percentages of distinct transitions and transversions of SNVs. (b) Proportions of SNVs types according to their regions in the gene. SNVs, single‐nucleotide variants There were 2,435 SNVs and 359 INDELs in AML, 1,870 SNVs and 309 INDELs in ALL, 2,570 SNVs and 388 INDELs in APL, 2,701 SNVs and 408 INDELs in CML, and 2,661 SNVs and 393 INDELs in CLL. We then compared the frequency of mutations and INDELs in five subgroups of leukemia. Mutations or INDELs in 50 genes (PDE4DIP, NOTCH2, FANCA, BCR, CLTCL1, MYH11, NOTCH1, COL1A1, CAMTA1, ALK, NIN, CIITA, CARD11, RECQL4, USP6, NF1, NUP214, KIAA1549, MYH9, FLT3, NSD1, PCSK7, XPC, PMS2, SETBP1, SRGAP3, ASXL1, PDGFRA, ATIC, RALGDS, PRDM16, TSHR, FGFR2, EGFR, CDH11, DNM2, SYK, MUTYH, BCOR, FNBP1, RAC1, IL7R, CNOT3, RNF43, TP53, KCNJ5, MEN1, SH2B3, SRSF2, CREB3L1) were shared across all patients with leukemia. The different pathological classification of leukemia might have distinct patterns of variants and INDELs. Table S3 showed the top 50 significantly different genes. In ALL, the mutation frequencies of XPA, CDK4, and BCL11B were higher than other subgroups, but the frequencies of SMARCE1 and FEV were lower. The high prevalence of GNAQ, ELF4, HOXD13, and COX6C mutations in patients with APL were noteworthy, SMARCE1 variant predominated in patients with CML, variant of PER1 and ETV6 had higher frequencies in patients with AML. The variants of PIK3CA, CHCHD7, FEV, and PHF6 variants were significantly enriched in CLL. These data suggested a strong functional role of these genes in the different types of leukemia.

The filtrate pathogenic and likely pathogenic genes

Based on ACMG guideline, we identified 39 pathogenic and likely pathogenic mutations in 27 genes including SNVs or INDELs in exonic and splicing regions among 32 leukemia patients (Figure 3 and Table 2). In AML, 21 pathogenic or likely pathogenic germline mutations in 17 genes were identified, of which 10 mutant genes only in AML patients. ALL patients had six pathogenic or likely pathogenic germline mutations, especially PBRM1 (c.2819_2829del, p.L940fs) and SUZ12 (c.1716_1717insG, p.L572fs). There were six pathogenic or likely pathogenic germline mutations among APL patients, of which PAX8 (c.G201T, p.E67D) only in APL patients. Two pathogenic mutations in ALDH2 (rs540073928, p.A175D) and FBXW7 (rs866987936, p.R361Q) were found in CLL patients. Among CML patients, seven pathogenic or likely pathogenic germline mutations were also detected, of which five mutant genes just in CML patients. These results hinted us that these mutations might play an important role in the pathogenesis of the different leukemia subgroup. The pathologic mutations for leukemia patients were shown in Table S4.
Figure 3

Overview of the distribution of the 27 pathogenic and likely pathogenic gene according to ACMG guidelines in five subgroup of leukemia. ACMG, American College of Medical Genetics and Genomics

Table 2

Summary of ACMG Likely Pathogenic mutations

GeneLeukemiaPriorityChr: PositionSNV/IndelREFALTRegionTranscript IDExonic FunctionNucleotide changeAA change
ALDH2AML, CLLH12:112228350rs540073928CAexonicNM_001204889missense SNVc.C524Ap.A175D
ASXL1AMLH20:31021634.CCAexonicNM_015338frameshift insertionc.1633_1634insAp.R545fs
ATICAMLH2:216182882rs575560797ATexonicNM_004044missense SNVc.A149Tp.D50V
CANT1AMLH17:76993297.CACexonicNM_001159772frameshift deletionc.407delTp.L136fs
CBLAMLH11:119155730.CCCGCGCTTTCTTexonicNM_005188frameshift insertionc.1483_1484insCGCGCTTTCTTp.P495fs
CBLCMLH11:119148892rs387906666AGexonicNM_005188missense SNVc.A1112Gp.Y371C
CEBPAAMLH19:33793130.ATAexonicNM_00128743frameshift deletionc.295delAp.I99fs
CEBPAAMLH19:33793153.GCAGATGCCGCCGexonicNM_001287435frameshift deletionc.262_272delp.G88fs
CEBPAAPLH19:33793092.AACTexonicNM_001287435frameshift insertionc.333_334insAGp.F112fs
FANCGCMLH9:35077398rs376732298TTGsplicingNM_004629.
FBXW7ALLH4:153249385rs867384286GAexonicNM_018315missense SNVc.C1039Tp.R347C
FBXW7CLLH4:153247366rs866987936CTexonicNM_018315missense SNVc.G1082Ap.R361Q
FLT3AML, APLH13:28592642rs121913488CAexonicNM_004119missense SNVc.G2503Tp.D835Y
FLT3AMLH13:28592640rs121913487ACexonicNM_004119missense SNVc.T2505Gp.D835E
FOXP1CMLH3:71037162.TGTexonicNM_001244810frameshift deletionc.828delCp.P276fs
IDH1AMLH2:209113113rs121913499GCexonicNM_001282387missense SNVc.C394Gp.R132G
KIAA1549AMLH7:138602332.TGATexonicNM_020910frameshift deletionc.2038_2039delp.S680fs
KRASAMLH12:25398281rs112445441CTexonicNM_004985missense SNVc.G38Ap.G13D
MYH9AMLH22:36715582rs372016779TCsplicingNM_002473.
NFIBCMLH9:14398522.CAsplicingNM_001190738.
NRASAMLH1:115258744rs121434596CAexonicNM_002524missense SNVc.G38Tp.G13V
NRASAMLH1:115256529rs11554290TGexonicNM_002524missense SNVc.A182Cp.Q61P
NRASALLH1:115258747rs121913237CTexonicNM_002524missense SNVc.G35Ap.G12D
PAX5AMLH9:36966685.AGCGAGTGAexonicNM_001280552frameshift deletionc.310_316delp.H104fs
PAX8APLH2:114002192.CAexonicNM_013953missense SNVc.G201Tp.E67D
PBRM1ALLH3:52623221.GTAAGCCTGAGAGexonicNM_018313frameshift deletionc.2819_2829delp.L940fs
RUNX1CMLH21:36171607.GAexonicNM_001001890stopgainc.C877Tp.R293X
SETBP1CMLH18:42531913rs267607040GAexonicNM_015559missense SNVc.G2608Ap.G870S
SETD2AMLH3:47155452.CCCGGTCCAAexonicNM_014159frameshift insertionc.4628_4629insTTGGACCGp.R1543fs
SMOAMLH7:128846423.TCexonicNM_005631missense SNVc.T1259Cp.I420T
SMOALLH7:128848655.GCexonicNM_005631missense SNVc.G1320Cp.K440N
SUZ12ALLH17:30322703.AAGexonicNM_015355frameshift insertionc.1716_1717insGp.L572fs
TFRCALL, APLH3:195791279rs184956956CTexonicNM_003234missense SNVc.G1219Ap.A407T
TFRCCMLH3:195785460rs772017482TCexonicNM_003234missense SNVc.A1580Gp.N527S
TP53AMLH17:7578406rs28934578CTexonicNM_001126115missense SNVc.G128Ap.R43H
WT1APLH11:32413557.GTexonicNM_024426missense SNVc.C1342Ap.H448N
WT1APLH11:32417920.GGAGTCGGGGCTACTCCAGGCexonicNM_024426frameshift insertionc.1080_1081insGCCTGGAGTAGCCCCGACTp.L361fs
WT1AMLH11:32417909.CCGACAexonicNM_024426frameshift insertionc.1091_1092insTGTCp.S364fs
WT1AMLH11:32417942.AAGexonicNM_024426frameshift insertionc.1058_1059insCp.R353fs

Abbreviations: ACMG, American College of Medical Genetics and Genomics; ALL, acute lymphoblastic leukemia; ALT, alter; AML, acute myeloid leukemia; APL, acute promyelocytic leukemia; CLL, chronic lymphocytic leukemia; CML, chronic myeloid leukemia; INDELs, insertions and deletions; REF, reference; SNV, single nucleotide variants.

Overview of the distribution of the 27 pathogenic and likely pathogenic gene according to ACMG guidelines in five subgroup of leukemia. ACMG, American College of Medical Genetics and Genomics Summary of ACMG Likely Pathogenic mutations Abbreviations: ACMG, American College of Medical Genetics and Genomics; ALL, acute lymphoblastic leukemia; ALT, alter; AML, acute myeloid leukemia; APL, acute promyelocytic leukemia; CLL, chronic lymphocytic leukemia; CML, chronic myeloid leukemia; INDELs, insertions and deletions; REF, reference; SNV, single nucleotide variants.

GO annotation for pathogenic and likely pathogenic genes

Gene ontology annotation and pathway analyses were conducted for 27 pathogenic and likely pathogenic genes. The possible BP of these overlapping genes were related to the gland development, leukocyte differentiation, respiratory system development, myeloid leukocyte differentiation, mesenchymal to epithelial transition, lung development, skeletal system development, positive regulation of mesonephros development, respiratory tube development, vasculogenesis, and so on (Figure 4a). The results of GO‐CC annotation suggested that these genes were involved in CC including RNA polymerase II transcription factor complex, nuclear transcription factor complex, tertiary granule, PcG protein complex, tertiary granule lumen, caveola, transcription factor complex, membrane raft, membrane microdomain, and plasma membrane raft (Figure 4b). Moreover, the MF of these pathogenic and likely pathogenic genes were mainly correlated with the DNA‐binding transcription activator activity, RNA polymerase II‐specific, RNA polymerase II proximal promoter sequence‐specific DNA binding, proximal promoter sequence‐specific DNA binding, protein self‐association, cadherin binding, nuclear hormone receptor binding, histone‐lysine N‐methyltransferase activity, protein phosphorylated amino acid binding, hormone receptor binding, promoter‐specific chromatin binding (Figure 4c).
Figure 4

Top 10 enrichment scores in GO enrichment analysis for 27 pathogenic and likely pathogenic genes. (a) Biological process of pathogenic and likely pathogenic genes; (b) Cellular component of pathogenic and likely pathogenic genes; (c) Molecular function of pathogenic and likely pathogenic genes. GO, gene ontology

Top 10 enrichment scores in GO enrichment analysis for 27 pathogenic and likely pathogenic genes. (a) Biological process of pathogenic and likely pathogenic genes; (b) Cellular component of pathogenic and likely pathogenic genes; (c) Molecular function of pathogenic and likely pathogenic genes. GO, gene ontology

Pathway analysis and prediction of PPI

Further enrichment analysis based on the KEGG database showed that these pathogenic and likely pathogenic genes were highly enriched in leukemia and cancer‐related pathways, as shown in Table 3. The pathways included (a) AML and CML; (b) cancer‐related pathways, such as transcriptional misregulation, central carbon metabolism, and proteoglycans; (c) various cancers including thyroid cancer, bladder cancer, and endometrial cancer.
Table 3

Annotation of pathways for pathogenic and likely pathogenic genes

IDDescriptionGene RatioBg RatioP valueQ valueGene IDCount
hsa05221Acute myeloid leukemia5/2157/70105.48 × 10−7 2.83 × 10−5 CEBPA/FLT3/KRAS/NRAS/RUNX15
hsa05202Transcriptional misregulation in cancer7/21180/70105.62 × 10−7 2.83 × 10−5 CEBPA/FLT3/PAX5/PAX8/RUNX1/TP53/WT17
hsa05200Pathways in cancer9/21397/70108.76 × 10−7 2.83 × 10−5 CBL/CEBPA/FLT3/KRAS/NRAS/PAX8/RUNX1/SMO/TP539
hsa05230Central carbon metabolism in cancer5/2167/70101.24 × 10−6 2.83 × 10−5 FLT3/IDH1/KRAS/NRAS/TP535
hsa05216Thyroid cancer4/2129/70101.35 × 10−6 2.83 × 10−5 KRAS/NRAS/PAX8/TP534
hsa05220Chronic myeloid leukemia5/2173/70101.91 × 10−6 3.33 × 10−5 CBL/KRAS/NRAS/RUNX1/TP535
hsa05219Bladder cancer3/2141/70100.000230.003444KRAS/NRAS/TP533
hsa05205Proteoglycans in cancer5/21203/70100.000270.003546CBL/KRAS/NRAS/SMO/TP535
hsa05213Endometrial cancer3/2152/70100.0004660.005437KRAS/NRAS/TP533
hsa05223Non‐small cell lung cancer3/2156/70100.000580.00609KRAS/NRAS/TP533
hsa05214Glioma3/2165/70100.0008980.008575KRAS/NRAS/TP533
hsa05218Melanoma3/2171/70100.0011620.010167KRAS/NRAS/TP533
hsa04012ErbB signaling pathway3/2187/70100.0020890.016725CBL/KRAS/NRAS3
hsa05215Prostate cancer3/2189/70100.002230.016725KRAS/NRAS/TP533
hsa04211Longevity regulating pathway—mammal3/2194/70100.0026070.018248KRAS/NRAS/TP533
hsa04660T cell receptor signaling pathway3/21104/70100.0034740.022799CBL/KRAS/NRAS3
hsa04919Thyroid hormone signaling pathway3/21118/70100.0049570.028711KRAS/NRAS/TP533
hsa04722Neurotrophin signaling pathway3/21120/70100.0051950.028711KRAS/NRAS/TP533
hsa04071Sphingolipid signaling pathway3/21120/70100.0051950.028711KRAS/NRAS/TP533
hsa05160Hepatitis C3/21133/70100.0069160.036307KRAS/NRAS/TP533
hsa04530Tight junction3/21139/70100.0078120.036373KRAS/MYH9/NRAS3
hsa04910Insulin signaling pathway3/21139/70100.0078120.036373CBL/KRAS/NRAS3
hsa04210Apoptosis3/21140/70100.0079670.036373KRAS/NRAS/TP533
hsa05161Hepatitis B3/21146/70100.0089410.039115KRAS/NRAS/TP533
Annotation of pathways for pathogenic and likely pathogenic genes STRING database was used to construct the PPI network for 27 pathogenic and likely pathogenic genes. String map displayed these genes containing 27 nodes and 88 edges, in which nodes representing proteins and edges depicting associated interactions (Figure 5). PPI analysis showed that CEBPA, FLT3, PAX5, PAX8, RUNX1, TP53, and WT1 genes located in network hub appeared in Transcriptional misregulation in cancer.
Figure 5

PPI network for 27 pathogenic and likely pathogenic genes. PPI, protein–protein interaction

PPI network for 27 pathogenic and likely pathogenic genes. PPI, protein–protein interaction

DISCUSSION

In this study, we performed a targeted capture deep sequencing of 504 tumor‐related genes in 109 leukemia samples. We identified a total of 4,655 SNVs and 614 INDELs in 419 genes, in which PDE4DIP, NOTCH2, FANCA, BCR, ROS1, NACA, KDM5A, CLTCL1, AKAP9, MYH11, PCM1, NOTCH1, COL1A1 had more than 40 mutations. We compared the frequency of mutations and INDELs in five subgroup of leukemia (ALL, APL, AML, CLL, and CML). Moreover, ACMG pathogenic analysis identified 27 pathogenic and likely pathogenic genes. GO enrichment analysis, pathway analysis, and PPI network were performed on the pathogenic and likely pathogenic genes. Our results might provide some molecular data on mutations in leukemia to map the genetic variations of Chinese patients with leukemia. Mutational analysis was used to map the genetic variants in leukemia, and found that the highest mutations occurred in PDE4DIP, followed by NOTCH2, FANCA, BCR, and ROS1 in almost all leukemia patients. Phosphodiesterase 4D interacting protein (PDE4DIP) anchored PDE4D at the centrosome‐Golgi cell region, which was involved in signal transduction and hydrolyze cGMP and cAMP to energize several reactions in the cell, including related to immune cell activation, hormone secretion, smooth vascular muscle action, and platelet aggregation (Shapshak, 2012). The protein is found to interact with a phosphodiesterase superfamily protein member (Vinayagam et al., 2011). The PDE4DIP mutations have also been previously identified in various cancers including lung cancer, medullary thyroid cancer, and ovarian cancer (Chang et al., 2018; Er et al., 2016; Y. Li et al., 2018), but not been reported in leukemia previously. Our study first reported that PDE4DIP mutations (128) in leukemia patients, suggesting PDE4DIP would be involved in pathogenesis of leukemia. Moreover, we found that PDE4DIP was downregulated in AML base on TCGA database, and the high expression was associated with poor AML prognosis. These suggested that PDE4DIP might be a tumor suppressor gene in leukemia. Results of STRING database showed that PDE4DIP protein might interact with PDE4D, PRKAR2A, and AKAP9, which can bind to cAMP or PKA, suggesting that PDE4DIP may participate in cAMP/PKA signaling pathway. In addition, PDE4DIP, or myomegalin, is a dual‐specificity AKAP known to colocalize with AKAP9 and PKA at the centrosome, which could play important roles in the localization and function of the AKAP/PKA complex in microtubule dynamics (Schmoker et al., 2018). The cAMP‐dependent PKA signaling pathway was reported to involve in many fundamental cellular processes in leukemia, including migration and proliferation (Murray & Insel, 2013; Xu et al., 2016). These studies hinted that PDE4DIP might play a role in leukemia by participating in the cAMP/PKA signaling pathway, but more convincing studies were needed to validate. Previously, NOTCH2, FANCA, BCR, and ROS1 have been described to play an important role in leukemia. For example, Notch2 controlled nonautonomous Wnt‐signaling in CLL CLL (Mangolini, Gotte, & Moore, 2018). FANCA dysfunction might promote cytogenetic instability in adult acute myelogenous leukemia (Lensch et al., 2003). BCRABL1 fusion genes were leukemogenic, causing CML or ALL (Baccarani et al., 2019). ROS1 revealed a central oncogenic role in CMML, which might represent a molecular target (Cilloni et al., 2013). In our study, we found NOTCH2, FANCA, BCR, and ROS1 were significantly mutated in 109 Chinese patients with leukemia. We used a methodology based on ACMG variant classification guidelines in 419 mutation genes, and identified 39 pathogenic and likely pathogenic mutations in 27 genes, which might be the cause of leukemia pathogenesis in these individuals. Particularly, there were 21 pathogenic or likely pathogenic gene in AML patients and 7 genes in CML patients, indicates that the pathogenesis of AML and CML might be more complicated. PBRM1 (c.2819_2829del, p.L940fs) and SUZ12 (c.1716_1717insG, p.L572fs) were only identified in ALL patients, ALDH2 (rs540073928, p.A175D) and FBXW7 (rs866987936, p.R361Q) only in CLL patients, and CANT1 (c.407delT, p.L136fs) and PAX8 (c.G201T, p.E67D) only in APL patients. These results suggested that the pathogenesis of different leukemia subgroups might be different. In addition, there were no studies of CANT1, KIAA1549, and NFIB on leukemia in previous literatures; therefore, the association between these genes and leukemia should be further investigated. Subsequently, GO and KEGG analysis were performed to further understand the role of pathogenic mutation genes. GO annotation showed that the BP including gland development, leukocyte differentiation, respiratory system development, myeloid leukocyte differentiation, mesenchymal to epithelial transition, and so on were involved, which might provide further insight into the occurrence and development of leukemia. Moreover, the enriched KEGG pathway was found to be involved in leukemia (hsa05221 and hsa05220) and cancer‐related pathways (hsa05200, hsa05202, and hsa05230), which was consistent with findings from other previous studies on the pathogenesis of leukemia (McClure et al., 2018; de Noronha, Mitne‐Neto, & Chauffaille, 2017). Seven key genes (CEBPA, FLT3, PAX5, PAX8, RUNX1, TP53, and WT1) were obtained from PPIs network, most of which were reported to play a critical role in carcinogenesis and tumor progression (Junk et al., 2019; Rhodes, Vallikkannu, & Jayalakshmi, 2017; Slattery, Herrick, & Mullany, 2017). Inevitably, this study has several drawbacks. First, this is a single‐center study, and a multi‐institutional large study will be necessary to verify the results. Second, due to insufficient data of leukemia patients, we could not evaluate the correlation of the mutations and clinical and prognostic data of leukemia. Third, the potential function and pathways were only predicted by bioinformatics and needed experimental verification.

CONCLUSION

Taken together, our study provided a map of gene mutations in Chinese patients with leukemia and enriched an understanding of the pathogenesis of leukemia. Of note, we are the first to demonstrate an association between PDE4DIP mutation and leukemia. Furthermore, we screened some pathogenic genes based on ACMG guidelines and performed GO analysis, pathway analysis, and PPI network. However, further investigations with larger cohorts and experimental research are warranted to further explore the potential mechanisms.

CONFLICTS OF INTEREST

The authors declare that there are no conflicts of interest.

AUTHOR CONTRIBUTIONS

The work presented here was carried out in collaboration between all authors. Hongxia Yao and Congming Wu carried out the molecular genetic studies and drafted the manuscript. Yueqing Chen, Li Guo, and Wenting Chen designed the methods and experiments, performed the statistical analyses and interpreted the results. Yanping Pan, Xiangjun Fu, and Guyun Wang collected clinical samples and information about patients. Hongxia Yao and Yipeng Ding conceived of the study, worked on associated data collection and their interpretation, participated in the design and coordination of the study, and funded the study. All authors read and approved the final manuscript. Fig S1 Click here for additional data file. Fig S2 Click here for additional data file. Table S1‐S4 Click here for additional data file.
  33 in total

1.  Next-generation sequencing for cancer diagnostics: a practical perspective.

Authors:  Cliff Meldrum; Maria A Doyle; Richard W Tothill
Journal:  Clin Biochem Rev       Date:  2011-11

2.  Feasibility of targeted next-generation sequencing of the TP53 and ATM genes in chronic lymphocytic leukemia.

Authors:  L Mansouri; L-A Sutton; V Ljungström; E F Sörqvist; R Gunnarsson; K E Smedby; G Juliusson; K Stamatopoulos; M Nilsson; R Rosenquist
Journal:  Leukemia       Date:  2013-10-31       Impact factor: 11.528

3.  Targeted next-generation sequencing for molecular diagnosis of endometriosis-associated ovarian cancer.

Authors:  Tze-Kiong Er; Yu-Fa Su; Chun-Chieh Wu; Chih-Chieh Chen; Jing Wang; Tsung-Hua Hsieh; Marta Herreros-Villanueva; Wan-Tzu Chen; Yi-Ting Chen; Ta-Chih Liu; Hung-Sheng Chen; Eing-Mei Tsai
Journal:  J Mol Med (Berl)       Date:  2016-02-27       Impact factor: 4.599

Review 4.  Clinical Significance of DNA Variants in Chronic Myeloid Neoplasms: A Report of the Association for Molecular Pathology.

Authors:  Rebecca F McClure; Mark D Ewalt; Jennifer Crow; Robyn L Temple-Smolkin; Mrudula Pullambhatla; Rachel Sargent; Annette S Kim
Journal:  J Mol Diagn       Date:  2018-08-20       Impact factor: 5.568

5.  Aberrant activation of ROS1 represents a new molecular defect in chronic myelomonocytic leukemia.

Authors:  Daniela Cilloni; Sonia Carturan; Enrico Bracco; Valentina Campia; Valentina Rosso; Davide Torti; Chiara Calabrese; Valentina Gaidano; Pimjai Niparuck; Alessandra Favole; Elisabetta Signorino; Ilaria Iacobucci; Annalisa Morano; Luciana De Luca; Pellegrino Musto; Francesco Frassoni; Giuseppe Saglio
Journal:  Leuk Res       Date:  2013-02-12       Impact factor: 3.156

6.  Cancer statistics in China, 2015.

Authors:  Wanqing Chen; Rongshou Zheng; Peter D Baade; Siwei Zhang; Hongmei Zeng; Freddie Bray; Ahmedin Jemal; Xue Qin Yu; Jie He
Journal:  CA Cancer J Clin       Date:  2016-01-25       Impact factor: 508.702

7.  Prognostic score including gene mutations in chronic myelomonocytic leukemia.

Authors:  Raphaël Itzykson; Olivier Kosmider; Aline Renneville; Véronique Gelsi-Boyer; Manja Meggendorfer; Margot Morabito; Céline Berthon; Lionel Adès; Pierre Fenaux; Odile Beyne-Rauzy; Norbert Vey; Thorsten Braun; Torsten Haferlach; François Dreyfus; Nicholas C P Cross; Claude Preudhomme; Olivier A Bernard; Michaela Fontenay; William Vainchenker; Susanne Schnittger; Daniel Birnbaum; Nathalie Droin; Eric Solary
Journal:  J Clin Oncol       Date:  2013-05-20       Impact factor: 44.544

8.  Recurrently Mutated Genes Differ between Leptomeningeal and Solid Lung Cancer Brain Metastases.

Authors:  Yingmei Li; Boxiang Liu; Ian David Connolly; Bina Wasunga Kakusa; Wenying Pan; Seema Nagpal; Stephen B Montgomery; Melanie Hayden Gephart
Journal:  J Thorac Oncol       Date:  2018-03-29       Impact factor: 15.609

9.  Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.

Authors:  Sue Richards; Nazneen Aziz; Sherri Bale; David Bick; Soma Das; Julie Gastier-Foster; Wayne W Grody; Madhuri Hegde; Elaine Lyon; Elaine Spector; Karl Voelkerding; Heidi L Rehm
Journal:  Genet Med       Date:  2015-03-05       Impact factor: 8.822

10.  Mutation profiling of 19 candidate genes in acute myeloid leukemia suggests significance of DNMT3A mutations.

Authors:  Sang-Yong Shin; Seung-Tae Lee; Hee-Jin Kim; Eun Hae Cho; Jong-Won Kim; Silvia Park; Chul Won Jung; Sun-Hee Kim
Journal:  Oncotarget       Date:  2016-08-23
View more
  3 in total

Review 1.  Ontologies and Knowledge Graphs in Oncology Research.

Authors:  Marta Contreiras Silva; Patrícia Eugénio; Daniel Faria; Catia Pesquita
Journal:  Cancers (Basel)       Date:  2022-04-10       Impact factor: 6.575

Review 2.  cAMP Signaling in Cancer: A PKA-CREB and EPAC-Centric Approach.

Authors:  Muhammad Bilal Ahmed; Abdullah A A Alghamdi; Salman Ul Islam; Joon-Seok Lee; Young-Sup Lee
Journal:  Cells       Date:  2022-06-24       Impact factor: 7.666

3.  Spectrum of gene mutations identified by targeted next-generation sequencing in Chinese leukemia patients.

Authors:  Hongxia Yao; Congming Wu; Yueqing Chen; Li Guo; Wenting Chen; Yanping Pan; Xiangjun Fu; Guyun Wang; Yipeng Ding
Journal:  Mol Genet Genomic Med       Date:  2020-07-07       Impact factor: 2.183

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.