| Literature DB >> 26315111 |
Zhenli Li1,2, Dan Yu1,2, Meifu Gan3, Qiaonan Shan4, Xiaoyang Yin4, Shunli Tang4, Shuai Zhang1,2, Yongyong Shi5, Yimin Zhu6, Maode Lai1,2, Dandan Zhang1,2.
Abstract
Colorectal cancer (CRC) is a complex disease with an estimated heritability of approximately 35%. However, known CRC-related common single nucleotide polymorphisms (SNPs) can only explain ~0.65% of the heritability. This "missing heritability" may be explained partially by rare copy number variants (CNVs). In this study, we performed a genome-wide scan using Illumina Human-Omni Express BeadChip, 694 sporadic CRC cases and 1641 controls were eventually included in our analysis after quality control. The global burden analysis revealed a 1.53-fold excess of rare CNVs in CRC cases compared with controls (P < 1 × 10(-6)), and the difference being more pronounced for genic rare CNVs and CNVs overlapped with coding regions (1.65-fold and 1.84-fold, respectively, both P < 1 × 10(-6)). Interestingly, both the cases in the lowest and middle tertile of age carried a higher burden of rare CNVs comparing to the highest tertile. Furthermore, 639 CNV-disrupted genes exclusive to CRC cases were found to be significantly enriched in gene ontology (GO) terms concerning nucleosome assembly and olfactory receptor activity. Our study was the first to evaluate the burden of rare CNVs in sporadic CRC and suggested that rare CNVs contributed to the missing heritability of CRC.Entities:
Keywords: colorectal cancer; genome-wide scan; nucleosome assembly; rare CNVs
Mesh:
Substances:
Year: 2015 PMID: 26315111 PMCID: PMC4694911 DOI: 10.18632/oncotarget.4621
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Basic characteristics of the study subjects
| CRC cases | Controls | ||||
|---|---|---|---|---|---|
| All | Colon | Rectal | |||
| 694 | 336 | 340 | 1641 | ||
| 382/312 | 195/141 | 175/165 | 850/791 | 0.151 | |
| 62.4 ± 12.3 | 62.9 ± 12.8 | 62.2 ± 11.5 | 57.4 ± 11.6 | <0.001 | |
As 18 of the CRC cases diagnosed with both colon and rectal cancer, they were excluded in the following stratification analysis for tumor site.
The P value for gender was calculated by χ2 test between all the cases and controls, while the P value of the age between the two groups was derived from independent T test.
Figure 1Outline of CNV discovery and CNV analysis
A total of 1004 sporadic CRC cases and 1994 controls were genotyped using Illumina Human-Omni Express BeadChip. aQC for SNP array data, individuals with call rate <95% or outliers were removed. bQC for sample and CNV calls by PennCNV and QuantiSNP.
Global burden of rare CNVs between colorectal cases and controls
| Category | Controls ( | CRC ( | Fold Change | Colon ( | Fold Change | Rectal ( | Fold Change | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Total | 2275 | 1471 | 751 | 682 | ||||||
| Deletion | 1184 | 845 | 431 | |||||||
| Duplication | 1091 | 626 | 320 | 293 | ||||||
| Total | 1.39 | 2.12 | 1.53 | 2.24 | 1.61 | 2.01 | 1.45 | |||
| Deletion | 0.72 | 1.22 | 1.69 | 1.28 | 1.78 | 1.14 | 1.59 | |||
| Duplication | 0.66 | 0.90 | 1.36 | 0.95 | 1.43 | 0.86 | 1.30 | |||
| Total | 0.74 | 0.80 | 1.08 | 0.80 | 1.09 | 0.79 | 1.07 | |||
| Deletion | 0.49 | 0.58 | 1.17 | 0.59 | 1.19 | 0.56 | 1.14 | |||
| Duplication | 0.47 | 0.51 | 1.09 | 0.49 | 1.04 | 0.30 | 0.54 | 1.15 | ||
| Total | 234.70 | 258.40 | 1.10 | 0.53 | 251.40 | 1.07 | 0.23 | 265.50 | 1.13 | 0.10 |
| Deletion | 139.70 | 157.30 | 1.13 | 0.57 | 171.90 | 1.23 | 0.11 | 141.20 | 1.01 | 0.41 |
| Duplication | 221.90 | 224.50 | 1.01 | 0.58 | 206.40 | 0.93 | 0.74 | 243.10 | 1.10 | 0.19 |
Empirical p-values between cases and controls were calculated using 1000,000 permutations by PLINK, and all the P values were shown in bold if reached statistical significance (P < 0.05).
Fold change of CRC/colon/rectal cases vs controls.
Global burden of genic rare CNVs between colorectal cases and controls
| Category | Controls ( | CRC ( | Fold Change | Colon ( | Fold Change | Rectal ( | Fold Change | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Total | 1271 | 887 | 451 | 415 | ||||||
| Deletion | 576 | 449 | 231 | 207 | ||||||
| Duplication | 695 | 438 | 220 | 208 | ||||||
| Total | 0.77 | 1.28 | 1.65 | 1.34 | 1.73 | 1.22 | 1.58 | |||
| Deletion | 0.35 | 0.65 | 1.84 | 0.69 | 1.96 | 0.61 | 1.73 | |||
| Duplication | 0.42 | 0.63 | 1.49 | 0.65 | 1.55 | 0.61 | 1.44 | |||
| Total | 0.52 | 0.63 | 1.20 | 0.63 | 1.21 | 0.63 | 1.20 | |||
| Deletion | 0.28 | 0.38 | 1.35 | 0.40 | 1.45 | 0.35 | 1.26 | |||
| Duplication | 0.33 | 0.39 | 1.15 | 0.36 | 1.08 | 0.20 | 0.41 | 1.23 | ||
| Total | 202.30 | 206.60 | 1.02 | 0.40 | 178.30 | 0.88 | 0.83 | 236.70 | 1.17 | 0.11 |
| Deletion | 124.00 | 119.50 | 0.96 | 0.48 | 121.80 | 0.98 | 0.38 | 114.00 | 0.92 | 0.51 |
| Duplication | 213.40 | 220.50 | 1.03 | 0.36 | 175.50 | 0.82 | 0.93 | 263.20 | 1.23 | 0.05 |
Empirical p-values between cases and controls were calculated using 1000,000 permutations by PLINK, and all the P values were shown in bold if reached statistical significance (P < 0.05).
Fold change of CRC/colon/rectal cases vs controls.
Figure 2Genome-wide burden of rare non-genic CNVs and genic CNVs
Genome-wide frequency of rare genic CNVs and rare non-genic CNVs were calculated for controls, rectal cancer patients and colon cancer patients respectively. Rate (Y axis) represents the number of rare genic/non genic CNVs per individual.
Figure 3Genome-wide burden of rare CNVs overlapped with coding regions
Frequency (Y axis) of all rare CNVs, rare deletions and rare duplications overlapped with coding region were calculated separately. Each cluster consisted of three bars representing controls, rectal cancer patients and colon cancer patients.
Enriched GO functional terms of exclusively disrupted genes in CRC cases
| Category | Term | Count | % | Bonferroni | |
|---|---|---|---|---|---|
| GOTERM_CC | GO:0000786~nucleosome | 15 | 2.60 | 7.95E-09 | 2.80E-06 |
| GOTERM_CC | GO:0032993~protein-DNA complex | 17 | 2.94 | 1.05E-08 | 3.68E-06 |
| GOTERM_BP | GO:0031497~chromatin assembly | 16 | 2.77 | 6.79E-08 | 1.30E-04 |
| GOTERM_BP | GO:0006333~chromatin assembly or disassembly | 19 | 3.29 | 7.81E-08 | 1.50E-04 |
| GOTERM_BP | GO:0065004~protein-DNA complex assembly | 16 | 2.77 | 1.26E-07 | 2.42E-04 |
| GOTERM_BP | GO:0006334~nucleosome assembly | 15 | 2.60 | 2.89E-07 | 5.53E-04 |
| GOTERM_BP | GO:0006323~DNA packaging | 17 | 2.94 | 6.80E-07 | 1.30E-03 |
| GOTERM_BP | GO:0034728~nucleosome organization | 15 | 2.60 | 1.04E-06 | 2.00E-03 |
| GOTERM_CC | GO:0000785~chromatin | 22 | 3.81 | 1.60E-06 | 5.64E-04 |
| GOTERM_CC | GO:0005694~chromosome | 35 | 6.06 | 4.41E-06 | 1.55E-03 |
| GOTERM_BP | GO:0007608~sensory perception of smell | 32 | 5.54 | 1.36E-05 | 2.57E-02 |
| GOTERM_BP | GO:0034622~cellular macromolecular complex assembly | 26 | 4.50 | 2.17E-05 | 4.07E-02 |
| GOTERM_MF | GO:0004984~olfactory receptor activity | 31 | 5.36 | 3.59E-05 | 2.15E-02 |
| GOTERM_CC | GO:0044427~chromosomal part | 29 | 5.02 | 4.35E-05 | 1.52E-02 |
| GOTERM_CC | GO:0045095~keratin filament | 12 | 2.08 | 1.05E-04 | 3.64E-02 |
| GOTERM_CC | GO:0032993~protein-DNA complex | 16 | 4.78 | 6.16E-11 | 1.69E-08 |
| GOTERM_CC | GO:0000786~nucleosome | 14 | 4.18 | 1.32E-10 | 3.62E-08 |
| GOTERM_BP | GO:0006333~chromatin assembly or disassembly | 18 | 5.37 | 2.66E-10 | 4.02E-07 |
| GOTERM_BP | GO:0031497~chromatin assembly | 15 | 4.48 | 8.81E-10 | 1.33E-06 |
| GOTERM_CC | GO:0000785~chromatin | 21 | 6.27 | 1.25E-09 | 3.45E-07 |
| GOTERM_BP | GO:0065004~protein-DNA complex assembly | 15 | 4.48 | 1.63E-09 | 2.46E-06 |
| GOTERM_BP | GO:0006323~DNA packaging | 16 | 4.78 | 5.65E-09 | 8.54E-06 |
| GOTERM_BP | GO:0006334~nucleosome assembly | 14 | 4.18 | 5.85E-09 | 8.83E-06 |
| GOTERM_BP | GO:0034728~nucleosome organization | 14 | 4.18 | 2.08E-08 | 3.15E-05 |
| GOTERM_CC | GO:0044427~chromosomal part | 25 | 7.46 | 3.15E-07 | 8.67E-05 |
| GOTERM_CC | GO:0005694~chromosome | 27 | 8.06 | 6.24E-07 | 1.72E-04 |
| GOTERM_BP | GO:0034622~cellular macromolecular complex assembly | 22 | 6.57 | 7.35E-07 | 1.11E-03 |
| GOTERM_CC | GO:0045095~keratin filament | 12 | 3.58 | 7.91E-07 | 2.18E-04 |
| GOTERM_BP | GO:0034621~cellular macromolecular complex subunit organization | 22 | 6.57 | 4.64E-06 | 6.98E-03 |
| GOTERM_BP | GO:0065003~macromolecular complex assembly | 30 | 8.96 | 2.56E-05 | 3.79E-02 |
| GOTERM_CC | GO:0043228~non-membrane-bounded organelle | 74 | 22.09 | 1.58E-04 | 4.24E-02 |
| GOTERM_CC | GO:0043232~intracellular non-membrane-bounded organelle | 74 | 22.09 | 1.58E-04 | 4.24E-02 |
| none | |||||
BP, biological process; CC, cellular component; MF, molecular function.
Count, number of DAVID gene IDs identified in specific GO terms. Note that the number may be different with the number of Ensembl gene IDs as DAVID incorporates some functionally similar Ensembl gene IDs into one DAVID gene ID according to DAVID Knowledgebase.
%, (Count of involved genes / Total number of genes within a particular term) *100.
Figure 4Rate differences of rare CNVs among colorectal cases across different age groups
CRC cases were divided into three groups according to case age tertile (T1, T2, T3) (X axis). Rate (Y axis) represents the number of rare CNVs per sample. The P values between different age groups were calculated by PLINK.