| Literature DB >> 31525256 |
Claudio Toma1,2, Marcos Díaz-Gay3, Sebastià Franch-Expósito3, Coral Arnau-Collell3, Bronwyn Overs1,2, Jenifer Muñoz3, Laia Bonjoch3, Yasmin Soares de Lima3, Teresa Ocaña3, Miriam Cuatrecasas4, Antoni Castells3, Luis Bujanda5, Francesc Balaguer3, Joaquín Cubiella6, Trinidad Caldés7, Janice M Fullerton1,2, Sergi Castellví-Bel3.
Abstract
Colorectal cancer (CRC) is a complex disorder for which the majority of the underlying germline predisposition factors remain still unidentified. Here, we combined whole-exome sequencing (WES) and linkage analysis in families with multiple relatives affected by CRC to identify candidate genes harboring rare variants with potential high-penetrance effects. Forty-seven affected subjects from 18 extended CRC families underwent WES. Genome-wide linkage analysis was performed under linear and exponential models. Suggestive linkage peaks were identified on chromosomes 1q22-q24.2 (maxSNP = rs2134095; LODlinear = 2.38, LODexp = 2.196), 7q31.2-q34 (maxSNP = rs6953296; LODlinear = 2.197, LODexp = 2.149) and 10q21.2-q23.1 (maxSNP = rs1904589; LODlinear = 1.445, LODexp = 2.195). These linkage signals were replicated in 10 independent sets of random markers from each of these regions. To assess the contribution of rare variants predicted to be pathogenic, we performed a family-based segregation test with 89 rare variants predicted to be deleterious from 78 genes under the linkage intervals. This analysis showed significant segregation of rare variants with CRC in 18 genes (weighted p-value > 0.0028). Protein network analysis and functional evaluation were used to suggest a plausible candidate gene for germline CRC predisposition. Etiologic rare variants implicated in cancer germline predisposition may be identified by combining traditional linkage with WES data. This approach can be used with already available NGS data from families with several sequenced members to further identify candidate genes involved germline predisposition to disease. This approach resulted in one candidate gene associated with increased risk of CRC but needs evidence from further studies.Entities:
Keywords: colorectal cancer; genetic predisposition to disease; linkage analysis; whole-exome sequencing
Year: 2019 PMID: 31525256 PMCID: PMC7004061 DOI: 10.1002/ijc.32683
Source DB: PubMed Journal: Int J Cancer ISSN: 0020-7136 Impact factor: 7.396
Figure 1Pedigree structures of the 18 CRC multiplex and extended families examined in our study. Males are indicated with squares, females with circles and diagnosis are shown by dark shading (full, patients diagnosed with colorectal cancer, CRC, left quarter, patients diagnosed with high‐risk adenoma; half, patients diagnosed with any other type of cancer detailed in the figure legend; unshaded, unaffected individuals or unknown). Patients analyzed by whole‐exome sequencing are indicated by an asterisk, and all subjects with DNA available are underlined.
Figure 2Results of the genome‐wide linkage analysis. Nonparametric linkage analysis was performed under the linear (black line) and exponential (red line) models in 18 multiplex/extended CRC families. Each chromosome is represented in a separate plot, including the X chromosome. A linkage signal with LOD > 2 was observed at chromosomes 1q22–q24.2 with a maximum linear LOD score at marker rs10753668 of 2.11 (linear model), 7q31.2–q34 with a maximum linear LOD score at marker rs2075371 of 2.023 (linear model) and 10q21.2–q23.1 with a maximum linear LOD score at marker rs1904589 of 2.118 (exponential model). Additional markers were subsequently added to fine‐map these linkage peaks.
Figure 3Schematic of the linkage intervals and the gene content between the proximal and distal boundaries on chromosome 1q22–q24.2 (a), 7q31.2–q34 (b) and 10q21.2–q23.1 (c) after fine mapping with 12 additional SNPs. The maximum LOD score under linear and exponential models are shown at each locus. The locations of known protein‐coding genes in the linkage interval are provided in the images below which are generated using the UCSC genome browser (https://genome.ucsc.edu). Final candidate genes for CRC, after gene network analysis, colon gene expression evaluation and sequence quality are highlighted using a red box.
Family‐based association test of rare variants under the linkage peaks
| Gene | SNVs/Seg‐SNV | Family (patients with Seg‐SNV) |
| Weighted | Specific LOD at locus |
|---|---|---|---|---|---|
|
| 1/1 | CRC‐23 (3) | 1.28E−08 | 2.00E−07 | 0.49 |
|
| 1/1 | CRC‐20 (2) | 1.28E−08 | 2.00E−07 | 0.20 |
|
| 1/1 | CRC‐1 (2) | 2.56E−08 | 1.00E−07 | 0.29 |
|
| 1/1 | CRC‐11 (2) | 2.56E−08 | 4.00E−07 | 0.30 |
|
| 1/1 | CRC‐11 (2) | 2.56E−08 | 4.00E−07 | 0.30 |
|
| 1/1 | CRC‐9 (2) | 1.99E−06 | 5.00E−06 | 0.47 |
|
| 1/1 | CRC‐19 (3) | 3.94E−06 | 4.90E−05 | 0.14 |
|
| 1/1 | CRC‐9 (2) | 2.18E−05 | 2.60E−05 | 0.47 |
|
| 1/1 | CRC‐10 (2) | 3.16E−05 | 6.00E−05 | 0.27 |
|
| 1/1 | CRC‐23 (3) | 3.55E−05 | 3.90E−04 | 0.29 |
|
| 1/1 | CRC‐7 (2) | 4.74E−05 | 4.20E−04 | 0.29 |
|
| 1/1 | CRC‐5 (3) | 1.26E−04 | 4.50E−04 | −0.003 |
|
| 1/1 | CRC‐13 (3) | 1.77E−04 | 6.90E−04 | 0.14 |
|
| 1/1 | CRC‐4 (2) | 3.06E−04 | 3.08E−03 | 0.20 |
|
| 4/1 | CRC‐8 (3) | 2.97E−04 | 3.00E−03 | 0.59 |
|
| 1/1 | CRC‐11 (2) | 4.71E−04 | 4.74E−03 | 0.30 |
|
| 1/1 | CRC‐20 (2) | 7.52E−04 | 7.47E−03 | 0.19 |
|
| 2/1 | CRC‐10 (2) | 9.15E−04 | 2.88E−03 | 0.30 |
Results of 89 rare variants (SNVs) segregating with patients (Seg‐SNV) across the 18 CRC families (n = 47 patients), after simulations and weight corrections. Only significant genes (p‐value < 0.05) are reported.
Abbreviations: SNVs, number of SNVs regardless segregation in CRC patients; Seg‐SNV, number of SNVs segregating in all CRC patients in this family. Specific LOD, linkage contribution from this gene to a specific peak.
Candidate genes within regions with positive linkage on chromosomes 1, 7 and 10 after considering hereditary cancer networks
| Gene | Variant | Family | Chromosomal region | Colon gene expression (RPKM) | IGV | Gene function/OMIM |
|---|---|---|---|---|---|---|
|
| c.1718C>T (p.Ser573Leu) | CRC‐9 | 1q22–q24.2 | 52 | + | Muscular dystrophies |
|
| c.1921C>G (p.Pro641Ala) | CRC‐13 | 7q31.2–q34 | 5.4 | + | Familial or sporadic basal cell carcinoma/Curry–Jones syndrome |
|
| c.699G>A (Asp239Tyr) | CRC‐10 | 7q31.2–q34 | 4.6 | + | Neuronal development |
|
| c.4885A>C (p.Ile1629Leu) | CRC‐8 | 10q21.2–q23.1 | 0.6 | + | Deafness |
|
| c.590C>T (p.Thr197Ile) | CRC‐10 | 10q21.2–q23.1 | 101 | + | Cardiomyopathy |
Information about the identified genetic variant, the CRC family, gene expression level in colon, sequence quality, gene function and previous involvement in hereditary conditions are listed.
Abbreviations: CRC, colorectal cancer; IGV, integrative genomics viewer: + validated, − not validated; OMIM, online Mendelian inheritance in man, http://www.omim.org; RPKM, reads per kilobase per million mapped reads (from the Human Protein Atlas, GTEx dataset‐colon); Curry‐Jones syndrome, craniofacial malformations, polysyndactyly, abnormal skin and gut development.