| Literature DB >> 30871259 |
Marcos Díaz-Gay1, Sebastià Franch-Expósito2, Coral Arnau-Collell3, Solip Park4, Fran Supek5, Jenifer Muñoz6, Laia Bonjoch7, Anna Gratacós-Mulleras8, Paula A Sánchez-Rojas9, Clara Esteban-Jurado10, Teresa Ocaña11, Miriam Cuatrecasas12, Maria Vila-Casadesús13, Juan José Lozano14, Genis Parra15, Steve Laurie16, Sergi Beltran17, Antoni Castells18, Luis Bujanda19, Joaquín Cubiella20, Francesc Balaguer21, Sergi Castellví-Bel22.
Abstract
Colorectal cancer (CRC) shows aggregation in some families but no alterations in the known hereditary CRC genes. We aimed to identify new candidate genes which are potentially involved in germline predisposition to familial CRC. An integrated analysis of germline and tumor whole-exome sequencing data was performed in 18 unrelated CRC families. Deleterious single nucleotide variants (SNV), short insertions and deletions (indels), copy number variants (CNVs) and loss of heterozygosity (LOH) were assessed as candidates for first germline or second somatic hits. Candidate tumor suppressor genes were selected when alterations were detected in both germline and somatic DNA, fulfilling Knudson's two-hit hypothesis. Somatic mutational profiling and signature analysis were also performed. A series of germline-somatic variant pairs were detected. In all cases, the first hit was presented as a rare SNV/indel, whereas the second hit was either a different SNV (3 genes) or LOH affecting the same gene (141 genes). BRCA2, BLM, ERCC2, RECQL, REV3L and RIF1 were among the most promising candidate genes for germline CRC predisposition. The identification of new candidate genes involved in familial CRC could be achieved by our integrated analysis. Further functional studies and replication in additional cohorts are required to confirm the selected candidates.Entities:
Keywords: colorectal cancer; computational genomics; germline–tumor analysis; mutational signatures; predisposition to disease; whole-exome sequencing
Year: 2019 PMID: 30871259 PMCID: PMC6468873 DOI: 10.3390/cancers11030362
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Description of the genes carrying a potentially disruptive germline SNV (single nucleotide variant) and a different SNV in the matched-tumor sample.
| Gene | Family | RefSeq Transcript | Hit | Genetic Variant | Path. Tools | DAMpred | ExAC Freq. | Protein Domain | Protein Function |
|---|---|---|---|---|---|---|---|---|---|
|
| FAMN4 | NM_001115.2 | 1st | c.1747G>A | 5/6 | − | 21/60,697 | Adenylyl cyclase class-3/4/guanylyl cyclase domain | Biosynthesis of cAMP from ATP |
| 2nd | c.458C>T | 4/6 | + | 0/60,706 | Interaction with | ||||
|
| FAM23 | NM_005529.7 | 1st | c.3148G>A | 3/6 | − | 3/60,456 | Laminin IV type A domain | Component of vascular extracellular matrix, regulation of angiogenesis and cell growth |
| 2nd | c.7406C>T | 4/6 | − | 0/60,706 | Immunoglobulin-like C2-type domain |
Abbreviations: DAMpred, disease-associated mutation prediction, affects protein structure (+), no effect on protein structure (−); ExAC, Exome Aggregation Consortium; Freq., frequency; Path., pathogenicity, cAMP: cyclic AMP.
Candidate genes for germline colorectal cancer (CRC) predisposition selected after the two-hit prioritization strategy. In all cases, a first single nucleotide variant (SNV)/indel hit was present in the germline and a second loss of heterozygosity (LOH) hit was identified in the matched-tumor sample.
| Gene | Family | RefSeq Transcript | Genetic Variant | Path. Tools | DAMpred | ExAC Freq. | Protein Domain | Protein Function |
|---|---|---|---|---|---|---|---|---|
|
| FAM20 | NM_000059.3 | c.4963delT | FS | n.a. | 0/60,706 | - | Double-strand break repair via homologous recombination, inherited predisposition to breast and ovarian cancer |
|
| FAMN4 | NM_000057.4 | c.2069C>T | 6/6 | + | 1/60,570 | Helicase ATP-binding domain | DNA helicase, double-strand break repair via homologous recombination, regulation of cell cycle and apoptosis, DNA replication, telomere maintenance |
|
| H458 | NM_000400.3 | c.688G>A | 4/6 | − | 0/60,706 | Helicase ATP-binding domain | DNA helicase, transcription-coupled nucleotide excision repair, regulation of cell cycle |
|
| FAMN3 | NM_001447.2 | c.1643T>C | 5/6 | − | 0/60,706 | Cadherin domain | Regulation of cell proliferation, cell adhesion |
|
| H466 | NM_000876.3 | c.232G>A | 6/6 | + | 1/60,684 | - | Positive regulation of apoptosis |
|
| H460 | NM_014572.3 | c.337G>A | 5/6 | − | 1/56,138 | Ubiquitin-associated domain | Positive regulation of apoptosis, regulation of cell cycle |
|
| FAM20 | NM_005484.3 | c.910G>C | 3/6 | − | 3/60,208 | Poly(ADP-ribose) polymerase (PARP) alpha-helical domain | Base excision repair, extrinsic apoptotic signaling pathway |
|
| H469 | NM_002813.6 | c.361A>T | 3/6 | − | 30/60,148 | PDZ domain | Subunit of 26S proteasome, regulation of apoptosis and cell cycle, regulation of ubiquitin-protein ligase activity |
|
| H460 | NM_201431.2 | c.779C>T | 6/6 | − | 53/60,475 | Ras-associating domain | Positive regulation of apoptosis |
|
| H466 | NM_002907.4 | c.221_225delinsAATGT p.(Pro74_Trp75delinsGlnCys) | 6/6 | + | 0/60,706 | - | DNA helicase, double-strand break repair via homologous recombination, DNA replication |
|
| H466 | NM_024730.3 | c.362T>C | 6/6 | + | 54/60,446 | - | Unknown (closely related to |
|
| FAM3 | NM_002912.4 | c.559A>T | 5/6 | − | 0/60,706 | Exonuclease domain (family B of DNA polymerases) | DNA repair, translesion DNA synthesis |
|
| H460 | NM_018151.4 | c.4262G>A | 4/6 | + | 5/59,938 | - | Double-strand break repair via nonhomologous end joining, telomere maintenance |
|
| H470 | NM_032985.5 | c.531G>C | 4/6 | − | 1/60,706 | Sec23/Sec24 trunk domain | Intracellular protein transport, associated with inherited cancer predisposition Cowden Syndrome |
|
| FAM3 | NM_003072.3 | c.295C>T | 5/6 | − | 1/60,196 | - | Regulation of cell growth, regulation of cell cycle, chromatin remodeling |
|
| H470 | NM_052902.4 | c.1214C>T | 5/6 | − | 51/59,930 | - | Interaction with |
Abbreviations: DAMpred: disease-associated mutation prediction, affects protein structure (+), no effect on protein structure (−); n.a., not available; ExAC, Exome Aggregation Consortium; Freq., frequency; FS, frameshift; Path., pathogenicity.
Figure 1Somatic mutational profile analysis performed with the Mutational Signatures in Cancer (MuSiCa) tool in 16 germline–tumor paired samples. (a) Mutational prevalence (number of mutations per sequenced Mb). Hypermutated samples (≥90 mutations/Mb) are marked with an asterisk (*); (b) mutational signature refitting analysis showing the contributions of the 30 Catalogue of Somatic Mutations in Cancer (COSMIC) reference mutational signatures in the mutational catalogues of the samples of the study.
Figure 2Methodology schematic for variant identification, showing the software used in each analysis step for the different classes of genetic variation considered. WES, whole-exome sequencing; BWA-MEM, Burrows-Wheeler Alignment Tool; GATK, Genome Analysis Toolkit; SNV, single nucleotide variant; indels, insertion and deletion variants; CNV, copy number variant; LOH, loss of heterozygosity; MuTect2, somatic SNV and indel variants caller.