| Literature DB >> 26092435 |
Ioannis Karageorgos1, Clint Mizzi2,3, Efstathia Giannopoulou4, Cristiana Pavlidis1, Brock A Peters5,6, Zoi Zagoriti1, Peter D Stenson7, Konstantinos Mitropoulos8, Joseph Borg9,10, Haralabos P Kalofonos4, Radoje Drmanac5,6, Andrew Stubbs3, Peter van der Spek3, David N Cooper7, Theodora Katsila1, George P Patrinos11,12.
Abstract
Cancer, like many common disorders, has a complex etiology, often with a strong genetic component and with multiple environmental factors contributing to susceptibility. A considerable number of genomic variants have been previously reported to be causative of, or associated with, an increased risk for various types of cancer. Here, we adopted a next-generation sequencing approach in 11 members of two families of Greek descent to identify all genomic variants with the potential to predispose family members to cancer. Cross-comparison with data from the Human Gene Mutation Database identified a total of 571 variants, from which 47 % were disease-associated polymorphisms, 26 % disease-associated polymorphisms with additional supporting functional evidence, 19 % functional polymorphisms with in vitro/laboratory or in vivo supporting evidence but no known disease association, 4 % putative disease-causing mutations but with some residual doubt as to their pathological significance, and 3 % disease-causing mutations. Subsequent analysis, focused on the latter variant class most likely to be involved in cancer predisposition, revealed two variants of prime interest, namely MSH2 c.2732T>A (p.L911R) and BRCA1 c.2955delC, the first of which is novel. KMT2D c.13895delC and c.1940C>A variants are additionally reported as incidental findings. The next-generation sequencing-based family genomics approach described herein has the potential to be applied to other types of complex genetic disorder in order to identify variants of potential pathological significance.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26092435 PMCID: PMC4499216 DOI: 10.1186/s40246-015-0034-2
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Fig. 1Pedigrees of the two families (a, b) of Greek descent whose genomes were analyzed
Fig. 2Outline of the concept of the family genomics approach implemented in this study
Fig. 3Summary of the genomic variants identified, grouped per variant classes in the HGMD database. HGMD variant classes were as follows: DP disease-associated polymorphisms, DFP disease-associated polymorphisms with additional supporting functional evidence, FP functional polymorphisms with in vitro/laboratory or in vivo supporting evidence but no known disease association DM disease-causing mutations, DM? putative disease-causing mutations but where there is some residual doubt as to pathological significance
The DM (disease-causing mutations) and DM? (putative disease-causing mutations but where there is some residual doubt as to pathological significance) mutations identified in each family
| Cancer type | HGMD tag | |
|---|---|---|
| Family A, annotated variants | ||
| dbsnp.129:rs56250729 | Prostate cancer | DM? |
| dbsnp.125:rs28756990 | Endometrial cancer | DM? |
| dbsnp.131:rs77067228 | Cancer | DM? |
| dbsnp.89:rs1799966 | Breast and/or ovarian cancer | DM/DM? |
| dbsnp.113:rs4986852 | Breast cancer | DM? |
| COSMIC:mut:148278;dbsnp.86:rs799917 | Breast and/or ovarian cancer | DM |
| dbsnp.89:rs1799950 | Breast cancer | DM |
| dbsnp.98:rs2229995 | Adenomatous polyposis coli | DM? |
| dbsnp.127:rs41545019 | Colorectal cancer | DM? |
| dbsnp.98:rs1805324 | Colorectal cancer, non-polyposis | DM? |
| dbsnp.129:rs61753720 | Acute lymphoblastic leukemia | DM |
| Family B, Annotated variants | ||
| dbsnp.126:rs33927012 | Medullary thyroid carcinoma | DM? |
| dbsnp.98:rs2229992 | Adenomatous polyposis coli | DM |
| dbsnp.76:rs41115 | Adenomatous polyposis coli | DM |
| dbsnp.79:rs169547 | Breast cancer | DM? |
| dbsnp.103:rs3092994 | Breast cancer | DM? |
| dbsnp.89:rs1799966 | Breast cancer | DM |
| dbsnp.86:rs1060915 | Breast cancer | DM |
| dbsnp.125:rs28897689 | Breast and/or ovarian cancer | DM |
| COSMIC:mut:148278;dbsnp.86:rs799917 | Breast and/or ovarian cancer | DM |
| dbsnp.60:rs16940 | Breast cancer | DM |
Fig. 4Variants identified by cancer type in all 11 genomes from the two families under study
Fig. 5Variants identified per family in all 11 genomes from the two families under study. The DM (disease-causing mutations) and DM? (putative disease-causing mutations but where there is some residual doubt as to pathological significance) mutations identified as well as our two variants of prime interest, MSH2 c.2732T>A and BRCA1 c.2955delC, are shown
SIFT PROVEAN analysis outcome of the two variants of prime interest identified in the 11 family members
| HUGO gene symbol | Chromosome | HGVS description of variant | HGMD tag | Aminoacid position | Reference residue | Alternative residue | Variation type | PROVEAN prediction | SIFT prediction | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | Prediction | Score | Prediction | ||||||||
|
| 2 | c.2732T>A | NA | 911 | L | Q | Single AA Change | −4.14 | Deleterious | 0.001 | Damaging |
|
| 17 | c.2955delC | DM | 986 | Frameshift | NA | NA | NA | NA | ||
Source: PROVEAN v1.1.3 (PROVEAN human genome variants tool, http://provean.jcvi.org/genome_submit_2.php?species=human).PROVEAN was developed in order to predict whether a given protein sequence variation (single or multiple amino acid substitutions, micro-insertions, micro-deletions) affects protein function. To achieve this, PROVEAN introduces a delta alignment score based on the reference and variant versions of a protein query sequence with respect to sequence homologs (NCBI NR protein database through BLAST, http://www.ncbi.nlm.nih.gov/). The default score threshold was set at −2.5 for binary classification (deleterious<−2.5 vs. neutral>−2.5). Similarly (through PSI-BLAST, http://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE=Proteins&PROGRAM=blastp&RUN_PSIBLAST=on), SIFT (http://sift.jcvi.org/) may be applied to naturally occurring non-synonymous polymorphisms. SIFT score ranges from 0 to 1. A SIFT score of ≤0.05 corresponds to a “damaging” prediction, whereas a SIFT score >0.05 predicts that the variant is likely to be “tolerated”
NA not available (where the precomputed score was not available in the database, the precomputed homologous protein identifiers for the query protein were retrieved in order to bypass the BLAST search and clustering, and the score was computed based on the homologs)
CRAVAT functional analysis outcome of the two variants of prime interest identified in the 11 family members
| HUGO gene symbol | HGVS description of variant | HGMD tag | Sequence ontology | Driver genes | Target | Occurrences in COSMIC by primary sites (gene mutated) |
|---|---|---|---|---|---|---|
|
| c.2732T>A | NA | Missense variant | TSG | NA | Cervix(1), large intestine (106), autonomic ganglia (2), central nervous system (2), liver(1), small intestine (1), haematopoietic and lymphoid tissue (7), endometrium (18), urinary tract (1), lung (14), breast (7), skin (4), stomach (1), esophagus (1), ovary (4), NS (2), prostate (2), kidney (6), pancreas (1) |
|
| c.2955delC | DM | Frameshift deletion | TSG | PARP inhibitor | Cervix (1), large intestine (66), stomach (8), central nervous system (2), pancreas (1), meninges (1), haematopoietic and lymphoid tissue (3), endometrium (23), urinary tract (5), lung (42), liver (5), skin (6), oesophagus (4), ovary (39), NS (2); prostate (2), kidney (5), breast (33) |
Source: CRAVAT tool. In the case of MSH2, cancer driver analyses were also performed, selecting “colon” as the tissue type of interest. A driver score of 0.27 was obtained (a driver score close to zero means an increased probability of the mutation being a cancer driver)
TSG tumor suppressor gene, NA not available