| Literature DB >> 23409019 |
Francisco Javier Gracia-Aznarez1, Victoria Fernandez, Guillermo Pita, Paolo Peterlongo, Orlando Dominguez, Miguel de la Hoya, Mercedes Duran, Ana Osorio, Leticia Moreno, Anna Gonzalez-Neira, Juan Manuel Rosa-Rosa, Olga Sinilnikova, Sylvie Mazoyer, John Hopper, Conchi Lazaro, Melissa Southey, Fabrice Odefrey, Siranoush Manoukian, Irene Catucci, Trinidad Caldes, Henry T Lynch, Florentine S M Hilbers, Christi J van Asperen, Hans F A Vasen, David Goldgar, Paolo Radice, Peter Devilee, Javier Benitez.
Abstract
The identification of the two most prevalent susceptibility genes in breast cancer, BRCA1 and BRCA2, was the beginning of a sustained effort to uncover new genes explaining the missing heritability in this disease. Today, additional high, moderate and low penetrance genes have been identified in breast cancer, such as P53, PTEN, STK11, PALB2 or ATM, globally accounting for around 35 percent of the familial cases. In the present study we used massively parallel sequencing to analyze 7 BRCA1/BRCA2 negative families, each having at least 6 affected women with breast cancer (between 6 and 10) diagnosed under the age of 60 across generations. After extensive filtering, Sanger sequencing validation and co-segregation studies, variants were prioritized through either control-population studies, including up to 750 healthy individuals, or case-control assays comprising approximately 5300 samples. As a result, a known moderate susceptibility indel variant (CHEK2 1100delC) and a catalogue of 11 rare variants presenting signs of association with breast cancer were identified. All the affected genes are involved in important cellular mechanisms like DNA repair, cell proliferation and survival or cell cycle regulation. This study highlights the need to investigate the role of rare variants in familial cancer development by means of novel high throughput analysis strategies optimized for genetically heterogeneous scenarios. Even considering the intrinsic limitations of exome resequencing studies, our findings support the hypothesis that the majority of non-BRCA1/BRCA2 breast cancer families might be explained by the action of moderate and/or low penetrance susceptibility alleles.Entities:
Mesh:
Year: 2013 PMID: 23409019 PMCID: PMC3568132 DOI: 10.1371/journal.pone.0055681
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Summary of the data analysis pipeline followed in the present study.
Raw sequencing data was screened for common artifacts prior to the alignment step in a first quality control phase (QC1). High quality (QC2) genome matches were analyzed for variants, in the form of departures from a consensus reference genome. Subsequently, variants were filtered by keeping those common to both members in each family and then discarding variants present in HapMap controls and dbSNP130. Further filtering by variant consequence, score and gene function (see material and methods for details) resulted in a list of 67 snps and 14 indel candidate variants.
SNP Filtering Summary.
| Sample | DetectedheterozygousSNPs | Commonwithin a family | Not present in 7 HapMap Controls | Not present in dbSNP130 | After variant consequencefilter | After score and genefunction filters |
| 07S240 (Pedigree 49) | 29549 | 29549 | 9451 | 3838 | 1000 | 67 |
| DAD_1 (Pedigree 694) | 28697 | 28697 | 9144 | 3666 | 899 | |
| F2887_13 (Pedigree2887) | 28978 | 9271 | 1681 | 404 | 156 | |
| F2887_24 (Pedigree2887) | 30691 | |||||
| F3311_5 (Pedigree3311) | 26774 | 8776 | 1686 | 328 | 100 | |
| F3311_43 (Pedigree3311) | 26468 | |||||
| I_1408 (Pedigree 531) | 28841 | 28841 | 9022 | 3553 | 971 | |
| RUL036_2 (Pedigree RUL036) | 27060 | 11405 | 2746 | 776 | 188 | |
| RUL036_7 (Pedigree RUL036) | 27032 | |||||
| RUL153_2 (Pedigree RUL153) | 26908 | 8295 | 1373 | 197 | 53 | |
| RUL153_3 (Pedigree RUL153) | 27406 | |||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Variant filtering representation through the number of SNPs remaining after the various filtering steps.
The Average and Percentage remaining rows represent the average number of variants and percentage of variants remaining per family.
Intronic, intergenic and synonymous variants were discarded. See methods.
Detailed criteria for these filters is reported on the methods section.
Number of variants shared between the two individuals in the family.
Total number of final variants for all the individuals in this study.
INDEL Filtering Summary.
| Sample | Detected INDELs | Common within a family | Not present in 7 HapMap Controls | Not present in dbSNP130 | After variant consequence filter | After score and gene function filters |
| 07S240 (Pedigree 49) | 36189 | 36189 | 26077 | 24580 | 11387 | 14 |
| DAD_1 (Pedigree 694) | 35606 | 35606 | 25741 | 24204 | 11046 | |
| F2887_13 (Pedigree2887) | 34081 | 12299 | 5314 | 4650 | 1354 | |
| F2887_24 (Pedigree2887) | 31445 | |||||
| F3311_5 (Pedigree3311) | 30983 | 12442 | 4982 | 4340 | 579 | |
| F3311_43 (Pedigree3311) | 26441 | |||||
| I_1408 (Pedigree 531) | 36131 | 36131 | 26082 | 24591 | 11352 | |
| RUL036_2 (Pedigree RUL036) | 25162 | 13042 | 5087 | 4398 | 587 | |
| RUL036_7 (Pedigree RUL036) | 25652 | |||||
| RUL153_2 (Pedigree RUL153) | 25698 | 11878 | 4445 | 3834 | 506 | |
| RUL153_3 (Pedigree RUL153) | 26045 | |||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Variant filtering representation through the number of INDELs remaining after the various filtering steps.
The Average and Percentage remaining rows represent the average number of variants and percentage of variants remaining per family.
Intronic and intergenic variants were discarded. See methods.
Detailed criteria for these filters is reported on the methods section.
Number of variants shared between the two individuals in the family.
Total number of final variants for all the individuals in this study.
Final Candidate Variants.
| Pedigree | Chr | Position | Consequence | Alleles Reference/Variant | Gene | Description | Allele frequency cases | Allele frequency controls | P value | OR | 95% CI |
|
| 14 | 44737671 | Stopgain SNV | C/T | FANCM | Fanconi anemia, complementation groupM [Source:HGNC Symbol;Acc:23168] | 0.00147 | 0.00077 | 0.13 | 2.29 | 0.71–8.54 |
| 5 | 137448086 | exonic; splicing | G/T | WNT8A | wingless-type MMTV integration site family, member 8A [Source:HGNC Symbol;Acc:12788] | 0 | 0 | N/A | N/A | N/A | |
| 9 | 127361802 | NS | G/A | MAPKAP1 | mitogen-activated protein kinase associated protein 1 [Source:HGNC Symbol;Acc:18752] | 0.004278 | 0.001966 | 0.03709 | 2.181 | 1.037–4.587 | |
| 9 | 116707958 | NS | C/T | TNFSF8 | tumor necrosis factor (ligand) superfamily, member 8provided by HGNC | 0 | 0 | N/A | N/A | N/A | |
|
| 1 | 43858431 | NS | C/A | PTPRF | protein tyrosine phosphatase, receptor type,F [Source:HGNC Symbol;Acc:9670] | 0.00186 | 0.0007877 | 0.1818 | 2.364 | 0.7409–7.542 |
| 3 | 69199787 | NS | C/T | UBA3 | ubiquitin-like modifier activating enzyme 3 [Source:HGNC Symbol;Acc:12470] | 0.000186 | 0 | 1 | N/A | N/A | |
|
| 16 | 278080 | UTR3 | G/A | AXIN1 | axin 1 [Source:HGNC Symbol;Acc:903] | 0.0007443 | 0.0001967 | 0.3758 | 3.786 | 0.423–33.89 |
| 22 | 31584009 | NS | G/A | TIMP3 | TIMP metallopeptidase inhibitor 3 [Source:HGNC Symbol;Acc:11822] | 0.0001859 | 0 | 1 | N/A | N/A | |
| 4 | 1665238 | exonic; splicing | G/A | SLBP | stem-loop binding protein [Source:HGNC Symbol;Acc:10904] | N/A | 0 | N/A | N/A | N/A | |
| 17 | 7789855 | NS | G/A | CNTROB | centrobin, centrosomal BRCA2 interacting protein [Source:HGNC Symbol;Acc:29616] | N/A | 0 | N/A | N/A | N/A | |
|
| 9 | 90806008 | NS | G/A | S1PR3 | sphingosine-1-phosphate receptor 3 [Source:HGNC Symbol;Acc:3167] | 0.000186 | 0 | 1 | N/A | N/A |
List of final candidate variants passing all filters.
Chromosome in which the variant was mapped.
Position according to the coordinate system (HG18).
Variant consequence: NS = non-synonymous variant UTR3 = 3′ untranslated region variant.
Fisher’s Exact Test P value.
Odds Ratio.
95% confidence interval for the Odds Ratio.
N/A = not available.