| Literature DB >> 29144510 |
Rashesh V Sanghvi1, Christian J Buhay1, Bradford C Powell2, Ellen A Tsai3,4, Michael O Dorschner5, Celine S Hong6, Matthew S Lebo3,7, Ariella Sasson8,9, David S Hanna5, Sean McGee10, Kevin M Bowling11, Gregory M Cooper11, David E Gray11, Robert J Lonigro12,13, Andrew Dunford14, Christine A Brennan12,13, Carrie Cibulskis14, Kimberly Walker1, Mauricio O Carneiro14, Joshua Sailsbery15, Lucia A Hindorff16, Dan R Robinson12,13, Avni Santani17,18, Mahdi Sarmady17,18, Heidi L Rehm3,7,14, Leslie G Biesecker6, Deborah A Nickerson10, Carolyn M Hutter16, Levi Garraway14,19,20,21, Donna M Muzny22, Nikhil Wagle23,24,25.
Abstract
PURPOSE: As massively parallel sequencing is increasingly being used for clinical decision making, it has become critical to understand parameters that affect sequencing quality and to establish methods for measuring and reporting clinical sequencing standards. In this report, we propose a definition for reduced coverage regions and describe a set of standards for variant calling in clinical sequencing applications.Entities:
Keywords: clinical sequencing; exome; genome; sequencing standards
Mesh:
Year: 2017 PMID: 29144510 PMCID: PMC6456263 DOI: 10.1038/gim.2017.192
Source DB: PubMed Journal: Genet Med ISSN: 1098-3600 Impact factor: 8.822
Overview of the sequencing approach used at each center. The sample sizes used for this analysis is 50 patients per center. As shown, eight of the ten centers used exome sequencing (ES) for their projects, while two centers used genome sequencing (GS).
| SITE A | SITE B | SITE C | SITE D | SITE E | SITE F | SITE G | SITE H | SITE I | SITE J | |
|---|---|---|---|---|---|---|---|---|---|---|
| ES | ES | ES | ES | ES | ES | ES | ES | GS | GS | |
| Blood | Blood | Blood | Blood | Blood | Blood | Blood | Blood | Blood | Blood | |
| HGSC VCRome2.1 custom exome chip | Agilent SureSelect Human All Exon V5 | Agilent SureSelect Human All Exon V4 | NimbleGen SeqCap EZ Exome v3.0 | Agilent SureSelect Human All Exon V2 | Agilent SureSelect Human All Exon V4 | Illumina TruSeq | NimbleGen SeqCap EZ Exome v3.0 | N/A | N/A | |
| Illumina | Illumina | Illumina | Illumina | Illumina | Illumina | Illumina | Illumina | Illumina | Illumina | |
| 2 × 101 bp | 2 × 101 bp | 2 × 101 bp | 2 × 101 bp | 2 × 101 bp | 2 × 125 bp | 2 × 101 bp | 2 × 101 bp | 2 × 101 bp | 2 × 101 bp | |
| 35.4 | 50.6 | 51.3 | 64 | 33 | 51 | 61.8 | 64 | N/A | NA | |
| 250X | 40X | 100X | 100X | 75X | 150X | 100X | 65X | 30X | 30x | |
| 1 microgram | 1 microgram | 6 micrograms | 1 microgram | 0.1 micrograms | 3 micrograms | 1 microgram | 2.5 micrograms | 1.1 micrograms | 1 microgram | |
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.33 | N/A | N/A | |
| 0.33 | 0.25 | 0.25 | 1 | 0.3 | 0.4 | 0.33 | 0.25 | 3 | 3 | |
| BWA | BWA | NovoAlign | BWA | BWA | NovoAlign | Novoalign | BWA | BWA | BWA | |
| ATLAS | GATK | GATK | GATK | GATK | VarScan | MPG | GATK | GATK | GATK |
Figure 1.Reduced coverage regions in the GeneTests List.
(A) Comparison of reduced coverage bases among all centers. (B) Comparison of GeneTests exons affected by the reduced coverage bases among all centers. (C) Comparison of GeneTests exons affected by the reduced coverage bases among all centers. (D) Pairwise comparisons of reduced coverage regions between any two centers. Absolute values represent the reduced coverage bases common to two centers. Percentages represent the overlap in reduced coverage bases between two centers as compared to the union of reduced coverage bases at the two centers. High correlation existed between the two GS centers (I and J) and among the ES centers using same capture design. (E) Disease-associated ClinVar variants overlapping the reduced coverage bases in each center.
Figure 2.Reduced coverage bases in GeneTests.
(A) Comparison of reduced coverage regions among eight ES centers. (B) The percent of total bases, total whole exons, and total whole genes amongst the 4,656 GeneTests list successfully covered at one or more centers. To be included, every base in an exon or gene must have been a usable base (coverage ≥20X, mapping quality ≥20, base quality ≥20).
Genes containing reduced coverage bases found in all ten centers or all eight ES centers that overlap with known disease-associated variants in ClinVar or HGMD.
| Genes | Genes with at least one reduced coverage base common to all ten centers | Genes with at least one reduced coverage base common to all eight ES centers | Clinvar or HGMD variant overlapping with reduced coverage bases | Present in OMIM | OMIM phenotype |
|---|---|---|---|---|---|
| ✓ | ✓ | ✓ | ✓ | Spondyloepiphyseal dysplasia, Kimberley type, 608361 (3); Spondyloepimetaphyseal dysplasia, aggrecan type, 612813 (3); Osteochondritis dissecans, short stature, and early-onset osteoarthritis, 165800 (3) | |
| ✓ | ✓ | ✓ | ✓ | Epileptic encephalopathy, early infantile, 1, 308350 (3); Lissencephaly, X-linked 2, 300215 (3); Mental retardation, X-linked 29 and others, 300419 (3); Proud syndrome, 300004 (3); Partington syndrome, 309510 (3); Hydranencephaly with abnormal genitalia, 300215 (3) | |
| ✓ | ✓ | ✓ | ✓ | Spondyloepimetaphyseal dysplasia with joint laxity, type 1, with or without fractures, 271640 (3); Ehlers-Danlos syndrome, progeroid type, 2, 615349 (3) | |
| ✓ | ✓ | ✓ | ✓ | Heterotaxy, visceral, 2, autosomal, 605376 (3); Double-outlet right ventricle, 217095 (3); Transposition of the great arteries, dextro-looped 2, 613853 (3) | |
| ✓ | ✓ | No Reported Phenotype | |||
| ✓ | ✓ | ✓ | ✓ | CR1 deficiency (1); {?SLE susceptibility} (1); [Blood group, Knops system], 607486 (3); {Malaria, severe, resistance to}, 611162 (3) | |
| ✓ | ✓ | ✓ | ✓ | No Reported Phenotype | |
| ✓ | ✓ | ✓ | ✓ | Ellis-van Creveld syndrome, 225500 (3); Weyers acrodental dysostosis, 193530 (3) | |
| ✓ | ✓ | ✓ | ✓ | {Colorectal cancer, susceptibility to, 1}, 608812 (3) | |
| ✓ | ✓ | ✓ | ✓ | Thalassemia, alpha-, 604131 (3); Heinz body anemia, 140700 (3); Erythrocytosis (3); Hypochromic microcytic anemia (3); Hemoglobin H disease, nondeletional, 613978 (3) | |
| ✓ | ✓ | No Reported Phenotype | |||
| ✓ | ✓ | ✓ | ✓ | Incontinentia pigmenti, 308300 (3); Ectodermal dysplasia, hypohidrotic, with immune deficiency, 300291 (3); Ectodermal, dysplasia, anhidrotic, lymphedema and immunodeficiency, 300301 (3); Immunodeficiency, isolated, 300584 (3); Immunodeficiency 33, 300636 (3); Invasive pneumococcal disease, recurrent isolated, 2, 300640 (3) | |
| ✓ | ✓ | No Reported Phenotype | |||
| ✓ | ✓ | ✓ | ✓ | Osteoporosis-pseudoglioma syndrome, 259770 (3); [Bone mineral density variability 1], 601884 (3); Hyperostosis, endosteal, 144750 (3); van Buchem disease, type 2, 607636 (3); Osteosclerosis, 144750 (3); {Osteoporosis}, 166710 (3); Exudative vitreoretinopathy 4, 601813 (3); Osteopetrosis, autosomal dominant 1, 607634 (3) | |
| ✓ | ✓ | ✓ | ✓ | Currarino syndrome, 176450 (3) | |
| ✓ | ✓ | No Reported Phenotype | |||
| ✓ | ✓ | ✓ | ✓ | Colorblindness, protan, 303900 (3); Blue cone monochromacy, 303700 (3) | |
| ✓ | ✓ | ✓ | ✓ | Colorblindness, deutan, 303800 (3); Blue cone monochromacy, 303700 (3) | |
| ✓ | ✓ | ✓ | No Reported Phenotype | ||
| ✓ | ✓ | ✓ | ✓ | Polycystic kidney disease, adult type I, 173900 (3) | |
| ✓ | ✓ | ✓ | ✓ | Diamond-Blackfan anemia 4, 612527 (3) | |
| ✓ | ✓ | ✓ | ✓ | Muscular dystrophy, rigid spine, 1, 602771 (3); Myopathy, congenital, with fiber-type disproportion, 255310 (3) | |
| ✓ | ✓ | No Reported Phenotype | |||
| ✓ | ✓ | ✓ | ✓ | Short stature, idiopathic familial, 300582 (3); Leri-Weill dyschondrosteosis, 127300 (3); Langer mesomelic dysplasia, 249700 (3) | |
| ✓ | ✓ | ✓ | ✓ | Spinal muscular atrophy-1, 253300 (3); Spinal muscular atrophy-2, 253550 (3); Spinal muscular atrophy-3, 253400 (3); Spinal muscular atrophy-4, 271150 (3) | |
| ✓ | ✓ | ✓ | ✓ | {Spinal muscular atrophy, type III, modifier of}, 253400 (3) | |
| ✓ | ✓ | ✓ | ✓ | Deafness, autosomal recessive 16, 603720 (3) | |
| ✓ | ✓ | No Reported Phenotype |
Figure 3.Analysis of Reduced Coverage Regions common to all Centers
(A) Overall, there were 735 missing intervals totaling 66.4 Kbp in the intersection of exome and genome centers. Forty-two percent of all missing intervals had lengths that were 5 bp or shorter. The remainder of missing intervals lengths ranged widely and occured with less frequency, but they accounted for 65.7 Kbp or 97.7% of the total length of all missing intervals combined. (B) Of the >67K exons accounting for 4,656 genes in GeneTests, 533 had reduced coverage regions. These regions fell into two distinct groups—either a small part (<20%, with the vast majority less than 10%) of the entire exon had reduced coverage, or most of an exon (>90%) had reduced coverage. (C) Comparison of GC% distribution between the GeneTests baseline and the reduced coverage regions in all centers.