| Literature DB >> 24307375 |
Christopher M Watson1, Laura A Crinnion, Joanne E Morgan, Sally M Harrison, Christine P Diggle, Julian Adlard, Helen A Lindsay, Nick Camm, Ruth Charlton, Eamonn Sheridan, David T Bonthron, Graham R Taylor, Ian M Carr.
Abstract
Targeted hybridization enrichment prior to next-generation sequencing is a widespread method for characterizing sequence variation in a research setting, and is being adopted by diagnostic laboratories. However, the number of variants identified can overwhelm clinical laboratories with strict time constraints, the final interpretation of likely pathogenicity being a particular bottleneck. To address this, we have developed an approach in which, after automatic variant calling on a standard unix pipeline, subsequent variant filtering is performed interactively, using AgileExomeFilter and AgilePindelFilter (http://dna.leeds.ac.uk/agile), tools designed for clinical scientists with standard desktop computers. To demonstrate the method's diagnostic efficacy, we tested 128 patients using (1) a targeted capture of 36 cancer-predisposing genes or (2) whole-exome capture for diagnosis of the genetically heterogeneous disorder primary ciliary dyskinesia (PCD). In the cancer cohort, complete concordance with previous diagnostic data was achieved across 793 variant genotypes. A high yield (42%) was also achieved for exome-based PCD diagnosis, underscoring the scalability of our method. Simple adjustments to the variant filtering parameters further allowed the identification of a homozygous truncating mutation in a presumptive new PCD gene, DNAH8. These tools should allow diagnostic laboratories to expand their testing portfolios flexibly, using a standard set of reagents and techniques.Entities:
Keywords: exome sequencing; mutation detection; sequence analysis; software
Mesh:
Substances:
Year: 2014 PMID: 24307375 PMCID: PMC4285299 DOI: 10.1002/humu.22490
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Figure 1Informatics pipeline for the analysis of targeted capture data. A: The high-throughput pipeline will typically be run centrally, with dissemination of the processed data to diagnostic end-users (B) for further filtering and interpretation.
A Summary of Known Deletion and Insertion Variants in the Phase 1 Hereditary Cancer Samples
| Gene | Transcript | Variant | Codon change | I/D | Number affected bases |
|---|---|---|---|---|---|
| NM_007294.3 | c.2475del | p.Asp825Glufs*21 | D | 1 | |
| NM_007294.3 | c.3005del | p.Asn1002Thrfs*22 | D | 1 | |
| NM_007294.3 | c.4806del | p.Gln1604Asnfs*2 | D | 1 | |
| NM_000249.3 | c.207+1del | D | 1 | ||
| NM_000249.3 | c.1489del | p.Arg497Glyfs*11 | D | 1 | |
| NM_000059.3 | c.3545_3546del | p.Phe1182* | D | 2 | |
| NM_000059.3 | c.6275_6276del | p.Leu2092Profs*7 | D | 2 | |
| NM_000059.3 | c.7673_7674del | p.Glu2558Valfs*7 | D | 2 | |
| NM_000251.1 | c.1226_1227del | p.Gln409Argfs*7 | D | 2 | |
| NM_000249.3 | c.1744_1746del | p.Leu582del | D | 3 | |
| NM_000251.1 | c.1786_1788del | p.Asn596del | D | 3 | |
| NM_001127510.2 | c.1875_1878del | p.Asn627Leufs*2 | D | 4 | |
| NM_000059.3 | c.6944_6947del | p.Ile2315Lysfs*12 | D | 4 | |
| NM_144997.5 | c.890_893del | p.Glu297Alafs*25 | D | 4 | |
| NM_000251.1 | c.1457_1460del | p.Asn486Thrfs*10 | D | 4 | |
| NM_000314.4 | c.956_959del | p.Thr319Lysfs*24 | D | 4 | |
| NM_001127510.2 | c.3183_3187del | p.Gln1062* | D | 5 | |
| NM_000251.1 | c.2502_2508del | p.Asn835Leufs*4 | D | 7 | |
| NM_000059.3 | c.8736_8744del | p.Asp2913_Ala2915del | D | 9 | |
| NM_000249.3 | c.197_207+20del | p.Thr66Lysfs*9 | D | 31 | |
| NM_007294.3 | c.1175_1214del | p.Leu392Glnfs*5 | D | 40 | |
| NM_144997.5 | c.347dup | p.Leu117Alafs*16 | I | 1 | |
| NM_000179.2 | c.4001+4_4001+8dup | I | 5 | ||
| NM_000059.3 | c.7762_7764delinsTT | p.Ile2588Phefs*60 | D/I | 3 |
I, insertion variant; D, deletion variant.
Figure 2The AgileExomeFilter user interface. Variants are displayed in rows, with the column headers outlining the annotation information for each variant. The lower panel allows selection of filtering criteria such as zygosity status, variant location, and allele frequency. Boxed numbers indicate the reduction in variant count following the selected filtering criteria (blue arrows), which lead to the identification of the pathogenic homozygous LRRC6 mutation (NM_012472.3) c.630del (p.Trp210Cysfs*12).
Figure 3A: The AgilePindelFilter user interface. Reads are shown with respect to the reference sequence for a 510-bp BRCA1 exon 22 deletion. The identified variants can be restricted to specific genes and their location within the gene. Quality parameters can be adjusted to display variants that meet a required number of reads. B: The variant shown in panel A is visible in the IGV browser by the drop in coverage and the substantial number of soft clipped reads. There is, however, no method by which the variant can be filtered out from the other sequence reads.
The Average Reduction in Variant Count for PCD Patients After Filtering Using AgileExomeFilter
| Filtering parameters | Mean | Range ( |
|---|---|---|
| Total variants | 33,143 | 31,929–34,075 |
| Retain if located in known PCD genes | 94 | 68–119 |
| Retain if exonic or splice site | 66 | 49–80 |
| Exclude if dbSNP minor allele frequency ≥0.10 | 15 | 7–21 |
| Exclude if nonsynonymous variant | 10 | 5–15 |
| Retain if biallelic or homozygous | 7 | 3–12 |
Splice site is defined as 10 bp flanking the exon.