| Literature DB >> 25496018 |
Dorin Manase1,2, Lisa C A D'Alessandro3,4, Ashok Kumar Manickaraj5,6, Saeed Al Turki7, Matthew E Hurles8, Seema Mital9,10.
Abstract
BACKGROUND: Given the growing use of whole-exome sequencing (WES) for clinical diagnostics of complex human disorders, we evaluated coverage of clinically relevant cardiac genes on WES and factors influencing uniformity and depth of coverage of exonic regions.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25496018 PMCID: PMC4272796 DOI: 10.1186/s12920-014-0067-8
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Clinically relevant cardiac gene list
|
|
| |
|---|---|---|
|
| ▪ Hypertrophic cardiomyopathy |
|
| ▪ Dilated cardiomyopathy | ||
| ▪ Catecholaminergic polymorphic ventricular tachycardia |
| |
| ▪ Arrhythmogenic right ventricular cardiomyopathy |
| |
| ▪ Romano–Ward long QT syndrome |
| |
| ▪ Brugada syndrome | ||
| ▪ Familial hypercholesterolemia |
| |
| ▪ Ehlers–Danlos syndrome, vascular type |
| |
| ▪ Marfan syndrome |
| |
| ▪ Loeys–Dietz syndromes | ||
| ▪ Familial thoracic aortic aneurysms and dissections | ||
|
| ▪ Genes associated with congenital heart disease |
|
Gene list includes American College of Medical Genetics list for reporting of incidental findings (n = 31) and genes associated with congenital heart disease (CHD) for which clinical testing is available (n = 19). All genes are listed according symbol.to their HUGO Gene Nomenclature Committee (HGNC).
Figure 1Agilent capture kit target size. Graphical representation of the proportion of bases captured by the Agilent 44MB_V2 (green), 50MB_V3 (purple), 51MB_V4 (orange), and 50MB_V5 (blue) capture kits. The grey wedges indicate the proportion not captured by the kits. A. Percentage of the total number of bases from CCDS genomic locations for all 50 clinically relevant cardiac genes. B. Percentage of the total number Known Gene genomic locations for all 50 clinically relevant cardiac genes.
Figure 2Cumulative coverage with Agilent capture kit versions. Graph showing proportion of CCDS and Known Gene datasets covered at various read depths by four Agilent capture kit versions. Coverage of CCDS coordinates is shown in blue while coverage of Known Gene coordinates is depicted in red. There was good coverage, 92-99% of CCDS target regions, at a minimum read depth of 3X but only 55-64% coverage of Known Gene target regions at 3X. A. Proportion of CCDS and Known Gene read depths for the Agilent 44MB_V2 capture kit. B. Proportion of CCDS and Known Gene read depths for the Agilent 50MB_V3 kit. This capture kit version demonstrated the highest overall observed read depth. C. Proportion of CCDS and Known Gene read depths for the Agilent 51MB_V4 kit. D. Proportion of CCDS and Known Gene read depths for the Agilent 50MB_V5 kit.
Figure 3Gene by gene coverage of cardiac genes. A. Box-and-whisker plots showing median read depth and 25% to 75% interquartile ranges (IQR) for the 50 cardiac genes in 94 exome samples captured using Agilent V3 that had high observed sequencing depth, 80X. Plots represent the read depth coverage for transcripts found within the CCDS. There was variability in depth of coverage between genes and variable coverage within genes as shown by the wide IQR. Based on the CCDS dataset, 10 of the 50 genes had median read depth below 30X and only 32 genes had an IQR above 30X. B. Box-and-whisker plots showing median read depth and 25% to 75% interquartile ranges (IQR) for the 50 cardiac genes in 20 exome samples captured using Agilent V2 that had the lowest observed sequencing depth, 30.5X. Based on the CCDS dataset, 30 of the 50 genes had median read depth below 30X and only 5 genes had an IQR above 30X.
Figure 4Inter-sample variability of observed read depths. Plot of the 95% confidence interval per captured CCDS base across samples for the Agilent 44MB_V2 (green) 50MB_V3 (purple), 51MB_V4 (orange), and 50MB_V5 (blue) capture kits. For V2, V3, and V4 kit versions, over 86% of captured bases varied less than ±5 reads between samples, with the largest variation coming at higher read depths. Given that V5 had only six exomes, the variability is more pronounced due to the small sample size, however, over 85% of captured bases varied by more than ±10 reads between samples. The largest variability was seen at higher read depths (>30X). The depth of sequencing and version of the capture kit did not affect inter-sample variability.
Figure 5GC content analysis. A. GC content (blue) and median coverage (red) for each of the 50 cardiac genes was plotted in decreasing order of from highest to lowest coverage. The linear average was plotted as the dark line in blue for GC content and red for median coverage. These linear averages show that as median coverage decreased from left to right, GC content increased. B. For each of the 50 genes, the median coverage and GC content percentage was plotted. There was a negative correlation between median coverage and total GC content of the 50 cardiac genes (r2 = 0.547; p = 8.66 × 10−10). C. Percent GC content of the top five well covered genes in green and the bottom five poorly covered genes in red showed a significant difference (p = 1.48 × 10−5).