| Literature DB >> 23687434 |
Samuel P Strom1, Michael B Gorin.
Abstract
BACKGROUND: Many genes have been reported as harboring autosomal dominant mutations causing retinal dystrophy. As newly available gene panel sequencing and whole exome sequencing will open these genes up to greater scrutiny, we assess the rate of rare coding variation in these genes among unaffected individuals to provide context for variants that will be discovered when clinical subjects are sequenced.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23687434 PMCID: PMC3654847
Source DB: PubMed Journal: Mol Vis ISSN: 1090-0535 Impact factor: 2.367
Coding length, coverage, and variant summaries for 36 autosomal dominant retinal dystrophy genes culled from the EVS6500 database.
| Gene | CDS | % Covered | Rare Variants | Private Variants | ||
|---|---|---|---|---|---|---|
| % | % | % | % | |||
| Length | ≥20x | Missense | Nonsense | Missense | Nonsense | |
| 1,152 | 100 | 0.43 | 0.05 | 0.29 | 0.03 | |
| 1,812 | 91.4 | 0.69 | 0.02 | 0.45 | 0.02 | |
| 936 | 0.29 | 0 | 0.18 | 0 | ||
| 82.7 | 0.05 | 0.91 | 0.05 | |||
| 897 | 100 | 0.31 | 0 | 0.2 | 0 | |
| 1,479 | 100 | 0.4 | 0 | 0.31 | 0 | |
| 942 | 100 | 0.22 | 0 | 0.18 | 0 | |
| 1,548 | 100 | 0.75 | 0.02 | 0.51 | 0.02 | |
| 603 | 0.17 | 0 | 0.05 | 0 | ||
| 600 | 94.8 | 0.26 | 0.05 | 0.18 | 0.03 | |
| 3,309 | 100 | 0 | 0.74 | 0 | ||
| 16,905 | 0.11 | 0.09 | ||||
| 1,797 | 100 | 0.46 | 0.03 | 0.26 | 0.03 | |
| 1,758 | 0.26 | 0 | 0.23 | 0 | ||
| 1,737 | 100 | 0.86 | 0.02 | 0.6 | 0.02 | |
| 1,230 | 0.49 | 0.02 | 0.31 | 0.02 | ||
| 711 | 0.18 | 0 | 0.09 | 0 | ||
| 2,922 | 90.7 | 0.86 | 0 | 0.63 | 0 | |
| 2,595 | 96.9 | 0.05 | 0.65 | 0.03 | ||
| 2,049 | 100 | 0.23 | 0 | 0.22 | 0 | |
| 1,497 | 93 | 0.32 | 0 | 0.25 | 0 | |
| 2,823 | 99 | 0.35 | 0 | 0.28 | 0 | |
| 7,005 | 99.9 | 0.4 | 0.02 | 0.32 | 0.02 | |
| 1,038 | 100 | 0.22 | 0 | 0.14 | 0 | |
| 1,044 | 100 | 0.49 | 0.02 | 0.31 | 0.02 | |
| 88.5 | 0.02 | 0.78 | 0 | |||
| 1,053 | 97 | 0.48 | 0.02 | 0.22 | 0.02 | |
| 100 | 0.06 | 0.06 | ||||
| 89.9 | 0.11 | 0.09 | ||||
| 663 | 0.12 | 0 | 0.08 | 0 | ||
| 1,599 | 100 | 0.45 | 0.03 | 0.34 | 0.03 | |
| 2,283 | 96.6 | 0.63 | 0 | 0.42 | 0 | |
| 99.8 | 0.85 | 0.02 | 0.65 | 0.02 | ||
| 633 | 81.2 | 0.06 | 0 | 0.02 | 0 | |
| 3,135 | 98.2 | 0.82 | 0 | 0.6 | 0 | |
| 720 | 0.23 | 0.02 | 0.18 | 0.02 | ||
Rare is defined as having a minor allele frequency of <0.1%, including private variants (observed in only one individual). Boxes containing very long genes (95th percentile), coverage rates of <80%, and variant rates >1% are marked with bold text. These data suggest that when performing DNA sequencing of the complete coding sequence of the highlighted genes, rare missense variants are likely to be identified regardless of disease state. For such variants, their impact on disease will thus be difficult to interpret. Abbreviations: CDS, coding sequence; kb, kilobases; 20x, percentage of coding bases covered on average by at least 20 independent sequence reads.
Figure 1Rare Variant Identification is Correlated With Coding Sequence Length. Coding sequence (CDS) length in nucleotides is plotted against the proportion of individuals carrying a very rare (MAF <0.1%) missense variant. A very strong positive correlation is observed (Pearson’s correlation coefficient r = 0.89), indicating that genes with long coding sequences are more likely to have a high rate of rare missense variants independent of the functional impact of those variants.