| Literature DB >> 23484092 |
Daniel F Schorderet1, Alexandra Iouranova, Tatiana Favez, Leila Tiab, Pascal Escher.
Abstract
The molecular diagnosis of retinal dystrophies is difficult because of the very important number of genes implicated and is rarely helped by genotype-phenotype correlations. This prompted us to develop IROme, a custom designed in solution-based targeted exon capture assay (SeqCap EZ Choice library, Roche NimbleGen) for 60 retinitis pigmentosa-linked genes and three candidate genes (942 exons). Pyrosequencing was performed on a Roche 454 GS Junior benchtop high-throughput sequencing platform. In total, 23 patients affected by retinitis pigmentosa were analyzed. Per patient, 39.6 Mb were generated, and 1111 sequence variants were detected on average, at a median coverage of 17-fold. After data filtering and sequence variant prioritization, disease-causing mutations were identified in ABCA4, CNGB1, GUCY2D, PROM1, PRPF8, PRPF31, PRPH2, RHO, RP2, and TULP1 for twelve patients (55%), ten mutations having never been reported previously. Potential mutations were identified in 5 additional patients, and in only 6 patients no molecular diagnosis could be established (26%). In conclusion, targeted exon capture and next-generation sequencing are a valuable and efficient approach to identify disease-causing sequence variants in retinal dystrophies.Entities:
Mesh:
Year: 2012 PMID: 23484092 PMCID: PMC3591198 DOI: 10.1155/2013/198089
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
List of genes enriched by targeted sequence capture (IROme).
| Gene | Alias | Chr | Chr location | Exons | Pathology |
|---|---|---|---|---|---|
|
| RP19, STGD1, CORD3, and ARMD2 | 1 | 94458391-94586688 (rs) | 50 | ADRP, ARRP, ARCRD, and ARMD |
|
| LCA4 | 17 | 6327057-6338519 (rs) | 6 | ARLCA, ADCRD |
|
| RP50, BMD, and VMD2 | 11 | 61717293-61732987 (fs) | 11 | ADRP, ARRP, and ADMD |
|
| RP54 | 2 | 29284556-29297127 (rs) | 2 | ARRP |
|
| RP17 | 17 | 58227302-58236902 (fs) | 8 | ADRP |
|
| CSNB2B | 11 | 67219877-67226699 (fs) | 7 | ARLCA, ARCSNB |
|
| LCA10, BBS14, and NPHP6 | 12 | 88442794-88535993 (rs) | 53 | ARLCA, ARBBS |
|
| RP26 | 2 | 182401403-182545392 (rs) | 14 | ARRP, ARCRD |
|
| RP61, USH3A | 3 | 150643950-150690786 (rs) | 3 | ARRP |
|
| RP49 | 4 | 47937994-48018689 (rs) | 13 | ARRP |
|
| X | 150906923-150913776 (fs) | 6 | ||
|
| RP45 | 16 | 57917847-58005020 (rs) | 33 | ARRP |
|
| LCA8, RP12 | 1 | 197170592-197447585 (fs) | 12 | ARRP, ARLCA |
|
| LCA7, CORD2 | 19 | 48325097-48364769 (fs) | 4 | ADRP, ADLCA, ARLCA, and ADCRD |
|
| RP59 | 1 | 26758773-26797785 (fs) | 9 | ARRP |
|
| RP25 | 6 | 64429876-66417118 (rs) | 43 | ARRP |
|
| RP28 | 2 | 62051989-62081278 (rs) | 6 | ARRP |
|
| RP30 | 17 | 79495422-79504156 (fs) | 5 | ADRP, ADMD |
|
| RP48, GCAP2 | 6 | 42152139-42162694 (rs) | 4 | ADRP, ADMD |
|
| LCA1, CORD6 | 17 | 7905988-7923658 (fs) | 20 | ARLCA, ADCRD |
|
| LCA11, RP10 | 7 | 128032331-128050306 (rs) | 17 | ADRP, ADLCA |
|
| RP56, sparcan | 3 | 100945570-101039404 (rs) | 20 | ARRP |
|
| NPHP5 | 3 | 121488610-121553926 (rs) | 15 | ARLCA |
|
| RP42 | 7 | 23145353-23215040 (fs) | 12 | ADRP |
|
| Lebercilin | 6 | 80194708-80247175 (rs) | 8 | ARLCA |
|
| AYTL2 | 5 | 1456595-1524092 (rs) | 14 | ARLCA |
|
| LCA14 | 4 | 155548097-155674270 (fs) | 4 | ARRP, ARLCA |
|
| RP38 | 2 | 112656056-112787138 (fs) | 19 | ARRP |
|
| RP37, PNR | 15 | 72084977-72110559 (fs) | 8 | ADRP, ARRP, and ARESCS |
|
| RP27 | 14 | 24549316-24584223 (rs) | 3 | ADRP, ARRP, and ARESCS |
|
| RP23 | X | 13752832-13787480 (fs) | 23 | XRP |
|
| 14 | 57267426-57277197 (rs) | 5 | ADLCA | |
|
| RP43 | 5 | 149237519-149324356 (rs) | 22 | ARRP |
|
| RP40, CSNBAD2 | 4 | 619373-664571 (fs) | 23 | ARRP, ADCSNB |
|
| RP57 | 17 | 79617489-79623607 (rs) | 4 | ARRP |
|
| RP36 | 17 | 74523871-74541458 (fs) | 5 | ARRP |
|
| RP41, STGD4, CORD12, and MCDR2 | 4 | 15964699-16086001 (rs) | 28 | ARRP, ADCRD, and ADMD |
|
| RP18 | 1 | 150293925-150325671 (fs) | 16 | ADRP |
|
| RP60 | 20 | 62612488-62664453 (fs) | 21 | ADRP |
|
| RP13 | 17 | 1553923-1588154 (rs) | 43 | ADRP |
|
| RP11 | 19 | 54618837-54635140 (fs) | 14 | ADRP |
|
| RDS, RP7 | 6 | 42664340-42690312 (rs) | 3 | ADRP, ADMD, ADCRD, and digenic |
|
| IRBP | 10 | 48381487-48390991 (rs) | 4 | ARRP |
|
| LCA13, RP53 | 14 | 68168603-68201169 (fs) | 8 | ADRP, ARLCA |
|
| RP44 | 10 | 86004809-86019716 (fs) | 7 | ADRP, ARRP, and ADCA |
|
| RP4, CSNBAD1 | 3 | 129247483-129254012 (fs) | 5 | ADRP, ARRP, and ADCSNB |
|
| CRALBP | 15 | 89753098-89764922 (rs) | 9 | ARRP |
|
| 11 | 62379194-62382592 (fs) | 3 | ADRP, digenic | |
|
| 8 | 55471729-55682531 (fs) | 4 | ADRP, ARRP | |
|
| X | 46696375-46741793 (fs) | 5 | XRP | |
|
| PAP1 | 7 | 33134409-33149013 (rs) | 7 | ADRP |
|
| LCA2, RP20 | 1 | 68894505-68915642 (rs) | 14 | ARRP, ARLCA |
|
| RP3, CORDX1 | X | 38128424-38186817 (rs) | 19 | XRP, XCRD, XMD |
|
| LCA6, CORD13 | 14 | 21756098-21819460 (fs) | 24 | ARLCA, ARCRD |
|
| RP47, Arrestin | 2 | 234216309-234255701 (fs) | 16 | ARRP, ARCSNB |
|
| RP35, CORD10 | 1 | 156117157-156147543 (fs) | 16 | ADRP, ARRP, and ADCRD |
|
| RP33 | 2 | 96940074-96971297 (rs) | 45 | ADRP |
|
| LCA3 | 14 | 88851268-88936694 (fs) | 12 | ARLCA |
|
| RP31 | 9 | 32540542-32552551 (rs) | 3 | ADRP |
|
| 11 | 8040791-8127659 (fs) | 13 | ||
|
| LCA15, RP14 | 6 | 35465651-35480715 (rs) | 15 | ARRP, ARLCA |
|
| RP39 | 1 | 215796236-216596738 (rs) | 73 | ARRP |
|
| RP58 | 2 | 27600098-27603657 (rs) | 4 | ARRP |
Genes are listed alphabetically according to their official gene symbol, and, in addition, gene aliases commonly used in ophthalmic research provided. Chromosomal (chr) location is based on the Homo sapiens high-coverage assembly GRCh37, yielding in the UCSC hg19 database (fs: forward strand; rs: reverse strand). For each gene the number of exons is listed. Targeted sequence capture was directed against genes causing autosomal dominant (AD), autosomal recessive (AR) X-linked (X), retinitis pigmentosa (RP), and Leber congenital amaurosis (LCA). Other retinopathies caused by a given gene are also indicated: cone or cone-rod dystrophy (CRD), macular degeneration (MD), congenital stationary night blindness (CSNB), Bardet-Biedl syndrome (BBS), enhanced S-cone syndrome (ARESCS), and chorioretinal atrophy (CA). Heterozygote ROM1 and PRPH2 mutations cause digenic disease. ORF15 of RPGR was not included in the assay.
Figure 1Workflow of the custom-designed targeted exome liquid hybridization capture assay IROme. Genomic DNA from patients was fragmented by nebulization and used for shotgun library construction (454 Roche GS Titanium Rapid Library). Upon adapter ligation, target enrichment is achieved by hybridizing the processed genomic DNA to biotinylated probes (Roche NimbleGen SeqCap EZ Choice). After biotin-streptavidin-based capture and washing, DNA was amplified by emulsion PCR and sequenced on a454 Roche GS Junior Sequencer. Sequencing data was aligned and mapped with the Roche 454 Reference Mapper program. Figure adapted from Roche NimbleGen technical information (http://www.nimblegen.com/products/seqcap/index.html).
Figure 2Workflow of data analysis and filtering. The sff (sequence file format) files generated by 454 Roche GS Junior sequencing were imported either into Reference Mapper or Sequence Pilot software. The coding sequence variants were selected from the 454_HCDiffs.txt files that contained all sequence variants. During filtering, coding sequence variants reported in dbSNP were removed, and missense and nonsense mutations kept. The remaining coding sequence variants were prioritized according to known reported mutations, the mode of inheritance, the percentage of sequence reads reporting the variant (threshold of 20%), and the predicted effect on the protein (PolyPhen score).
Validation of IROme by APEX.
| Pat | nt tested | nt detected | Mean | % reads |
|---|---|---|---|---|
| 1 | 557 | 100% | 25 | 98.9 |
| 2 | 558 | 100% | 26 | 99.4 |
| 3 | 558 | 100% | 22 | 99.2 |
| 4 | 547 | 100% | 20 | 98.3 |
The nucleotides (nt) tested by APEX represent validated RP-linked mutations or variants. The mean coverage (cvg) refers to the average of the coverage of all exons where the mutations are located. The percentage of sequence reads generated by IROme and correctly calling the nucleotides at homozygous state are indicated.
Synopsis of molecular diagnostic on RP patients by IROme.
| Pat | Total | Read | Median | Total | cds | filt. | prio. | Test/val | Potential | cvg | mut | Cosegregate |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 47 | 453 | 21.3 | 1206 | 114 | 51 | 8 | 2/2 | p.C2ORF71-R571_P576del | 38 | 55.3 | M het norm |
| 2 | 47.6 | 433 | 20.9 | 1217 | 98 | 44 | 7 | 2/2 | p.PDE6B-H337R | 21 | 100 | ? |
| 3 | 42.2 | 416 | 17.0 | 1085 | 78 | 39 | 6 | 1/1 | p.CLRN1-P134L | 19 | 68.4 | ? |
| 4 | 44.6 | 395 | 21.6 | 1173 | 95 | 42 | 5 | 3/3 | p.RHO-Y191C | 39 | 38.5 | yes |
| 5 | 24.4 | 429 | 13.8 | 894 | 104 | 45 | 1 | 1/1 | p.TULP1-F529_A530dup | 6 | 100 | yes |
| 6 | 31.7 | 422 | 16.2 | 1039 | 85 | 47 | 1 | 1/1 | p.RHO-R252P | 22 | 54.5 | ? |
| 7 | 20.1 | 281 | 13.3 | 789 | 77 | 38 | 4 | 2/2 | p.SAG-E11K | 30 | 56.7 | no |
| 8 | 13.9 | 445 | 9.1 | 736 | 70 | 33 | 2 | 1/1 | p.RP2-D161Y | 22 | 45.5 | yes |
| 9 | 29 | 297 | 17.3 | 832 | 80 | 39 | 9 | 1/1 | g.ABCA4-ex45-47del | 0 | 0 | yes |
| 10 | 37.3 | 443 | 16.7 | 1247 | 93 | 46 | 3 | 3/3 | p.PROM1-R373C | 32 | 50 | yes |
| 11 | 50.2 | 440 | 21.5 | 1151 | 92 | 46 | 2 | 1/1 | p.RP2-E20X | 28 | 67.8 | yes |
| 12 | 49.1 | 394 | 23.8 | 1116 | 94 | 39 | 9 | 4/4 | p.CNGB1-R765C | 30 | 100 | yes |
| 13 | 33 | 436 | 14.6 | 1017 | 85 | 33 | 3 | 2/2 | p.GUCY2D-V887G | 18 | 94.4 | yes |
| 14 | 32.6 | 443 | 15.3 | 1205 | 93 | 42 | 3 | 1/1 | p.CRX-Q105X | 17 | 58.8 | M het norm |
| 15 | 32.7 | 442 | 14.2 | 1026 | 86 | 40 | 3 | 1/1 | p.USH2A-P2630R | 25 | 40 | no |
| 16 | 69.4 | 434 | 28.4 | 1246 | 87 | 41 | 1 | 1/1 | p.PRPF31-E183_ins33bp | 74 | 40 | yes |
| 17 | 16.7 | 452 | 9.8 | 861 | 82 | 43 | 2 | 2/2 | p.PRPH2-L39P | 18 | 50 | yes |
| 18 | 39.6 | 429 | 17.2 | 1826 | 85 | 35 | 3 | 1/1 | ||||
| 19 | 66.5 | 449 | 26.4 | 1298 | 103 | 45 | 5 | 2/2 | p.PRPH2-S217_dup16bp | 71 | 39.4 | yes |
| 20 | 47.1 | 358 | 19.8 | 1171 | 91 | 47 | 3 | 2/1 | p.C2ORF71-L889P | 23 | 39.1 | ? |
| 21 | 47 | 363 | 17.3 | 1197 | 102 | 48 | 3 | 1/0 | ||||
| 22 | 36.6 | 354 | 14.7 | 1072 | 86 | 45 | 2 | 2/1 | p.EYS-D2930G | 38 | 60.5 | ? |
| 23 | 53.3 | 393 | 22.4 | 1164 | 92 | 40 | 7 | 5/5 | p.PRPF8-E2331X | 38 | 44.7 | yes |
For each patient, the total number of Mb (106 bp) sequenced on the Roche 454 GS Junior (total seq Mb) and the average read length (read length bp) are indicated. The median fold coverage (cvg) was extracted from the unique depth information. From all the sequence variants (total seq var), first only the sequence variants located in coding sequences were analyzed (cds seq var), with filtering (filt seq var) and prioritizing (prio seq var) according to Figure 2. The sequence variants eventually tested and validated by Sanger sequencing (test/val seq var) are also indicated. For each potential mutation, the coverage (cvg pot mut) and the percentage of sequence reads reporting the potential mutation (mut reads %) are indicated. For cosegregation analysis, “?” indicates absence of available family members and/or simplex cases. For patients 1 and 14, the mother (M) and/or the father (F) are healthy heterozygous carriers (het norm).
Figure 3Fold coverage of targeted sequences. For each patient the unique depth data provided by column 5 of the 454_AlignmentInfo.tsv file was used to estimate the coverage per targeted bp. The onefold coverage data corresponding to reference genome sequences used for alignment purposes, but not targeted by IROme, were removed. The coverage data is represented as cumulative percentage; that is, indicating what percentage of targeted bp has a minimal coverage of x-fold (x axis represents the fold coverage). The average coverage for all patients is represented as a black dashed line, and the median coverage for all patients is 17-fold.