| Literature DB >> 26214305 |
Maria Delio1, Kunjan Patel1, Alex Maslov1, Robert W Marion2, Thomas V McDonald3, Evan M Cadoff4, Aaron Golden1, John M Greally1, Jan Vijg1, Bernice Morrow1, Cristina Montagna5.
Abstract
BACKGROUND: While next generation sequencing (NGS) is a useful tool for the identification of genetic variants to aid diagnosis and support therapy decision, high sequencing costs have limited its application within routine clinical care, especially in economically depressed areas. To investigate the utility of a multi-disease NGS based genetic test, we designed a custom sequencing assay targeting over thirty disease-associated areas including cardiac disorders, intellectual disabilities, hearing loss, collagenopathies, muscular dystrophy, Ashkenazi Jewish genetic disorders, and complex Mendelian disorders. We focused on these specific areas based on the interest of our collaborative clinical team, suggesting these diseases being the ones in need for the development of a sequencing-screening assay.Entities:
Mesh:
Year: 2015 PMID: 26214305 PMCID: PMC4516357 DOI: 10.1371/journal.pone.0133742
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Proof of Principle Cohort: Samples with previously identified sequenced variants (samples 1–5) and previously identified copy number changes (samples 6–7).
| Sample number | ID | Disorder | Gene | Validated mutation | Zygosity | Depth |
|---|---|---|---|---|---|---|
| 1 | TG322.001 | Microphthalmia, syndromic 2 |
| c.4540C>T, p.R1480X | Het | 1401 |
| 2 | TG472.001 | Focal Segmental Glomerulosclerosis |
| c.2409+7dup | Het | 786 |
| 3 | TG478.001 | Cystic Fibrosis |
| c.3120+1G>A | Het | 642 |
| 4 | AJ177 | Glycogen Storage Disease |
| c.247C>T, p.R83C | Het | 1757 |
| AJ177 | Tay-Sachs |
| c.1274_1277dupTATC, p.Tyr427IlefsTer5 | N/A | 920 | |
| 5 | JS6.200 | Hearing loss |
| c.556C>T, p.Arg186Trp | Het | 1396 |
| 6 | TG17.001 | Multiple congenital anomalies |
| arr 18q22.3 (71791767–71878869)x3 | N/A | N/A |
| 7 | TG33.001 | Multiple congenital anomalies |
| arr 8p23.2 (2524538–3288481)x3 | N/A | N/A |
*Proof-of-principle samples 6 and 7 were included in the sequencing test cohort.
Demographic, clinical features, and sequencing results of the sixteen patients in test cohort.
| Sample number | ID | Sex | Ethnicity | Clinical Information | Genetic tests and results performed (clinically) before enrollment in study | Results from Multi-Disease Target Panel | |
|---|---|---|---|---|---|---|---|
| Gene | Protein Change | ||||||
|
|
| M | AA | DD DF | Chromosome microarray (18q22.3 interstitial duplication) Rett syndrome ( | None to report | None to report |
|
|
| F | N/A | MCA DD Chronic lung disease Osteopathia Striata with cranial sclerosis | Chromosome microarray (8p23.2 duplication maternally inherited) Xq11.2 deletion, X-inactivation 63:37) FISH for subtelomeric rearrangements FISH for VCFS, CHARGE ( | PAX6 | p.S213* (rs201251689) p.I341T no ExAC p.N1224K (rs371475225) |
|
|
| M | Cau/His | MCA FSGS Ticks and possible Tourette Syndrome Speech delay Fine motor delay Marfan Syndrome-like features | Cystic Fibrosis ( | FBN1 | p.G1334D |
|
|
| M | Cau/His | MCA FSGS Ticks and possible Tourette Syndrome Speech delay Fine motor delay Marfan Syndrome-like features | Cystic Fibrosis ( | FBN1 | p.G1334D |
|
|
| F | Cau | DD DFID Epilepsy Williams-Beuren Syndrome-like characteristics Moderate anxiety | Chromosome microarray |
| p.Y463fs p.R106G (rs143163513) |
|
|
| M | His | VSD SD Bilateral cleft lip and cleft palate Duplicated renal collecting system | Chromosomes and chromosome microarray |
| p.Y121C p.L104F p.G576A (rs18000098) p.G190G |
|
|
| F | Cau/Isr | DD Noonan Syndrome-like characteristics Clinodactyly Brachydactyly Left inguinal hernias, Epicanthal folds Suborbital fullness | Hearing loss ( | SPRED1 NLGN4X | p.N10K (rs201692618) p.R101* |
|
|
| M | AA | PDD Tall stature Mild dysmorphism Macrocephaly Hypertelorism Flat nasal bridge | Fragile X Chromosome microarray | NXF5 | p.R320* (rs140252282) |
|
|
| F | His | DCM | None |
| p.R340Q (rs191009474) p.V118M (rs35507268) p.T3469M p.R220C |
|
|
| F | His | DCM | None |
| p.A672T (rs199473140) p.F212S p.T3469M p.V118M (rs35507268) |
|
|
| F | His | Brugada Syndrome LQTS | None |
| p.D232N (rs121908338) |
|
|
| F | Ben | CPVT | None |
| p.R67* c.532+1G>A |
|
|
| F | Cau | SPOH | None |
|
|
|
|
| F | AA | HCM | None |
| p.R469W (rs141502002) |
|
|
| F | AA | DCM with conduction disease | None |
| p.I1583T (rs367566833) |
|
|
| M | Mex | Brugada Syndrome | None | None to report | None to report |
Legend: Ethnicitiy: AA = African American, Cau = Caucasian, His = Hispanic, Isr = Israeli, Ben = Bengali, Mex = Mexican, N/A = Not Available; Clinical Information: DD = Developmental Delay, DF = Dyspmorphic Features, MCA = Multiple Congenital Anomalies, FSGS = Focal segmental glomerulosclerosis, ID = Intellectual Disability, VSD = Ventricular Septal Defect, SD = Speech Delay, PDD = Pervasive Developmental Delay, DCM = Dilated Cardiomyopathy, LQTS = Long QT Syndrome, CPVT = Catecholaminergic Polymorphic Ventricular Tachycardia, SPOH = Severe Postural Orthostatic Hypertension, HCM = Hypertrophic Cardiomyopathy.
Fig 1Custom panel design.
The pie chart illustrates the percent of genes included in the custom design categorized based on specific diseases/abnormalities. Of note the Ashkenazi Jews variant disorders have been kept separate because they represent an ethnic division commonly associated with specific disease and genetic variants.
Fig 2Project pipeline.
Sample quality is conducted in the laboratory (blue box), subsequent library prep and sequencing is conducted in the Epigenomics Shared Facility (pink box). The sequence reads automatically progress into the WASP pipeline for quality control parameters, alignment to reference sequence through BWA and duplicate removal through Picard is performed (yellow boxes). Local Realignment, base quality recalibration, variant discovery and annotation take place via GATK or VarScan2 (somatic) (green boxes). Clinically relevant variants are prioritized through Alamut and visualized through IGV (orange boxes). Validation is then performed by Sanger sequencing and results are visualized through Sequencher 4.0.1 (purple box).
Fig 3Variant Calling Pipeline.
The vcf file created through the project quality control pipeline is filtered for frequency using 1000Genome, dbSNP, ExAC, and NHLBI Exome Project. All variants with a MAF <1% are then prioritized by gene function and genomic region. Variants associated with the patient's phenotype mapping to exons are further prioritized based on functional annotation. If protein-changing variants are identified in a gene of interest, further analysis of non-protein changing variants, UTRs, and intronic regions are analyzed to consider the possibility of compound heterozygous variants. Variants are visualized and reviewed though IGV and In silico prediction tools, and taken into consideration to estimate the possible impact of each variant at the protein level. Variants are also manually investigated through Alamut v2.2 software (Interactive Biosoftware, San Diego, CA) to aid in variant interpretation and pathogenicity predictions. Of note, we follow the ACMG guidelines for incidental findings (grey boxes).
Summary of sequencing coverage and detected variants for test cohort.
| Samples | Total Reads | Avg. Depth (X) | % Bases at ≥ 20X coverage | Total no. of SNVs | Total no. of SNVs at ≤ 1% MAF | Total no. of Indels | Total no. of Indels at ≤ 1% MAF |
|---|---|---|---|---|---|---|---|
|
| 48581252 | 496 | 81 | 4306 | 402 | 790 | 458 |
|
| 46810778 | 498 | 79 | 3593 | 186 | 674 | 313 |
|
| 61418574 | 582 | 80 | 3616 | 186 | 652 | 312 |
|
| 41042110 | 448 | 77 | 3517 | 131 | 662 | 321 |
|
| 53239778 | 546 | 88 | 3662 | 205 | 685 | 367 |
|
| 47030442 | 514 | 88 | 3670 | 189 | 685 | 375 |
|
| 47609338 | 491 | 88 | 4481 | 751 | 776 | 386 |
|
| 44839312 | 416 | 89 | 3630 | 245 | 646 | 310 |
|
| 37422180 | 263 | 86 | 3905 | 223 | 691 | 348 |
|
| 68089872 | 627 | 78 | 3323 | 150 | 643 | 304 |
|
| 39783474 | 298 | 73 | 3354 | 135 | 648 | 304 |
|
| 40765488 | 446 | 75 | 4075 | 328 | 784 | 371 |
|
| 60697664 | 533 | 77 | 4093 | 363 | 758 | 371 |
|
| 53780084 | 327 | 73 | 3298 | 140 | 626 | 284 |
|
| 42699994 | 417 | 75 | 3310 | 154 | 667 | 312 |
|
| 127605702 | 1217 | 91 | 3787 | 255 | 696 | 322 |
|
|
|
|
|
|
|
|
* indicates samples that were multiplexed together.
TG471.002 was added to another lane for logistic reasons.
Fig 4Mutational pedigree of TG471.001 and TG472.002, sisters with DCM.
Affected sisters are indicated in black. Associated variants are provided under each family member.
Cost comparison of target sequencing panel Einstein_v1 versus Whole Exome Sequencing.
| Einstein _v1 | Whole Exome Sequencing | |
|---|---|---|
|
| Roche Nimblegen SeqCap EZ | Roche Nimblegen SeqCap EZ Exome v3.0 |
|
| 4.98Mb | 64 Mb |
|
| 26,046 | >2,100,000 |
|
| Biotinylated DNA bait | Biotinylated DNA bait |
|
| 1ug | 1 ug |
|
| 4,093 SNVs/758 InDel | 6,350SNVs/2064 InDel |
|
| 734 SNVs | 1,190 SNVs |
|
| $37.5 | $600 |
|
| $87.5 (16 samples/lane) | $466 (3 samples/lane) |
|
| $200 | $200 |
|
|
|
|
* The number of SNVs/InDels identified was based on samples used in the current analysis (n = 2 for WES and matching target sequencing).
** Based on estimated $1,400/lane 150 bp pair end sequencing on Illumina 2500.