Literature DB >> 26783550

RNAseq analysis for the diagnosis of muscular dystrophy.

Hernan Gonorazky1, Minggao Liang2, Beryl Cummings3, Monkol Lek3, Johann Micallef4, Cynthia Hawkins5, Raveen Basran5, Ronald Cohn6, Michael D Wilson2, Daniel MacArthur3, Christian R Marshall5, Peter N Ray7, James J Dowling8.   

Abstract

The precise genetic cause remains elusive in nearly 50% of patients with presumed neurogenetic disease, representing a significant barrier for clinical care. This is despite significant advances in clinical genetic diagnostics, including the application of whole-exome sequencing and next-generation sequencing-based gene panels. In this study, we identify a deep intronic mutation in the DMD gene in a patient with muscular dystrophy using both conventional and RNAseq-based transcriptome analyses. The implications of our data are that noncoding mutations likely comprise an important source of unresolved genetic disease and that RNAseq is a powerful platform for detecting such mutations.

Entities:  

Year:  2015        PMID: 26783550      PMCID: PMC4704476          DOI: 10.1002/acn3.267

Source DB:  PubMed          Journal:  Ann Clin Transl Neurol        ISSN: 2328-9503            Impact factor:   4.511


Introduction

Neurogenetic diseases are a common cause of severe disability, associated with multiple morbidities, early mortality, and significant economic and societal costs.1, 2 Muscular dystrophies (MDs) are prototypical neurogenetic disorders in that they are clinically and genetically heterogeneous but united by a common set of clinical and diagnostic observations (muscle weakness, elevated Creatine Phosphokinase (CPK), and dystrophic changes on muscle biopsy).3 A critical existing issue in the MD field is that a significant number of children (~50%) with suspected disease do not yet have identified gene mutation(s) despite extensive genetic analysis.4 This fact presents a major barrier to clinical care by prolonging the diagnostic odyssey for patients and families, by limiting care recommendations and prognostic information, and by exposing patients to potentially unnecessary testing.5 It also hinders research into disease pathogenesis and therapy development. Whole‐exome sequencing (WES), along with next‐generation sequencing‐based gene panels, represent a significant technical advance that has greatly improved genetic diagnostics.6 However, there remains a large group of patients whose genetic cause has not been uncovered despite thorough investigation with these modalities.4, 7 One potential and underexplored source of mutation is sequence variation that alters RNA expression and/or processing. Such mutations may occur at exon/intron boundaries, and thus be captured by WES, or occur outside of the standard exome (such as in deep intronic and intergenic regions). To date, while 15% of mutations in the Human Gene Mutation Database are predicted to alter splicing and/or gene expression, nearly all of these are described at splice‐site junctions. Only a small number of cases exist where mutations in intronic or intergenic sequences have been identified as the cause of disease.8, 9, 10 These aspects of the genome are seldom examined in the context of neurogenetic disease; furthermore, next‐generation sequencing based on RNA transcript analysis (RNAseq) has yet to be studied in this context. Here, we present a case of a deep intronic mutation as the cause of Duchenne MD, and show that RNAseq is an effective modality for detecting this mutation.

Methods and Results

We encountered a 6‐year‐old boy with the insidious onset of gait abnormalities. He had been normally developing until age 2.5 years when he began experiencing increased trips and falls. After assessment by a family practitioner, a serum CPK study was sent, and was found to be elevated at >16,000 μ/L (normal 55–300). His physical examination was notable for shoulder and calf muscle hypertrophy, limb‐girdle muscle weakness, mild bilateral ankle contractures, a positive Gowers sign, and a waddling gait. On the basis of these features, we initiated clinical genetic testing for Duchenne MD,11 including multiplex ligation‐dependent probe amplification‐based deletion/duplication analysis of the DMD gene and direct Sanger sequencing of DMD coding sequence and intron/exon boundaries. These investigations did not reveal a causative mutation. Because his clinical picture was consistent with the diagnosis of MD, we next sent a next‐generation sequencing‐based gene panel that examined all coding exons of the known limb‐girdle MD genes (35 genes, Emory Genetics Laboratory).4 No causative mutation was detected using this strategy. We then performed a diagnostic muscle biopsy, which revealed histologic changes consistent with a MD (Fig. 1A). Immunostaining showed absent dystrophin expression in the majority of fibers (Fig. 1B and C), suggestive of a mutation at the DMD locus. We therefore used the remaining muscle biopsy material to analyze the DMD transcript, using both a standard approach and RNA‐seq.
Figure 1

Deep intronic mutation in the gene as a cause of muscular dystrophy. (A–C) Diagnostic muscle biopsy results. (A) Hematoxylin and Eosin staining revealed a typical dystrophic pattern (areas of fibrosis, fatty infiltrate, and degenerating and regenerating fibers). (B) IHC for dystrophin showing absent expression, with the exception of some revertant fibers (arrow). (C) Inmunohistochemistry (IHC) for a‐sarcoglycan showing a normal staining pattern. (D) RT‐PCR analysis using cDNA from patient muscle and overlapping primer sets that span multiple DMD exons. The primer sets including both exons 37 and 38 yielded larger than expected bands (arrows). (E) Sanger sequencing of an RT‐PCR fragment with exons 37/38 revealed a 51 base pair insertion of intron 37 sequence. (F) Schematic of the mutation and its consequences.

Deep intronic mutation in the gene as a cause of muscular dystrophy. (A–C) Diagnostic muscle biopsy results. (A) Hematoxylin and Eosin staining revealed a typical dystrophic pattern (areas of fibrosis, fatty infiltrate, and degenerating and regenerating fibers). (B) IHC for dystrophin showing absent expression, with the exception of some revertant fibers (arrow). (C) Inmunohistochemistry (IHC) for a‐sarcoglycan showing a normal staining pattern. (D) RT‐PCR analysis using cDNA from patient muscle and overlapping primer sets that span multiple DMD exons. The primer sets including both exons 37 and 38 yielded larger than expected bands (arrows). (E) Sanger sequencing of an RT‐PCR fragment with exons 37/38 revealed a 51 base pair insertion of intron 37 sequence. (F) Schematic of the mutation and its consequences. For conventional analysis, we performed overlapping RT‐PCR reactions covering the entire DMD gene (Fig. 1D). We detected altered cDNA products (size greater than predicted) with all primer sets that included both exons 37 and 38. We sequenced one of the altered products and uncovered a novel 51 base pair insertion between exons 37 and 38 corresponding to a discrete sequence fragment within intron 37 (Fig. 1E). We next analyzed genomic DNA corresponding to the area surrounding the inserted sequence, and identified a single sequence variant in intron 37 (g.chrX:32,366,860 A>C [c.5326‐215 T>G]). This variant creates a novel splice acceptor site, which then pairs with a cryptic splice donor site in the intron to create an aberrant 51 base pair exon (Fig. 1D). This aberrant exon creates a frameshift and premature stop codon in the DMD coding sequence (Fig. 1F), and is consistent with the absent protein expression seen by immunostaining. On the basis of this, we concluded that this sequence variant is causative for disease in our patient. RNAseq analysis proved to be equally as informative. We isolated RNA from ~15 mg of biopsy from our patient and from three controls (RNA integrity value (RIN) values ranging from 7.2 to 9.5). We generated libraries with an Illumina, San Diego, CA, USA. TruSeq V2 mRNA library kit and performed RNAseq on an Illumina HiSeq 2000 to average depth of 50–100 million paired end reads of transcript sequence per sample.12 Reads were aligned to the hg19 reference assembly using RNA‐STAR13 and Gencode Release 19 transcriptome annotations.14 To detect unannotated transcripts and isoforms, we performed unguided transcriptome assembly for each sample using Cufflinks15; these were then annotated against the reference transcriptome. Unsupervised cluster analysis based on gene expression levels definitively distinguished the patient sample from controls (Fig. 2A). Examination of the patient's transcriptome identified altered transcript levels (>threefold) in 1197 genes relative to controls (FDR‐adjusted P > 0.05), with DMD the third most significantly reduced transcript (Fig. 2B). Examination of the DMD transcript showed relatively uniform reduction in all exons (Fig. 2C), whereas dedicated interrogation of DMD transcript identified a single novel change corresponding to an insertion of transcribed sequence from intron 37 (Fig. 2C and D). We then determined the specific location and length of this sequence, and it corresponded exactly to the altered fragment we identified by the conventional analysis. In all, RNA‐seq rapidly and correctly identified the causative abnormality in the DMD transcript.
Figure 2

RNA‐seq analysis of muscle from a patient with an intronic mutation. (A) Pairwise correlation heatmap of RNA‐seq samples based on the Pearson correlation of log gene expression values for all genes. Gene expression (counts) was determined and normalized by effective library size using DESeq2. (B) Counts for the top four downregulated genes in patient versus controls as ranked by significance. Bar plots show counts for the patient (blue) and sample means for controls (red). Error bars represent 95% confidence intervals. Circles represent counts for individual samples. (C) Per‐exon read counts and differential exon usage for in patient versus control samples. The novel DMD exon was detected as the most differently expressed exon between the patient and controls (E049 at chrX:32366809‐32366856; marked by arrow). Additional novel exons were detected (E050, E090, E091) but likely represent transcriptional noise. (D) UCSC genome browser screenshot of raw RNA‐seq signal normalized to library size at the locus corresponding to the shaded region in (C) (upper panel). A zoom‐in of the boxed region including the novel exon is shown in the lower panel (novel exon marked by arrow).

RNA‐seq analysis of muscle from a patient with an intronic mutation. (A) Pairwise correlation heatmap of RNA‐seq samples based on the Pearson correlation of log gene expression values for all genes. Gene expression (counts) was determined and normalized by effective library size using DESeq2. (B) Counts for the top four downregulated genes in patient versus controls as ranked by significance. Bar plots show counts for the patient (blue) and sample means for controls (red). Error bars represent 95% confidence intervals. Circles represent counts for individual samples. (C) Per‐exon read counts and differential exon usage for in patient versus control samples. The novel DMD exon was detected as the most differently expressed exon between the patient and controls (E049 at chrX:32366809‐32366856; marked by arrow). Additional novel exons were detected (E050, E090, E091) but likely represent transcriptional noise. (D) UCSC genome browser screenshot of raw RNA‐seq signal normalized to library size at the locus corresponding to the shaded region in (C) (upper panel). A zoom‐in of the boxed region including the novel exon is shown in the lower panel (novel exon marked by arrow). To test the potential utility of RNAseq as a clinical diagnostic technique, we then analyzed the data in a semiblinded fashion. RNAseq source files from our patient and controls were given random numbers and sent blinded (without clinical or genetic information) for analysis to the McArthur laboratory. We (the McArthur group) then independently and unambiguously identified the DMD transcript abnormality, including detection of the unique sequence variant in the intron that was found by genomic DNA sequencing to be the causative gene mutation (Fig. 3A and B). As we were interested in RNAseq as a potential clinical diagnostic tool, we additionally documented comprehensive coverage of several major muscle specific transcripts, including DMD and other large, complex and likely low abundance transcripts (ex: LAMA2, RYR1 and NEB, Fig. 3C). We also determined that at our read depth of 100 million reads we found in our patient sample ~20 reads supporting the intron 37 inclusion DMD transcript (as opposed to zero in the controls or 150 samples from GTEx). These additional data thus confirm that RNA‐seq can identify a disease associated transcript variant in a nonbiased fashion, and provide baseline measurements that suggest RNAseq can serve as a clinical diagnostic tool for a range of muscle disorders.
Figure 3

RNA‐seq as a tool for mutation detection in muscular dystrophy. (A) RNA‐seq reads for the transcript from the patient and a representative control. Note the existence of an aberrant transcript fragment in all reads from the DMD patient (arrow). (B) Examination at the resolution of the individual base pair of the aberrant transcript fragment. Direct evaluation reveals the causative sequence variant in the patient (box). (C) exon usage data plot (normalized as reads per million) for four large, muscle‐specific transcripts. Note that there is adequate coverage of these large transcripts throughout the gene. The exception is , which shows uniformly low expression in the patient sample (control in blue, patient in red).

RNA‐seq as a tool for mutation detection in muscular dystrophy. (A) RNA‐seq reads for the transcript from the patient and a representative control. Note the existence of an aberrant transcript fragment in all reads from the DMD patient (arrow). (B) Examination at the resolution of the individual base pair of the aberrant transcript fragment. Direct evaluation reveals the causative sequence variant in the patient (box). (C) exon usage data plot (normalized as reads per million) for four large, muscle‐specific transcripts. Note that there is adequate coverage of these large transcripts throughout the gene. The exception is , which shows uniformly low expression in the patient sample (control in blue, patient in red).

Discussion

This case study illustrates two important concepts. First, it adds to a small but growing group of cases where noncoding mutations lying outside the intron/exon boundary are identified as the cause of neurogenetic disease.8, 9, 10, 16 This provides support to our prediction that a significant fraction of the currently “unsolved” cohort of patients with MD and other similar genetic diseases are due to mutations that alter RNA processing and/or expression. Identification of such mutations will likely require analysis of nonexomic sequence sources, including RNA (as was done in this case) and genomic noncoding DNA. Second, our study demonstrates for the first time the potential utility of RNAseq for the identification of mutations that alter RNA transcript processing and/or expression.17 Using RNAseq, we were able to quickly and accurately detect a noncoding mutation in DMD. Given that essentially all‐known muscle disease genes are adequately captured and represented by RNAseq, the technology is likely applicable across the broad spectrum of muscle disease. This is particularly true given that muscle biopsy material is available for analysis in many cases. Also, it is possible to perform RNA analysis on myotubes derived from myoD‐driven transdifferentiation of fibroblasts,18 therefore obviating the requirement for muscle biopsy material for generating usable RNAseq data. This opens the broader possibility of application of RNAseq to a range of neurogenetic disorders, using for example, neurons derived in vitro either through direct reprogramming19 or from induce Plupripotent stem (iPS) cells.20 Lastly, of note, the specific mutation identified in our case is unusual as it occurs deep in an intron and it creates a novel exon with splice acceptor and donor independent from the surrounding exons. Importantly, investigation of this genomic variant by combined annotation‐dependent depletion (CADD) analysis places it only in the 10th percentile in terms of pathogenicity, meaning that it would likely be ranked only in the top 300,000 of disease‐relevant variants in our patient's genome. Based on this, it is possible that this mutation would not have been considered pathogenic in the absence of transcript data, and instead would have been coded as a VOUS or even a benign variant. Given the potentially high rate of VOUS in the noncoding genome, this raises the question as to whether whole‐genome sequencing on its own will be adequate to uncover such mutations, or if RNA analysis will be a critical required element for establishing pathogenicity of noncoding mutations. With the speed and potential accuracy of RNAseq (as well as the price, which is equivalent to WES), it is possible that it may represent a linked or even preferred modality.

Author Contributions

H. G. and J. M. presented the clinical data, helped with RNA‐seq analysis, and assisted in manuscript writing. M. L., M. W., B. C., M. L., and D. McA. all worked on RNA‐seq analysis and interpretation. C. R. M., R. B., and P. D. R. performed conventional RNA analysis on the patient biopsy. C. H. interpreted the muscle biopsy. R. C. helped with data interpretation and clinical evaluation. J. J. D. conceived the study, helped with all data interpretation, and wrote the manuscript. All authors assisted in manuscript editing.

Conflict of Interest

None declared.
  20 in total

Review 1.  Cost of illness and economic evaluation in rare diseases.

Authors:  Julio López-Bastida; Juan Oliva-Moreno
Journal:  Adv Exp Med Biol       Date:  2010       Impact factor: 2.622

Review 2.  Forward engineering neuronal diversity using direct reprogramming.

Authors:  Rachel K Tsunemoto; Kevin T Eade; Joel W Blanchard; Kristin K Baldwin
Journal:  EMBO J       Date:  2015-04-23       Impact factor: 11.598

3.  Using complementary DNA from MyoD-transduced fibroblasts to sequence large muscle genes.

Authors:  Leigh B Waddell; Nicole Monnier; Sandra T Cooper; Kathryn N North; Nigel F Clarke
Journal:  Muscle Nerve       Date:  2011-08       Impact factor: 3.217

4.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

5.  A mixed methods study of age at diagnosis and diagnostic odyssey for Duchenne muscular dystrophy.

Authors:  Siaw H Wong; Belinda J McClaren; Alison Dalton Archibald; Alice Weeks; Tess Langmaid; Monique M Ryan; Andrew Kornberg; Sylvia A Metcalfe
Journal:  Eur J Hum Genet       Date:  2015-01-28       Impact factor: 4.246

6.  A comprehensive genomic approach for neuromuscular diseases gives a high diagnostic yield.

Authors:  Arunkanth Ankala; Cristina da Silva; Francesca Gualandi; Alessandra Ferlini; Lora J H Bean; Christin Collins; Alice K Tanner; Madhuri R Hegde
Journal:  Ann Neurol       Date:  2014-12-17       Impact factor: 10.422

7.  A highly conserved program of neuronal microexons is misregulated in autistic brains.

Authors:  Manuel Irimia; Robert J Weatheritt; Jonathan D Ellis; Neelroop N Parikshak; Thomas Gonatopoulos-Pournatzis; Mariana Babor; Mathieu Quesnel-Vallières; Javier Tapial; Bushra Raj; Dave O'Hanlon; Miriam Barrios-Rodiles; Michael J E Sternberg; Sabine P Cordes; Frederick P Roth; Jeffrey L Wrana; Daniel H Geschwind; Benjamin J Blencowe
Journal:  Cell       Date:  2014-12-18       Impact factor: 41.582

8.  Becker muscular dystrophy caused by an intronic mutation reducing the efficiency of the splice donor site of intron 26 of the dystrophin gene.

Authors:  Berivan Baskin; Brenda Banwell; Reem Al Khater; Cynthia Hawkins; Peter N Ray
Journal:  Neuromuscul Disord       Date:  2009-02-18       Impact factor: 4.296

9.  Differential analysis of gene regulation at transcript resolution with RNA-seq.

Authors:  Cole Trapnell; David G Hendrickson; Martin Sauvageau; Loyal Goff; John L Rinn; Lior Pachter
Journal:  Nat Biotechnol       Date:  2012-12-09       Impact factor: 54.908

Review 10.  Diagnosis and management of Duchenne muscular dystrophy, part 1: diagnosis, and pharmacological and psychosocial management.

Authors:  Katharine Bushby; Richard Finkel; David J Birnkrant; Laura E Case; Paula R Clemens; Linda Cripe; Ajay Kaul; Kathi Kinnett; Craig McDonald; Shree Pandya; James Poysky; Frederic Shapiro; Jean Tomezsko; Carolyn Constantin
Journal:  Lancet Neurol       Date:  2009-11-27       Impact factor: 44.182

View more
  33 in total

1.  Improving genetic diagnosis in Mendelian disease with transcriptome sequencing.

Authors:  Beryl B Cummings; Jamie L Marshall; Taru Tukiainen; Monkol Lek; Sandra Donkervoort; A Reghan Foley; Veronique Bolduc; Leigh B Waddell; Sarah A Sandaradura; Gina L O'Grady; Elicia Estrella; Hemakumar M Reddy; Fengmei Zhao; Ben Weisburd; Konrad J Karczewski; Anne H O'Donnell-Luria; Daniel Birnbaum; Anna Sarkozy; Ying Hu; Hernan Gonorazky; Kristl Claeys; Himanshu Joshi; Adam Bournazos; Emily C Oates; Roula Ghaoui; Mark R Davis; Nigel G Laing; Ana Topf; Peter B Kang; Alan H Beggs; Kathryn N North; Volker Straub; James J Dowling; Francesco Muntoni; Nigel F Clarke; Sandra T Cooper; Carsten G Bönnemann; Daniel G MacArthur
Journal:  Sci Transl Med       Date:  2017-04-19       Impact factor: 17.956

Review 2.  Non-Coding Loss-of-Function Variation in Human Genomes.

Authors:  Zachary Zappala; Stephen B Montgomery
Journal:  Hum Hered       Date:  2017-01-12       Impact factor: 0.444

3.  Clinical Utility Gene Card for: Becker muscular dystrophy.

Authors:  David Coote; Mark R Davis; Macarena Cabrera; Merrilee Needham; Nigel G Laing; Kristen J Nowak
Journal:  Eur J Hum Genet       Date:  2018-02-21       Impact factor: 4.246

4.  Rescue of recurrent deep intronic mutation underlying cell type-dependent quantitative NEMO deficiency.

Authors:  Bertrand Boisson; Yoshitaka Honda; Masahiko Ajiro; Jacinta Bustamante; Matthieu Bendavid; Andrew R Gennery; Yuri Kawasaki; Jose Ichishima; Mitsujiro Osawa; Hiroshi Nihira; Takeshi Shiba; Takayuki Tanaka; Maya Chrabieh; Benedetta Bigio; Hong Hur; Yuval Itan; Yupu Liang; Satoshi Okada; Kazushi Izawa; Ryuta Nishikomori; Osamu Ohara; Toshio Heike; Laurent Abel; Anne Puel; Megumu K Saito; Jean-Laurent Casanova; Masatoshi Hagiwara; Takahiro Yasumi
Journal:  J Clin Invest       Date:  2018-12-18       Impact factor: 14.808

Review 5.  Deep intronic mutations and human disease.

Authors:  Rita Vaz-Drago; Noélia Custódio; Maria Carmo-Fonseca
Journal:  Hum Genet       Date:  2017-05-12       Impact factor: 4.132

Review 6.  Opportunities, resources, and techniques for implementing genomics in clinical care.

Authors:  Teri A Manolio; Robb Rowley; Marc S Williams; Dan Roden; Geoffrey S Ginsburg; Carol Bult; Rex L Chisholm; Patricia A Deverka; Howard L McLeod; George A Mensah; Mary V Relling; Laura Lyman Rodriguez; Cecelia Tamburro; Eric D Green
Journal:  Lancet       Date:  2019-08-05       Impact factor: 79.321

7.  Whole-transcriptome sequencing in blood provides a diagnosis of spinal muscular atrophy with progressive myoclonic epilepsy.

Authors:  Kristin D Kernohan; Laure Frésard; Zachary Zappala; Taila Hartley; Kevin S Smith; Justin Wagner; Hongbin Xu; Arran McBride; Pierre R Bourque; Care Rare Canada Consortium; Steffany A L Bennett; David A Dyment; Kym M Boycott; Stephen B Montgomery; Jodi Warman Chardon
Journal:  Hum Mutat       Date:  2017-03-28       Impact factor: 4.878

8.  A combined RNA-seq and whole genome sequencing approach for identification of non-coding pathogenic variants in single families.

Authors:  Revital Bronstein; Elizabeth E Capowski; Sudeep Mehrotra; Alex D Jansen; Daniel Navarro-Gomez; Mathew Maher; Emily Place; Riccardo Sangermano; Kinga M Bujakowska; David M Gamm; Eric A Pierce
Journal:  Hum Mol Genet       Date:  2020-04-15       Impact factor: 6.150

Review 9.  [Application of RNA sequencing in clinical diagnosis of Mendelian disease].

Authors:  Hui Xiao; Wen-Hao Zhou
Journal:  Zhongguo Dang Dai Er Ke Za Zhi       Date:  2020-10

10.  Comparative evaluation of cDNA library construction approaches for RNA-Seq analysis from low RNA-content human specimens.

Authors:  T L Masters; C A Hilker; P R Jeraldo; A V Bhagwate; K E Greenwood-Quaintance; B W Eckloff; N Chia; A D Hanssen; M P Abdel; J Z Yao; J Jen; R Patel
Journal:  J Microbiol Methods       Date:  2018-10-14       Impact factor: 2.363

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.