| Literature DB >> 28630177 |
Gilad D Evrony1,2,3, Dwight R Cordero4, Jun Shen5,6, Jennifer N Partlow1,2,3, Timothy W Yu1,2,3, Rachel E Rodin1,2,3, R Sean Hill1,2,3, Michael E Coulter1,2,3, Anh-Thu N Lam1,2,3, Divya Jayaraman1,2,3, Dianne Gerrelli7, Diana G Diaz7, Chloe Santos7, Victoria Morrison7, Antonella Galli8, Ulrich Tschulena9, Stefan Wiemann9, M Jocelyne Martel10, Betty Spooner11, Steven C Ryu1,2,3, Princess C Elhosary1,2,3, Jillian M Richardson1,2,3, Danielle Tierney1,2,3, Christopher A Robinson12, Rajni Chibbar12, Dana Diudea12, Rebecca Folkerth5, Sheldon Wiebe13, A James Barkovich14, Ganeshwaran H Mochida1,2,3,15, James Irvine11,16, Edmond G Lemire17, Patricia Blakley17, Christopher A Walsh1,2,3.
Abstract
While next-generation sequencing has accelerated the discovery of human disease genes, progress has been largely limited to the "low hanging fruit" of mutations with obvious exonic coding or canonical splice site impact. In contrast, the lack of high-throughput, unbiased approaches for functional assessment of most noncoding variants has bottlenecked gene discovery. We report the integration of transcriptome sequencing (RNA-seq), which surveys all mRNAs to reveal functional impacts of variants at the transcription level, into the gene discovery framework for a unique human disease, microcephaly-micromelia syndrome (MMS). MMS is an autosomal recessive condition described thus far in only a single First Nations population and causes intrauterine growth restriction, severe microcephaly, craniofacial anomalies, skeletal dysplasia, and neonatal lethality. Linkage analysis of affected families, including a very large pedigree, identified a single locus on Chromosome 21 linked to the disease (LOD > 9). Comprehensive genome sequencing did not reveal any pathogenic coding or canonical splicing mutations within the linkage region but identified several nonconserved noncoding variants. RNA-seq analysis detected aberrant splicing in DONSON due to one of these noncoding variants, showing a causative role for DONSON disruption in MMS. We show that DONSON is expressed in progenitor cells of embryonic human brain and other proliferating tissues, is co-expressed with components of the DNA replication machinery, and that Donson is essential for early embryonic development in mice as well, suggesting an essential conserved role for DONSON in the cell cycle. Our results demonstrate the utility of integrating transcriptomics into the study of human genetic disease when DNA sequencing alone is not sufficient to reveal the underlying pathogenic mutation.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28630177 PMCID: PMC5538549 DOI: 10.1101/gr.219899.116
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Microcephaly-micromelia syndrome phenotype and pedigrees. (A) Photographs of an affected individual (12601) illustrating the severe microcephaly, facial dysmorphisms, and limb anomalies characteristic of microcephaly-micromelia syndrome. (B) Photograph of a foot (individual 15204) and photograph and X-ray of a hand (individual 12601) showing both pre-axial (malformed toe and absent thumb) and post-axial (underdeveloped fifth metatarsal bone, and hypoplastic fifth digit arising from bifid fourth metacarpal bone) abnormalities. (C) Brain MRIs of two affected individuals showing the common structural brain abnormalities of MMS: profound microcephaly, simplified gyral pattern, markedly diminished white matter volume and myelination, hypoplastic or absent corpus callosum, aqueductal stenosis, and a small pons. Note the large dorsal interhemispheric cysts in both individuals, which was present in nearly every affected individual examined by MRI or autopsy to date. Cortical thickness is grossly normal and the cerebellar hemispheres are relatively large compared to the rest of the brain. Head circumferences (HC) and z-scores (number of standard deviations [SD] from the mean of newborns of the same gestational age at birth) are shown. MRI sequences were as follows: 12601: axial T2 (top left), mid-sagittal T1 (top right), left-sagittal T1 (bottom right), coronal T2-FIESTA (bottom left); 15204: axial T2 (top), coronal T2-HASTE (bottom). White scale bars = 1 cm. (D) Brain histology of MMS cases. (Left) Low-power Nissl-stained brain section of a child who died at 3 mo of age showing simplified gyral pattern and reduced white matter (CG) cingulate gyrus, (CC) corpus callosum, (LV) lateral ventricle, (Cd) caudate, (P) putamen, (GP) globus pallidus, (OT) optic tract). (Top right) Cresyl violet-stained brain section of a 35-wk-gestation newborn at the angle of the lateral ventricle (LV) showing decreased cells in the subventricular zone (SVZ; arrow). Bar = 100 μm. (Middle right) Hematoxylin- and eosin-stained section of the cerebral cortex of a 41-wk-gestation newborn demonstrating disorganized clusters of neurons (arrows) separated by cell-free zones in superficial layers. Bar = 100 μm. (Bottom right) Cresyl violet-stained section from a full-term newborn cerebral cortex demonstrating the persistence of radial columns of neurons separated by cell-sparse regions. Bar = 500 μm. (E) Pedigrees of the families with MMS profiled in this study. Individual IDs are labeled for individuals whose samples were profiled. The pedigree at the top left can be linked via individuals VII:7 and VIII:2 to the larger pedigree in the original description of the syndrome by Ives and Houston (Ives and Houston 1980). Gray symbol (top right pedigree) represents a child that died in infancy with limb anomalies, but the specific diagnosis of MMS was not confirmed. Deceased status is indicated with crossed-out symbols for affected individuals only and not for unaffected individuals. For simplicity, not all individuals of the pedigrees are illustrated. See Supplemental Data 1 for a list of all case samples in this study.
Figure 2.Linkage and homozygosity analysis identifies a locus on Chromosome 21q22.11. (A) Linkage analysis using SNP-microarray genotypes of affected individuals and unaffected parents (see Supplemental Data 1 for list of genotyped individuals) identified a locus associated with the disease at Chromosome 21q22.11 with maximum LOD score of 9.2 (purple star; interval: Chr 21: 33,344,469–34,196,070; GRCh38/hg38). (B) SNP-microarray genotypes in the interval defined by linkage analysis (Fig. 2A). Each line represents an individual (unaffected parents on top and affected individuals labeled in bold on bottom). Each column in the SNP ideogram represents a SNP, with homozygous alleles in red or blue and heterozygous alleles in green. Affected individuals 421 and 12601 define a minimal region of overlapping homozygosity (ROH) at 21q22.11 (blue box and line; Chr 21: 33,364,965–34,029,433; GRCh38/hg38). RefSeq gene annotations are shown above. Low quality SNP calls are omitted. Note: samples 418 and 419 are shown here but these were not used for linkage analysis since they did not pass quality control filters (see Supplemental Methods).
Variants identified in the MMS region of homozygosity by genomic DNA sequencing
Figure 3.RNA-seq identifies an intron-retention splicing defect associated with an intronic variant in DONSON. (A) RNA-seq read coverage for representative samples shows aberrant retention of intron 6 of DONSON (red bar) in affected individuals associated with the c.1047-9A>G noncoding mutation (Chr 21: g.33582064:T>C; red asterisk). The interval shown is Chr 21: 33,580,994–33,583,594 (GRCh38/hg38), illustrated in the reverse strand direction. The genotype of each sample is shown on the right. Homozygous MMS, heterozygous parent, and wild-type control sample names are colored red, purple, and black, respectively. Read coverage graph Y-axes are scaled (numbers on right side of Y-axis) to show the maximum coverage of each sample in the interval. (B) RNA-seq quantification of intron retention for each intron of DONSON, based upon pooling all the RNA-seq samples of each genotype (see Supplemental Methods for details). Cells are shaded green to red according to the percentile between the minimum and maximum values in the table. The aberrant retention of intron 6 is highlighted. The table also shows the mild increase in intron 6 retention in heterozygous parents relative to wild-type controls and the baseline low-level retention of intron 6 in controls relative to other introns. MMS individuals also showed a trend of increased retention of other introns upstream of intron 6, suggesting that impaired splicing of intron 6 might affect splicing of other introns; however, the mechanism by which this would occur is unclear. (C) RNA-seq quantification of intron 6 retention calculated as in Figure 3B. Error bars are 95% confidence intervals (see Supplemental Methods). All group comparisons were significant: controls versus parents: P = 0.03; controls versus affected: P < 10−15; parents versus affected: P < 10−15 (Fisher's exact test with Holm multiple comparisons adjustment). (D) Schematic of the intron retention splicing defect caused by the c.1047-9A>G (Chr 21: 33582064 T>C) mutation in intron 6 in microcephaly-micromelia syndrome. Retention of intron 6 would lead to either nonsense-mediated decay of the transcript due to the stop codon within intron 6 or to a truncated protein. On the right are the predicted wild-type and truncated mutant proteins and their amino acid (AA) lengths. Translation of the first part of the aberrantly retained intron 6 creates 17 amino acids followed by a premature stop codon. (E) RT-PCR spanning from exon 6 to exon 7 of DONSON (top schematic) in various tissues confirms increased retention of intron 6 in MMS samples, which are homozygous for the Chr 21: 33582064 T>C mutation, compared to heterozygous parents and wild-type controls (unspliced transcript with intron 6: 230 bp; spliced transcript: 121 bp). Shown here are six representative samples for each genotype. (Note that the variant is A>G in the DONSON transcript strand and T>C in the genomic plus strand). See Supplemental Figure 2A for RT-PCR gel images of all assayed samples. The exon 7 PCR primer contains a FAM fluorescent label (green circle) for quantification of PCR products (Fig. 3E). Wild-type (T/T) samples: 1- FU-009 umbilical cord; 2- FU-006 umbilical cord; 3- FU-004 umbilical cord; 4- fetal liver; 5- fetal brain; 6- cerebellum. Heterozygous (T/C) parent samples: 1- 15603 cell line; 2- 15602 cell line; 3- 15202 cell line; 4- 15201 cell line; 5- 15202 blood sample a; 6- 15202 blood sample b. Homozygous (C/C) MMS samples: 1- 15204 brain (RNAlater); 2- 15204 brain (fresh-frozen sample a); 3- 15204 brain (fresh-frozen sample b); 4- 15204 heart (RNAlater); 5- 15204 heart (fresh-frozen); 6- 15204 kidney (RNAlater). (F) Quantification of the RT-PCR intron 6 retention assay products. RT-PCR was performed with the PCR primer for exon 7 containing a FAM-fluorescent label allowing quantification of the RT-PCR products with a capillary electrophoresis DNA analyzer (see Methods). Percentage unspliced RT-PCR product was calculated as [Area of unspliced band]/[Area of unspliced band + Area of spliced band], and averaged across all samples of each genotype (number of samples in each group is shown on top). Groups were significantly different from each other (Controls versus Parents, P = 0.005; Controls vs. Affected, P < 10−22; Parents vs. Affected, P < 10−7; two-tailed unpaired t-test). Importantly, note that this measurement can be used to evaluate relative splicing differences between genotypes but is not an absolute measurement of splicing, since the PCR amplification efficiencies of the unspliced and spliced products differ. See Supplemental Figure 2A for percentage unspliced RT-PCR product of all assayed samples and Supplemental Figure 2B for percentage unspliced RT-PCR product summarized by tissue type for brain, umbilical cord, and cell lines/blood leukocytes.
Figure 4.DONSON expression in human and mouse brain development. (A) DONSON expression by in situ hybridization in a coronal section of a 9-wk-gestation human fetal brain. Expression is prominent in the neocortex subventricular zone, which contains progenitor cells, and in the cortical plate, where newly born neurons reside. Expression is also seen in the ganglionic eminences, which give rise to the basal ganglia and interneurons that migrate into the neocortex. Scale bar for main image (left) is 500 µm; scale bars for other images are 100 µm. Hybridization was performed with antisense probe 1 (see Supplemental Methods). (Ctx) Neocortex, (GE) ganglionic eminence, (v) ventricular zone, (sv) subventricular zone, (i) intermediate zone, (c) cortical plate, (m) marginal zone. (B) DONSON expression by in situ hybridization in a sagittal section of a 12-wk-gestation human fetal brain. Expression is prominent in the basal ganglia (BG) and the ventricular and subventricular zones and cortical plate of the neocortex (Ctx), mesencephalon (M, midbrain), and rhombencephalon (R, hindbrain). Scale bar for main image (left) is 1000 µm; scale bars for other images are 100 µm. Hybridization was performed with antisense probe 1. (C) DONSON expression by in situ hybridization in a coronal section of a hemisphere of a 20-wk-gestation human fetal brain. Expression is evident in the ventricular and subventricular zones, intermediate zone, and cortical plate of the neocortex (Ctx). Scale bar for main image (left) is 1000 µm; scale bar for other image is 100 µm. Hybridization was performed with antisense probe 2. (D) DONSON expression by in situ hybridization in a coronal section of an E15.5 mouse brain. Expression is evident in the ventricular (v) and subventricular zones (sv), intermediate zone (i), and cortical plate (c) of the neocortex. Scale bar for top and bottom images are 500 and 100 µm, respectively. Hybridization was performed with human antisense probe 3. For all above tissue sections, negligible signal was observed with sense sequence probes in adjacent sections (Supplemental Fig. 3D), confirming specificity of the antisense probe staining.
Donson loss of function is lethal in early embryonic mouse development