Literature DB >> 32552793

Analysis of transcript-deleterious variants in Mendelian disorders: implications for RNA-based diagnostics.

Sateesh Maddirevula1, Hiroyuki Kuwahara2, Nour Ewida1, Hanan E Shamseldin1, Nisha Patel1, Fatema Alzahrani1, Tarfa AlSheddi1, Eman AlObeid1, Mona Alenazi1, Hessa S Alsaif1, Maha Alqahtani1, Maha AlAli1, Hatoon Al Ali1, Rana Helaby1, Niema Ibrahim1, Firdous Abdulwahab1, Mais Hashem1, Nadine Hanna3, Dorota Monies1, Nada Derar4, Afaf Alsagheir5, Amal Alhashem6,7, Badr Alsaleem8, Hamoud Alhebbi6, Sami Wali6, Ramzan Umarov2, Xin Gao9, Fowzan S Alkuraya10,11,12.   

Abstract

BACKGROUND: At least 50% of patients with suspected Mendelian disorders remain undiagnosed after whole-exome sequencing (WES), and the extent to which non-coding variants that are not captured by WES contribute to this fraction is unclear. Whole transcriptome sequencing is a promising supplement to WES, although empirical data on the contribution of RNA analysis to the diagnosis of Mendelian diseases on a large scale are scarce.
RESULTS: Here, we describe our experience with transcript-deleterious variants (TDVs) based on a cohort of 5647 families with suspected Mendelian diseases. We first interrogate all families for which the respective Mendelian phenotype could be mapped to a single locus to obtain an unbiased estimate of the contribution of TDVs at 18.9%. We examine the entire cohort and find that TDVs account for 15% of all "solved" cases. We compare the results of RT-PCR to in silico prediction. Definitive results from RT-PCR are obtained from blood-derived RNA for the overwhelming majority of variants (84.1%), and only a small minority (2.6%) fail analysis on all available RNA sources (blood-, skin fibroblast-, and urine renal epithelial cells-derived), which has important implications for the clinical application of RNA-seq. We also show that RNA analysis can establish the diagnosis in 13.5% of 155 patients who had received "negative" clinical WES reports. Finally, our data suggest a role for TDVs in modulating penetrance even in otherwise highly penetrant Mendelian disorders.
CONCLUSIONS: Our results provide much needed empirical data for the impending implementation of diagnostic RNA-seq in conjunction with genome sequencing.

Entities:  

Keywords:  Mapping; Mendelian; Negative WES; RNA-based diagnostics; Transcriptomics

Mesh:

Year:  2020        PMID: 32552793      PMCID: PMC7298854          DOI: 10.1186/s13059-020-02053-9

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


Introduction

Genome sequencing, enabled by the advent of next-generation sequencing (NGS) technologies, has changed the landscape of diagnostics in the Mendelian diseases space [1]. Whole-exome sequencing (WES) is the most popular NGS diagnostic application and has achieved a diagnostic rate of 25–52% across the spectrum of Mendelian disorders, although higher figures have been reported for certain phenotypic categories [2-5]. The minimal boost of diagnostic yield offered by whole-genome sequencing (WGS) over WES suggests that the bottleneck is not in the capture/calling of the causal variants in the sequencing stage but rather in their interpretation [6, 7]. This notion is supported by studies showing the value of careful reinterpretation of “negative” WES and how misinterpreting the causal variants in WES is a major challenge that cannot be circumvented by WGS [7, 8]. Therefore, there is a growing interest in exploring transcriptomics to improve variant interpretation [9]. Indeed, published data suggest an enrichment of “negative” WES cases for cryptic splice-altering variants that are not easily predicted in silico [10, 11]. Coding genomic variants modulate phenotypes through their effect on proteins while non-coding variants (NCV) mediate their effects through RNA either directly (transcript-level) or indirectly (chromatin-level). In the context of Mendelian diseases, estimates vary widely on the contribution of variants that affect splicing to the overall mutation pool (15–60% of disease-causing variants) [12]. Two major challenges preclude accurate estimation of this important class of disease-causing mutations. First, many “coding” variants that are presumed to exert their pathogenicity at the protein level are in fact splicing variants whose effect on splicing was never empirically determined. These not only include single base-pair substitutions that may or may not alter the amino acid sequence (nonsynonymous and synonymous missense), but also include protein-truncating variants [13]. Another major challenge is the clear reporting bias in the literature where variants that impact consensus splicing codes are more likely to be tested and reported. Deep intronic, UTR and promoter/enhancer variants are far less likely to be uncovered by conventional Sanger or WES and, even when captured by WGS, are very difficult to interpret using in silico tools despite their clear contribution to Mendelian diseases [14-17]. Transcriptomics, therefore, holds a promising role in delineating Mendelian phenotypes that are caused by variants that are deleterious at the transcript level [18]. These include variants that reduce the abundance of the transcript, e.g., nonsense-mediated decay (NMD), as well as those that create aberrant splicing. Early experience with RNA-Seq (massively parallel sequencing of RNA) suggests its potential to reveal variants that have been missed at the sequencing stage as well as those that have been missed at the interpretation stage [10, 11, 19–21]. It is also clear from these studies, however, that there are unique computational challenges to this technology, and although several computational tools have been developed, there is a growing need for a deeper understanding of the nature of transcript-deleterious variants to inform better tools. We have previously shown in a pilot study the power of positional mapping as a tool that is agnostic to the underlying class of mutation to provide unbiased estimate of NCVs [8]. In this study, we provide based on comprehensive positional mapping of 5647 families with suspected Mendelian phenotypes a detailed overview of transcript-level deleterious variants and their contribution to Mendelian phenotypes in humans. We then interrogate the translational potential of that knowledge by exploring the role of RNA-based approaches in patients with “negative” clinical WES results.

Materials and methods

Human subjects

Subjects described in this study represent combined cohorts recruited under individual IRB-approved research protocols (KFSHRC RAC# 2121053, 2080006 and 2070023). In each of these protocols, we selectively recruited individuals with at least one of the following features: (a) positive family history consistent with a Mendelian inheritance of the disorder and (b) phenotypic presentation consistent with a previously published Mendelian disease. Informed consent was obtained from all subjects prior to their enrollment. Phenotypic data were collected from all subjects. Blood was collected in EDTA tubes for DNA extraction and in sodium heparin tubes for the establishment of lymphoblastoid cell lines (LCL). Occasionally, blood collected in PAXGene tubes was the only source of RNA. In a subset of cases, cultured skin-derived fibroblasts and urine-derived renal epithelial cells were also obtained as an additional source of RNA.

Positional mapping, WES, and variant identification

The method of combining positional mapping and variant identification using WES has been described elsewhere [1, 22]. Briefly, all samples were genotyped on an Axiom SNP platform, and the regions of homozygosity (ROH) were determined to guide the search for the likely causal variant whenever the phenotype and family history are compatible with autosomal recessive inheritance. WES was performed as described before, and the resulting variants were filtered by the autozygome coordinates [3, 23]. Variants were filtered using gnomAD and a local population database (2379 exomes) for allele frequency of < 0.001 and were interpreted by following the ACMG guidelines [24] to determine the likely causal variants. Although protein-truncating variants may exert their pathogenic effect at the level of the final transcript via NMD, we have chosen to exclude them because it is very difficult to disentangle their effect on protein from that on RNA. Variants were highlighted as candidate transcript-deleterious variants (TDVs) if they were compatible with pathogenicity potential in terms of frequency and segregation, and involved one of the following six categories: (a) canonical splice donor or acceptor sites (the first and last 2 bp of each intron), (b) the first or last base pair of an exon, (c) non-canonical splice site intronic variants, i.e., other than the first and last 2 bp of an intron, (d) coding exons other than the first or last base pair (regardless of whether the resulting missense is synonymous or nonsynonymous), (e) UTR (5′ and 3′), and (f) promoter/enhancer elements. Variants in categories c, d, e, and f were only considered if no alternate candidate variants were identified. A small subset of cases for which no candidate variants were identified, were subjected to RNA-Seq (see below).

RTPCR

Variants suspected to be deleterious at the transcript level were interrogated by RTPCR using cDNA-specific primers and RNA from blood (LCL or PAXgene) and/or skin fibroblasts. When the index who is homozygous for variant was unavailable, we attempted to test the obligate heterozygous parents. RTPCR followed a standard number of 35 cycles and 2000 ng of RNA as a template. If this standard protocol resulted in a visible band on a gel, the gene was considered “expressed.” If additional cycles or higher amount of RNA were needed, the gene was considered “poorly expressed,” otherwise, the gene was labeled as “not expressed.” The products were analyzed by Sanger sequencing directly and if there was evidence of multiple products, cloning was pursued followed by Sanger sequencing. In cases where no evidence of aberrant splicing was identified, we attempted quantifying the transcript using q-RTPCR.

RNA-Seq and computational analysis

RNA samples of the subjects were prepared at KFSHRC and sent to the KAUST core lab for RNA sequencing. The quality of each RNA sample was determined based on its RNA Integrity Number (RIN) using Agilent 2100 BioAnalyzer. Those samples that scored RIN < 6.0 were not considered further. The sequencing libraries were prepared using Illumina TruSeq Stranded mRNA. Paired-end 150 bp reads were generated on Illumina NovaSeq6000. GTEx RNA-Seq samples [25] for blood and skin tissue types were downloaded from the Database of Genotypes and Phenotypes (dbGaP) and transformed into the fastq format using SRA Toolkit (https://www.ncbi.nlm.nih.gov/sra/docs/toolkitsoft/). Samples with RIN < 8.0 were not included in our GTEx controls. RNA-Seq reads from both patients and GTEx were also aligned to hg38 (GENCODE 25) using STAR 2.6 [26] with the two-pass option. Only reads mapped to chromosomes 1–22 and X were considered. SAMtools [27] and BEDTools [28] were applied to the BAM files to quantify the occurrence of annotated and unannotated splicing junctions, as well as to count nonsplit reads mapped to intronic regions. Splicing junctions with < 5 read supports were filtered out. To quantify the transcript abundance levels, RNA-Seq reads were also mapped to the reference transcript sequences for hg38 (GENCODE 25) using Kallisto [29]. Using the generated BAM files and transcript abundance levels, error-free normal transcript abundance levels were estimated with the omega quantification [30]. Briefly, omega computes an adjusted count per million (CPM) value for each coding gene g, ω, as follows: where T is the set of mRNA transcripts for gene g, w is the rate to express annotated, normal transcript t based on the RNA splicing data, and x is the CPM level of transcript t. Thus, low values of ω can indicate low abundance outliers or splicing outliers that escaped NMD. From the GTEx data, RNA-Seq datasets of the "Cells - EBV-transformed lymphocytes" and "Cells - Cultured fibroblasts" tissue types were selected as the control for cases derived from blood and skin tissue types, respectively. To ensure the use of an appropriate set of samples in control for each patient, we measured the median of the ω values of each coding gene for all the blood and skin tissue types in the GTEx datasets and confirmed that the selected tissue type gave the highest level of correlation with the patient data. Based on the second percentile of the ω values in the corresponding control, two scores, α and β, were measured to analyze the severity of transcriptional aberrations in gene g for each patient. Let ω (i) and ω (k) represent the value of ω for patient i and for the second percentile value of the corresponding control k, respectively. Then, score α(i) was computed as follows: where ε is a small factor set to 0.001 to avoid division by zero. The α score measures the significance of genes as the low abundance outliers or splicing outliers. The other score, β, was derived by first computing the fraction of normal transcripts, ρ which is defined to be the ratio of ω to Given this definition, score β(i) for patient i can be expressed as: Thus, the β score measures the significance of likeliness to express transcripts with splicing error. A high alpha score means that the abundance level of normal transcripts of a given gene is lower compared with a lower-end abundance of the same gene in the control set. Similarly, a high beta score means that the fraction of the normal transcripts of a given gene is lower compared with a lower end of the same gene in the control set. With these scores, each coding gene g was selected as a causative candidate for each patient i if all of the following criteria are met: Either α(i) ≥ 3.0 or β(i) ≥ 3.0. For all the other patients j with RNA samples being the same cell type, α(i) < α(j). Note that criterion 2 is based on 11 RNA-Seq datasets from RNA samples with RIN > 8.5 (4 from fibroblasts and 7 from LCL) and that this criterion was set specifically for comparison based on a small number of patients. To visualize splicing events, BAM files were first converted into the hg19 coordinate using CrossMap [31] and Integrative Genomic Viewer [32] was used. We have also attempted to compare our method to previously published methods as explained in Additional file 1: Supplemental file 1.

Results

Quantifying the contribution of transcript-deleterious variants

Our cohort included 5647 families with suspected Mendelian phenotypes (Fig. 1). The vast majority (94% and 91%) are consanguineous and multiplex, i.e., > 1 affected member, respectively. A likely causal variant was identified in 2438 of these families (n = 1807 non-redundant variants), 272 (15%) of which represent TDVs (TDVs are listed in Additional file 2: Table S1, and their population frequencies are summarized in Additional file 3: Table S2). One limitation of this estimate is the potential for bias against the identification of more challenging classes of transcript-deleterious variants. Therefore, we decided to exploit the agnostic nature of positional mapping to derive unbiased estimate of transcript-deleterious variants. We singled out all families in which we were able to map their recessive Mendelian phenotype to a single locus (n = 157) since these lend themselves more readily to focused and thorough investigations to reveal the underlying variant including the most challenging ones. Indeed, each of these loci was thoroughly interrogated and this resulted in the identification of a likely causal variant (n = 148, two variants were observed in four cases) in 95.5% of cases (150 out of 157, Fig. 2 and Additional file 4: Table S3). The breakdown of variant classes within these loci shows that TDVs accounted for 18.9% of variants (28 out of 148), which suggests that the above figure of 15% may indeed represent an underestimate based on bias against more challenging transcript-deleterious variants. Interestingly, only 2% of the 18.9% are variants not expected to be captured by WES (> 50 bp from the nearest exon), which suggests that, at least in the case of recessive phenotypes in consanguineous families, the overwhelming majority of causal variants are captured by WES pipelines.
Fig. 1

A flow chart of the entire study with its different components

Fig. 2

Unbiased estimate of the contribution of TDVs to recessive Mendelian mutations based on 157 families that map to a single locus each

A flow chart of the entire study with its different components Unbiased estimate of the contribution of TDVs to recessive Mendelian mutations based on 157 families that map to a single locus each

RNA as a tool to solve “negative” WES cases

In order to investigate the contribution of RNA analysis to solving “negative” WES cases, we recruited 155 cases for which clinical WES did not reveal a likely causal variant (Fig. 3, Table 1 and Additional file 5: Table S4). A likely causal variant was subsequently identified in 60.6% (88 unique variants in 94 out of 155 cases). Additional file 5: Table S4 shows that many of these cases harbored a likely deleterious variant in a gene that was novel at the time of clinical reporting, i.e., cases unlikely to have benefited from RNA analysis. TDVs accounted for 22.7% of all identified variants (20 out 88 unique variants). As expected, class (a) variants (those affecting the canonical splice sites) were under-represented (21% vs. 64% in the original cohort, see below) since these would have been readily flagged at the time of reporting. On the other hand, the more challenging classes were over-represented (79% vs. 36% in the original cohort, see below). These include a very deep (+ 335) variant in ABCB4 causing cholestatic disease in all available affected members of an extended family (see Additional file 6: Figure S1). Thus, the hypothetical diagnostic yield of RNA-Seq in the setting of a “negative” WES is 13.5% (21 out 155 cases), at least in the setting of recessive phenotypes. To test this empirically, we set out to investigate the six cases (no RNA was available from the seventh case) whose autosomal recessive Mendelian phenotypes map to single loci, and have “negative” WES, using RNASeq. First, we aimed to establish the sensitivity of our RNA-Seq pipeline and comparing it to previously published pipelines by testing five cases with established transcript-deleterious variants and found that 100% were correctly called, i.e., the mutated gene was chosen among the top or only candidate gene for each of the five cases (Additional file 6: Figure S2 and Additional file 7: Table S5) as follows:
Fig. 3

The results of our reanalysis of “WES-negative” cases to estimate the hypothetical yield of RNA-Seq in this clinical setting

Table 1

Summary of the negative clinical WES cases and genetic findings. For a full list of cases including negative and previously published, please refer to Additional file 5: Table S4. The symbol “a” indicates a novel gene (no assigned OMIM phenotype) for condition to be reported elsewhere

IDPhenotypeGeneMutation HGVS nomenclatureZygosityType of mutationClass of transcript-deleterious variantsNotes
17DG0527Global developmental delay, hypotonia, epilepsy, postnatal microcephaly, strabismus and choreoathetosisaHomozygousFrameshift indelNovel gene for this condition
15DG1507Epilepsy and global developmental delayaHomozygousTranscript-deleterious variantcNovel gene for this condition
18DG0320Multiple congenital anomaliesaHomozygousMissenseNovel gene for this condition
18DG0989Neonatal adrenoleukodystrphyaHomozygousMissenseNovel gene for this condition
19DG0509Undefined epileptic encephalopathyaHomozygousNonsenseNovel gene for this condition
18DG0669Microcephaly, atrial septal defect, ventricular septal defectaHomozygousTranscript-deleterious variantaNovel gene for this condition
17DG0738Chronic interstitial kidney disease with small kidneysaHomozygousNonsenseNovel gene for this condition
19DG0230Griscelli syndromeaHomozygousMissenseUnpublished
17DG0872Hydrocephalus, agenesis of corpus callosum, macrocephalyaHomozygousTranscript-deleterious variantbNovel gene for this condition
17DG1071Cholestasis, progressive familial intrahepatic 3ABCB4NM_000443.3:c.286+335A>G:p.(Val96Glyfs*11)HomozygousTranscript-deleterious variantcUnpublished
16DG0145Intellectual disabilityADAT3NM_138422.1:c.382G>A:p. (Val128Met)HomozygousMissenseUnpublished
16DG1223Oligohydramios, short long bones and echogenic kidneysANKS3NM_133450:c.352G>A:p.(Ala118Thr)HomozygousMissensePMID:27417436
18DG0295Joubert syndromeARL3NM_004311.3:c.445C>T:p.(Arg149Cys)HomozygousMissensePMID: 30269812
15DG2104Joubert SyndromeARMC9NM_025139.3:c.51+5G>T:p.?, r.1_51delHomozygousTranscript-deleterious variantcPMID: 27431290
15DG2485Asparagine synthetase deficiencyASNSNM_133436.2:c.28A>C:p.(Ser10Arg)HomozygousMissensePMID:30214071
15DG0357Bardet-Biedl syndromeBBS1NM_024649.4:r. [1232_3423del]HomozygousLarge deletionPMID: 27894351
16DG1620OsteopetrosisCLCN7NM_001287.5:c.739-18G>A;(p.Met250Argfs*6)HomozygousTranscript-deleterious variantcPMID: 29620724
19DG1262Multiple congenital anomaliesCOG6NM_020751.2:c.695-8T>GHomozygousTranscript-deleterious variantcUnpublished
PSMMC0118Short stature on growth hormone replacement, subclinical hypothyroidism, grade 1 hydronephrosis (Lt), delayed bone age, IVF pregnancy, first of a twin, delayed developmentCREBRFNM_001168393.2:c.475delT;p.(Ser159Hisfs*57)HeterozygousFrameshift indelPMID: 31130284
17DG0967Cholestasis with high GGT and renal failureDCDC2NM_001195610.1:c.223_293del:p.(Arg75Leufs*16)HomozygousFrameshift indelUnpublished
17DG0996CholestasisDCDC2NM_001195610.1:c.223_293del:p.(Arg75Leufs*16)HomozygousFrameshift indelUnpublished
13DG2237Warsaw breakage syndromeDDX11NM_004399.2: c.2426T>G:p. (Val809Gly)HomozygousMissensePMID: 30214071
17DG0022Chronic unexplained diarrheaDGAT1NM_012079.5:c.836T>C:p.(Leu279Pro)HomozygousMissenseUnpublished
16DG0357AcromesomeliaDIP2CNM_014974.2:c.3283C>T:p.(Arg1095Trp)HomozygousMissensePMID:29620724
17DG0756Congenital disorder of glycosylationFUT8NM_178155.2:c.943C>T:p.(Arg315*)HomozygousNonsensePMID: 30237576
16DG0733Severe progressive microcephaly, global developmental delay and epilepsyGPR56NM_005682.5:c.1503C>A;p.(Tyr501*)HomozygousNonsensePMID: 27431290
PSMMC0115Unexplained macrocephaly, epilepsy, short stature and developmental delayKCND1NM_004979.6:c.1883G>A:p.(Arg628Lys)HemizygousMissenseUnpublished
15DG2234Microcephaly, cerebral white matter abnormality and intellectual disabilityKCTD3NM_016121.3 c.1036_1073del:p.(P346Tfs*4)HomozygousTranscript-deleterious variantdUnpublished
13DG2107Psychomotor retardation and seizuresKCTD3NM_016121.3 c.1036_1073del:p.(P346Tfs*4)HomozygousTranscript-deleterious variantdPMID:25558065
17DG0404High GGT neonatal cholestasis/sclerosing cholangitisKIF12NM_138424.1:c.610G>A:p.(Val204Met)HomozygousMissensePMID: 30250217
18DG0966Methylmalonic aciduria and homocystinuriaLMBRD1NM_018368.4:c.1156C>T:p.(Arg386*)HomozygousNonsenseUnpublished
16DG0559Joubert syndromeLRRC34NM_001172779:c.199A>T:p.(Lys67*)HomozygousNonsenseIn press
17DG0731Disseminated tuberculosis, hypogammaglobulinemia, nearly all T and B cells are naiveMAP3K14/NIKNM_003954.3:c.916delT: p.(Cys306Valfs*2)HomozygousFrameshift indeldoi.org/10.1016/j.jaci.2018.11.003
15DG2492Short stature, global developmental delay, dysmorphism, congenital heart disease , PUJ obstruction and partial agenesis of corpus callosumMFSD11NM_001242532.1:c.143G>C:p.(Gly48Ala)HomozygousMissensePMID: 28940097
16DG0621Severe neurodevelopmental disorderMICU2NM_152726.3:c.42G>A:p.(Trp14*)HomozygousNonsensePMID: 29053821
17DG1094MegacystisMYH11NM_022844.2:c.1033+1G>AHomozygousTranscript-deleterious variantaPMID: 30237576
12DG2078Klippel-Feil syndrome and myopathyMYO18BNM_032608.5:c.6905C A:p.(Ser2302*)HomozygousNonsensePMID:25748484
18DG0176Microcephaly, developmental delay, visual impairment, hyponatremia, failure to thrive, choreoathetoid movement, seizuresNUP214NM_005085:c.461:p.(Asp154Gly)HomozygousMissensePMID:30758658
16DG1424Diarrhea, failure to thrive, intestinal failure and TPN dependencePERCC1Deletion of regulatory element (chr16:1480850_1483950del)HomozygousTranscript-deleterious variantfUnpublished
18DG0670ErythrokeratodermaPERPNM_022121.4:c.466G>A:p.(Gly156Arg)HomozygousMissensePMID: 31898316
16DG1048Peroxisome biogenesis disorder 12A (Zellweger)PEX19NM_001193644.1:c.161C>T:p. (Ser54Leu)HomozygousMissensePMID: 30561787
13DG0810Congenital MicrocephalyPPFIBP1NM_001198915.1:c.960_961del:p.(Glu320Aspfs*3HomozygousFrameshift indelPMID: 30214071
16DG0201Short stature, brachydactyly, intellectual disability and seizuresPRMT7NM_019023.2:c. 190C>T:P. (Gln64*)HomozygousNonsensePMID: 28940097
15DG2427Syndromic cataractRIC1NM_020829.3: c.3794G>C:p.(Arg1265Pro)HomozygousTranscript-deleterious variantbPMID: 27878435
13DG1181Primary microcephalyRTTNNM_173630.3:c.5746-20A>G:p.1917_1942delHomozygousTranscript-deleterious variantcPMID: 30214071
17DG1005Bardet-Biedl syndromeSCLT1NM_144643.2:c.290+2T>C:p.(Lys79Valfs*4)HomozygousTranscript-deleterious variantaPMID: 30237576
16DG0760Epilepsy, generalized, with febrile seizures plus, type 1SCN1BNM_001037.3:c.355T>G:p.(Tyr119Asp)HomozygousMissensePMID: 28218389
PSMMC0210Hypotonia, global developmental delay, cardiac disease, leukodystophySCN3ANM_001081676.1:c.1485T>G:p.(Ser495Arg)HeterozygousMissenseUnpublished
18DG0278Congenital insensitivity to painSCN9ANM_002977.3:c.2311-14T>GHomozygousTranscript-deleterious variantcUnpublished
14DG0045Renal failure, morbid obesity, intellectual disability, retinitis pigmentosa (sibling of 14DG0047, see Table S5)SDCCAG8NM_006642.2:c.741-152G>A, p.Arg247Serfs*23; NM_006642.2: r.740_741ins741-202_741-1HomozygousTranscript-deleterious variantcIn press
16DG0276Tricho-Hepato-Enteric SyndromeSKIV2LNM_006929.5:c.3561_3581del; p.(Ser1189_Leu1195del)HomozygousNon-frameshift indelUnpublished
16DG0815Tricho-Hepato-Enteric SyndromeSKIV2LNM_006929.5:c.3561_3581del; p.(Ser1189_Leu1195del)HomozygousNon-frameshift indelUnpublished
17DG0977Tricho-Hepato-Enteric SyndromeSKIV2LNM_006929.4:c.3561_3581del,p.(Ser1189_Leu1195del)HomozygousNon-frameshift indelUnpublished
18DG0594Pseudovaginal perineoscrotal hypospadiasSRD5A2NM_000348:c.682G>A:p.(Ala228Thr)HomozygousMissenseUnpublished
17DG0821Congenital adrenal hyperplasia (CAH)STARNM_000349.2:c.201_202del:p.(Tyr68Glnfs*2)HomozygousFrameshift indelUnpublished
18DG0512Osteogenesis imperfecta, type XIVTMEM38BNM_018112.2:c.455_542del;p.(Gly152Alafs*5)HomozygousFrameshift indelUnpublished
16DG0114Muscular dystrophy-dystroglycanopathyTMEM5NM_014254.3:c.686A>G:p.(Tyr229Cys)HomozygousMissenseUnpublished
16DG1117LeukodystrophyTRAK1NM_001042646: c.287-2A>GHomozygousTranscript-deleterious variantaPMID:28940097
16DG0659Muscular dystrophy, limb-girdle, autosomal recessive 18TRAPPC11NM_021942.6:c.464C>T:p.(Ser155Leu)HomozygousMissenseUnpublished
16DG1614Global developmental delay and epilepsyUFC1NM_016406.3:c.317C>T:p. (Thr106Ile)HomozygousMissensePMID: 29868776
16DG0018Osteogenesis imperfectaWNT3ANM_033131.3:c.254G>A:p.(Arg85Gln)HomozygousMissensePMID: 29620724
14DG0613Primary microcephalyYARSNM_003680.3:c.789C>A:p.(Phe263Leu)HomozygousMissensePMID: 28383543/30214071
15DG2661DysmorphismZFATNM_020863.3:c.1199G>A:p.(Arg400Gln)HomozygousMissensePMID: 28640246/28940097
10DG0840 (a case of Troyer syndrome and a class (d) variant in SPG20, see Additional file 2: Table S1): The RNA-Seq-based prediction generated 167 candidates. Among them, SPG20 was ranked 157th on the alpha score and 25th on the beta score. With the autozygome coordinate-based filtering, SPG20 was found to be the only candidate. 11DG0165 (a case of congenital muscular dystrophy and a class (c) variant in POMT2, see Additional file 2: Table S1): The RNA-Seq-based prediction generated 195 candidates. Among them, POMT2 was ranked 2nd on the alpha score and 13th on the beta score. With the autozygome coordinate-based filtering, POMT2 was found to be the top among the 14 final candidates. 15DG2154 (a case of microcephalic primordial dwarfism and a class (c) variant in DONSON, see Additional file 2: Table S1): The RNA-Seq-based prediction generated 324 candidates. Among them, DONSON was ranked 200th on the alpha score and 281st on the beta score. With the autozygome coordinate-based filtering, DONSON was found to be the only candidate. 16DG1048 (a case of peroxisomal disorder and a class (d) variant in PEX19, see Additional file 2: Table S1): The RNA-Seq-based prediction generated 129 candidates. Among them, PEX19 was ranked 17th on the alpha score and 120th on the beta score. With the autozygome coordinate-based filtering, PEX19 was found to be the only candidate. 16DG1620 (a case of osteopetrosis and a class (c) variant in CLCN7, see Additional file 2: Table S1): The RNA-Seq-based prediction generated 112 candidates. Among them, CLCN7 was ranked 79th on the alpha score and 42nd on the beta score. With the autozygome coordinate-based filtering, CLCN7 was found to be the top of the three remaining candidates. The results of our reanalysis of “WES-negative” cases to estimate the hypothetical yield of RNA-Seq in this clinical setting Summary of the negative clinical WES cases and genetic findings. For a full list of cases including negative and previously published, please refer to Additional file 5: Table S4. The symbol “a” indicates a novel gene (no assigned OMIM phenotype) for condition to be reported elsewhere To test this empirically, we set out to investigate six cases whose autosomal recessive Mendelian phenotypes map to single loci with “negative” WES and for whom RNA sources were available. While no likely causal variant was identified in five of these cases, RNA-Seq analysis of blood-derived RNA on patient 15DG2234 (microcephaly, abnormality of the cerebral white matter and intellectual disability) highlighted KCTD3 as the only likely candidate within the candidate autozygome (139 candidates were highlighted prior to the autozygome filter). Indeed, subsequent RTPCR confirmed that this pattern was created by a partial exonic deletion of 38 bps (NM_016121.3:c.1036_1073del:p.(Pro346Thrfs*4)) that was missed by WES and led to the creation of an additional aberrant band in which the involved exon was completely skipped (Additional file 6: Figure S2).

The landscape of transcript-deleterious variants in Mendelian diseases

Additional file 2 Table S1 lists all likely causal TDVs (272 unique variants) identified through our detailed analysis of 5647 families with suspected Mendelian phenotypes. The breakdown of the six classes of TDVs is summarized in Fig. 4 and is described below in detail.
Fig. 4

(Left) Pie chart showing the breakdown of variant types in a large cohort of families with suspected Mendelian disorders. (Right) Pie chart showing the distribution of all identified transcript-deleterious variants identified across the entire cohort. Classes a, b, c, d, e, and f represent the first or last 2 bp of introns, the first or last 1 bp of exons, non-canonical splice site intronic variants, non-canonical splice site exonic variants, UTR (5′ and 3′), and promoter variants, respectively.

Class (a) variants: a total of 175 (representing 64.3% of all TDVs) unique variants involving the canonical intronic splice sites were identified. RTPCR data were available for 93 (4 from literature and 89 from this cohort). Although this class is generally classified as “loss of function,” we note that several resulted in in-frame rather than frameshift indel (Additional file 2: Table S1). More concerning was the finding of canonical splicing variants in established disease genes with no resulting phenotype, i.e., non-penetrance (Additional file 8: Table S6). For example, the variant NM_001172818.1:c.300 + 1G > A in PGM1, was identified in homozygosity in individuals with no phenotype despite its deleterious effect on splicing (confirmed by RTPCR), which explains its high population frequency. Similarly, we have identified an individual with ambiguous genitalia who is homozygous for LRP4 (NM_002334.2:c.796+2T>C) but lacks all features of established LRP4-related syndromes. On the other hand, the finding of ARHGAP31 (NM_020754.2:c.539+1G>A) in asymptomatic individuals despite its deleterious effect on splicing (confirmed by RTPCR) can be attributed to the fact that previously reported mutations in this gene were proposed to be gain-of-function. The SBDS founder variant (NM_016038.2:c.258+2T>C) is also worth highlighting since this is the most commonly reported variant in Schwachman-Diamond syndrome (SDS) and yet we identified it in homozygosity in at least three individuals who lack SDS features. Upon further investigation, we found that this is a leaky splicing variant and that all previously reported SDS patients were compound heterozygous for a more severe truncating variant (Additional file 8: Table S6). Finally, we note the unusual result of normal RTPCR on a patient with Marfan syndrome and a de novo FBN1 (NM_000138.4:c.6872-1G>A) variant, which suggests that the effect of splicing may be tissue-specific (Additional file 8: Table S6). Class (b) variants: RNA was available for 11 of the 13 variants involving the first or last bp of an exon, and in each of these cases an aberrant transcript was observed. This includes a variant in TMX2 in case 19DG2556 with microcephaly and lissencephaly, which represents an independent confirmation of the very recently described TMX2-related disorder [33]. We suggest that this class should be combined with class (a) as canonical splice site variants. This is further supported by the consistently pathogenic prediction these variants received in silico (see below). Class (c) variants: The range of non-canonical splice site intronic variants was remarkable ranging from 3 bp to 649 bp deep in our cohort. Since current capture techniques in WES usually capture < 50 bp of the flanking intronic sequence, we divided class c variants into those amenable for capture by WES, i.e., within 50 bp (n = 65) and those that are not, i.e., more than 50 bp from the nearest exon/intron junction (n = 8). The challenging nature of these variants is amplified when the phenotype is atypical (Additional file 8: Table S6). For instance, the NM_020751.2:c.1167-24A > G and NM_020751.2:c.695-8 T > G variants in COG6 resulted in a phenotype sufficiently different from CDG that it is listed in OMIM as a separate disorder, i.e., Shaheen syndrome [34]. Similarly, we note the surprising finding of NM_182894.2:c.456-6C > G variant in VSX2 causing ectopia lentis rather than the established microphthalmia phenotype, which supports a previously published case report [35]. Perhaps most surprising was the finding of a homozygous NF1 variant (NM_001128147.2:c.586 + 5G>A) in a young child with juvenile myelomonocytic leukemia but the parents did not have any manifestations of neurofibromatosis (Additional file 8: Table S6). In an example of the challenge in proving the pathogenicity of this class of variants, we note that the previously published COL6A2 (NM_001849.3:c.1459-63G>A) variant, which fully segregated with the expected phenotype of Ullrich muscular dystrophy, did not show abnormal RTPCR pattern suggesting the possibility of a tissue-specific splicing effect. Class (d) variants: A total of 6 (2 exonic variants (excluding the first and last bp) were tested by RTPCR and found to be indeed transcript-deleterious. These include 3 that predict silent changes at the protein level. Class (e) variants: Only three UTR (1.1%) variants were identified in the entire cohort (two 3′ UTR and one 5′ UTR mutation), suggesting their rarity, which is further supported by our unbiased analysis of families that map to single loci (Additional file 2: Table S1). Class (f) variants: Only two variants (0.73%) were identified in the promoter or other regulatory regions of genes. The first is a TATA box mutation in UGT1A1 (NM_000463.2:c.-41_-40dupTA) [36]. The second is a deletion (chr16:1480850_1483950del) in a patient with unexplained diarrhea, and this deletion was reported very recently to be the cause of chronic diarrhea secondary to its regulatory effect on PERCC1 [37]. (Left) Pie chart showing the breakdown of variant types in a large cohort of families with suspected Mendelian disorders. (Right) Pie chart showing the distribution of all identified transcript-deleterious variants identified across the entire cohort. Classes a, b, c, d, e, and f represent the first or last 2 bp of introns, the first or last 1 bp of exons, non-canonical splice site intronic variants, non-canonical splice site exonic variants, UTR (5′ and 3′), and promoter variants, respectively.

The role of in silico prediction

We have applied four (SpliceAI, TraP-score, S-CAP-score, and CADD) [38-41] in silico prediction tools to all variants that have been empirically tested for their transcript-deleterious effect in this cohort (n = 169, including 4 from the literature) (Additional file 9: Table S7). To simplify the analysis, we used the default cutoff value suggested in each of these tools to classify variants as “deleterious” or “non-deleterious.” We found that none of these tools achieved > 71% sensitivity in predicting the pathogenic nature of the variants we tested at the RNA level (SpliceAI (65%), TraP-score (63%), S-CAP-score (61%), and CADD (71%) and that at least one of the four tools failed to predict the pathogenicity of 25% of the variants. However, the yield of these tools was widely different between the different classes. Only 8% of class (a) variants compared to 44.8% of the other classes combined (18% for class b, 46% for class c, 33% for class d, 10% for class e, neither of the two class f variants was empirically tested in this study) received inconsistent prediction in silico. In agreement with our suggestion that class (b) variants should be lumped with class (a) (for the purpose of assigning a canonical splicing score on the ACMG classification), we show that in no instance did the four tools disagree on classifying these variants as pathogenic.

In search of tissue-specific aberrant transcripts

In addition to the 169 variants for which patient RNA material was available and tested, we also tested the expression of the genes containing the remaining 103 TDVs, in blood, skin, and urine (renal epithelial cells, see “Materials and methods”) derived RNA, since these are the readily available sources of RNA clinically. Please note that blood-derived RNA was extracted from PAXGene and/or LCL and these are listed separately in Additional file 2: Table S1. We found that 84.1% (195 out of 232) of the tested genes are expressed in the blood-derived RNA, 85.8% (199 out of 232) in fibroblast-derived RNA and 90% (209 out of 232) in the renal epithelial cells-derived RNA. The majority of genes were expressed in all three sources of RNA (75.5%), while only 2.6% (6 out of 232 genes) were not expressed in any of these sources. We were able to detect the aberrant transcript associated with TDVs in controls who lack the respective variant in only 11/169 (6.5%) of those that were empirically tested. In all these instances, the aberrant transcript was much less abundant in controls, and in none of these cases was the aberrant transcript listed in Ensembl or UCSC Genome Browser. In the 12 patients for whom we had both skin- and blood-derived RNA, we found no instance of an aberrant transcript that was solely present in one but not the other whenever the gene was expressed in both (n = 11, Additional file 2: Table S1). However, we did encounter two instances of pathogenic variants that did not reveal aberrant transcripts in blood-derived RNA (FBN1:NM_000138.4:c.6872-1G>A and COL6A2:NM_001849.3:c.1459-63G>A). We conclude that the deleterious effects of these variants may be tissue-specific.

Discussion

RNA has long been exploited to investigate the effect of variants suspected to alter the final transcript. However, unbiased sequencing of all transcripts in an RNA sample (RNA-Seq) was only possible recently thanks to technological advancements. It is not surprising, therefore, that there is much enthusiasm about RNA-Seq as a supplemental test to genome sequencing to diagnose Mendelian conditions, among other indications. Although the effect of noncoding variants with GWAS significance on splicing is increasingly appreciated, the goal of this study was to study variants only in the context of Mendelian diseases since this is the area that stands to benefit most from the current applications of RNA-Seq [10, 19, 42–44]. Unlike the relatively homogeneous DNA, RNA is highly heterogeneous spatially and temporally. In addition, there is marked variability in the abundance of different transcripts even in a given cell. Finally, the effect of pathogenic variants on RNA is far more nuanced than the simple “present” or “absent” that characterizes DNA variants (even mosaic DNA variants are either present or absent in a given cell). These factors make the use of RNA-Seq in clinical diagnostics challenging and highlight the need for empirical data, e.g., mapping splicing variations in clinically accessible tissue, that inform the development of computational tools that unlock the full potential of this technology [45, 46]. This study is an attempt to contribute to the literature on RNA-based diagnosis of Mendelian diseases. The large volume of our cohort (2438 molecularly characterized Mendelian families) spanning 1807 Mendelian genes, and our unique resource of families that map to single loci and thus offer an unbiased window into the breakdown of disease-causing variants in Mendelian diseases, allowed us to draw several conclusions. First, we estimate the contribution of TDVs to be at least 15% of the overall Mendelian mutation pool, although our unbiased estimate based on single locus families suggests a higher contribution of 18.9%. This has important implications because it suggests that RNA-Seq has a great potential in solving Mendelian phenotypes. Unfortunately, it is not possible to compare this hypothetical yield to what has been achieved in the few reported studies since those studies heavily focused on cases that could not be diagnosed by WES or WGS [10, 11], including a recent study involving 94 individuals with undiagnosed rare diseases that suggested a diagnostic rate of 16.7% [19]. This yield is similar to our estimated yield (13.5%) based on extensive positional mapping and RNA analysis of 155 Mendelian cases that could not be diagnosed by WES. Second, our finding of aberrant transcripts not described in databases that are detected in controls, despite their very low frequency, recapitulates the challenge described in previous studies in identifying the signal from noise when interpreting RNA-Seq. We suggest that while greater in magnitude, this challenge is no different in principle from the challenge of identifying the candidate causal variant in WES/WGS, and that filters that improve the signal/noise ratio are even more acutely needed in RNA-Seq. For example, we show in this study that the use of autozygome coordinates drastically reduces the search space in RNA-Seq (up to a factor of 300 in one case). While we acknowledge this filter is not always applicable, it should be pursued even in the absence of clear history of consanguinity since its integration into existing WES/WGS is straightforward as has been shown before [3]. Third, despite the significant investment in the development of in silico prediction tools, these remain far from perfect and our data clearly show that at least 25% of transcript-deleterious variants would be missed by tested tools. This suggests that these tools cannot replace RNA-Seq, which will likely become a standard clinical test for cases with negative WES/WGS. Fourth, and reassuringly, our data also seem to alleviate concerns about access to the relevant tissue since only < 10% of the tested genes were not expressed at all in the three sources of RNA available to us. Whenever the gene was expressed, we were able to demonstrate the effect of splicing in at least one of the two sources of RNA, with only two instances where a clearly pathogenic splicing variant did not result in aberrant transcript in the only tissue available for the respective patient, i.e., blood, despite abundant expression. It should be emphasized here that the overwhelming majority of the tested variants involved brain pathologies in their phenotypic expression. Fifth, we show several instances of abnormal splicing with no resulting phenotype as well as normal splicing with resulting phenotype for well-established disease genes. The apparent non-penetrance in the former scenario could be alternatively explained by a tissue-specific effect, which could also explain the latter scenario. Fortunately, these appear to be the exception; however, they are useful reminders of the expected limitation of RNA-Seq on clinically accessible samples. Sixth, although our study did not specifically aim to compare splicing to other classes of variants, we think that the examples we encountered with respect to the phenotypic expression of homozygous vs compound heterozygous regulatory variants is noteworthy. This phenomenon, first described in the context of thrombocytopenia-absent radius (TAR) syndrome, has only rarely been invoked since, e.g., SNORD118-related cerebral microangiopathy leukoencephalopathy with calcifications and cysts [47], and TXNL4A-related Burn-McKeown syndrome [48]. We have previously shown that a non-canonical splice-site variant in DONSON causes microcephalic primordial dwarfism when inherited in trans with a hypomorphic variant, but results in an embryonically lethal microcephaly-micromelia syndrome when homozygous [49]. Here, we show that homozygosity for the most common disease-causing mutation in SBDS is not sufficient to cause SDS and that its inheritance in trans with a more severe mutation seems necessary. This calls for caution in inferring pathogenicity of a previously reported and confirmed pathogenic variants depending on their zygosity, and we suggest that regulatory and splicing variants may be particularly prone to this phenomenon. In conclusion, we report the largest cohort of Mendelian phenotypes with comprehensive analysis of their underlying transcript-deleterious variants. The lessons learned from this cohort expand our knowledge of this class of variants and provide much needed empirical data for the clinical implementation of RNA-Seq as a promising supplemental tool to genome sequencing. Additional file 1. Supplemental file 1. Causal gene prediction comparison for RNA-Seq data. Additional file 2. Table S1. Full listing of the 272 transcript-deleterious variants identified in our cohort. * indicates genes that required more amplification cycles to detect expression. Additional file 3. Table S2. Frequency of all transcript deleterious variants are reported in the current study. Additional file 4. Table S3. List of families that map to a single locus and the outcome of their investigation. Additional file 5. Table S4. Full list of negative clinical WES cases and the outcome of their analysis by RNA studies and other tools. ‘A’ indicates a novel gene/novel gene for the condition. Additional file 6. Figure S1. Pedigree of a family which is mapped to a single locus and identified a very deep (+ 335) variant in ABCB4 causing cholestatic disease. Figure S2. A) A sashimi plot showing base-level densities of reads mapped to a genomic region containing exons 11 and 12 of KCTD3 transcripts from three samples. The x-axis represents the genomic coordinate in hg19. The y-axis represents per-base read counts, and the range is specified in the upper-left corner of the plot for each sample. Arcs connecting exons represent splice junction reads. The horizontal bar lines on the bottom indicate isoforms (exons as rectangle boxes and introns as line with arrow heads). The distribution in blue shows the sample with aberrant KCTD3 transcript, while the other two distributions are from randomly selected samples of lymphocytes (red from a patient and green from the GTEx cohort). B) A sashimi plot showing base-level densities of reads mapped to a genomic region in the positive control cases. Additional file 7. Table S5. The performance comparison results of our RNA-Seq pipeline. Additional file 8. Table S6. Transcript-deleterious variants with unusual phenotypic consequences. Additional file 9. Table S7. In silico (using SpliceAI, TraP, S-CAP, and CADD) predictions for all transcript-deleterious variants in this cohort that were empirically tested. Additional file 10. Review history.
  47 in total

1.  Genetic testing: The diagnostic power of RNA-seq.

Authors:  Katharine H Wrighton
Journal:  Nat Rev Genet       Date:  2017-05-08       Impact factor: 53.242

2.  Clinical Utility of Transcriptome Sequencing: Toward a Better Diagnosis for Mendelian Disorders.

Authors:  Samya Chakravorty; Madhuri Hegde
Journal:  Clin Chem       Date:  2017-11-02       Impact factor: 8.327

3.  Lens subluxation and retinal dysfunction in a girl with homozygous VSX2 mutation.

Authors:  Arif O Khan; Mohammed A Aldahmesh; Jawaher Noor; Ahmed Salem; Fowzan S Alkuraya
Journal:  Ophthalmic Genet       Date:  2013-09-03       Impact factor: 1.803

Review 4.  Deep intronic mutations and human disease.

Authors:  Rita Vaz-Drago; Noélia Custódio; Maria Carmo-Fonseca
Journal:  Hum Genet       Date:  2017-05-12       Impact factor: 4.132

Review 5.  The application of next-generation sequencing in the autozygosity mapping of human recessive diseases.

Authors:  Fowzan S Alkuraya
Journal:  Hum Genet       Date:  2013-08-02       Impact factor: 4.132

6.  Clinical exome sequencing for genetic identification of rare Mendelian disorders.

Authors:  Hane Lee; Joshua L Deignan; Naghmeh Dorrani; Samuel P Strom; Sibel Kantarci; Fabiola Quintero-Rivera; Kingshuk Das; Traci Toy; Bret Harry; Michael Yourshaw; Michelle Fox; Brent L Fogel; Julian A Martinez-Agosto; Derek A Wong; Vivian Y Chang; Perry B Shieh; Christina G S Palmer; Katrina M Dipple; Wayne W Grody; Eric Vilain; Stanley F Nelson
Journal:  JAMA       Date:  2014-11-12       Impact factor: 56.272

7.  Disease-associated mutations that alter the RNA structural ensemble.

Authors:  Matthew Halvorsen; Joshua S Martin; Sam Broadaway; Alain Laederach
Journal:  PLoS Genet       Date:  2010-08-19       Impact factor: 5.917

8.  TMX2 Is a Crucial Regulator of Cellular Redox State, and Its Dysfunction Causes Severe Brain Developmental Abnormalities.

Authors:  Laura V Vandervore; Rachel Schot; Chiara Milanese; Daphne J Smits; Esmee Kasteleijn; Andrew E Fry; Daniela T Pilz; Stefanie Brock; Esra Börklü-Yücel; Marco Post; Nadia Bahi-Buisson; María José Sánchez-Soler; Marjon van Slegtenhorst; Boris Keren; Alexandra Afenjar; Stephanie A Coury; Wen-Hann Tan; Renske Oegema; Linda S de Vries; Katherine A Fawcett; Peter G J Nikkels; Aida Bertoli-Avella; Amal Al Hashem; Abdulmalik A Alwabel; Kalthoum Tlili-Graiess; Stephanie Efthymiou; Faisal Zafar; Nuzhat Rana; Farah Bibi; Henry Houlden; Reza Maroofian; Richard E Person; Amy Crunk; Juliann M Savatt; Lisbeth Turner; Mohammad Doosti; Ehsan Ghayoor Karimiani; Nebal Waill Saadi; Javad Akhondian; Maarten H Lequin; Hülya Kayserili; Peter J van der Spek; Anna C Jansen; Johan M Kros; Robert M Verdijk; Nataša Jovanov Milošević; Maarten Fornerod; Pier Giorgio Mastroberardino; Grazia M S Mancini
Journal:  Am J Hum Genet       Date:  2019-11-14       Impact factor: 11.025

9.  Mutations in SNORD118 cause the cerebral microangiopathy leukoencephalopathy with calcifications and cysts.

Authors:  Emma M Jenkinson; Mathieu P Rodero; Paul R Kasher; Carolina Uggenti; Anthony Oojageer; Laurence C Goosey; Yoann Rose; Christopher J Kershaw; Jill E Urquhart; Simon G Williams; Sanjeev S Bhaskar; James O'Sullivan; Gabriela M Baerlocher; Monika Haubitz; Geraldine Aubert; Kristin W Barañano; Angela J Barnicoat; Roberta Battini; Andrea Berger; Edward M Blair; Janice E Brunstrom-Hernandez; Johannes A Buckard; David M Cassiman; Rosaline Caumes; Duccio M Cordelli; Liesbeth M De Waele; Alexander J Fay; Patrick Ferreira; Nicholas A Fletcher; Alan E Fryer; Himanshu Goel; Cheryl A Hemingway; Marco Henneke; Imelda Hughes; Rosalind J Jefferson; Ram Kumar; Lieven Lagae; Pierre G Landrieu; Charles M Lourenço; Timothy J Malpas; Sarju G Mehta; Imke Metz; Sakkubai Naidu; Katrin Õunap; Axel Panzer; Prab Prabhakar; Gerardine Quaghebeur; Raphael Schiffmann; Elliott H Sherr; Kanaga R Sinnathuray; Calvin Soh; Helen S Stewart; John Stone; Hilde Van Esch; Christine E G Van Mol; Adeline Vanderver; Emma L Wakeling; Andrea Whitney; Graham D Pavitt; Sam Griffiths-Jones; Gillian I Rice; Patrick Revy; Marjo S van der Knaap; John H Livingston; Raymond T O'Keefe; Yanick J Crow
Journal:  Nat Genet       Date:  2016-08-29       Impact factor: 38.330

10.  CADD: predicting the deleteriousness of variants throughout the human genome.

Authors:  Philipp Rentzsch; Daniela Witten; Gregory M Cooper; Jay Shendure; Martin Kircher
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  13 in total

1.  ASHG 2020 Curt Stern Award introduction: Fowzan Sami Alkuraya.

Authors:  Cynthia C Morton
Journal:  Am J Hum Genet       Date:  2021-03-04       Impact factor: 11.025

2.  2020 Curt Stern Award address: a more perfect clinical genome-how consanguineous populations contribute to the medical annotation of the human genome.

Authors:  Fowzan S Alkuraya
Journal:  Am J Hum Genet       Date:  2021-03-04       Impact factor: 11.025

Review 3.  Transcriptome analysis provides critical answers to the "variants of uncertain significance" conundrum.

Authors:  Mackenzie D Postel; Julie O Culver; Charité Ricker; David W Craig
Journal:  Hum Mutat       Date:  2022-05-18       Impact factor: 4.700

4.  CI-SpliceAI-Improving machine learning predictions of disease causing splicing variants using curated alternative splice sites.

Authors:  Yaron Strauch; Jenny Lord; Mahesan Niranjan; Diana Baralle
Journal:  PLoS One       Date:  2022-06-03       Impact factor: 3.752

5.  A novel, de novo intronic variant in POGZ causes White-Sutton syndrome.

Authors:  Ashanta Merriweather; David R Murdock; Jill A Rosenfeld; Hongzheng Dai; Shamika Ketkar; Lisa Emrick; Sarah Nicholas; Richard A Lewis; Carlos A Bacino; Daryl A Scott; Brendan Lee; Vernon Reid Sutton; Lorraine Potocki; Lindsay C Burrage
Journal:  Am J Med Genet A       Date:  2022-04-09       Impact factor: 2.578

6.  Recessive, Deleterious Variants in SMG8 Expand the Role of Nonsense-Mediated Decay in Developmental Disorders in Humans.

Authors:  Fatema Alzahrani; Hiroyuki Kuwahara; Yongkang Long; Mohammed Al-Owain; Mohamed Tohary; Moeenaldeen AlSayed; Mohammed Mahnashi; Lana Fathi; Maha Alnemer; Mohamed H Al-Hamed; Gabrielle Lemire; Kym M Boycott; Mais Hashem; Wenkai Han; Almundher Al-Maawali; Feisal Al Mahrizi; Khalid Al-Thihli; Xin Gao; Fowzan S Alkuraya
Journal:  Am J Hum Genet       Date:  2020-11-25       Impact factor: 11.025

7.  Interpretable prioritization of splice variants in diagnostic next-generation sequencing.

Authors:  Daniel Danis; Julius O B Jacobsen; Leigh C Carmody; Michael A Gargano; Julie A McMurry; Ayushi Hegde; Melissa A Haendel; Giorgio Valentini; Damian Smedley; Peter N Robinson
Journal:  Am J Hum Genet       Date:  2021-07-21       Impact factor: 11.025

8.  Exploiting the Autozygome to Support Previously Published Mendelian Gene-Disease Associations: An Update.

Authors:  Sateesh Maddirevula; Hanan E Shamseldin; Amy Sirr; Lama AlAbdi; Russell S Lo; Nour Ewida; Mashael Al-Qahtani; Mais Hashem; Firdous Abdulwahab; Omar Aboyousef; Namik Kaya; Dorota Monies; May H Salem; Naffaa Al Harbi; Hesham M Aldhalaan; Hamad Alzaidan; Hadeel M Almanea; Abrar K Alsalamah; Fuad Al Mutairi; Samira Ismail; Ghada M H Abdel-Salam; Amal Alhashem; Ali Asery; Eissa Faqeih; Amal AlQassmi; Waleed Al-Hamoudi; Talal Algoufi; Mohammad Shagrani; Aimée M Dudley; Fowzan S Alkuraya
Journal:  Front Genet       Date:  2020-12-31       Impact factor: 4.599

9.  Clinical implementation of RNA sequencing for Mendelian disease diagnostics.

Authors:  Vicente A Yépez; Mirjana Gusic; Robert Kopajtich; Christian Mertes; Nicholas H Smith; Charlotte L Alston; Rui Ban; Skadi Beblo; Riccardo Berutti; Holger Blessing; Elżbieta Ciara; Felix Distelmaier; Peter Freisinger; Johannes Häberle; Susan J Hayflick; Maja Hempel; Yulia S Itkis; Yoshihito Kishita; Thomas Klopstock; Tatiana D Krylova; Costanza Lamperti; Dominic Lenz; Christine Makowski; Signe Mosegaard; Michaela F Müller; Gerard Muñoz-Pujol; Agnieszka Nadel; Akira Ohtake; Yasushi Okazaki; Elena Procopio; Thomas Schwarzmayr; Joél Smet; Christian Staufner; Sarah L Stenton; Tim M Strom; Caterina Terrile; Frederic Tort; Rudy Van Coster; Arnaud Vanlander; Matias Wagner; Manting Xu; Fang Fang; Daniele Ghezzi; Johannes A Mayr; Dorota Piekutowska-Abramczuk; Antonia Ribes; Agnès Rötig; Robert W Taylor; Saskia B Wortmann; Kei Murayama; Thomas Meitinger; Julien Gagneur; Holger Prokisch
Journal:  Genome Med       Date:  2022-04-05       Impact factor: 11.117

10.  Tricho-hepato-enteric syndrome: Retrospective multicenter experience in Saudi Arabia.

Authors:  Badr M Alsaleem; Mohammed Hasosah; Amna Basheer M Ahmed; Maher M Al Hatlani; Aziz Helal Alanazi; Abdulrahman Al-Hussaini; Ali T Asery; Khalid A Alghamdi; Muhanad M AlRuwaithi; Musa Ali M Khormi; Ahmed Al Sarkhy; Ali S Alshamrani
Journal:  Saudi J Gastroenterol       Date:  2022 Mar-Apr       Impact factor: 2.485

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.