| Literature DB >> 24136861 |
Roddy Walsh1, Nicholas S Peters, Stuart A Cook, James S Ware.
Abstract
BACKGROUND: Distinguishing genetic variants that cause disease from variants that are rare but benign is one of the principal challenges in contemporary clinical genetics, particularly as variants are identified at a pace exceeding the capacity of researchers to characterise them functionally.Entities:
Keywords: Cardiovascular Medicine; Clinical Genetics; Congenital Heart Disease; Genetics
Mesh:
Substances:
Year: 2013 PMID: 24136861 PMCID: PMC3888601 DOI: 10.1136/jmedgenet-2013-101917
Source DB: PubMed Journal: J Med Genet ISSN: 0022-2593 Impact factor: 6.318
Figure 1An overview of paralogue annotation. (1) Paralogues (evolutionarily related genes, with homologous sequence and protein domain structures) are identified for a gene of interest. A subset of paralogues for SCN5A is shown here for illustration. (2) Protein sequences of paralogues are aligned, identifying functionally equivalent amino acids across the protein family. (3) Disease-causing variants in paralogues are identified from previous literature reports, and their locations are mapped to the gene of interest. Variants at these sites have a high probability of pathogenicity.
SCN5A paralogues
| Paralogue | HGMD missense mutations | HGMD missense residues | Major diseases | |
|---|---|---|---|---|
| 410 | 327 | 393 | Epilepsy, Dravet syndrome, Hemiplegic migraine | |
| 26 | 26 | 24 | Epilepsy, Neonatal-infantile seizures | |
| 1 | 1 | 1 | Epilepsy | |
| 67 | 53 | 67 | Hyperkalaemic periodic paralysis, myotonia, paramyotoniacongenita, periodic paralysis | |
| 0 | 0 | 0 | ||
| 1 | 1 | 1 | Infantile epileptic encephalopathy | |
| 53 | 50 | 45 | Congenital indifference to pain, primary erythermalgia, paroxysmal extreme pain disorder, Small fibre neuropathy | |
| 0 | 0 | 0 | ||
| 0 | 0 | 0 | ||
| 63 | 61 | 54 | Episodic ataxia 2, hemiplegic migraine | |
| 0 | 0 | 0 | ||
| 9 | 9 | 4 | BrS, LQTS | |
| 1 | 1 | 0 | ||
| 1 | 1 | 0 | ||
| 28 | 28 | 27 | Night blindness | |
| 2 | 2 | 0 | Juvenile myoclonic epilepsy | |
| 25 | 25 | 13 | Epilepsy, autism spectrum disorder | |
| 0 | 0 | 0 | ||
| 13 | 8 | 13 | Hypokalaemic periodic paralysis, malignant hyperthermia |
Nineteen paralogues of SCN5A were used in this study, including voltage-gated sodium channels (SCN-A) and the homologous voltage-gated calcium channels. For each paralogue, the table shows the total number of distinct missense variants reported in HGMD, the number of distinct amino acid residues affected by these mutations and the number of mutations that were mapped to equivalent residues in SCN5A. Some of the most prominent diseases associated with these paralogue mutations are also highlighted: these are typically diseases attributable to abnormalities of membrane excitability in a range of tissues.
BrS, Brugada syndrome; HGMD, Human Gene Mutation Database; LQTS, long QT syndrome.
Figure 2Genetic variation in cohorts with inherited arrhythmia syndromes. RYR2 or SCN5A were sequenced in 2266 patients with an inherited arrhythmia syndrome, as shown. Three hundred and fifty-seven variants were identified in 511 patients, of which 153 were novel missense variants. In the absence of segregation or functional data, these would typically be reported as variants of unknown significance. However, paralogue annotation provided additional information for 65 (36%) variants, that would further inform a clinical genetic report. BrS, Brugada syndrome; CPVT, catecholaminergic polymorphic ventricular tachycardia.
Paralogue annotation of novel variants identified in RYR2
| CDS | Protein | Region | Cases (n=155) | Exon | Paralogue | Paralogue variant | Paralogue disease | Consensus |
|---|---|---|---|---|---|---|---|---|
| c.527G>A | p.R176Q | N-terminal hotspot | 1 | 8 | R163L | Malignant hyperthermia | 9 | |
| R163C | Central core disease | 9 | ||||||
| c.994C>T | p.R332W | N-terminal hotspot | 1 | 12 | R316L | Malignant hyperthermia | 8 | |
| c.1069G>A | p.G357S | N-terminal hotspot | 1 | 13 | G341R | Malignant hyperthermia | 9 | |
| G341R | Malignant hyperthermia | 9 | ||||||
| c.1646C>T | p.A549V | Outside hotspots | 1 | 17 | A537T | Congenital myopathy | 9 | |
| c.6504C>G | p.H2168Q | Outside hotspots | 2 | 42 | H2204Q | Multiminicore disease | 9 | |
| c.7258A>T | p.R2420W | Central hotspot | 1 | 48 | R2454C | Malignant hyperthermia | 9 | |
| R2454H | Malignant hyperthermia | 9 | ||||||
| c.11989A>G | p.K3997E | Channel hotspot | 1 | 90 | R4041W | Malignant hyperthermia | 9 | |
| c.14369G>A | p.R4790Q | Channel hotspot | 1 | 100 | R4861C | Central core disease | 9 | |
| R4861H | Central core disease | 9 | ||||||
| c.14414A>G | p.K4805R | Channel hotspot | 1 | 100 | K4876R | Malignant hyperthermia | 9 | |
| c.14465G>A | p.R4822H | Channel hotspot | 1 | 101 | R4893W | Central core disease | 9 | |
| R4893Q | Central core disease | 9 | ||||||
| R4893P | Central core disease | 9 | ||||||
Ten out of 31 novel missense variants identified in 155 unrelated CPVT patients10 were annotated. This provides strong evidence of pathogenicity for these variants. In addition, five of the 29 RYR2 missense variants previously reported to be pathogenic were annotated (see supplementary table S1, available online only).
RYR2 coordinates given with respect to transcripts NM_001035/NP_001026 (Refseq), ENST00000366574/ENSP00000355533 (Ensembl), LRG_402t1/LRG_402p1 (Locus Reference Genomic).
CDS, coding DNA sequence; CPVT, catecholaminergic polymorphic ventricular tachycardia.
Paralogue annotation of novel variants identified in SCN5A
| CDS | Protein | Region | Cases (n=2111) | Exon | Paralogue | Paralogue variant | Paralogue disease | Consensus |
|---|---|---|---|---|---|---|---|---|
| c.278T>C | p.F93S | N-terminus | 1 | 3 | F90S | Myoclonic epilepsy of infancy | 5 | |
| c.281T>G | p.I94S | N-terminus | 1 | 3 | I91T | Myoclonic epilepsy of infancy | 5 | |
| c.362G>A | p.R121Q | N-terminus | 2 | 3 | R118S | Myoclonic epilepsy of infancy | 9 | |
| c.533C>G | p.A178G | TM domain 1 | 1 | 5 | A175V | Dravet syndrome | 9 | |
| A175T | Myoclonic epilepsy of infancy | 9 | ||||||
| c.659C>T | p.T220I | TM domain 1 | 2 | 6 | T217K | Myoclonic epilepsy of infancy | 9 | |
| c.694G>A | p.V232I | TM domain 1 | 2 | 6 | R212R | Autism spectrum disorder | 9 | |
| c.1100G>T | p.R367L | TM domain 1 | 1 | 9 | R377Q | Generalised epilepsy with febrile seizures | 9 | |
| R377L | Dravet syndrome | 9 | ||||||
| c.1120T>G | p.W374G | TM domain 1 | 1 | 9 | W384R | Dravet syndrome | 9 | |
| W384X | Myoclonic epilepsy of infancy | 9 | ||||||
| c.1157G>A | p.G386E | TM domain 1 | 2 | 10 | G396E | Dravet syndrome | 8 | |
| c.1156G>A | p.G386R | TM domain 1 | 1 | 10 | G396E | Dravet syndrome | 8 | |
| c.1187T>C | p.V396A | TM domain 1 | 1 | 10 | V406F | Dravet syndrome | 9 | |
| c.1186G>C | p.V396L | TM domain 1 | 1 | 10 | V406F | Dravet syndrome | 9 | |
| c.2047T>G | p.C683G | Interdomain Linker I-II | 1 | 14 | C699Y | Dravet syndrome | 3 | |
| R744Q | Childhood absence epilepsy | 3 | ||||||
| c.2150C>T | p.P717L | TM Domain 2 | 1 | 14 | P768L | Myoclonic epilepsy of infancy | 8 | |
| c.2553C>A | p.F851L | TM Domain 2 | 1 | 16 | F902C | Myoclonic epilepsy of infancy | 9 | |
| c.2633G>A | p.R878H | TM Domain 2 | 5 | 16 | R931C | Myoclonic epilepsy of infancy | 8 | |
| R896Q | Congenital indifference to pain | 8 | ||||||
| R931H | Epilepsy | 8 | ||||||
| c.2657A>C | p.H886P | TM Domain 2 | 1 | 16 | H939Q | Myoclonic epilepsy of infancy | 9 | |
| H939Y | Dravet syndrome | 9 | ||||||
| W962C | Autism spectrum disorder | 9 | ||||||
| c.2677C>T | p.R893C | TM Domain 2 | 2 | 16 | R946S | Generalised epilepsy of infancy | 9 | |
| R946C | Myoclonic epilepsy of infancy | 9 | ||||||
| R946H | Myoclonic epilepsy of infancy | 9 | ||||||
| c.2678G>A | p.R893H | TM Domain 2 | 3 | 16 | R946S | Generalised epilepsy of infancy | 9 | |
| R946C | Myoclonic epilepsy of infancy | 9 | ||||||
| R946H | Myoclonic epilepsy of infancy | 9 | ||||||
| c.2701G>A | p.E901K | TM Domain 2 | 3 | 16 | E954K | Dravet syndrome | 9 | |
| c.3695G>A | p.R1232Q | TM Domain 3 | 1 | 21 | R1245Q | Myoclonic epilepsy of infancy | 7 | |
| c.3758A>G | p.E1253G | TM Domain 3 | 1 | 21 | E1266A | Dravet syndrome C | 9 | |
| c.3813G>C | p.W1271C | TM Domain 3 | 1 | 21 | W1284S | Dravet syndrome | 9 | |
| c.3968T>G | p.V1323G | TM Domain 3 | 1 | 23 | V1299F | Paroxysmal extreme pain disorder | 9 | |
| c.4057G>A | p.V1353M | TM Domain 3 | 2 | 23 | V1366I | Generalised epilepsy with febrile seizures | 9 | |
| c.4079T>G | p.F1360C | TM Domain 3 | 1 | 23 | F1404C | Episodic ataxia | 9 | |
| c.4226A>G | p.Y1409C | TM Domain 3 | 1 | 23 | Y1422C | Myoclonic epilepsy of infancy | 9 | |
| c.4234C>T | p.L1412F | TM Domain 3 | 1 | 23 | L1079P | Night blindness | 9 | |
| c.4255A>G | p.K1419E | TM Domain 3 | 1 | 24 | E1115K | BrS | 9 | |
| c.4258G>C | p.G1420R | TM Domain 3 | 1 | 24 | G1433R | Dravet syndrome | 9 | |
| G1433E | Myoclonic epilepsy of infancy | 9 | ||||||
| G1433V | Dravet syndrome | 9 | ||||||
| c.4283C>T | p.A1428V | TM Domain 3 | 1 | 24 | A1441P | Myoclonic epilepsy of infancy | 9 | |
| c.4321G>C | p.E1441Q | TM Domain 3 | 1 | 25 | G1483R | Episodic ataxia | 9 | |
| E1454K | Dravet syndrome | 9 | ||||||
| c.4342A>C | p.I1448L | TM Domain 3 | 1 | 25 | L1461I | Myoclonic epilepsy of infancy | 9 | |
| c.4343T>C | p.I1448T | TM Domain 3 | 1 | 25 | L1461I | Myoclonic epilepsy of infancy | 9 | |
| c.4346A>G | p.Y1449C | TM Domain 3 | 1 | 25 | Y1462C | Myoclonic epilepsy of infancy | 9 | |
| F1491S | Episodic ataxia | 9 | ||||||
| Y1462H | Dravet syndrome | 9 | ||||||
| c.4387A>T | p.N1463Y | TM Domain 3 | 1 | 25 | N1476K | Dravet syndrome | 9 | |
| c.4402G>T | p.V1468F | TM Domain 3 | 1 | 25 | V1293I | Paramyotoniacongenita | 9 | |
| c.4573G>A | p.V1525M | TM Domain 4 | 1 | 27 | V1538I | Dravet syndrome | 9 | |
| c.4642G>A | p.E1548K | TM Domain 4 | 3 | 27 | E1561K | Dravet syndrome | 9 | |
| c.4747C>T | p.R1583C | TM Domain 4 | 2 | 27 | R1596C | Cryptogenic focal epilepsy | 9 | |
| R1596L | Dravet syndrome | 9 | ||||||
| R1596H | Generalised epilepsy with febrile seizures | 9 | ||||||
| c.4748G>A | p.R1583H | TM Domain 4 | 1 | 27 | R1596C | Cryptogenic focal epilepsy | 9 | |
| R1596L | Dravet syndrome | 9 | ||||||
| R1596H | Generalised epilepsy with febrile seizures | 9 | ||||||
| c.4981G>C | p.G1661R | TM Domain 4 | 1 | 28 | G1674R | Myoclonic epilepsy of infancy | 9 | |
| c.4981G>A | p.G1661R | TM Domain 4 | 2 | 28 | G1674R | Myoclonic epilepsy of infancy | 9 | |
| c.5015C>A | p.S1672Y | TM Domain 4 | 2 | 28 | A1685D | Myoclonic epilepsy of infancy | 9 | |
| A1685V | Febrile seizures | 9 | ||||||
| c.5134G>A | p.G1712S | TM Domain 4 | 1 | 28 | G1725C | Dravet syndrome | 9 | |
Forty-five out of 122 novel missense variants identified in 2111 unrelated BrS patients5 were annotated. In addition, 26 of the 70 SCN5A missense variants previously reported to be pathogenic were annotated (see supplementary table S2, available online only).
SCN5A coordinates given with respect to transcripts NM_198056/NP_932173 (Refseq), ENST00000333535/ENSP00000328968 (Ensembl), LRG_289t1/LRG_289p1 (Locus Reference Genomic).
BrS, Brugada syndrome; CDS, coding DNA sequence; TM, transmembrane.
Paralogue annotation accurately identifies disease-associated residues in RYR2 and SCN5A
| Protein | Pathogenic | Benign | Uncertain | Unannotated | Total | |
|---|---|---|---|---|---|---|
| Published observations | 368 | 28 | 60 | 1560 | 2016 | |
| Paralogue annotations: observed (expected) | 113 (81) | 1 (6) | 6 (13) | 321 (341) | 441 | |
| Published observations | 139 | 20 | 71 | 4737 | 4967 | |
| Paralogue annotations: observed (expected) | 35 (8) | 1 (1) | 1 (4) | 238 (262) | 275 |
Distribution of paralogue annotations across the amino acid residues of the SCN5A and RYR2 proteins. ‘Published observations’ shows the number of amino acid residues with known missense variants, classified as pathogenic, benign or uncertain, and the number of residues at which missense variation has not previously been observed (unannotated). ‘Uncertain’ refers to dbSNP variants without clinical information or residues with variants with conflicting reports as to pathogenicity. ‘Paralogue annotations observed’ shows the number of residues of each class that are annotated by variants in paralogues, and which would therefore be expected to be sites of pathogenic variation. ‘Paralogue annotations expected’ shows the number of residues in each class that would be expected to be annotated if paralogue annotation was random, with no predictive value. Variants annotated in this way are highly enriched for pathogenic variation in both genes (2×2 Fisher's exact test p=0.0009), with a positive predictive value (PPV) of 98.7%. 559 previously unannotated residues (321 in SCN5A, 238 in RYR2) are identified as putative disease-associated residues.
Annotation across protein domains of RYR2 and SCN5A
| Known missense variants | Paralogue mappings | |||||
|---|---|---|---|---|---|---|
| Gene | Protein domains | Pathogenic | Benign | Uncertain | Actual | Enrichment |
| N-terminus | 27 | 2 | 6 | 14 | 0.51 | |
| Transmembrane | 277 | 8 | 26 | 355 | 1.57 | |
| Interlinker domains | 99 | 25 | 35 | 41 | 0.31 | |
| C-terminus | 40 | 5 | 16 | 31 | 0.58 | |
| Total | 443 | 40 | 83 | 441 | ||
| Hotspots | 134 | 10 | 17 | 157 | 1.52 | |
| Outside hotspots | 20 | 10 | 59 | 118 | 0.69 | |
| Total | 154 | 20 | 76 | 275 | ||
Distribution of known variants and paralogue mutation annotations across protein domains of SCN5A and mutation hotspots of RYR2. There is significant enrichment of both known pathogenic mutations and paralogue mutation mappings in the protein regions recognised to be susceptible to pathogenic variation, that is, the trans-membrane domains of SCN5A and the three mutation hotspots of RYR2.
Figure 3Disease-causing variation in human ryanodine receptors, RYR1 and RYR2. An alignment of RYR1 and RYR2 reveals the structural similarity of the proteins and homologous clustering of pathogenic variation in these related proteins. The protein is represented in light grey, with reported ‘mutation hotspots’ marked in dark grey, and exon boundaries highlighted. The locations of missense variants previously reported to be pathogenic are shown with black lines above and below the protein graphic—longer lines indicate more than one pathogenic DNA variant affecting the same protein residue.
Figure 4A web-based application makes paralogue annotation easily accessible for genes causing inherited arrhythmia syndromes. A web-based application is available at http://cardiodb.org/Paralogue_Annotation/. Users enter the position of a novel variant using complementary DNA or protein coordinates: in this example a substitution has been found in RYR2, affecting glycine residue 357.1 This residue maps to RYR1 residue 341, and two cDNA variants at that location (c.1021G>A and c.1021G>C) that each cause substitution of Arg for Gly at that position have been reported to cause malignant hyperthermia. Users should check the alignment quality—here the mapping quality is high: the surrounding region is highly homologous, the reference amino acid is the same in both proteins, and the alignment has a high consensus score.2 Pubmed links give access to the reports relating to the paralogue mutation(s), 3 allowing users to assess the quality of evidence for pathogenicity. Here functional characterisation has been performed on this variant in the highlighted publication, adding confidence that the variant is disease causing and the residue is intolerant of variation in both RYR1 and its paralogue RYR2.