Literature DB >> 29961767

Genome sequencing as a first-line genetic test in familial dilated cardiomyopathy.

Andre E Minoche1, Claire Horvat2, Renee Johnson2, Velimir Gayevskiy1, Sarah U Morton3, Alexander P Drew1, Kerhan Woo4, Aaron L Statham4, Ben Lundie4, Richard D Bagnall5,6, Jodie Ingles5,6,7, Christopher Semsarian5,6,7, J G Seidman8,9, Christine E Seidman9,10, Marcel E Dinger1,4,11, Mark J Cowley1,11, Diane Fatkin12,13,14.   

Abstract

PURPOSE: We evaluated genome sequencing (GS) as an alternative to multigene panel sequencing (PS) for genetic testing in dilated cardiomyopathy (DCM).
METHODS: Forty-two patients with familial DCM underwent PS and GS, and detection rates of rare single-nucleotide variants and small insertions/deletions in panel genes were compared. Loss-of-function variants in 406 cardiac-enriched genes were evaluated, and an assessment of structural variation was performed.
RESULTS: GS provided broader and more uniform coverage than PS, with high concordance for rare variant detection in panel genes. GS identified all PS-identified pathogenic or likely pathogenic variants as well as two additional likely pathogenic variants: one was missed by PS due to low coverage, the other was a known disease-causing variant in a gene not included on the panel. No loss-of-function variants in the extended gene set met clinical criteria for pathogenicity. One BAG3 structural variant was classified as pathogenic.
CONCLUSION: Our data support the use of GS for genetic testing in DCM, with high variant detection accuracy and a capacity to identify structural variants. GS provides an opportunity to go beyond suites of established disease genes, but the incremental yield of clinically actionable variants is limited by a paucity of genetic and functional evidence for DCM association.

Entities:  

Keywords:  Familial dilated cardiomyopathy; Genetic testing; Molecular diagnosis; Panel sequencing; genome sequencing

Mesh:

Year:  2018        PMID: 29961767      PMCID: PMC7271716          DOI: 10.1038/s41436-018-0084-7

Source DB:  PubMed          Journal:  Genet Med        ISSN: 1098-3600            Impact factor:   8.822


INTRODUCTION

Dilated cardiomyopathy (DCM) is a common heritable heart muscle disorder that frequently has a genetic etiology.[1] Although long lists of disease-associated genes have been compiled,[1] genetic testing yields positive results in relatively few individuals and has not been recommended as part of routine patient care.[2] Knowing genotype status has enormous potential benefit for families, permitting tailored surveillance strategies and early detection of individuals at risk.[3] The lack of results for most DCM families represents an unmet clinical need and a major road-block for implementation of personalized therapy. In recent years, next-generation sequencing has facilitated genetic testing by enabling high throughput evaluation of multiple genes, including under-investigated large genes with hundreds of coding exons. Available methods include multi-gene panel sequencing (PS), whole-exome sequencing (WES) and whole-genome sequencing (WGS). PS uses libraries enriched for protein-coding regions of disease-associated genes. There is generally high sequence coverage and comparative studies with Sanger sequencing have found excellent reproducibility for variant detection.[4] PS is widely used by clinical diagnostic laboratories with cardiomyopathy panel sizes increasing over time from <20 genes to >100 genes.[5,6] The finite number of genes is a major limitation, necessitating re-design of panels as new disease genes are discovered and additional costs if a second test is required in PS-negative cases. In contrast to PS, WES is not limited to specific genes and looks at all protein-coding sequences.[7] Variability in breadth of coverage is a potential limiting factor for WES-based clinical testing and may necessitate extensive follow-up Sanger sequencing to fill in gaps, particularly for high-probability disease genes. In addition to protein-coding sequences, WGS uniquely provides information about the vast tracts of non-coding sequences that are increasingly implicated in human disease and enables high-resolution structural variant (SV) detection.[8,9] However, since WGS typically has a lower overall sequencing depth than PS and WES, its potential sensitivity for mutation screening has been questioned.[10] Golbus and colleagues[11] have recently reported promising results for WGS in a pilot study of 11 DCM patients. A detailed appraisal of the role of WGS for DCM genetic testing is now timely and warranted. Here we compare PS and WGS in a cohort of patients with familial DCM. We determined the concordance of PS and WGS for rare variant detection, evaluated loss-of-function (LOF) variants in an extended gene panel, and performed the first comprehensive evaluation of SV in DCM. Our data show that WGS is a reliable method for screening established DCM disease genes as well as providing a wealth of sequence information for ongoing data-mining in “unsolved” cases.

MATERIALS AND METHODS

Study subjects

Forty-two patients (19 [45%] males), aged 18 to 82 (mean 50) years with familial DCM were recruited from St Vincent’s Hospital and by referral from collaborating physicians. The clinical characteristics of study probands are provided in Supplementary Table S1. Familial DCM was defined by the presence of DCM and/or early (<35 years) sudden unexplained death in two or more individuals in the absence of another heritable cardiac or systemic cause. Probands and participating first-degree relatives provided informed written consent and were evaluated by history and physical examination, ECG and transthoracic echocardiography. All study subjects were of self-reported European ancestry. Protocols were approved by St Vincent’s Hospital Human Research Ethics Committee.

DNA sequencing and variant calling

See Supplementary Methods for expanded sequencing methods. Briefly, 42 patient DNA samples were newly sequenced by WGS, or previously sequenced using a custom capture panel for 67 or 69 DCM genes.[12] WES data were generated in-house from the NA12878 cell line (Coriell Institute for Medical Research, Camden, NJ) using the SureSelectXT Human AllExon V5 ([SSv5], Agilent Technologies, Santa Clara, CA, n=1) and the Clinical Research Exome V2 ([CREv2], Agilent, n=13) capture kits. We also re-analyzed published data from samples that used SureSelectXT Human AllExon V6 ([SSv6], Agilent, n=6) capture kit.[13] All genomic data, including previously published data, were analyzed using a GATK best practices analysis pipeline.[14] Short variants were annotated, filtered and prioritized using Seave.[15] Structural variants (SV) including copy number variants were identified using ClinSV (Minoche et al, manuscript in preparation) which uses a combination of discordantly mapping read pairs, split-mapping reads, and depth of coverage changes. A genomic position was defined as “covered” if the sequencing depth had ≥15 high quality reads.[16] Selected variants were confirmed in probands and evaluated in family members using Sanger sequencing and/or PCR.

Variant concordance analysis

Variants that passed filters and were located within the genomic regions targeted in PS were included in this analysis. Variant concordance was assessed using bcftools (v1.2) and vcfeval from RTG-Core (Real Time Genomics, v3.4.4). In individual patients, sites at which the frequency of the two predominant alleles was < 95% (allowing for sequencing errors) were considered non bi-allelic. The concordance analysis was performed for all SNVs and indels, then separately for the subset that were annotated as high- or medium-impact.

Variant filtration and prioritization

Rare stop gain, splice donor or acceptor site loss, frame-shift indels (defined as LOF variants) and missense variants were included if the maximal minor allele frequency (MAF) in the 1000 Genomes Project, Exome Sequencing Project or Exome Aggregation Consortium (ExAC) databases was <1%. Missense variants were then excluded if they were predicted to be benign by both SIFT and PolyPhen2, or annotated as benign in ClinVar. SV were included if they had population allele frequencies <1% and overlapped with exonic regions from genes of interest (Supplementary Methods). Variants were classified into one of 5 categories: pathogenic, likely pathogenic, uncertain significance (VUS), likely benign, benign, according to recommendations for clinical reporting from the American College of Medical Genetics and Genomics (ACMG).[17]

Gene sets analyzed

Three sets of genes were evaluated (Supplementary Table S2). The first set included 67 “panel genes” evaluated by PS. An “extended gene set” was comprised of 406 genes, including reported DCM-associated genes that were not represented on the PS panel, genes with presumptive links to cardiac and skeletal myopathies, and cardiac-enriched genes from the human protein atlas.[18] The third set included a list of 57 genes compiled by the ACMG in which incidental findings are deemed clinically-reportable.[19]

RESULTS

PS-identified SNV and indels

PS was performed in 42 probands with familial DCM. Seventy-eight rare (MAF <1%) LOF or potentially damaging missense variants were identified in 37 probands (Table 1). All 78 PS-identified variants were confirmed to be present in probands, and were also investigated in family members, using Sanger sequencing (Supplementary Methods). Twenty-one variants in 21 families (50%) were subsequently deemed pathogenic or likely-pathogenic based on ACMG criteria,[17] including 14 LOF TTN variants that we have reported previously.[12]
Table 1.

Yield of rare potentially deleterious variants† in 42 probands with familial DCM.

Proband IDGeneVariantVariant TypeACMG ClassificationDetection method
PSWGS
AA-II-3TTNc.72824A>T, p.K24275ISNV; missenseVUSX
BAG3Chr10:121436136–121436726 del,SV; deletion/duplicationPathogenicX
Chr10:121436139–121436799 dup
AF-III-6MYH6c.5002G>A, p.D1668NSNV; missenseLikely BenignXX
TTNc.94553 T>C, p.V31518ASNV; missenseVUSX
SGCBc.1_2delAT, p.M1GfsINDEL; frameshift deletionLikely PathogenicX
AJ-II-2TTNc.95722T>C, p.Y31908HSNV; missenseVUSXX
RYR2c.8209_−4T>CSNV; spliceVUSX
AM-III-7DMDc.3816G>C, p.L1272FSNV; missenseVUSXX
TTNc.59351delT, p.E19785fsSNV; deletionPathogenicXX
ADRA1Ac.1022_1035delGACGCCACCAGGCC, p.A469fsINDEL; in-frame deletionVUSX
PLECc.7477C>T, p.Q2493*SNV; stop codonVUSX
AP-III-4TTNc.44818C>T, p.P14940SSNV; missenseVUSXX
TTNc.47692C>T, p.R15898*SNV; stop codonLikely PathogenicXX
AT-III-1DMDc.6T>G, p.S2RSNV; missenseVUSXX
NEBc.8683G>A, p.D2895NSNV; missenseVUSXX
MYO18Bc.1692+2_1692+29delTGAGTCCCCTGTCCCGCCGTCCCCCCCAGCAINDEL; splice deletionVUSX
AV-IV-2TTNc.41860T>C, p.Y13954HSNV; missenseVUSXX
TTNc.78095_78098delGAAA, p.R26032fsINDEL; frameshift deletionPathogenicXX
TTNc.81038G>A, p.R27013QSNV; missenseVUSXX
NEBc.16561G>A, p.D5521NSNV; missenseVUSX
BA-III-3NEBLc.1258A>G, p.I420VSNV; missenseVUSXX
TTNc.13900G>T, p.E4634*SNV; stop codonPathogenicXX
TTNc.47191C>T, p.R15731CSNV; missenseVUSXX
SYNE1c.13909G>A, p.D4637NSNV; missenseVUSXX
SYNE1c.23315G>C, p.R7772QSNV; missenseVUSX
BF-III-1NEBc.2533A>G, p.K845ESNV; missenseVUSXX
BDKRB1c.844C>T, p.R282*SNV; stop codonVUSX
BG-III-1NEBc.25367C>T, p.T8456MSNV; missenseVUSXX
MYH7c. 4828G>C, p.E1610QSNV; missenseVUSXX
TTNc.3002T>G, p.M1001RSNV; missenseVUSXX
TTNc.30485C>T, p.T10162MSNV; missenseVUSXX
TRDNChr6:123509665–124330720dupSV; duplicationVUSX
BK-III-3ALMS1c.1343C>G, p.T448RSNV; missenseLikely BenignXX
MYH7c.2207T>C, p.I736TSNV; missenseLikely PathogenicXX
SYNMc. 499C>T, p.R167CSNV; missenseVUSX
TRIM63c.739C>T, p.Q247*SNV; stop codonVUSX
BL-III-2LAMP2c.183_+2T>CSNV; splicePathogenicXX
DSC2Chr18:28681558–28686269delSV; deletionVUSX
BM-III-25TTNc.56834delC, p.G18945fsSNV; deletionPathogenicXX
LAMA2c.1634T>A, p.L545QSNV; missenseVUSX
NEBc.5555TG, p.M1852RSNV; missenseVUSX
BP-III-3NEBc.21685G>C, p.D7229HSNV; missenseVUSXX
DSPc.5874_5789delAGAAAC, p.ET1929–1930delINDEL; in-frame deletionLikely BenignX
SCN4Ac.1800C>A, p.Y600*SNV; stop codonVUSX
RYR2c.14757–7_14757–6delTCinsATINDEL; spliceVUSX
BR-IV-1LAMA2c.4645A>T, p.N1549YSNV; missenseVUSXX
TTNc.54710T>C, p.L18237PSNV; missenseVUSXX
TTNc.78991C>T, p.R26331*SNV; stop codonPathogenicXX
SCN1BChr19:35531062–35531134delSV; deletionVUSX
MURCc.692_712delGAGAGAGGCTAAGGCAGTCAINDEL; in-frame deletionLikely BenignX
G, p.ERLRQSG231–238del
TLL2c.1232delT, p.V411*SNV; stop codonVUSX
BY-III-4BAG3c.361C>T, p.R121*SNV; stop codonPathogenicXX
NEBc.22813G>A, p.E7640KSNV; missenseVUSXX
SYNE1c.9148C>G, p.L3050VSNV; missenseVUSXX
DSG2c.473T>G, p.V158GSNV; missenseLikely BenignX
SYNE1c.1762delT, p.L588*SNV; stop codonVUSX
C-II-9NEBc.9352A>T, p.T3118SSNV; missenseLikely BenignXX
SCN5Ac.5872C>T, p.R1958*SNV; stop codonVUSXX
CTNNA3Chr10:68286145–68513397delSV; deletionVUSX
CI-II-2TTNc.88033A>G, p.T29345ASNV; missenseVUSXX
CS-III-12CAV3c.216C>G, p.C72WSNV; missenseLikely BenignXX
PSEN2c.211C>T, p.R104WSNV; missenseLikely BenignXX
TTNc.69491–69492delTG, p.V23164fsINDEL; frameshift deletionLikely PathogenicXX
ANO5c.191delA, p.N64fsSNV; deletionVUSX
CT-II-2TTNc.54380G>C, p.G18127ASNV; missenseVUSXX
TTNc.76116dupT, p.N25372fsSNV; deletionPathogenicXX
CZ-III-4FLT1c.2023A>G, p.S675GSNV; missenseVUSXX
MYH7c.1105G>A, p.R369QSNV; missensePathogenicXX
TTNc.54109C>T, p.R18037WSNV; missenseVUSXX
PDE4DIPc.54G>T, p.C18*SNV; stop codonVUSX
DD-III-4MYH7c.2620G>C, p.E874QSNV; missenseLikely PathogenicXX
SYNE1c.25856T>C, p.L8619PSNV; missenseVUSXX
XIRP2Chr2:167868183–168522158delSV; deletionVUSX
DF-III-1MYH7c.1578_+1G>ASNV; spliceVUSXX
SCN5Ac.3305C>A, p.S1102YSNV; missenseVUSXX
FLNCc.7170C>A, p.C2369*SNV; stop codonVUSX
DI-II-6TTNc.74880_74883dupAACA, p.P24962fsINDEL; frameshift insertionPathogenicXX
DO-II-4TTNc.95173T>C, p.C31725RSNV; missenseVUSXX
EMDc.428C>T, p.S143FSNV; missenseLikely BenignX
NKX2–5[#]c.552C>G, p.I184MSNV; missenseLikely PathogenicX
SYNE2c.18065insT, p.A6022A*SNV; deletionVUSX
EA-II-7NEXNc.1955A>G, p.Y652CSNV; missenseVUSXX
TTNc.78991C>T, p.R26331*SNV; stop codonPathogenicXX
TTNc.104878C>T, p.R34960CSNV; missenseVUSXX
ASB15c.1207G>T, p.G403*SNV; stop codonVUSX
FJ-II-3ABCC9c.3096_+1delGSNV; spliceLikely PathogenicXX
DTNAc.239G>A, p.R80HSNV; missenseVUSXX
FK-II-1ACTN2c.1192C>T, p.R398CSNV; missenseVUSXX
DESc.1193T>C, p.L398PSNV; missenseVUSXX
LDB3c.1414C>A, p.P472TSNV; missenseVUSXX
SYNE1c.14848A>G, p.N4590DSNV; missenseVUSXX
TTNc.102275G>A, p.R34092HSNV; missenseVUSXX
FRMD3Chr9:85906635–85962244delSV; deletionVUSX
FQ-III-11NEBc.21685G>C, p.D7229HSNV; missenseVUSXX
SCN5Ac.659C>T, p.T220ISNV; missenseVUSXX
TTNc.56206delA, p.T18736fsSNV; deletionLikely PathogenicXX
LAMA2c.4487C>T, p.A1496VSNV; missenseVUSX
AK1Chr9:130621906–130645663dupSV; duplicationVUSX
GE-III-7DSPc.8531G>C, p.G2844ASNV; missenseVUSXX
TTNc.95573A>G, p.N31858SSNV; missenseVUSXX
RYR1c.13328–13348dupGGGGGCCCCTTCCGGCCCGAA, p.GGPFRPE4443–4449dupINDEL; in-frame deletionLikely BenignX
GR-II-6No variants met criteria
GV-III-5No variants met criteria
GW-III-1SYNMc.3274C>T, p.R1092CSNV; missenseVUSXX
LMNAc.1567G>C, p.G523RSNV; missenseVUSX
HB-II-1HPS3Chr3:148874313–148882670delSV; deletionVUSX
HT-II-1No variants met criteria
HU-II-2SGCAc.929A>G, p.Y310CSNV; missenseVUSXX
TTNc.14486A>C, p.Q4829PSNV; missenseVUSXX
TTNc.75364G>A, p.V25122MSNV; missenseVUSXX
ID-II-3SYNE1c.19981C>A, p.Q6661KSNV; missenseVUSXX
KI-III-2TTNc.44284C>T, p.R14762*SNV; stop codonPathogenicXX
TTNc.48953T>C, p.I16318TSNV; missenseVUSXX
TTNc.97324G>A, p.A32442TSNV; missenseVUSX
KS-II-1RBM20c.1906C>T, p.R636CSNV; missensePathogenicXX
MO-II-2TTNc.49458G>A, p.W16486*SNV; stop codonPathogenicXX
ACADVLc.1077_+1G>TSNV; spliceVUSX
MYOM2c.52C>T, p.Q18*SNV; stop codonVUSX
R-IV-1NEBLc.2588C>G, p.S863CSNV; missenseVUSXX
TTNc.76116dupT, p.N25372fsSNV; insertionPathogenicXX
FLNCc.3791_−8G>ASNV; spliceVUSX
S-III-1LAMA2c.7415G>T, p.G2472VSNV; missenseVUSXX
PSEN2c.211C>T, p.R104WSNV; missenseLikely BenignXX
TTNc.30389G>A, p.R10130HSNV; missenseVUSXX
TTNc.100432T>G, p.W33478GSNV; missenseVUSXX
TTNc.100447G>C, p.E33482QSNV; missenseVUSX

Potential disease-causing variants are shown in bold;

variant identified by candidate gene screening.

ACMG, American College of Medical Genetics; INDEL, small insertion or deletion; PS, targeted panel sequencing; SNV, single nucleotide variant; SV, structural variant; VUS, variant of unknown significance; WGS, whole-genome sequencing

WGS sequencing depth and coverage

WGS had an average depth of 34x, covering 97% of the genome, 98% of all exons, and 99% of PS gene exons (Figure 1A). In comparison, PS had a much higher average read depth (486x) but covered only 91% of its targeted regions (Figure 1A). WGS coverage was compared to a WES data-set obtained in the human NA12878 cell line using the SSv5 capture kit. In this data-set, WES had an average read depth of 150x but covered only 69% of the PS targets (Figure 1A). Similar results were found in WES data-sets for which CREv2 and SSv6 capture kits had been used, with coverage of panel targets only 64% and 56%, respectively (Figure 1A, Supplementary Table S3). The poor coverage of WES relative to PS highlights the benefit of a comprehensive disease-focussed panel, compared to a generic WES design. The remaining causes of the lower coverage of panel genes with PS and WES when compared to WGS, were incompletely- or mis-targeted exons due to biases in probe design, synthesis and hybridization (Figure 1, C and D).
Figure 1.

Comparative coverage across panel genes. (A) Bar graphs showing breadth of coverage, that is, the median (± 1st and 3rd quartiles) percentage of positions covered by ≥15 sequence reads with WGS, PS and WES (SSv5 and CREv2) for targeted regions on the PS panel (“panel targets”) and (B) exons in Ensembl isoforms of these genes (“Ensembl exons”). (C) Histograms comparing WGS depth of read coverage across representative gene exons with PS and (D) WES (SSv5). Note lack of uniform coverage due to capture bias in (C) and discrepancy between actual exons and WES target regions in (D). (E) Overlap of protein-coding nucleotides contained within Ensembl isoforms with regions targeted on PS panel and (F) WES (SSv5), i.e. regions targeted by WES (“WES targets”) and targeted protein-coding exons (“WES exons”). (G) Breadth of coverage across panel genes with varying WES input in Gigabases (Gb) showing WES SSv5 exons (red) and Ensembl exons (orange); WGS breadth of coverage across the same Ensembl exons is indicated by dashed line (teal), assuming the ~34x genome-wide average coverage used in thus study.

We further interrogated the reference sets of gene isoforms used in the design and analysis of target regions. WGS data were analyzed with respect to protein-coding transcripts from the comprehensive Ensembl database, reporting 1–31 isoforms per panel gene. In comparison, PS targets included exons and conserved flanking sequences of isoforms from University of California Santa Cruz (UCSC) knownGene and RefSeq, while WES SureSelect targets were based on subsets of isoforms from UCSC, RefSeq, GENCODE, and CCDS databases. Coverage of Ensembl isoforms of panel genes was high (99%) with WGS, but was 86% with PS and ranged from 90% to 79% with WES (Figure 1B, Supplementary Table S3). We found that this was mainly due to an extra 67 kb of Ensembl isoform sequences that were not included on the PS panel and 59 kb not included in WES SSv5 SureSelect targets/exons (Figure 1, E and F). Even with WES at >300x average depth of coverage, sequencing breadth plateaued at 90% for Ensembl exons and 95% for SureSelect targeted exons which was less than the 99% achieved by WGS (Figure 1G).

Concordance between WGS and PS

WGS identified on average, 3.8 × 106 SNVs and 1.2 × 106 small indels per proband genome-wide, of which 24,000 were coding and 42 were LOF and rare (MAF<1%). Extensive Sanger sequencing of a subset of 115 SNV and indels in probands and family members showed that WGS had an overall very low false-positive rate (0.81%, Supplementary Results). For the variant concordance analysis between WGS and PS, we used the same analytical pipeline and investigated the detection rates of all SNVs and indels in PS-targeted regions irrespective of MAF and read depth. A median of 483 variants per proband were concordant between WGS and PS, with 131 variants identified exclusively by WGS and 104 variants exclusively by PS (Supplementary Table S4). 77% of WGS-specific variants were in positions that had <10 reads on PS, with the majority of these occurring at positions with no reads. In contrast, only 3% variants were missed by WGS for the same reason. Variants listed in dbSNP are more likely to be true-positives and this was the case for 87% of WGS-specific variants but only 31% of the PS-specific variants, suggesting that two-thirds of PS-specific variants were either novel or false-positives. Of these PS-specific variants, 22% were multi-allelic (vs 5% in WGS) and 53% were located at homopolymers and short tandem repeats, likely resulting from replication slippage. This can occur naturally but more often arises during PCR-mediated DNA replication.[20] To further investigate discordant variants, we looked at annotated LOF and missense variants, and manually inspected the read alignments. There was an average of 82 concordant variants per proband, with 11 variants exclusive to WGS and 23 variants exclusive to PS (Supplementary Table S5). The most frequent cause of WGS-specific variants was inadequate PS read coverage (Supplementary Figure S1A), followed by missed variants occurring in Ensembl exons that were not represented on the PS panel. An additional 16% of WGS-specific variants were present in the PS reads but missed by the variant caller. In contrast, most (84%) of the PS-specific variants were sequencing artefacts due to phasing (Supplementary Figure S1B). This can occur during cycle-based sequencing when some of the DNA strands fall behind or jump ahead of the cycle.[21] Other sources of discordance seen with both WGS and PS included: amplification of sequencing errors, incorrect variant calls (especially for indels), and ambiguous regions of low mapping quality (Supplementary Figure S1C). In total, there was an average of 10 likely-real LOF and missense variants per person that were missed by PS and 0.3 missed by WGS.

WGS detection of pathogenic/likely pathogenic variants in panel genes

WGS data underwent further filtering to identify rare LOF or potentially damaging missense variants in panel genes. WGS successfully detected the 78 prioritized PS variants, including all 21 of the pathogenic and likely-pathogenic variants. A frame-shift variant, SGCB p.M1Gfs, in Family AF (Table 1), identified by WGS and confirmed by Sanger sequencing was missed by PS due to lack of coverage.

LOF variants in an extended gene set

To further explore the WGS data, we compiled an extended set of 406 genes with proven and putative links to cardiomyopathy (Supplementary Table S2). WGS successfully identified a reported likely-pathogenic missense variant, p.I184M, in the NKX2–5 gene[22] (Table 1). An additional 17 LOF variants in 14 families were evaluated (Table 2), all of which were validated by Sanger sequencing, but none met ACMG criteria for pathogenicity.
Table 2.

WGS-identified LOF variants in the extended gene set.

Family IDGeneVariantExAC variant frequency; PLi scoreNotesSegregation
Families” unsolved” after PS
AJRYR2c.8209_−4T>CAbsent; 1.00Ryanodine receptor 2, SR protein involved in Ca2+ homeostasis. KO mice (−/−): embryonic lethal, (+/−): adults, normal baseline EF, ↓ contraction post-TAC; cKO mice (−/−): adults, transient ↓ contraction. CPVT families with exon 3 deletion: DCM in a subset of individuals.1/1 affected0/1 unaffected.
ATMYO18Bc.1692+1_1692+29delAbsent; 0Myosin 18B, Z-disc protein. KO mice (−/−): embryonic lethal, disordered myofibrils.1/2 affected0/3 unaffected.
BFBDKRB1p.R282*0.002883; 0.07Bradykinin receptor B1. KO mice (−/−): adults, mild ventricular dilation, normal EF.2/3 affected2/10 unaffected.
BGTTN[#]c.31763–1G>A0.0002733; 0Titin, sarcomeric protein. Titin-truncating transgenic mice (−/−): embryonic lethal, (+/−): adults, stress-induced DCM. Zebrafish (+/−): adults, spontaneous DCM.3/7 affected0/4 unaffected.
BPRYR2c.14757–7_14757–6delTCinsATAbsent; 1.00Ryanodine receptor 2 (see above).2/3 affected2/5 unaffected.
SCN4Ap.Y600*Absent; 0.01α4 subunit, voltage-gated sodium channel. KO mice (−/−): adults, no heart defects reported.1/3 affected0/5 unaffected.
MYBPHL[#]p.R255*0.001450; 0Myosin binding protein H-like, sarcomeric protein. KO mice (−/−, +/−): adults, DCM; p.R255* variant: altered MyBP-HL localization in transfected neonatal mouse cardiomyocytes.1/3 affected, 2/5 unaffected.
DFFLNCp.C2369*Absent; 1.00Filamin C, actin crosslinking protein in Z-disc and sarcolemma. KO mice (−/−): neonatal death; zebrafish MO: cardiac developmental defects.2/2 affected2/3 unaffected.
Families with identified pathogenic/likely-pathogenic variants
AMADRA1ARRHQA341–345*Absent; 0α1A adrenoceptor. KO mice (−/−): adults, normal EF.3/3 affected3/12 unaffected.
PLECp.Q2360*0.0002591; 0.02Plectin, cytoskeletal protein. KO mice (−/−): neonatal death, Z-disc disorganization.2/3 affected8/12 unaffected.
BKTRIM63p.Q247*0.000486; 0Tripartite motif containing 63, E3 ubiquitin ligase. p.Q247* variant: transfected cells, ↓auto-ubiquitination; transgenic mice, adults, ↑ LV mass, ↑ LVEDD, normal EF.2/2 affected0/1 unaffected.
BRTLL2p.V441*Absent; 0Tolloid like 2, zinc-dependent metalloprotease. KO mice (−/−): adults, no heart defects reported.1/2 affected1/9 unaffected.
CSANO5p.N64fs0.001027; 0Anoctamin 5, transmembrane protein. KO mice (−/−): adults, normal EF (baseline, isoproterenol stress).6/6 affected0/6 unaffected.
CZPDE4DIPp.C18*0.0000698; NAPhosphodiesterase 4D interacting protein, A-kinase anchoring protein, contributes to phosphorylation of cMyBPC and cTNI.4/4 affected0/2 unaffected.
DOSYNE2p.A6022A*Absent; 0Spectrin repeat-containing nuclear envelope protein 2, nuclear membrane protein. KO mice (−/−): adults, normal EF.2/3 affected1/2 unaffected.
EAASB15p.G403*0.0000906; 0Ankyrin repeat and SOCS box containing 15, involved in muscle differentiation and protein turnover.2/3 affected1/3 unaffected.
MOACADVLc.1077_+1G>TAbsent; 0Very long chain acyl-CoA dehydrogenase. c.1077_+1G>T variant: 36% residual enzyme activity (predicted to be asymptomatic).2/2 affected0/1 unaffected.
MYOM2p.Q18*0.0006355; 0Myomesin 2, sarcomeric protein. Neonatal rat cardiomyocytes (−/−): ↓ rate of contraction.1/2 affected0/1 unaffected.
RFLNCc.3791_−8G>A0.0001607; 1.00Filamin C (see above).1/5 affected1/2 unaffected.

Segregation analysis did not include phenotype-negative variant carriers aged less than 40 years.

Variant absent from proband, identified by WGS of additional family members.

CPVT, catecholaminergic polymorphic ventricular tachycardia; DCM dilated cardiomyopathy; EF, ejection fraction; ExAC, Exome Aggregation Consortium; KO, knockout (cKO, cardiac-specific knockout); LV, left ventricular, LVDD, LV end-diastolic diameter; SR, sarcoplasmic reticulum; TAC, tranverse aortic constriction

Five variants, in the FLNC, ANO5, ACADVL, TRIM63, and PDE4DIP genes, were present in all (2 or more) family members with DCM in the respective kindreds (Supplementary Figure S2). In Family DF (negative after PS testing), a FLNC p.C2369* variant was identified. Variants in FLNC, that encodes the actin-binding protein filamin C, have been associated with cardiac and skeletal myopathies.[23] Filamin C deficiency causes cardiac developmental defects in zebrafish and neonatal death in homozygous mice,[23,24] while heterozygous mice carrying a human FLNC p.W2710X mutation show skeletal myopathy.[25] To date, there is no definitive animal model evidence that heterozygous filamin C loss-of-function results in DCM. Although the nonsense FLNC variant was present in both affected individuals in Family DF (II-1, II-2), it was also present in two unaffected siblings aged 52 (II-4, II-5) and 45 years, respectively, and it was classified as a VUS. In Family CS, all affected individuals carried a likely-pathogenic LOF TTN variant and an ANO5 p.N64fs variant. ANO5 encodes anoctamin 5, a transmembrane protein with putative calcium-activated chloride channel activity. Homozygosity for LOF ANO5 variants in human subjects has been associated with limb girdle and Miyoshi muscular dystrophies.[26] This phenotype is not present in Ano5 knockout mice[27] and none of the CS family members had overt skeletal myopathy. Despite good family co-segregation, the ANO5 variant was called a VUS. In Family MO, individuals with a pathogenic LOF TTN variant also carried a splice acceptor site variant, ACADVL c.1077_+1G>T. The latter has been associated with a 36% reduction in very long chain acyl-CoA dehydrogenase activity but this level is predicted to be tolerated.[28] Affected individuals in Family BK had a likely-pathogenic missense MYH7 variant as well as a TRIM63 p.Q274* variant, both of which have proposed associations with hypertrophic cardiomyopathy.[29,30] The two variant carriers (II-3, II-4) clearly had DCM with one also showing left ventricular hypertrophy. This nonsense TRIM63 variant has been associated with reduced auto-ubiquitination in transfected cells, and increased left ventricular mass in transgenic mice.[30] The four affected individuals tested in Family CZ all carried a PDE4DIP p.C18* variant in addition to a pathogenic missense MYH7 variant. PDE4DIP encodes an A-kinase anchoring protein that is involved in phosphorylation of the sarcomeric proteins, cMyBPC and cTNNI.[31] This variant could plausibly modulate sarcomere kinetics but was classified as a VUS due to a lack of compelling evidence supporting PDE4DIP loss-of-function as a DCM mechanism.

Structural variants

Because WGS sequence coverage extends beyond exonic regions, comprehensive genome-wide evaluation of SV is possible. Using our in-house pipeline, ClinSV, we detected an average of 5379 SV, including 4470 copy number variants per proband, of which 232 were rare (MAF <1%), and 23 that were rare and overlapped genic regions. When the 67 panel genes were evaluated, one rare SV was identified. This was a complex BAG3 deletion/duplication that included the BAG domain and is likely to have a loss-of-function effect (Figure 2A, Table 3). This BAG3 SV was confirmed to be present in the proband using Sanger sequencing, segregated with DCM in Family AA (Supplementary Figure S2), and was deemed to be pathogenic. Numerous truncating BAG3 variants have been reported in DCM patients, with many co-segregating with disease in families.[32] Eight SV were found in the extended gene set, all of which were validated by independent sequencing methods (Supplementary Methods and Results), but none met ACMG criteria for pathogenicity or were likely primary causes of DCM (Figure 2B, Table 3). For example, in the proband from Family BG we identified a rare whole gene duplication of triadin (TRDN; Figure 2B), a developmentally regulated core member of the ryanodine receptor complex, whose copy number has been conserved in mammals.[33] Although present in 4 of the 6 other affected family members (Supplementary Figure S2), this variant was also seen in 4 unaffected family members, and was classified as a VUS. The remaining SV mostly showed incomplete segregation with disease, occurred in families with other identified pathogenic/likely-pathogenic variants, overlapped with SV reported in population databases, or were in genes with unknown relevance to DCM, and were all classified as VUS.
Figure 2.

Structural variants identified in DCM probands. Schematics showing protein locations and relative size of deletions and duplications in (A) BAG3, and (B) 8 genes in the extended gene set. Protein domains are named and highlighted with colors, dashed vertical lines denote exon boundaries, and codons are numbered.

Table 3.

Structural variants identified in panel genes and in the extended gene set.

Family IDGeneVariantDatabase of Genomic Variants*NotesSegregation
Families “unsolved” after PS
AABAG3Chr10:121436136–121436726 del, Chr10:121436139–121436799 dup (g.30254–30844 del, 30257–30917 dup); involves exon 4 (aa.357–533) including BAG domain.Overlap with 2 larger dup, 1 in unaffected subject.BCL associated athanogene 3, chaperone protein. KO mice (−/−): neonatal death. Human ventricle: decreased BAG3 expression in heart failure.3/3 affected, 1/1 unaffected
BGTRDNChr6:123509665–124330720 dup (821.1kb); includes TRDN (complete gene) and NKAIN2 (exon 1).Similar dup in >10 DD patients + unaffected subjects.Triadin, SR protein involved in Ca2+ homeostasis. Tg mice (5-fold overexpression): ↓ contraction.5/7 affected, 4/4 unaffected
CCTNNA3Chr10:68286145–68513397 del (227.3kb); loss of exon 10 (aa 428–458) including vinculin-like domain.>40 overlapping deletions, known CNV hotspot.α3 catenin, intercalated disc protein. KO mice (−/−): adult DCM.4/6 affected, 4/17 unaffected
FKFRMD3Chr9:85906635–85962244 del (55.6kb); loss of exons 5–12 (aa 125–357, includes partial FERM domain, peptide and lipid-binding regions).Overlap with 3 del/dup.FERM domain-containing protein 3, cytoskeletal protein.4/4 affected, 1/4 unaffected
HBHPS3Chr3:148874313–148882670 del (g.31943–40298 del, 8.4 kb); loss of exon 9–14 (aa.504–863), no identified domains,Overlap with 1 larger dup in 2 DD cases.Hermansky-Pudlak syndrome 3 protein.1/2 affected 1/2 unaffected
Families with identified pathogenic/likely-pathogenic variants
BLDSC2Chr18:28681558–28686269 del (g.37617–42327 del, 4.7kb); loss of first exon + promoter.AbsentDesmocollin 2, desmosomal protein. KO mice (−/−): embryonic lethal.2/2 affected
BRSCN1BChr19:35531062–35531134 del (g.14470–14543 del, 73b in 3’ UTR).Absentβ1 subunit, voltage-gated sodium channel.1/3 affected, 1/9 unaffected
DDXIRP2Chr2:167868183–168522158 del/dup (654 kb); del from exon 3/dup (includes whole gene + translocated del segment)Similar del (exon 4 on), multiple smaller CNV in same region.Xin actin binding repeat-containing protein 2, intercalated disc protein. KO mice (−/−): postnatal death, (+/−): normal EF. Human ventricle: expression levels increased (early) or reduced (late) with myocardial stress.2/2 affected, 1/2 unaffected
FQAK1Chr9:130621906–130645663 dup (238 kb); includes complete gene.Overlap with 3 full gene dup/multiple small del, in unaffected subjects.Adenylate kinase 1, enzyme involved in energy metabolism.2/2 affected

Database includes diverse patient and control cohorts;

Segregation analysis did not include phenotype-negative variant carriers aged less than 40 years.

CNV, copy number variant; DCM, dilated cardiomyopathy; DD, developmental delay; EF, ejection fraction; KO, knockout, PS, panel sequencing, SR, sarcoplasmic reticulum.

Value of WGS in additional family members

In two large kindreds that remained unsolved after PS and WGS testing of the proband, we undertook WGS in additional family members. In Family BP (Supplementary Figure S2), a MYBPHL p.R255* variant was identified in 3 family members but not in the proband. One of these, I-1, had been diagnosed with DCM and first degree atrioventricular block at 84 years of age. The other two variant carriers included an asymptomatic 56-year old male (II-1), and a 42-year old female with ventricular ectopy (II-6), neither of whom had DCM. MYBPHL has been recently described as a DCM disease gene, with homozygous and heterozygous knockout mice showing DCM and conduction-system abnormalities.[34] The same MYBPHL p.R255* variant was seen in a family with early-onset DCM and in an unrelated individual with left ventricular dilation.[34] The mutant MyBP-HL protein was function-altering, with reduced expression in human cardiomyocytes and abnormal myofilament localization in transfected neonatal mouse cardiomyocytes.[34] In Family BG (Supplementary Figure S2), WGS in two of the proband’s aunts (II-2, II-4) identified a novel LOF TTN variant. Sanger sequencing of this variant in all family members confirmed its absence from the proband and presence in three affected siblings.

Overall yield of WGS and reportable incidental findings

The yield of pathogenic/likely-pathogenic variants was increased from 21 families (50%) with PS, to 24 families (57%) following WGS analysis of panel genes and preliminary mining of the extended gene panel. Interrogation of the ACMG 57-gene list[17] (Supplementary Table S2) yielded three significant incidental findings. In the Family BG proband, we found a common function-altering GLA p.D313T variant associated with Fabry disease[35] which did not segregate with DCM in the family (Supplementary Figure S2). Two other probands were heterozygous carriers of common missense MUTYH variants that are annotated as pathogenic in ClinVar and associated with increased cancer risk.

DISCUSSION

Our data support WGS as a viable method for genetic testing in familial DCM with high detection accuracy for rare SNVs and indels in disease-associated genes. These findings concur with recently-reported results for WGS-based testing in hypertrophic cardiomyopathy[36] and extend pilot data for WGS in DCM.[11] A compelling argument in favor of WGS as a first-line testing method is its potential for ongoing data mining in unsolved cases. However, in an initial analysis we found that surprisingly few variants in an extended gene set data met clinical criteria for pathogenicity. Despite a lower overall sequencing depth than PS (and WES), WGS gave superior SNV and indel detection with less risk of variants being missed due to sequence gaps. WGS’s broad and uniform sequence coverage, even across exons targeted specifically by PS, results from sample preparation methods that avoid capture bias[37] and a lack of constraint to predefined sets of transcript isoforms and target regions. Some WGS-identified variants in panel genes were missed by PS because different gene reference sets were used. This limitation of PS could potentially be reduced by optimized probe design and comprehensive representation of tissue-specific isoforms.[6] The poor WES coverage observed over PS regions was recapitulated using three different exome capture kits, in data generated from two laboratories. Even at 300x depth, WES failed to achieve the same breadth of coverage as 34x WGS. Importantly we show that on target WES performance can be good, but often the target does not correspond to the nearby exon, resulting in potentially missed pathogenic variants. Coverage uniformity gives WGS a distinctive advantage for identification of SV, that account for 0.5–1% of heritable inter-individual sequence differences (compared to 0.1% for SNVs).[8] Although SV are increasingly implicated in human disorders,[8] we found relatively few pathogenic SV in cardiomyopathy-associated genes. These results may be skewed by ascertainment bias, since DCM associations of panel genes have predominantly been established by profiling SNVs and indels. SVs often involve pairs of flanking low copy repeats.[8] Known DCM-associated genes may lack these regions and hence be less susceptible to structural rearrangements. Extending our analysis to 406 cardiac-enriched genes resulted in the discovery of only 8 additional SV, none of which were clearly causative of DCM. One of the challenges in evaluating SV is assessment of their functional effects, especially for those variants that involve complex or partial duplications and deletions. Intuitively, changes in gene copy number would impact on gene “dose”, but this is not necessarily the case and a number of adaptive mechanisms may result in dosage compensation.[8] Even full gene or balanced duplications and deletions can have unpredictable effects if there is disruption of local or long-range genomic architecture that involves gene-regulatory sequences. To improve WGS yield, more genetic information is needed to identify patterns of differential variant prevalence in cases and controls, recurrent variant types, and mutation hotspots. Equally, more experimental data are required to identify function-altering variants and to show that these functional effects have plausible links to disease causation. In the ACMG criteria,[17] segregation of variants in affected family members provides support for DCM association, but the level of evidence can increase to moderate or strong with larger family sizes and/or multiple families. Segregation analysis in familial DCM relies on the assumption of single driver mutations but this is increasingly open to question.[38] As genetic evaluation has shifted from gene candidate screening studies to genome-wide analyses, it is not uncommon to find multiple rare variants in family probands[39] and, as seen with families BG and BP, the number of potential function-altering variants in any one family can increase as more individuals are studied. These combinations of variants may have additive, synergistic or neutralizing effects, and each person’s total burden of rare and common variants may determine threshold levels for myocardial dysfunction. Should WGS be used as a first-line genetic test for DCM or reserved for PS-negative cases? The answer to this question is currently unknown and involves a dynamic interplay between cost and yield (Supplementary Figure S3). Currently, the cost of WGS-based clinical testing is more than double that of PS. With increasing customer demand and technical efficiencies, the operational costs of both WGS and PS are likely to decrease over time. For PS, these reductions will be offset by the ongoing need to design and optimize new probe sets as new disease genes are discovered. The manpower costs of expert clinical reporting are essentially equivalent for PS and WGS since the same suites of disease genes are generally evaluated, but these costs may rise as the spectrum of reportable genes and variants expands. The turn-around time for WGS is equivalent or faster than routine PS (or WES), because of the time required for targeted sequence capture, together with delays incurred by waiting for samples to be pooled. Recent data have shown that fast-tracked WES-based genetic testing in acutely ill pediatric patients is however feasible and cost-saving, due to expedited diagnosis and management.[40] WGS is a storehouse of medically-relevant information that extends beyond identification of rare disease-causing variants, much of which cannot be obtained from PS or WES. For example, polygenic risk scores derived from suites of common variants (often in non-coding regions) may predict an individual’s risk of DCM complications, and pharmacogenomic associations may guide drug selection and doses. Comprehensive economic models need to be developed that consider variables such as cost per test, number of tests ordered, yield of tests, requirement for secondary testing of probands, cascade testing of family members, and the impact of genotype information on requirements for clinical surveillance of family members. Genotype-based early interventions may also impact more broadly on long-term health outcomes and costs, potentially impacting hospitalization rates, device implantation, drug administration, workforce productivity, and use of social services. Compelling health economics data would provide a powerful argument to insurance companies or governments for the overall benefits of genetic testing and the preferred testing modality.
  2 in total

1.  Multi-omics study revealing the complexity and spatial heterogeneity of tumor-infiltrating lymphocytes in primary liver carcinoma.

Authors:  Lijun Shi; Yang Zhang; Lin Feng; Liming Wang; Weiqi Rong; Fan Wu; Jianxiong Wu; Kaitai Zhang; Shujun Cheng
Journal:  Oncotarget       Date:  2017-05-23

2.  Detection and impact of rare regulatory variants in human disease.

Authors:  Xin Li; Stephen B Montgomery
Journal:  Front Genet       Date:  2013-05-31       Impact factor: 4.599

  2 in total
  18 in total

Review 1.  From Hypertrophy to Heart Failure: What Is New in Genetic Cardiomyopathies.

Authors:  Nosheen Reza; Kiran Musunuru; Anjali Tiku Owens
Journal:  Curr Heart Fail Rep       Date:  2019-10

Review 2.  Cardiovascular Genetics: The Role of Genetics in Predicting Risk.

Authors:  Jessica Chowns; Lily Hoffman-Andrews; Amy Marzolf; Nosheen Reza; Anjali Tiku Owens
Journal:  Med Clin North Am       Date:  2022-02-02       Impact factor: 5.456

3.  Exploring the Genetic Architecture of Spontaneous Coronary Artery Dissection Using Whole-Genome Sequencing.

Authors:  Ingrid Tarr; Stephanie Hesselson; Siiri E Iismaa; Emma Rath; Steven Monger; Michael Troup; Ketan Mishra; Claire M Y Wong; Pei-Chen Hsu; Keerat Junday; David T Humphreys; David Adlam; Tom R Webb; Anna A Baranowska-Clarke; Stephen E Hamby; Keren J Carss; Nilesh J Samani; Monique Bax; Lucy McGrath-Cadell; Jason C Kovacic; Sally L Dunwoodie; Diane Fatkin; David W M Muller; Robert M Graham; Eleni Giannoulatou
Journal:  Circ Genom Precis Med       Date:  2022-05-18

Review 4.  Clinical and genetic evaluation after sudden cardiac arrest.

Authors:  Stephanie L Harris; Steven A Lubitz
Journal:  J Cardiovasc Electrophysiol       Date:  2020-01-15

Review 5.  Advances in the role and mechanism of BAG3 in dilated cardiomyopathy.

Authors:  Leiling Liu; Kaijun Sun; Xiaojun Zhang; Ying Tang; Danyan Xu
Journal:  Heart Fail Rev       Date:  2021-01       Impact factor: 4.214

Review 6.  Next-generation sequencing in Charcot-Marie-Tooth disease: opportunities and challenges.

Authors:  Menelaos Pipis; Alexander M Rossor; Matilde Laura; Mary M Reilly
Journal:  Nat Rev Neurol       Date:  2019-10-03       Impact factor: 42.937

7.  Phenotypic Expression, Natural History, and Risk Stratification of Cardiomyopathy Caused by Filamin C Truncating Variants.

Authors:  Marta Gigli; Davide Stolfo; Sharon L Graw; Marco Merlo; Caterina Gregorio; Suet Nee Chen; Matteo Dal Ferro; Alessia PaldinoMD; Giulia De Angelis; Francesca Brun; Jean Jirikowic; Ernesto E Salcedo; Sylvia Turja; Diane Fatkin; Renee Johnson; J Peter van Tintelen; Anneline S J M Te Riele; Arthur A M Wilde; Neal K Lakdawala; Kermshlise Picard; Daniela Miani; Daniele Muser; Giovanni Maria Severini; Hugh Calkins; Cynthia A James; Brittney Murray; Crystal Tichnell; Victoria N Parikh; Euan A Ashley; Chloe Reuter; Jiangping Song; Daniel P Judge; William J McKenna; Matthew R G Taylor; Gianfranco Sinagra; Luisa Mestroni
Journal:  Circulation       Date:  2021-09-30       Impact factor: 29.690

Review 8.  Understanding the genetics of adult-onset dilated cardiomyopathy: what a clinician needs to know.

Authors:  Upasana Tayal; James S Ware; Neal K Lakdawala; Stephane Heymans; Sanjay K Prasad
Journal:  Eur Heart J       Date:  2021-06-21       Impact factor: 35.855

Review 9.  Importance of Genetic Testing in Dilated Cardiomyopathy: Applications and Challenges in Clinical Practice.

Authors:  Arsonval Lamounier Júnior; Filipe Ferrari; Renato Max; Luiz Eduardo Fonteles Ritt; Ricardo Stein
Journal:  Arq Bras Cardiol       Date:  2019-09-02       Impact factor: 2.000

Review 10.  Human Induced Pluripotent Stem-Cell-Derived Cardiomyocytes as Models for Genetic Cardiomyopathies.

Authors:  Andreas Brodehl; Hans Ebbinghaus; Marcus-André Deutsch; Jan Gummert; Anna Gärtner; Sandra Ratnavadivel; Hendrik Milting
Journal:  Int J Mol Sci       Date:  2019-09-06       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.