Literature DB >> 32969205

Evaluating systematic reanalysis of clinical genomic data in rare disease from single center experience and literature review.

Natalie B Tan1,2,3, Rachel Stapleton1, Zornitza Stark1,2, Martin B Delatycki1,2, Alison Yeung1,4,5, Matthew F Hunter4,5, David J Amor2,3,6, Natasha J Brown1,2,6,7, Chloe A Stutterd1,7, George McGillivray1, Patrick Yap8,9, Matthew Regan4,5, Belinda Chong1, Miriam Fanjul Fernandez1,2, Justine Marum1, Dean Phelan1, Lynn S Pais10, Susan M White1,2, Sebastian Lunke1,11, Tiong Y Tan1,2.   

Abstract

BACKGROUND: Our primary aim was to evaluate the systematic reanalysis of singleton exome sequencing (ES) data for unsolved cases referred for any indication. A secondary objective was to undertake a literature review of studies examining the reanalysis of genomic data from unsolved cases.
METHODS: We examined data from 58 unsolved cases referred between June 2016 and March 2017. First reanalysis at 4-13 months after the initial report considered genes newly associated with disease since the original analysis; second reanalysis at 9-18 months considered all disease-associated genes. At 25-34 months we reviewed all cases and the strategies which solved them.
RESULTS: Reanalysis of existing ES data alone at two timepoints did not yield new diagnoses. Over the same timeframe, 10 new diagnoses were obtained (17%) from additional strategies, such as microarray detection of copy number variation, repeat sequencing to improve coverage, and trio sequencing. Twenty-seven peer-reviewed articles were identified on the literature review, with a median new diagnosis rate via reanalysis of 15% and median reanalysis timeframe of 22 months.
CONCLUSION: Our findings suggest that an interval of greater than 18 months from the original report may be optimal for reanalysis. We also recommend a multi-faceted strategy for cases remaining unsolved after singleton ES.
© 2020 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals LLC.

Entities:  

Keywords:  exome sequencing; genome sequencing; rare disease; reanalysis

Mesh:

Year:  2020        PMID: 32969205      PMCID: PMC7667328          DOI: 10.1002/mgg3.1508

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


INTRODUCTION

A recent scoping review of the literature on the clinical impact of genomic sequencing in pediatric patients reported a median diagnostic yield of 33.2% (Smith et al., 2019). A “negative” exome in the remaining two‐thirds of cases may be due to a number of factors. There are technical challenges such as imperfect sequencing, variant annotation, and filtering; together with difficulties in variant interpretation owing to incomplete knowledge of gene‐/variant‐disease associations, gene function; and the evolution of patient phenotype with time. One advantage of ES and genome sequencing (GS) is the capacity for reanalysis of existing data in unsolved cases. We applied different reanalysis strategies at two timepoints to evaluate the effectiveness of reanalyzing singleton ES data and compared against other strategies to reach a diagnosis in unsolved cases. To complement our reanalysis, we undertook a literature review of studies investigating the incremental diagnostic rate achieved via the reanalysis of ES/GS data. There has been no comprehensive literature review of reanalysis studies to date, making this a timely exercise in light of the recent statement by the American College of Medical Genetics and Genomics (ACMG) highlighting the need for reanalysis in undiagnosed cases (Deignan et al., 2019). Understanding the new diagnostic rate afforded by different reanalysis strategies will help to guide laboratories and clinicians considering reanalysis for their patients, as well as inform policy regarding funding schema for systematic reanalysis.

MATERIALS AND METHODS

Ethical compliance

Written consent was provided for each reanalysis case to be included in this research and additional research‐based analysis was undertaken through the Undiagnosed Diseases Program Victoria as approved by The Royal Children's Hospital Ethics Committee (HREC36291A).

Patient selection

We reanalyzed data from unsolved cases referred for ES to Victorian Clinical Genetics Services (VCGS) Clinical Genomics Laboratory over a 10‐month period between June 2016 and March 2017. Unsolved cases were defined as those with no diagnosis established on ES after analysis of known human Mendelian disorder genes, termed the “Mendeliome,” which comprised 3734 genes in July 2017. Selection criteria included a request for clinical genomic testing irrespective of indication and the provision of informed consent for genomic testing and reanalysis. The following data were collected: demographics, age, alive or deceased, and referral indication.

Study design

Our first reanalysis (“Tier 1,” Figure 1) involved reviewing only variants in genes newly associated with the disease since the time of original analysis. These genes are added as updates to the Mendeliome every 6 months (“Mendeliome Update”), and are generated from the review of current literature. Reanalysis using this list of updated Mendeliome genes was conducted in July 2017, between 4 and 13 months after the original ES report was generated. There were 570 genes added to the Mendeliome in July 2017, incorporating updates from July 2016, January 2017, and July 2017. Our second reanalysis (“Tier 2”) involved reviewing variants in all disease‐associated genes for each case. Interrogation of the existing exome data for the whole Mendeliome (4304 genes) occurred in November 2017, nine to 18 months after the initial ES report. Finally, we reviewed the status of all cases in April 2019 (25–34 months after the initial report) and collated all strategies that led to a diagnosis.
FIGURE 1

Study design

Study design

Exome sequencing, variant annotation, and interpretation

Singleton ES was performed at VCGS Pathology, an accredited (National Association of Testing Authorities, Australia) laboratory, using the Nextera Rapid Capture Exome kit (Illumina, San Diego, CA) on HiSeq 4000 sequencers (Illumina, San Diego, CA), with a target mean coverage of 100x and a minimum of 91% of bases sequenced to at least 15x. ES variant calls were reannotated in July 2017 (Tier 1 reanalysis) and November 2017 (Tier 2 reanalysis) using the latest respective version of our custom in‐house bioinformatics pipeline, Cpipe,(Sadedin et al., 2015) in order to generate annotated variant calls within the target region (coding exons ±2 base pairs), via alignment to the reference genome (GRCh37). Variants were annotated against all gene transcripts, with the curation of variants against the HGNC recommended transcript (according to HGVS nomenclature). Variant filtering was performed as previously described (Stark et al., 2016), with the data for each patient assessed and prioritized for classification by two clinical geneticists—the referring geneticist and a study geneticist (N.T.). Variant classification was performed using our in‐house variant classification scheme based on the American College of Medical Genetics and Genomics guidelines for variant interpretation (Richards et al., 2015). Class 3 variants (Variants of Uncertain Significance, VUS) were further sub‐classified as being of potential clinical relevance (class 3A), unknown clinical relevance (class 3B) or low clinical relevance (class 3C) based on the relevance of the variant to the patient phenotype and likelihood of being up‐ or down‐graded with additional evidence (e.g., segregation). Cases were selected for reanalysis if the original report was uninformative—being either a “negative” report (i.e., no clinically reportable variants were identified) or a report with class 3B or 3C variants. The process for patient selection and data analysis is summarized in Figure 2.
FIGURE 2

Summary of patient selection

Summary of patient selection

Literature search

PubMed, Embase, and Medline were searched in December 2019 for articles reporting reanalysis of ES/GS data in unsolved cases. Full search strategies are available as Supplementary Information. Complete reference details for all articles were imported into Endnote (Clarivate Analytics, Boston, MA) and duplicate records removed. Articles published after 1 January 2017 were selected for assessment of eligibility. Two independent reviewers (NT and TT) compared all titles and abstracts against eligibility criteria with any discrepancies resolved by consensus. One author (NT) reviewed each full‐text article and extracted relevant data. Articles that met the following criteria were included in the review: (1) peer‐reviewed original research article; (2) reanalysis was performed of ES or GS data for individuals who received no diagnosis at the time of reporting the original ES or GS result. Studies that reanalyzed patient data derived from targeted gene panels only were excluded. Studies of patients enrolled in a research protocol performing ES or GS for a clinical indication were included. Studies were limited to those published in English. No restrictions were placed on study design or the number of individuals reported. As a result, this literature review encompassed studies utilizing a number of different study methodologies across diverse patient populations. Therefore, results across studies were summarized and narratively described rather than statistically combined in a meta‐analysis.

RESULTS

Characteristics of the study population

One hundred and eleven patients in this cohort were referred for clinical ES during the 10‐month study period; 51 females and 60 males. At the time of the original analysis, pathogenic and likely pathogenic variants were identified for 36 cases, conferring a diagnostic rate of 32% (36/111). The remaining 75 cases received uninformative ES results. The median age at the time of the initial report date was similar for the diagnosed versus uninformative groups (3 vs. 3.5 years). The number of deceased individuals at the time of referral for ES was more than twice as high for the uninformative group than for the diagnosed group (Table 1). Whilst the referral indication varied, the most common phenotype tested was a syndromic neurodevelopmental disorder (56% of all referrals).
TABLE 1

Characteristics of the study population

Original ES report outcomeGenderMedian age at initial report dateDeceased at time of referralReferral indication
FemaleMaleSyndromic Neurodevelopmental DisorderNon‐Syndromic Neurodevelopmental DisorderMultiple Congenital AnomaliesSingle System Disorder
Diagnosed (36)18183 years (101 days to 43 years)1 (2.8%)25218 (ophthalmology 3, renal 3, dermatology 1, metabolic 1)
Undiagnosed (75)33423.5 years (31 days to 47 years)5 (6.7%)373728 (neurology 11, gastroenterology 4, ophthalmology 4, dermatology 2, oncology 2, respiratory 1, renal 1, immunology 1, endocrinology 1, growth 1)
Characteristics of the study population

Tier 1 reanalysis

Of the 75 original ES cases that were uninformative, 53 (42 negative cases and 11 class 3B/3C cases) underwent reanalysis applying only the Mendeliome Update genes in July 2017 (Figure 2). Class 3A cases were not examined at this stage as the majority were undergoing or had segregation testing performed, leading to the reclassification of these variants (Figure 2). Eighteen additional variants in 12 negative cases were curated, with no pathogenic or likely pathogenic variants ultimately identified. No additional variants of interest were proposed for curation in the 11 cases initially reported with 3B/3C variants. One case was solved at this stage via SNP array testing that identified an ATAD3 gene cluster deletion (patient S3 in Desai et al., 2017).

Tier 2 reanalysis

Fifty‐seven unsolved cases were fully reanalyzed after the Mendeliome was updated in November 2017. These cases comprised: 41 of the original 42 negative cases (owing to the solved ATAD3 case); the 11 cases with class 3B/3C variants; three cases that were originally reported with class 3A variants that were later downgraded to class 3B variants following segregation studies; and two cases where the referring clinician requested reanalysis despite class 3A variants being reported on the original ES. Forty‐nine variants in 20 cases were curated, the majority of which were found to be not diagnostic. One patient was identified as a heterozygous carrier of a pathogenic variant in GJB2, which is associated with autosomal recessive deafness (OMIM#220290). This condition was not associated with the patient's phenotype and was, therefore, an incidental carrier status finding. We identified two class 3A missense variants and one splice variant of interest in SYNE1 in a patient with congenital arthrogryposis. Segregation studies are being considered for these variants, as the phenotype is compatible with biallelic SYNE1 pathogenic variants (Attali et al., 2009). In a majority of cases (54%), no variants of interest were identified following this reanalysis.

Cases solved by additional strategies

By April 2019, 15 of the original 75 uninformative ES cases had been solved, conferring an additional diagnostic rate of 20%. The strategies leading to a resolution of these 15 cases were: (i) completion of segregation studies; (ii) completion of functional studies; (iii) enrolment in a gene discovery research program for trio ES/GS; and (iv) interrogation for genomic lesions either not tractable by ES or not well identified due to technical limitations of sequencing and variant filtration. Four diagnoses were established following segregation studies. Initial analysis of these cases had identified class 3A variants of potential clinical significance which were all upgraded to class 4 likely pathogenic variants following segregation testing. The segregation studies helped to establish: if a variant was de novo in a proband (NAA10, OMIM*300013); if a variant was inherited from an affected parent (CHD7, OMIM *608892); and if compound heterozygous variants were biparentally inherited (ADSL, OMIM*608222; SLC6A5, OMIM*604159). One case underwent functional studies (sedimentation profiling of mitochondrial protein complexes), together with segregation testing, to upgrade a homozygous variant in SLC25A46 (OMIM*610826) from class 3A to class 4 (Abrams et al., 2018). In five cases, variants in novel candidate genes were identified via the Undiagnosed Diseases Program Victoria (UDP‐Vic), a gene discovery research project in collaboration with the Broad Institute Center for Mendelian Genomics. Families recruited to this program underwent research‐based trio ES (of the proband and both biological parents). Twenty‐one of the 58 unsolved cases (36%) were enrolled in UDP‐Vic with seven diagnoses achieved (33%). Five of these potential diagnoses are in novel candidate genes that are strongly considered to be causative in light of preliminary functional studies and identification of two or more unrelated individuals with concordant phenotypes via the Matchmaker Exchange (Sobreira et al., 2017) initiative. Two diagnoses were made in established genes following family‐based ES via UDP‐Vic (cases 4 and 5 in Table 2).
TABLE 2

Results of solved cases—published genes

CaseMechanism to achieve diagnosisGene (*OMIM)Genomic variantPhenotype (#OMIM)Reason pathogenic variant not detected/selected in original singleton ES
1SNP array (Illumina Human CytoSNP‐12 v2.1) ATAD3B/3A (*612316/*612317) ATAD3B/3A 38,667 bp deletionCerebellar dysfunction with altered mitochondrial DNA and cholesterol metabolism (Desai et al., 2017)Not tractable by ES (gene cluster deletion)
2High‐density array (Illumina HumanOmni5 Exome‐4 v1.2) NIPBL (*608667)arr[hg19] 5p13.2(36791812_36895270)x1~2Cornelia de Lange syndrome (#122470)Not tractable by ES (mosaic intragenic deletion)
3Trio GS SCN8A (*600702)

NM_014191.3(SCN8A):c.2549G>A;

NP_055006.1(SCN8A):p.(Arg850Gln)

Early infantile epileptic encephalopathy 13 (#614558)Low coverage
4Trio ES CHD7 (*608892)

NM_017780.3(CHD7):c.5405‐7G>A;

NP_060250.2(CHD7):p.(=)

CHARGE syndrome (#214800)Low coverage
5Trio ES WDR45 (*300526)

NM_007075.3(WDR45):c.411_416delGTTTGA;

NP_009006.2(WDR45):p.(Glu137_Phe138del)

Neurodegeneration with brain iron accumulation 5 (#300894)Low variant fraction

Abbreviations: ES, exome sequencing; GS, genome sequencing; SNP, single nucleotide polymorphism.

Results of solved cases—published genes NM_014191.3(SCN8A):c.2549G>A; NP_055006.1(SCN8A):p.(Arg850Gln) NM_017780.3(CHD7):c.5405‐7G>A; NP_060250.2(CHD7):p.(=) NM_007075.3(WDR45):c.411_416delGTTTGA; NP_009006.2(WDR45):p.(Glu137_Phe138del) Abbreviations: ES, exome sequencing; GS, genome sequencing; SNP, single nucleotide polymorphism. Another three cases reached diagnoses in established genes via means other than the reannotation of existing singleton ES data. Two diagnoses were made via high resolution SNP‐based chromosomal microarray (cases 1 and 2 in Table 2). The remaining case was solved through trio GS (case 3 in Table 2) as part of a research study of brain malformations and leukodystrophies. Importantly, of the 37 unsolved cases that were not enrolled in the gene discovery research project, upon review in April 2019 three cases are no longer considered to be likely due to an underlying monogenic condition owing to the resolution of clinical features/symptoms with the passage of time.

Literature review

The number of articles identified by searches of PubMed, Embase, and Medline was 689, 754, and 1177, respectively. Complete reference details for all 2620 articles were imported into Endnote (Clarivate Analytics, Boston, MA) and duplicate records removed, leaving 1586 articles. Of these, 616 articles published after 1 January 2017 were assessed for eligibility. The full‐text of 27 articles were reviewed and the findings are summarized in Table 3. The strategy of reannotation which, for example, incorporates updated population frequency data and in silico pathogenicity scores, has been employed by a majority of studies identified by our literature review (72%) (Al‐Nabhani et al., 2018; Baker et al., 2019; Epilepsy Genetics, 2019; Bowling et al., 2017; Costain et al., 2018; Ewans et al., 2018; Hiatt et al., 2018; Jalkh et al., 2019; Li et al., 2019; Liu et al., 2019; Machini et al., 2019; Nambot et al., 2018; Ngo et al., 2020; Salmon et al., 2019; Schmitz‐Abe et al., 2019; Shashi et al., 2019; Trinh et al., 2019; Wenger et al., 2017; Wright et al., 2018; Xiao et al., 2018). The median new diagnosis rate via this approach across studies is 13% (0.46‐50; weighted average of 9%). Other reanalysis approaches have included: expansion from singleton to family ES studies (Eldomery et al., 2017; Epilepsy Genetics, 2018); revisiting of ES data only if a pathogenic variant is identified on subsequent GS (Alfares et al., 2018); repeat ES due to loss of original ES data (Shamseldin et al., 2017); GS following or in parallel with ES reanalysis (Shashi et al., 2019); and realignment to a new human genome reference build (Need et al., 2017). The median new diagnosis rate for reanalysis studies adopting any of the above approaches is 15% (0.08–83.33; weighted average of 7%), with a median reanalysis timeframe of 22 months (0–5 years).
TABLE 3

Comparison of reanalysis studies in the literature

PublicationDiagnostic platformCohort of undiagnosed cases with non‐diagnostic ES/GSApproach to reanalysis% New diagnoses (number of diagnoses in published genes OR detection of CNV or SV/number of unsolved cases reanalyzed after initial negative report)Reasons for new diagnosesReanalysis timeframeRecommendation
Exome sequencing (ES)
Wenger et al. (2017)Singleton ES40 probands (mostly paediatric), heterogeneous conditionsReannotation of ES data10 (4/40)New gene publication, relevant literature located at the time of reanalysis20 months (average)Reanalysis at 2‐ to 3‐year interval could result in a 10% diagnostic yield
Shamseldin et al. (2017)Singleton ES33 probands (parental consanguinity in all cases), presumed heterogeneous conditionsRepeat ES as data from original ES not available for reanalysis48.48 (16/33)Improved variant filtration via positional mappingNot reportedIncorporation of positional mapping in the analysis of ES whenever applicable
Need et al. (2017)Family ES6 families, heterogeneous conditionsRealignment to new human genome reference build, increased coverage83.33 (5/6)New gene publication, realignment to new human genome reference build~4 yearsMultifaceted approach to reanalyzing ES data should be a standard part of clinical diagnostic paradigms
Eldomery et al. (2017)ES (68 family studies, 6 singleton)74 families, heterogeneous conditionsExpansion to family ES studies, improved data filtering (SNV prioritization, de novo SNVs), CNV detection from ES data36.49 (27/74)New gene publication, gene discovery publication, putative parental mosaicism, biallelic or hemizygous variants identified via family ES studies, CNV detection from ES data (loss of heterozygosity, uniparental disomy, small CNV), translational research (GeneMatcher)<3 yearsReanalysis of data coupled with the incorporation of additional family member ES data can improve the molecular diagnostic rate
Epilepsy Genetics Initiative (2018)ES (2 family studies, 1 singleton)3 probands, epilepsy conditionsExpansion to family ES studies, reannotation of ES variant data0.08 (3/3747)Newly published alternate exon (updated consensus coding sequence database; CCDS)~3 to 4 years for family ES cases, singleton case not reportedIterative interrogation of ES data, with re‐evaluation of other well‐defined alternative exons in known epilepsy genes
Nambot et al. (2018)Singleton ES156 probands, neurodevelopmental disorder with or without congenital anomaliesReannotation of ES data, CNV detection from ES data15.38 (24/156)New gene publication, gene discovery publication, reclassification of originally reported variant, CNV detection from ES data, translational research (Matchmaker Exchange)<3 yearsProspective reanalysis of ES data in patients with no diagnosis, with consideration for trio ES (before GS) for cases that remain unsolved despite recurrent reanalyses
Xiao et al. (2018)Singleton ES or targeted sequencing19 probands (proportion of singleton ES not reported), neurodevelopmental disorder with or without congenital anomaliesReannotation of ES data, CNV detection from ES data26.32 (5/19)New gene publication, phenotype expansion publication, CNV detection from ES data8–18 monthsRe‐evaluation at 1‐ to 2‐year interval
Wright et al. (2018)Family ES861 families, neurodevelopmental disorders with or without congenital anomalies, abnormal growth parameters, dysmorphic features, unusual behavioral phenotypesReannotation of ES data, CNV detection from ES data21.14 (182/861)New gene publication, improved analysis pipelines (updated annotations and variant filtering thresholds), additional analytical methods (to detect chromosomal aneuploidy, CNV detection from ES data, mosaicism, non‐essential splice variants, uniparental disomy)~3 yearsIterative reinterpretation of already reported clinical sequencing data should become routine
Ewans et al. (2018)ES (28 family studies, 9 singleton)37 families, heterogeneous conditionsReannotation of ES data15.38 (4/26)New gene publication, improved analysis pipelines, updated patient phenotype information12 monthsReanalysis after 12 months or when instigated by referrers
Stark et al. (2019)Singleton ES29 probands, infants with suspected monogenic disordersRe‐evaluation of existing ES data13.79 (4/29)New gene publication6–18 monthsReanalysis at 18 months is a cost‐effective model for the storage and re‐examination of genomic data in clinical service delivery
Al‐Nabhani et al. (2018)Singleton ES50 probands, intellectual disabilityReannotation of ES data12 (6/50)New gene publication, improved analysis pipelines, updated patient phenotype information22 months (average)Reanalysis of negative exomes in this study of intellectual disability cases solved at least 12% of cases
Salmon et al. (2019 Singleton ES84 probands, heterogeneous conditionsReannotation of ES data15.48 (13/84)New gene publication, updated patient phenotype informationNot reportedReanalysis of exome data can increase the diagnostic yield and reduce the need for additional costly tests such as genome sequencing
Baker et al. (2019)ES (230 family studies, 10 singleton)240 probands, heterogeneous conditionsReannotation of ES data15.83 (38/240)New gene publication, phenotype expansion publication, candidate gene publication, reclassification of originally reported variant1.5 years (reported median)Automated reanalysis methods can facilitate efficient re‐evaluation of non‐diagnostic samples using up‐to‐date literature
Jalkh et al. (2019)Singleton ES101 probands, heterogeneous conditionsReannotation of ES data, variant filtration based on local ES control dataset12.87 (13/101)New gene publication, updated patient phenotype informationNot reportedES reanalysis should take into consideration updated bioinformatics tools, novel gene discoveries, and new clinical information
Li et al. (2019)Family ES76 families, epilepsy with intellectual disabilityReannotation of ES data10.53 (8/76)New gene publication, relevant literature located at the time of reanalysis, updated patient phenotype information, reclassification of synonymous variant affecting splicing0–12+ monthsSuitable time points for reanalysis might be 6–12 months after the initial report
Epilepsy Genetics Initiative (2019)ES (singleton and family studies)137 probands, epilepsy conditions (excludes 2 probands previously reported in Epilepsy Genetics Initiative, 2018)Reannotation of ES data4.38 (6/137)New gene publication, gene discovery publication, newly published alternate exon (updated consensus coding sequence database; CCDS), new OMIM entry, translational research (GeneMatcher)2.3 years (average)Periodic reinterrogation of unresolved exomes is critical to improving the diagnostic rate
Schmitz‐Abe et al. (2019 ES (singleton and family studies)75 probands, heterogeneous conditionsReannotation of ES data, improved data filtering (SNV calling and prioritization)8 (6/75)New gene publication, improved variant calling using custom‐built pipeline (VExP)1.9 ± 1.4 years (average)Important to reanalyze negative ES data periodically, preferably annually
Liu et al. (2019)Singleton ES188 probands (cohort 1) and 1496 probands (cohort 2), presumed heterogeneous conditionsReannotation of ES data31.91 (60/188 cohort 1) and 15.37 (230/1496 cohort 2)New gene publication, reclassification of originally reported variant, variant‐specific atypical phenotypic presentations, gene‐specific multiple disease inheritance patterns and mechanisms, newly discovered isoforms encompassing previously unknown exons, complex patient phenotypes obscured by multilocus molecular diagnoses5 years (sporadic cases performed earlier)Periodic, cost‐effective reanalysis may benefit patients and their families and physicians
Bruel et al. (2019)Singleton ES313 probands, ID/epileptic encephalopathy with or without congenital anomaliesRe‐evaluation of existing ES data, extending variant interpretation to genes not associated with human disease in OMIM15.34 (48/313)Novel gene discovery, relevant literature located at the time of reanalysis, phenotype expansion publication, translational research (GeneMatcher)Nil (reanalysis performed immediately after original ES result obtained)Limitations of singleton ES reanalysis could be overcome utilizing trio ES as a second step
Trinh et al. (2019)ES (singleton and family studies)3015 probands (heterogeneous conditions)Reannotation of ES data0.46 (14/3015)Focus on 14 genes recently nominated by the DDD study (new gene publications for 13 of 14 genes)Not reportedImportance of re‐evaluating ES data in light of new publications
Ngo et al. (2020)ES (singleton and family studies)60 probands (ataxic disorders)Reannotation of ES data, CNV detection from exome data8.33 (5/60)Known pathogenic variant detected, CNV detection from ES data5 yearsKey focus for undiagnosed cases on repeating bioinformatic analysis at regular intervals, and use of more comprehensive genomic tools and complete methods to identify mutation types currently not observed in ES as they become available
Genome sequencing (GS)
Costain et al. (2018)Singleton GS100 patients (paediatric), heterogeneous conditionsReannotation of GS variant calls10.94 (7/64)New gene publication, phenotype expansion publication, additional case reports3 yearsReanalysis every 1–2 years until diagnosis, or sooner if phenotype evolves
Machini et al. (2019)Singleton GS100 patients (50 with cardiomyopathy and 50 healthy)Reannotation of GS variant calls22 (22/100)New gene and variant publications, reclassification of originally reported variant, improved analysis pipelines13 months (mean)Reanalysis on an annual basis, with the frequency and utility of reanalysis to be guided by the presence of new symptoms or availability of new treatments
ES and/or GS
Bowling et al. (2017)ES or GS (singleton and family studies)211 families, neurodevelopmental disordersReannotation of ES/GS data4.74 (10/211)New gene publication, improved analysis pipelines, updated patient phenotype information, translational research (GeneMatcher)Not reportedSystematic reanalysis of genomic data should become standard practice
Hiatt et al. (2018)ES or GS (singleton and family studies)Number of undiagnosed probands not reported, total cohort of 494 individuals with neurodevelopmental disordersReannotation of ES/GS data1.57 (6/383)New gene publication, additional analytical methods (to detect uniparental disomy), translational research (GeneMatcher)Not reportedDatasets first analyzed over two years ago should be prioritised for reanalysis
Alfares et al. (2018)Singleton ES and GS108 patients, presumed heterogeneous conditionsReanalysis of ES data if a pathogenic variant was detected on GS30 (3/10)New gene publication5 months (average)Until the cost of GS approximates that of ES, reanalyzing ES raw data is recommended before performing GS
Shashi et al. (2019)ES and GS (singleton and family studies)38 probands (37 ES, 1 GS), heterogeneous conditionsReannotation of ES data, improved data filtering (in‐house bioinformatics tools), CNV detection from ES data, repeat ES/GS, GS following/in parallel with ES50 (19/38)Deep phenotyping of patients, new gene publication, CNV/SV, translational research (GeneMatcher)Not reportedGS should be utilized only after ES data have been extensively mined and combined with the phenotypic data to maximize its yield

Abbreviations: CNV, copy number variation; SNV, single nucleotide variation; SV, structural variant.

Comparison of reanalysis studies in the literature Abbreviations: CNV, copy number variation; SNV, single nucleotide variation; SV, structural variant.

DISCUSSION

Reanalysis of existing genomic data in unsolved cases increases diagnostic yield in the setting of well‐defined disease cohorts (e.g., epilepsy) as well as unselected cohorts with heterogeneous conditions (Table 3). The first publication to address reanalysis systematically (Wenger et al., 2017) did so by reanalyzing negative exome cases for 40 mostly pediatric patients via the reannotation of ES data using updated bioinformatics pipelines. Re‐evaluation of existing ES data conducted without reannotation using updated software and/or reference databases has achieved an incremental diagnosis rate of 14% via reanalysis at six to 18 months after the original analysis (Stark et al., 2019). This approach was also applied by Bruel et al. (2019), with the authors performing reanalysis immediately after obtaining the original ES result. The particular focus of the Bruel study was to extend variant interpretation to genes not currently associated with disease in OMIM for the purpose of research. As such, the need to wait for the benefit of newly published data (e.g., new gene discovery, functional studies) and updated resources (e.g., gnomAD) was not essential, making immediate re‐evaluation of existing ES data feasible in this setting. Whilst Nambot et al. (2018) reported annual reanalysis over three years, we applied different reanalysis strategies at two timepoints to elucidate the impact of each approach. We elected to reanalyze only genes updated to the Mendeliome in the first instance (Tier 1 reanalysis) in an effort to target newly published genes given that more than three‐quarters of reanalysis studies demonstrate that new diagnoses are achieved via this mechanism (Alfares et al., 2018; Al‐Nabhani et al., 2018; Baker et al., 2019; Epilepsy Genetics, 2019; Bowling et al., 2017; Costain et al., 2018; Eldomery et al., 2017; Ewans et al., 2018; Hiatt et al., 2018; Jalkh et al., 2019; Li et al., 2019; Liu et al., 2019; Machini et al., 2019; Nambot et al., 2018; Need et al., 2017; Salmon et al., 2019; Schmitz‐Abe et al., 2019; Shashi et al., 2019; Stark et al., 2019; Trinh et al., 2019; Wenger et al., 2017; Wright et al., 2018; Xiao et al., 2018). We did not see a benefit from this approach alone, most likely due to the short period of time between initial analysis and our first reanalysis time point (4–13 months). Our second approach looked at all Mendeliome genes (Tier 2 reanalysis) to evaluate if new diagnoses would be delivered by new gene publications, newly reported genotype‐phenotype associations (“expanded phenotype” literature), or in light of new/evolving patient features. It was also hypothesized that this comprehensive reanalysis may reveal diagnoses that were “missed” at original analysis. No new diagnoses were made via this strategy at 9–18 months after the initial analysis. Once again, this may be due to reanalysis being performed too soon after initial analysis given that the median timeframe for reanalysis across all published studies has been 22 months (where reported; Table 3). New diagnoses in this study were identified using approaches other than the reanalysis of singleton ES data. The successful strategies delineate several important lessons: (i) the need for other testing methodologies to detect genomic lesions not tractable by ES; (ii) consideration for repeated/updated ES for improved coverage; (iii) expansion to family sequencing studies to highlight de novo variants in an affected proband; and (iv) utilizing translational research to pursue novel gene discovery. In our study, there were two instances of copy number variation (CNV) that were not tractable by ES. One case was published after the initial ES as part of a discovery publication describing complex gene cluster deletions involving the ATAD3 gene locus (Desai et al., 2017). ATAD3 cluster deletions were not a recognized entity at the time of reporting the original microarray and this CNV was not tractable by ES. The second case had a mosaic intragenic deletion of NIPBL that was detected on a high‐density SNP‐based array (Illumina HumanOmni5 Exome‐4 v1.2) that afforded better probe coverage of the gene compared to the array platform utilized for the original microarray (Illumina Human CytoSNP‐12 v2.1). Both genomic lesions lay outside of the ES data generated. Examples of other variants that fall into this category include intronic variants, deep intronic splice variants, nucleotide repeat expansions, and structural variants such as chromosomal rearrangements. Improvements in genomic data analysis to detect CNVs (Pfundt et al., 2017) and repeat expansions (Tankard et al., 2018) may mitigate the need for some forms of adjunctive testing in the future. There were two cases for which pathogenic missense variants were identified following repeat sequencing with improved coverage of the genes of interest (SCN8A, CHD7). In both cases, the variant was detected on original ES in only five reads. Both cases underwent repeat research‐based ES or GS as part of family sequencing studies in different research projects. The improved coverage helped to bring these variants to clinical attention. The CHD7 variant was also of poor sequencing quality and was filtered out of the variant callset by the standard analysis pipeline used in the original analysis. The SCN8A variant was present in the original exome data and of reasonable sequencing quality and thus should have been detectable as a variant of clinical interest. One patient was diagnosed owing to a de novo in‐frame deletion in WDR45, which is associated with the condition neurodegeneration with brain iron accumulation 5 (NBIA5). This variant was detected on the original ES with a low variant fraction of 0.21. If the variant fraction is particularly low, as in this case, then it is possible to dismiss these calls as being indicative of somatic mosaicism or artifactual. The WDR45 in‐frame deletion was on this occasion brought to clinical attention when highlighted as de novo in the proband via family‐based ES studies. The variant fraction on repeat ES was 0.41. One‐third of the diagnoses in this study have been derived from novel gene discovery, illustrating the significance of pursuing translational research when attempting to solve the unsolved. In particular, international data sharing through platforms such as GeneMatcher (Sobreira et al., 2015) has been a successful strategy employed by several reanalysis studies to date, with collaborations between research groups resulting in numerous diagnoses and novel gene publications (Epilepsy Genetics, 2019; Bowling et al., 2017; Bruel et al., 2019; Eldomery et al., 2017; Hiatt et al., 2018; Nambot et al., 2018). One of the limitations of reanalyzing ES cases is the time commitment required. In this study, reanalysis was particularly labor‐intensive for the referring clinician who was asked to participate in the prioritization of variants for classification as they are best positioned to identify variants of relevance to the phenotype. This familiarity with their patients may introduce clinician bias against the selection of promising variants, with variants dismissed as being not of clinical relevance when the patient may have an atypical presentation of a condition that the clinician had yet to consider. Another limitation of this study is that we did not examine the full impact of family‐based ES studies on the new diagnosis rate as not all cases proceeded to family‐based ES. However, we previously evaluated the diagnostic efficacy of trio versus singleton ES and found that the incremental diagnostic yield of trio ES is low in our local context (Tan et al., 2019). There are several reasons why a case may remain negative after ES. The variant‐disease association may not yet be established in which case serial reanalysis of existing ES data may disclose a diagnosis in the fullness of time. This approach needs to be balanced against the time and cost associated with reanalysis, a labor‐intensive exercise for laboratory scientists and clinicians. The workload required for iterative reanalysis of unsolved ES cases was recently modeled by Baker and colleagues, with an estimated variant analysis burden of five novel annotations per singleton ES case per month (Baker et al., 2019). Whilst their study provides a benchmark to assist clinical laboratories in determining the frequency with which to perform reanalysis, further studies are required to assess optimal strategies for cost‐effectiveness and scalability of routine reanalysis. A case will also remain unsolved if the gene‐disease association has not yet been well established. Recent statistics suggest that the rate of gene discovery may be slowing (Boycott et al., 2017), and thus the time taken for new gene publications to translate to new diagnoses may mean longer reanalysis timeframes are required before a diagnosis is reached. Another reason may be that the disease is not Mendelian in origin, but rather due to a mitochondrial DNA defect, oligogenic or more complex genetic disorder, or the result of an epigenetic or somatic variant. Finally, as seen in this study, a case will remain negative after ES if the underlying cause is not genetic, as with exposures to teratogenic, environmental or infectious agents. Given that our study did not achieve any additional diagnoses via singleton ES reanalysis, we are unable to draw conclusions from this data regarding the optimal timing of reanalysis. However, the review of literature suggests considering the reanalysis of singleton ES data for unsolved cases after 18 months from the time of the original report. As all additional diagnoses in this study were derived from strategies other than reanalysis of singleton ES data, this illustrates the need to implement a multi‐faceted strategy for cases remaining unsolved after singleton ES, any of which may be employed ahead of reanalysis.

CONFLICT OF INTEREST

The authors have no conflicts of interest to declare.

AUTHOR CONTRIBUTIONS

All authors have made substantial contributions to conception and design, or acquisition of data, or analysis and interpretation of data; NBT, ZS, MBD, AY, MFH, DJA, NJB SMW, SL, and TYT have been involved in drafting the manuscript or revising it critically for important intellectual content; and all authors have given final approval of the version to be published. Supplementary Material Click here for additional data file.
  42 in total

1.  A head-to-head evaluation of the diagnostic efficacy and costs of trio versus singleton exome sequencing analysis.

Authors:  Tiong Yang Tan; Sebastian Lunke; Belinda Chong; Dean Phelan; Miriam Fanjul-Fernandez; Justine E Marum; Vanessa Siva Kumar; Zornitza Stark; Alison Yeung; Natasha J Brown; Chloe Stutterd; Martin B Delatycki; Simon Sadedin; Melissa Martyn; Ilias Goranitis; Natalie Thorne; Clara L Gaff; Susan M White
Journal:  Eur J Hum Genet       Date:  2019-07-18       Impact factor: 4.246

2.  Analyzing and Reanalyzing the Genome: Findings from the MedSeq Project.

Authors:  Kalotina Machini; Ozge Ceyhan-Birsoy; Danielle R Azzariti; Himanshu Sharma; Peter Rossetti; Lisa Mahanta; Laura Hutchinson; Heather McLaughlin; Robert C Green; Matthew Lebo; Heidi L Rehm
Journal:  Am J Hum Genet       Date:  2019-06-27       Impact factor: 11.025

3.  Unique bioinformatic approach and comprehensive reanalysis improve diagnostic yield of clinical exomes.

Authors:  Klaus Schmitz-Abe; Qifei Li; Samantha M Rosen; Neeharika Nori; Jill A Madden; Casie A Genetti; Monica H Wojcik; Sadhana Ponnaluri; Cynthia S Gubbels; Jonathan D Picker; Anne H O'Donnell-Luria; Timothy W Yu; Olaf Bodamer; Catherine A Brownstein; Alan H Beggs; Pankaj B Agrawal
Journal:  Eur J Hum Genet       Date:  2019-04-12       Impact factor: 4.246

4.  Automated Clinical Exome Reanalysis Reveals Novel Diagnoses.

Authors:  Samuel W Baker; Jill R Murrell; Addie I Nesbitt; Kieran B Pechter; Jorune Balciuniene; Xiaonan Zhao; Zhenming Yu; Elizabeth H Denenberg; Elizabeth T DeChene; Alisha B Wilkens; Elizabeth J Bhoj; Qiaoning Guan; Matthew C Dulik; Laura K Conlin; Ahmad N Abou Tayoun; Minjie Luo; Chao Wu; Kajia Cao; Mahdi Sarmady; Emma C Bedoukian; Jennifer Tarpinian; Livija Medne; Cara M Skraban; Matthew A Deardorff; Ian D Krantz; Bryan L Krock; Avni B Santani
Journal:  J Mol Diagn       Date:  2019-01       Impact factor: 5.568

5.  GeneMatcher: a matching tool for connecting investigators with an interest in the same gene.

Authors:  Nara Sobreira; François Schiettecatte; David Valle; Ada Hamosh
Journal:  Hum Mutat       Date:  2015-08-13       Impact factor: 4.878

6.  Points to consider in the reevaluation and reanalysis of genomic test results: a statement of the American College of Medical Genetics and Genomics (ACMG).

Authors:  Joshua L Deignan; Wendy K Chung; Hutton M Kearney; Kristin G Monaghan; Catherine W Rehder; Elizabeth C Chao
Journal:  Genet Med       Date:  2019-04-24       Impact factor: 8.822

7.  Increasing the sensitivity of clinical exome sequencing through improved filtration strategy.

Authors:  Hanan E Shamseldin; Sateesh Maddirevula; Eissa Faqeih; Niema Ibrahim; Mais Hashem; Ranad Shaheen; Fowzan S Alkuraya
Journal:  Genet Med       Date:  2016-10-06       Impact factor: 8.822

8.  Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.

Authors:  Sue Richards; Nazneen Aziz; Sherri Bale; David Bick; Soma Das; Julie Gastier-Foster; Wayne W Grody; Madhuri Hegde; Elaine Lyon; Elaine Spector; Karl Voelkerding; Heidi L Rehm
Journal:  Genet Med       Date:  2015-03-05       Impact factor: 8.822

9.  Novel pathogenic variants and multiple molecular diagnoses in neurodevelopmental disorders.

Authors:  Joanne Trinh; Krishna Kumar Kandaswamy; Martin Werber; Maximilian E R Weiss; Gabriela Oprea; Shivendra Kishore; Katja Lohmann; Arndt Rolfs
Journal:  J Neurodev Disord       Date:  2019-06-25       Impact factor: 4.025

10.  Systematic reanalysis of clinical exome data yields additional diagnoses: implications for providers.

Authors:  Aaron M Wenger; Harendra Guturu; Jonathan A Bernstein; Gill Bejerano
Journal:  Genet Med       Date:  2016-07-21       Impact factor: 8.822

View more
  11 in total

1.  Common Variable Immunodeficiency and Neurodevelopmental Delay Due to a 13Mb Deletion on Chromosome 4 Including the NFKB1 Gene: A Case Report.

Authors:  Clara Franco-Jarava; Irene Valenzuela; Jacques G Riviere; Marina Garcia-Prat; Mónica Martínez-Gallo; Romina Dieli-Crimi; Neus Castells; Laura Batlle-Masó; Pere Soler-Palacin; Roger Colobran
Journal:  Front Immunol       Date:  2022-06-17       Impact factor: 8.786

2.  Reanalysis of exome negative patients with rare disease: a pragmatic workflow for diagnostic applications.

Authors:  Gaby Schobers; Jolanda H Schieving; Michèl A A P Willemsen; Lisenka E L M Vissers; Helger G Yntema; Maartje Pennings; Rolph Pfundt; Ronny Derks; Tom Hofste; Ilse de Wijs; Nienke Wieskamp; Simone van den Heuvel; Jordi Corominas Galbany; Christian Gilissen; Marcel Nelen; Han G Brunner; Tjitske Kleefstra; Erik-Jan Kamsteeg
Journal:  Genome Med       Date:  2022-06-17       Impact factor: 15.266

3.  Lessons learnt from multifaceted diagnostic approaches to the first 150 families in Victoria's Undiagnosed Diseases Program.

Authors:  Thomas Cloney; Lyndon Gallacher; Lynn S Pais; Natalie B Tan; Alison Yeung; Zornitza Stark; Natasha J Brown; George McGillivray; Martin B Delatycki; Michelle G de Silva; Lilian Downie; Chloe A Stutterd; Justine Elliott; Alison G Compton; Alysia Lovgren; Ralph Oertel; David Francis; Katrina M Bell; Simon Sadedin; Sze Chern Lim; Guy Helman; Cas Simons; Daniel G Macarthur; David R Thorburn; Anne H O'Donnell-Luria; John Christodoulou; Susan M White; Tiong Yang Tan
Journal:  J Med Genet       Date:  2021-11-05       Impact factor: 5.941

4.  Paediatric genomic testing: Navigating medicare rebatable genomic testing.

Authors:  Rani Sachdev; Mike Field; Gareth S Baynam; John Beilby; Maria Berarducci; Yemima Berman; Tiffany Boughtwood; Marie B Cusack; Vanessa Fitzgerald; Jeffery Fletcher; Mary-Louise Freckmann; Natalie Grainger; Edwin Kirk; Ben Lundie; Sebastian Lunke; Lesley McGregor; David Mowat; Gayathri Parasivam; Vanessa Tyrell; Mathew Wallis; Susan M White; Alan S L Ma
Journal:  J Paediatr Child Health       Date:  2021-02-10       Impact factor: 1.954

Review 5.  Methods to Improve Molecular Diagnosis in Genomic Cold Cases in Pediatric Neurology.

Authors:  Magda K Kadlubowska; Isabelle Schrauwen
Journal:  Genes (Basel)       Date:  2022-02-11       Impact factor: 4.096

6.  Genetic Testing of Inherited Retinal Disease in Australian Private Tertiary Ophthalmology Practice.

Authors:  Sena A Gocuk; Yuanzhang Jiao; Alexis Ceecee Britten-Jones; Nathan M Kerr; Lyndell Lim; Simon Skalicky; Richard Stawell; Lauren N Ayton; Heather G Mack
Journal:  Clin Ophthalmol       Date:  2022-04-13

Review 7.  Evaluating systematic reanalysis of clinical genomic data in rare disease from single center experience and literature review.

Authors:  Natalie B Tan; Rachel Stapleton; Zornitza Stark; Martin B Delatycki; Alison Yeung; Matthew F Hunter; David J Amor; Natasha J Brown; Chloe A Stutterd; George McGillivray; Patrick Yap; Matthew Regan; Belinda Chong; Miriam Fanjul Fernandez; Justine Marum; Dean Phelan; Lynn S Pais; Susan M White; Sebastian Lunke; Tiong Y Tan
Journal:  Mol Genet Genomic Med       Date:  2020-09-23       Impact factor: 2.183

Review 8.  Clinical Exome Reanalysis: Current Practice and Beyond.

Authors:  Jianling Ji; Marco L Leung; Samuel Baker; Joshua L Deignan; Avni Santani
Journal:  Mol Diagn Ther       Date:  2021-07-20       Impact factor: 4.476

9.  CNV Detection from Exome Sequencing Data in Routine Diagnostics of Rare Genetic Disorders: Opportunities and Limitations.

Authors:  Beryl Royer-Bertrand; Katarina Cisarova; Florence Niel-Butschi; Laureane Mittaz-Crettol; Heidi Fodstad; Andrea Superti-Furga
Journal:  Genes (Basel)       Date:  2021-09-16       Impact factor: 4.096

10.  Paediatric genomic testing: Navigating genomic reports for the general paediatrician.

Authors:  Margit Shah; Arthavan Selvanathan; Gareth Baynam; Yemima Berman; Tiffany Boughtwood; Mary-Louise Freckmann; Gayathri Parasivam; Susan M White; Natalie Grainger; Edwin P Kirk; Alan Sl Ma; Rani Sachdev
Journal:  J Paediatr Child Health       Date:  2021-08-23       Impact factor: 1.929

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.