Literature DB >> 32794656

Frequency and spectrum of actionable pathogenic secondary findings in Taiwanese exomes.

Chieh-Wen Kuo1, Wuh-Liang Hwu2,3, Yin-Hsiu Chien2,3, Ching Hsu4, Miao-Zi Hung3, I-Lin Lin3, Feipei Lai4, Ni-Chung Lee2,3.   

Abstract

BACKGROUND: Exome sequencing has recently become more readily available, and more information about incidental findings has been disclosed. However, data from East Asia are scarce. We studied the application of exome sequencing to the identification of pathogenic/likely pathogenic variants in the ACMG 59 gene list and the frequency of these variants in the Taiwanese population.
METHODS: This study screened 161 Taiwanese exomes for variants from the ACMG 59 gene list. The identified variants were reviewed based on information from different databases and the available literature and classified according to the ACMG standard guidelines.
RESULTS: We identified seven pathogenic/likely pathogenic variants in eight individuals, with five participants with autosomal recessive variants in one allele and three participants with autosomal dominant variants. Approximately 1.86% (3/161) of the Taiwanese individuals had a reportable pathogenic/likely pathogenic variant as determined by whole-exome sequencing (WES), which was comparable to the proportions published previously in other countries. We further investigated the high carrier rate of rare variants in the ATP7B gene, which might indicate a founder effect in our population.
CONCLUSION: This study was the first to provide Taiwanese population data of incidental findings and emphasized a high carrier rate of candidate pathogenic/likely pathogenic variants in the ATP7B gene.
© 2020 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals LLC.

Entities:  

Keywords:  Taiwanese; incidental finding; whole-exome sequencing

Mesh:

Substances:

Year:  2020        PMID: 32794656      PMCID: PMC7549563          DOI: 10.1002/mgg3.1455

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


INTRODUCTION

Whole‐exome sequencing (WES) has become more feasible; therefore, more data are available to clinicians and may provide clues to previously obscure diseases. However, in addition to the genetic information being disclosed, another issue has developed, namely, the incidental discovery of secondary findings that are unrelated to the primary reason for ordering sequencing but that are of medical importance and could be used as a basis to improve a patient's health. In 2013, the American College of Medical Genetics (ACMG) recommended that a list of 56 genes should be reported by a laboratory to the ordering clinician, regardless of the indication for ordering sequencing, and that for most genes, only variants reported previously or predicted to be pathogenic should be reported (Green et al., 2013). The minimal 56‐gene list was revised to contain 59 genes in 2016 (Kalia et al., 2017). To identify pathogenic variants, which is also a challenging issue, the ACMG, the Association for Molecular Pathology (AMP) and the College of American Pathologists (CAP) jointly recommended the classification of variants into five categories based on set criteria: “pathogenic,” “likely pathogenic,” “uncertain significance,” “likely benign,” and “benign.” These criteria included typical types of evidence, such as population data, computational data, functional data, and segregation data (Richards et al., 2015). Recently, to improve the ACMG interpretation framework, the removal of criteria PP5 and BP6 (Biesecker & Harrison, 2018) and the development of a four‐step framework for criteria PS3 and BS3 (Brnich et al., 2020) were suggested. Studies have proposed that the prevalence of pathogenic variants in actionable genes varies among different ethnic backgrounds. One study evaluated actionable pathogenic single‐nucleotide variants in 500 European‐ and 500 African‐descent participants, which showed frequencies of 3.4% and 1.2%, respectively (Dorschner et al., 2013). Another population‐based study including 196 Korean exomes revealed 11 pathogenic or likely pathogenic variants in 13 individuals (Jang, Lee, Kim, & Ki, 2015), while another Korean study involving 1303 exomes revealed 13 pathogenic and 13 likely pathogenic variants on the ACMG 59 gene list with a carrier rate of 2.46% (Kwak et al., 2017). A Japanese study of 2049 individuals undergoing whole‐genome sequencing reported 143 reported pathogenic variants for the 57 autosomal ACMG recommended genes and 21% of the individuals with at least one reported pathogenic allele according to public databases of pathogenic variations (Yamaguchi‐Kabata et al., 2018). Subsequently, data from 1005 whole exomes and genomes in Qatar disclosed a frequency of 0.59% actionable pathogenic or likely pathogenic variants in the population, which was lower than the frequencies previously reported in European and African populations (Jain, Gandhi, Koshy, & Scaria, 2018). Although there is increasing attention being paid to reporting pathogenic variants in actionable genes, no study has been published to evaluate the prevalence rate of these variants in the Taiwanese population. Hence, this study analyzed WES data from 161 Taiwanese individuals and discussed the reporting of pathogenic/likely pathogenic variants on the ACMG 59 gene list in the Taiwanese population.

MATERIALS AND METHODS

Ethical compliance

This study was approved by the institutional review board of National Taiwan University Hospital (IRB NTUH 201703073RINB and 201505135RINA).

Patient enrollment

From 2017/6 to 2019/6, 166 unrelated patients suspected of having genetic disease who underwent exome sequencing were retrospectively analyzed. Of them, 80 underwent rapid trio‐exome sequencing (TruSeq Exome Kit), and 86 underwent single‐exome sequencing (Agilent V6) by a third‐party company; the results were analyzed by the Biomedical Genetic Laboratory of National Taiwan University Hospital. Five patients were excluded since the variants involved in the suspected diseases were included in the ACMG 59 gene list. Therefore, a total of 161 cases without parental data were further analyzed for incidental findings. Three milliliters of whole blood was collected in an EDTA tube for DNA extraction using a Puregene DNA extraction system (Qiagen) after obtaining informed written consent. According to the informed consent, secondary findings were not disclosed to the participants.

Exome sequencing

Exome capture was performed using either the TruSeq Exome Capture Kit (Illumina) or the SureSelect Human All Exon V6 Kit (Agilent). Sequencing was performed using the NextSeq500 (Illumina) or HiSeq4000 (Agilent) kit. A 75‐bp paired‐end run was performed. A mean raw coverage over 100‐fold was obtained for each sample. Sequence alignment to the human reference genome (GRCh37) was performed using the Burrows–Wheeler aligner (BWA), and variant calling was performed using the Genome Analysis Tool Kit (GATK V3.5, Broad Institute) (Wang, Li, & Hakonarson, 2010).

Variant prioritization and interpretation

Variants were first annotated by wANNOVAR (http://wannovar.wglab.org/), followed by our in‐house pipeline (Richards et al., 2015; Wu et al., 2019). Variant pathogenicity from ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/), the inheritance pattern from OMIM (https://omim.org/), and the allele frequency from the Taiwan biobank (https://taiwanview.twbiobank.org.tw/) were then added to the annotated file. The pathogenicity of variants was classified according to the American College of Medical Genetics and Genomics and the Association for Molecular Pathology guidelines (Biesecker & Harrison, 2018; Brnich et al., 2020; Retterer et al., 2016). Moreover, variants were reviewed based on information from different databases, such as the Human Gene Mutation Database (HGMD), ClinVar, Clinvitae, EmVClass, InterVar (Li & Wang, 2017), the M‐cap score (Jagadeesh et al., 2016), and VarSome (Kopanos et al., 2019), and for variants located in splicing regions, the Human Splicing Finder (HSF) (Desmet et al., 2009) and MaxEnt Scan (Yeo & Burge, 2004) were applied to evaluate the impact of alternating splicing sites.

Criteria for actionable gene list variants

As algorithm shown in Figure 1, the incidental findings listed as disease‐causing in the HGMD or pathogenic/likely pathogenic in ClinVar were included in the reported group. Additionally, the other variants were included in the candidate group if they met the following criteria: 1. a maximal minor allele frequency <0.05; 2. a variant calling quality >300; and 3. a location at an exon/splice site.
Figure 1

Algorithm of the selection of pathogenic or likely pathogenic variants. The exomes were first screened for variants from the ACMG 59 gene list. Then, the identified variants were included into the reported group or candidate group if they met the criteria. After application of all available lines of evidence and the ACMG standard guidelines, the identified variants were classified into three categories: pathogenic/likely pathogenic, uncertain significance, and benign/likely benign

Algorithm of the selection of pathogenic or likely pathogenic variants. The exomes were first screened for variants from the ACMG 59 gene list. Then, the identified variants were included into the reported group or candidate group if they met the criteria. After application of all available lines of evidence and the ACMG standard guidelines, the identified variants were classified into three categories: pathogenic/likely pathogenic, uncertain significance, and benign/likely benign Afterward, the variants in both groups were reviewed according to all available lines of evidence, such as the primary literature cited in the HGMD and in PubMed. The ACMG standard guidelines (Richards et al., 2015) were applied to classify the variants into three categories: pathogenic/likely pathogenic, uncertain significance, and benign/likely benign.

RESULTS

Prevalence of variants from actionable gene list in the Taiwanese population

The data set analyzed in this research was composed of 161 whole exomes. As shown in Table 1 and Figure 1, a total of 3122 distinct variants of the 59 ACMG reportable genes were identified in the 161 exomes. Of these, 76 potentially pathogenic variants were classified as disease‐causing in the HGMD or pathogenic/likely pathogenic in ClinVar (reported group). In addition, 275 variants were identified based on the above criteria (candidate group).
Table 1

Classification of identified variants

ClassificationVariants (N)
Total variants for 59 ACMG reportable genes3122
Reported group76
Candidate group275
Pathogenic/likely pathogenic variantsReported group4
Candidate group3
Sum7
Uncertain significance variantsReported group37
Candidate group165
Sum202
Benign/likely benign variantsReported group35
Candidate group107
Sum142
Classification of identified variants In total, all 351 variants were reviewed according to all available lines of evidence, such as the primary literature cited in the HGMD or PubMed and the ACMG standard guidelines. The analysis revealed seven potentially pathogenic variants with four in the reported group and three in the candidate group, and ATP7B c.2804C>T was identified in two individuals. Detailed information on these seven potentially pathogenic variants is listed in Table 2.
Table 2

Pathogenic or likely pathogenic variants based on the ACMG guidelines

Variant informationDatabaseFrequency
GeneOMIM numberChr: posVariantAA changedbSNPACMG interpretation resultCriteriaDisease‐associatedEmVClassClinvitaeM‐CAP scoreClinVarHGMDVarSomeInterVarMaximum minor Allele frequencyFrequency in Taiwan
MSH6 1204352:48026566NM_000179.2:c.1444C>TR482Xrs63750909PPVS1 PM1 PM2 PP3Lynch syndromeNAPPPPDMPP0.0006NA
TSC2 61325416:2100489NM_000548.5:c.225+2T>CNALPPVS1 PM2 PP3Tuberous sclerosis syndromeNANANALPNAPNANANA
SCN5A 6038303:38645439NM_000335.5:c.1654G>CG552Rrs3918389LPPS1 PM1 PM2 PP3Brugada syndromeNANAPPNANALPLP0.0002NA
ATP7B 27790013:52511739NM_000053.4:c.3775_3776insAAAGG1259Efs*14NAPPVS1 PM1 PM2 PP3Wilson diseaseNANANANANAPNANANA
ATP7B 27790013:52523835NM_000053.4:c.2828G>AG943Drs779323689LPPS1 PM1 PM2 PP3Wilson diseaseNAP/LPPPP/LPDMLPLP0.00190.00066
ATP7B 27790013:52523859NM_000053.4:c.2804C>T a T935Mrs750019452LPPS1 PM1 PM2 PP3Wilson diseaseNAP/LPPPP/LPDMLPLP0.00240.00198
ATP7B 27790013:52532469NM_000053.4:c.2333G>TR778Lrs28942074PPS1 PS3 PM1 PM2 PP3Wilson diseasePPPPPDMPLP0.00230.00165

Abbreviations: DM, disease‐causing mutation; LB, likely benign; LP, likely pathogenic; NA, not available; P, pathogenic; PP, possibly pathogenic; VUS, variant of uncertain significance.

ATP7B c.2804C>T was identified in two individuals.

Pathogenic or likely pathogenic variants based on the ACMG guidelines Abbreviations: DM, disease‐causing mutation; LB, likely benign; LP, likely pathogenic; NA, not available; P, pathogenic; PP, possibly pathogenic; VUS, variant of uncertain significance. ATP7B c.2804C>T was identified in two individuals. In total, eight participants, 4.97% of the participants analyzed, were identified as having pathogenic or likely pathogenic variants listed as incidental findings in the ACMG 59 gene list (Table 3). Of these participants, five (3.11% of the participants analyzed) had autosomal recessive variants in one allele, and three (1.86% of the participants analyzed) had autosomal dominant variants. Only pathogenic (known or expected) variants for dominant disease or biallelic variants for recessive disease met the ACMG criteria for reporting. Further investigation revealed that all the autosomal recessive variants were in the ATP7B gene, which highlighted the high carrier rate of Wilson disease in our population.
Table 3

Characteristics of participants with pathogenic/likely pathogenic variants

Participant (n)Percentage of participants (%)
Participants with pathogenic/likely pathogenic variants84.97
Autosomal dominant31.86
Autosomal recessive with two alleles found00.00
Autosomal recessive with one allele found53.11
X‐linked00.00
Characteristics of participants with pathogenic/likely pathogenic variants

Stop‐gain and frameshift variants

This study additionally evaluated stop‐gain and frameshift variants in the reported and candidate groups. A total of one stop‐gain and two frameshift variants were identified using the initial filtering criteria, which accounted for a lower percentage than missense variants. The stop‐gain variant in the reported group was MSH6 c.1444C>T (p.R482X), which was predicted to be pathogenic. As shown in Table 2, six of seven databases reported this variant as pathogenic/likely pathogenic. The maximum minor allele frequency was 0.006. The MSH6 c.1444C>T variant has been reported in several individuals with Lynch syndrome‐associated cancers (Baglietto et al., 2010; Hendriks et al., 2004; Okkels et al., 2012; Sjursen et al., 2010). Of the two frameshift variants, ATP7B NM_000053.4:c.3775_3776insAAAG p.(G1259Efs*14) in one participant was predicted to be likely pathogenic (Table 2). In contrast, one frameshift variant, MSH6 NM_000179.3:c.4068_4071dupGATT p.(K1358Dfs*2) (rs55740729), was identified in nine participants. The maximum minor allele frequency of this variant is 3.90%, which is not consistent with the disease presentation. The HGMD also reports this as a functional polymorphism. Therefore, by applying the PVS1, PP3, BS1, and BS2 criteria of the ACMG guidelines, this study classified this variant as a variant of uncertain significance, although several submitters classified it as benign/likely benign in ClinVar (Variation ID: 89518).

DISCUSSION

This study is the first to search for actionable pathogenic variants in the Taiwanese population. In this study, approximately 1.86% of the Taiwanese individuals had reportable pathogenic or likely pathogenic variants, that is, two AR alleles or one AD allele, as determined by WES. According to ACMG, pathogenic (known or expected) variants for dominant disease or biallelic variants for recessive disease should be reported. In the published literature regarding incidental findings in WES and whole‐genome sequencing (WGS), the frequency of pathogenic/likely pathogenic variants has been variable, but approximately half are related to cardiovascular diseases (Amendola et al., 2015; Jain et al., 2018; Jang et al., 2015; Kwak et al., 2017; Olfson et al., 2015; Tang et al., 2018). By using the ACMG 59 gene list, a study involving 954 East Asian genomes found the frequency of pathogenic or likely pathogenic variants to be 2.5% (Tang et al., 2018). By using the ACMG 56 gene list, a study in Korea found a carrier frequency of actionable variants of 2.46% in 1303 individuals (Kwak et al., 2017). Another population‐based study including 196 Korean exomes identified 11 pathogenic or likely pathogenic variants in 13 individuals (Jang et al., 2015). In addition, a study involving 6503 individuals reported frequencies of actionable variants of 2% and 1.1% in European and African groups, respectively (Amendola et al., 2015). This study only enrolled 161 individuals, which is a smaller sample size than the above studies, which could have had an impact on the results. Additionally, there could be some rare polymorphisms that were identified as pathogenic/likely pathogenic variants in this study. This study reported four of seven distinct pathogenic or likely pathogenic variants in the ATP7B gene, and the carrier rate was 3.11%. Previous studies showed variable prevalence rates across different ethnicities. One study in France estimated the prevalence rate of 1.5 cases per 100,000 (Poujois et al., 2018). A study in Taiwan found a prevalence rate of 1.81 cases per 100,000 (Tai et al., 2018); therefore, the heterozygous carrier rate was 0.85% based on Hardy–Weinberg equilibrium. In our study, we identified five out of 161 individuals who were carriers of an ATP7B pathogenic variant. Thus, the carrier frequency of WD‐related mutations was one in 32 (3.11%). This frequency is lower than that in France (3.2%) (Collet et al., 2018) but higher than those in Hong Kong (1.36%) (Mak et al., 2008), Korea (2%) (Park, Ki, Lee, & Kim, 2019), and the USA (1.1%) (Gao, Brackley, & Mann, 2019) according to molecular studies. This finding is consistent with Gao et al.’s observation that the East Asian population has the highest prevalence of Wilson disease, and the genetic prevalence of Wilson disease is greater than the epidemiological estimates (Gao et al., 2019). This finding is further supported by the fact that most of the pathogenic/likely pathogenic variants of ATP7B are found in the East Asian population in the Genome Aggregation Database (GnomAD), in which all eight individuals with variant c.2828G>A, 46 of 49 patients with variant c.2804C>T and all 37 patients with variant c.2333G>T were from the East Asian population. Considering those published studies and databases, the prevalence rate of Wilson disease and carrier rate of ATP7B gene mutations vary across different ethnicities. In particular, the prevalence rate and carrier rate are higher in East Asian countries, such as Taiwan, Korea, and China, and, surprisingly, in France. Therefore, given the history of the surrounding region, the higher carrier rate of mutated ATP7B gene variants could be due to the founder effect. Additionally, there could be some rare polymorphisms in the ATP7B gene. During the investigation, we found correctly classifying ATP7B variants to be challenging. Many rare variants still lack functional analysis, and some variants might be rare polymorphisms, especially in East Asia. Classifying those variants and deciding whether further reporting was warranted remain debatable issues among clinicians, and those decisions might influence future medical decisions for individuals. Thus, further studies on these rare variants in the ATP7B gene will be needed to answer these questions and guide laboratories with regard to the need to report those variants. There were some limitations of this study. First, this study only included 161 Taiwanese participants, which may not fully represent the population. In the future, a multicenter study involving a larger number of participants will be necessary to evaluate the epidemiology of these pathogenic or likely pathogenic variants, especially those in the ATP7B gene. In conclusion, we provided the first Taiwanese population data of incidental findings and highlighted a high carrier rate of candidate pathogenic/likely pathogenic variants in the ATP7B gene. This helps delineate the challenge of variant classification and benefits further interpretation.

CONFLICT OF INTEREST

The authors declare no conflicts of interest.

AUTHOR CONTRIBUTIONS

Wuh‐Liang Hwu, Yin‐Hsiu Chien, Feipei Lai, and Ni‐Chung Lee conceived the study. Ching Hsu and I‐Lin Lin performed the bioinformatics analysis. Chieh‐Wen Kuo and Miao‐Zi Hung carried out the variant analysis, interpretation, and literature reviews. Chieh‐Wen Kuo and Ni‐Chung Lee prepared, edited, and coordinated the manuscript. All authors approved the manuscript.
  32 in total

1.  Actionable secondary findings from whole-genome sequencing of 954 East Asians.

Authors:  Clara Sze-Man Tang; Saloni Dattani; Man-Ting So; Stacey S Cherny; Paul K H Tam; Pak C Sham; Maria-Mercè Garcia-Barcelo
Journal:  Hum Genet       Date:  2017-11-11       Impact factor: 4.132

2.  Evaluation of reported pathogenic variants and their frequencies in a Japanese population based on a whole-genome reference panel of 2049 individuals.

Authors:  Yumi Yamaguchi-Kabata; Jun Yasuda; Osamu Tanabe; Yoichi Suzuki; Hiroshi Kawame; Nobuo Fuse; Masao Nagasaki; Yosuke Kawai; Kaname Kojima; Fumiki Katsuoka; Sakae Saito; Inaho Danjoh; Ikuko N Motoike; Riu Yamashita; Seizo Koshiba; Daisuke Saigusa; Gen Tamiya; Shigeo Kure; Nobuo Yaegashi; Yoshio Kawaguchi; Fuji Nagami; Shinichi Kuriyama; Junichi Sugawara; Naoko Minegishi; Atsushi Hozawa; Soichi Ogishima; Hideyasu Kiyomoto; Takako Takai-Igarashi; Kengo Kinoshita; Masayuki Yamamoto
Journal:  J Hum Genet       Date:  2017-12-01       Impact factor: 3.172

3.  InterVar: Clinical Interpretation of Genetic Variants by the 2015 ACMG-AMP Guidelines.

Authors:  Quan Li; Kai Wang
Journal:  Am J Hum Genet       Date:  2017-01-26       Impact factor: 11.025

4.  Characteristics and prevalence of Wilson's disease: A 2013 observational population-based study in France.

Authors:  Aurélia Poujois; France Woimant; Solène Samson; Pascal Chaine; Nadège Girardot-Tinant; Philippe Tuppin
Journal:  Clin Res Hepatol Gastroenterol       Date:  2017-06-23       Impact factor: 2.947

5.  Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals.

Authors:  Gene Yeo; Christopher B Burge
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

6.  Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.

Authors:  Sue Richards; Nazneen Aziz; Sherri Bale; David Bick; Soma Das; Julie Gastier-Foster; Wayne W Grody; Madhuri Hegde; Elaine Lyon; Elaine Spector; Karl Voelkerding; Heidi L Rehm
Journal:  Genet Med       Date:  2015-03-05       Impact factor: 8.822

7.  Findings of a 1303 Korean whole-exome sequencing study.

Authors:  Soo Heon Kwak; Jeesoo Chae; Seongmin Choi; Min Jung Kim; Murim Choi; Jong-Hee Chae; Eun-Hae Cho; Tai Ju Hwang; Se Song Jang; Jong-Il Kim; Kyong Soo Park; Yung-Jue Bang
Journal:  Exp Mol Med       Date:  2017-07-14       Impact factor: 8.718

8.  ACMG recommendations for reporting of incidental findings in clinical exome and genome sequencing.

Authors:  Robert C Green; Jonathan S Berg; Wayne W Grody; Sarah S Kalia; Bruce R Korf; Christa L Martin; Amy L McGuire; Robert L Nussbaum; Julianne M O'Daniel; Kelly E Ormond; Heidi L Rehm; Michael S Watson; Marc S Williams; Leslie G Biesecker
Journal:  Genet Med       Date:  2013-06-20       Impact factor: 8.822

9.  Human Splicing Finder: an online bioinformatics tool to predict splicing signals.

Authors:  François-Olivier Desmet; Dalil Hamroun; Marine Lalande; Gwenaëlle Collod-Béroud; Mireille Claustres; Christophe Béroud
Journal:  Nucleic Acids Res       Date:  2009-04-01       Impact factor: 16.971

10.  Recommendations for application of the functional evidence PS3/BS3 criterion using the ACMG/AMP sequence variant interpretation framework.

Authors:  Sarah E Brnich; Ahmad N Abou Tayoun; Fergus J Couch; Garry R Cutting; Marc S Greenblatt; Christopher D Heinen; Dona M Kanavy; Xi Luo; Shannon M McNulty; Lea M Starita; Sean V Tavtigian; Matt W Wright; Steven M Harrison; Leslie G Biesecker; Jonathan S Berg
Journal:  Genome Med       Date:  2019-12-31       Impact factor: 11.117

View more
  1 in total

1.  Frequency and spectrum of actionable pathogenic secondary findings in Taiwanese exomes.

Authors:  Chieh-Wen Kuo; Wuh-Liang Hwu; Yin-Hsiu Chien; Ching Hsu; Miao-Zi Hung; I-Lin Lin; Feipei Lai; Ni-Chung Lee
Journal:  Mol Genet Genomic Med       Date:  2020-08-14       Impact factor: 2.183

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.