Literature DB >> 27578510

A Population-Based Genomic Study of Inherited Metabolic Diseases Detected Through Newborn Screening.

Kyoung Jin Park1, Seungman Park2, Eunhee Lee2, Jong Ho Park1, June Hee Park3, Hyung Doo Park4, Soo Youn Lee4, Jong Won Kim1,5.   

Abstract

BACKGROUND: A newborn screening (NBS) program has been utilized to detect asymptomatic newborns with inherited metabolic diseases (IMDs). There have been some bottlenecks such as false-positives and imprecision in the current NBS tests. To overcome these issues, we developed a multigene panel for IMD testing and investigated the utility of our integrated screening model in a routine NBS environment. We also evaluated the genetic epidemiologic characteristics of IMDs in a Korean population.
METHODS: In total, 269 dried blood spots with positive results from current NBS tests were collected from 120,700 consecutive newborns. We screened 97 genes related to NBS in Korea and detected IMDs, using an integrated screening model based on biochemical tests and next-generation sequencing (NGS) called NewbornSeq. Haplotype analysis was conducted to detect founder effects.
RESULTS: The overall positive rate of IMDs was 20%. We identified 10 additional newborns with preventable IMDs that would not have been detected prior to the implementation of our NGS-based platform NewbornSeq. The incidence of IMDs was approximately 1 in 2,235 births. Haplotype analysis demonstrated founder effects in p.Y138X in DUOXA2, p.R885Q in DUOX2, p.Y439C in PCCB, p.R285Pfs*2 in SLC25A13, and p.R224Q in GALT.
CONCLUSIONS: Through a population-based study in the NBS environment, we highlight the screening and epidemiological implications of NGS. The integrated screening model will effectively contribute to public health by enabling faster and more accurate IMD detection through NBS. This study suggested founder mutations as an explanation for recurrent IMD-causing mutations in the Korean population.

Entities:  

Keywords:  Epidemiology; Founder mutation; Incidence; Inherited metabolic disease; Newborn screening; Next-generation sequencing

Mesh:

Substances:

Year:  2016        PMID: 27578510      PMCID: PMC5011110          DOI: 10.3343/alm.2016.36.6.561

Source DB:  PubMed          Journal:  Ann Lab Med        ISSN: 2234-3806            Impact factor:   3.464


INTRODUCTION

Inherited metabolic diseases (IMDs) are a heterogeneous group of rare diseases with a collective incidence of 1 in 500 to 4,000 live births, representing a substantial public health burden [12345]. Therefore, a newborn screening (NBS) program was introduced to detect presymptomatic newborns with IMDs. Over the past decade, tandem mass spectrometry (MS/MS) has been a major technological breakthrough for the NBS program by providing a way to detect multiple metabolites simultaneously [34567]. Although the use of MS/MS has enabled cost-effective, rapid IMD identification, there have been some bottlenecks such as false-positives and imprecision [4568]. As a second-tier method, enzymatic assays are laborious, time-consuming, and semiquantitative. Sequential Sanger sequencing is hampered by the genetic heterogeneity of IMDs, which results in delayed diagnosis [89]. These limitations of the current NBS tests have raised a necessity of rapidly diagnosing IMDs by next-generation sequencing (NGS) [89101112]. Recent studies have demonstrated that NGS is useful for the molecular diagnosis of some IMDs, including hyperphenylalaninemia, lysosomal storage diseases, and mitochondrial diseases [131415]. Furthermore, several previous studies revealed the analytical validity and clinical utility of NGS for newborns from the United States [1012]. For example, a NGS panel called NBDx, targeting 126 genes for NBS, was developed, and it demonstrated acceptable analytical performance [10]. Additionally, a previous study revealed that NGS leads to improved outcomes in the neonatal intensive care unit, confirming its clinical utility [12]. Currently, NGS is on the verge of being adopted for NBS. To introduce a multigene panel into an NBS program, some important factors should be considered. First, the analytical validity and clinical utility of NGS should be evaluated in a routine NBS environment as previously noted [1012]. Second, the current biochemical NBS tests cannot be replaced by the exclusive use of NGS. Third, the multigene panel for NBS should be designed with specific considerations of genetic epidemiologic characteristics of the target diseases and population. Until recently, epidemiologic studies of IMDs with NGS in a NBS setting have not been reported. The epidemiology of IMDs screened by NBS programs varies widely among different ethnic populations [1]. There are considerable differences in the incidence and spectrum of IMDs between Asians and other ethnicities [123461617]. The collective incidence of IMDs has been reported to be 1 in 2,800 in Korea, 1 in 6,219 in Taiwan, 1 in 6,300 (except hyperphenylalaninemia) in Australia, 1 in 2,000 in Italy, 1 in 4,100 in Germany, and 1 in 500 in a pan-ethnic population [124518]. In addition to ethnic backgrounds, the application of methods like MS/MS have a strong impact on the results of IMD epidemiologic studies. One study reported that there was increase in the incidence of IMDs after the introduction of MS/MS into NBS [2]. Another study revealed that the incidence of medium-chain-acyl-coenzyme A dehydrogenase (MCAD) deficiency was specifically increased after the implementation of MS/MS [4]. Previous genetic epidemiologic studies of IMDs have focused on evaluating a limited number of diseases using conventional molecular methods [1920212223242526]. Little is known about the incidence and spectrum of IMDs as estimated by the application of NGS as a screening method. The aim of this study was to develop a multigene panel for detecting IMDs during NBS in Korea, and to evaluate the utility of an integrated screening model based on traditional biochemical tests and a multigene panel in a routine NBS system. In addition, we aimed to investigate genetic epidemiologic characteristics of IMDs in the Korean newborn population using NGS. We determined the overall incidence and mutation spectrum of IMDs based on the integrated screening model and tested for founder effects of recurrent mutations.

METHODS

1. Study participants and study design

We developed a multigene panel called NewbornSeq that integrates DNA isolation, targeted sequencing, variant annotation, and data interpretation. We evaluated the sensitivity and specificity of NewbornSeq, using 37 controls (27 positive controls and 10 negative controls). The positive controls were from confirmed IMD patients, as determined by biochemical tests and Sanger sequencing from the Samsung Medical Center, Seoul, Korea. Negative controls consisted of one healthy volunteer sample and nine samples of patients with other diseases which were not screened for in the current NBS program. Between-run reproducibility was measured by detecting mutations in technical duplicates. Turnaround time (TAT) was compared between NewbornSeq and the current NBS tests. A population study was conducted in 120,700 newborns during routine NBS performed at Green Cross Laboratories in Korea from May 2013 to July 2014. The population in this study represented about 22% of total births (120,700/540,200) in Korea during that period [18]. A total of 269 cases with positive results from current NBS tests were also screened by NewbornSeq. The population samples were applied to an integrated screening model based on biochemical NBS tests and NewbornSeq (Fig. 1). We investigated whether there was any association between abnormal metabolite levels and gene mutations. According to the association results, patients were divided into three groups: (1) the APC group (Association-Positive Cases, with mutations in genes relevant to abnormal metabolite levels), (2) the PCD group (Positive Cases with Discrepancy, with mutations in genes irrelevant to abnormal metabolites), and (3) the PPC group (Presumptive Positive Cases, with metabolite abnormalities only) (Fig. 1). We investigated the mutation incidence and spectrum of the IMDs in the APC group. Researchers were blinded to all information regarding the identification of the newborns and controls. This study was approved by the Institutional Review Board of the Samsung Medical Center; informed consent was exempt because this study was performed by using stored biospecimens.
Fig. 1

Workflow for diagnosing inherited metabolic diseases. The study population represented about 22% of births (120,700/540,200) in Korea during the designated period. Using the integrated screening model, results were interpreted and divided into three groups: Association-Positive Cases (Cases with mutations in genes relevant to metabolites), Positive Cases with Discrepancy (Cases with mutations in genes irrelevant to metabolites), and Presumptive Positive Cases (Cases with only metabolite abnormalities). The numbers in brackets indicate the number of samples.

Abbreviations: 17α-OHP, 17α-hydroxyprogesterone; TSH, thyroid-stimulating hormone; FT4, free thyroxine; MS/MS, tandem mass spectrometry.

2. Current newborn screening pipeline

Dried blood spots (DBS) were collected from a heel stick on day 3-5 after birth. All NBS tests were performed as a part of the routine NBS program in Korea (Supplemental Method 1). The cases with metabolite levels higher than the cutoff were retested. "Presumptive positive" cases were defined as the individuals with abnormal levels of a metabolite detected from two separate samples.

3. NewbornSeq pipeline

1) DNA preparation and targeted sequencing

Genomic DNA from 27 positive controls and 269 newborns was extracted from EDTA-anticoagulated whole blood (WB) and DBS, respectively (Supplemental Method 1). Additional genomic DNA from 10 negative controls was extracted from DBS. Among them, genomic DNA from one healthy control was extracted from WB and DBS to validate the procedure of isolating DNA from DBS. A customized multiplex PCR amplification strategy was applied to analyze the 97 genes in the current Korean NBS panel, by using Ion AmpliSeq Designer software (Life Technologies, Carlsbad, CA, USA; Supplemental Table S1). All exons and intron sequences of 20 bp around each exon were targeted. The genomic regions with known mutations in the regulatory sequences were included, ultimately resulting in a total of 287 kb for analysis. Ninety-seven percent of targeted bases were covered under this protocol. Targeted sequencing was performed by using the Ion PGM platform (Life Technologies) following the manufacturer's instructions (Supplemental Method 1).

2) Bioinformatic analysis, mutation prioritization, and Sanger sequencing

Data were analyzed by using Torrent Suite software (version 4.0.3; Life Technologies). Variant calling was performed by using the "Germ Line-PGM-High Stringency" setting (Supplemental Table S2). The variants were functionally annotated by using the ANNOVAR tool [2728]. To prioritize pathogenic variants, we sequentially applied the following criteria: selection of allele frequency <0.01 in the 1000 Genome Project (1000GP, http://browser.1000genomes.org/index.html), the Exome Sequencing Project (ESP6500, http://evs.gs.washington.edu/EVS/), and the Exome Aggregation Consortium (ExAC, http://exac.broadinstitute.org/); selection of variants with multiple lines of evidence supporting "deleterious" or "damaging" effects, using Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping v2 (Polyphen-2), likelihood ratio test (LRT), MutationTaster, MutationAssesor, or FATHMM; selection of variants with a Genomic Evolutionary Rate Profiling (GERP) score higher than 2; removal of common polymorphisms reported in dbSNP v.138; selection of protein-impacting mutations such as nonsense mutations, mutations in GT-AG dinucleotides of the canonical splice sites, and frameshift mutations (Fig. 2). To avoid the false exclusion of pathogenic mutations, we manually reviewed the variants. Finally, the prioritized variants were classified as known pathogenic (KP) mutations and expected pathogenic (EP) mutations. "Disease-causing mutations" (DM) in the Human Genome Mutation Database (HGMD) or "pathogenic" mutations in ClinVar were categorized as KP mutations, while other variants were considered EP mutations [2930]. Novel EP variants were compared to the reference sequence of whole-exome sequencing data (Korean Reference Genome DB, KRGDB, http://152.99.75.168/KRGDB/menuPages/firstInfo.jsp) from 622 healthy Korean individuals. In parallel, we applied the criteria for pathogenicity classification according to the American College of Medical Genetics and Genomics (ACMG) guideline [31], and the prioritized variants were classified as pathogenic variants, likely pathogenic variants, and variants of unknown significance (VUS) (Fig. 2). All prioritized mutations from the APC group and compound heterozygous or homozygous mutations from the PCD group were validated with independent Sanger sequencing (Supplemental Method 1).
Fig. 2

Putative variant prioritization and pathogenicity classification. Pathogenic variants were prioritized based on conventional methods and the American College of Medical Genetics and Genomics criteria in (A) total samples (n=269), (B) association-positive cases (n=125), and (C) in positive cases with discrepancy (n=85). The numbers in brackets indicate the number of different types of variants.

Abbreviation: ACMG, American College of Medical Genetics and Genomics.

4. Haplotype analysis

Haplotype analysis was performed to determine if there were founder effects in recurrent mutations identified from both the APC and PCD groups. The selection criteria of samples and SNPs for genotyping are shown in Supplemental Method 2. A total of 123 SNPs and seven candidate mutations were genotyped on the Sequenom MassARRAY SNP genotyping platform (Sequenom Inc., San Diego, CA, USA) and by Sanger sequencing, respectively (Supplemental Table S3). Haplotypes were constructed by using the software PHASE v2.1.1 (http://stephenslab.uchicago.edu/phase/download.html). Haplotype frequencies in mutation-positive cases were compared with those of 90 control individuals from the Korean HapMap [32].

5. Statistical analyses

Kruskal-Wallis test was used to compare metabolite levels among APC, PCD, and PPC. The statistical analyses were performed with MedCalc version 11.5.1.0 (Mariakerke, Belgium). P values less than 0.05 were considered statistically significant.

RESULTS

1. Performance of the NewbornSeq pipeline

Sequencing quality and coverage statistics using control samples are summarized in Supplemental Table S4. Taking the median, a total of 99% and 93% of bases were covered by at least 1-fold and 20-fold of coverage, respectively. The median percentage of on-target reads was 93% across the samples. There was no difference between the use of DBS and WB as sample type. The conventional prioritization method reduced the number of variants per sample from a median of 247 to 4 in the 27 positive control samples (reduction rate of 98%). When the ACMG criteria were applied, the number of variants was reduced to a median of three per sample (reduction rate of 99%). NewbornSeq showed 100% sensitivity and specificity for 97 pathogenic variant alleles (54 causative alleles and 43 incidental alleles) in 27 positive control samples. However, only 96% (93/97) of pathogenic variants were reproducible; four pathogenic variants were not replicated in technical duplicates owing to low coverage less than 20 folds (Supplemental Table S5). The TAT was a median of 17 days by Sanger sequencing-based second-tier tests, which was reduced to within five days by the application of NewbornSeq (Supplemental Table S5). A total of 1,958 variants were called in 269 newborns. We further reduced the number of variants to 244 (0-3 variants/sample) using the conventional criteria. According to the association between the metabolite abnormalities and mutated genes, 59 cases (22%), 125 cases (46%), and 85 cases (32%) were included in the APC, PCD, and PPC groups, respectively (Fig. 1). Sixty-six alleles among 70 mutant alleles from the APC group were confirmed by Sanger sequencing (validation rate of 94%, Supplemental Table S6). When comparing metabolite levels among the groups, both thyroid-stimulating hormone (TSH) and free thyroxine (FT4) levels among the APC, PCD, and PPC groups were significantly different (P values for TSH and FT4 were 0.0044 and 0.0299, respectively; Supplemental Table S7).

2. Diagnosis of inherited metabolic diseases through the integrated screening model

In the APC group, 54 cases were validated among 59 cases with mutations in genes relevant to metabolite abnormalities, including congenital hypothyroidism (CH, n=34), galactosemia (n=11), type II citrullinemia (CTLN2, n=3), phenylketonuria (PKU, n=1), methylmalonic aciduria (MMA, n=2), and 3-methylcrotonyl-CoA carboxylase deficiency (3-MCC deficiency, n=3) (Table 1). Three cases (IMD_144, IMD_152, and IMD_ 153) had concurrent heterozygous mutations in different genes within the same metabolic pathway (Table 2). The overall positive rate of IMDs was estimated to be 20% (54/269) (Supplemental Table S8).
Table 1

Mutation incidence and frequency of inherited metabolic diseases detected using an integrated screening model

Disease/GeneMode of inheritanceN of validated casesBirth prevalence
Biallelic mutationsAny mutationsBiallelic mutationsAny mutationsCompatible to mode of inheritance
Congenital hypothyroidism
TSHRAD/AR1101in 120,7001in 12,0701in 12,070
PAX8AD03NA1in 40,2331in 40,233
DUOX2AD/AR212 (14)*1in 60,3501in 8,6211in 8,621
DUOXA2AR27 (8)1in 60,3501in 15,0881in 60,350
TPOAR01NA1in 120,504NA
SLC5A5AR111in 120,7001in 120,7001in 120,700
 Subtotal634 (37)1in 20,1171in 3,5501in 4,023
Galactosemia
 GALEAR161in 120,5041in 20,0841in 120,504
GALTAR03NA1in 40,168NA
GALK1AR02NA1in 60,252NA
 Subtotal1111in 120,5041in 10,9551in 120,504
Citrullinemia type II
SLC25A13AR131in 93,1651in 31,0551in 93,165
Phenylketonuria
PAHAR01NA1in 93,165NA
Methylmalonic aciduria
MUTAR121in 93,1651in 46,5831in 93,165
3-methylcrotonyl-CoA carboxylase deficiency
MCCC1AR03NA1in 31,055NA
TotalAD/AR954 (57)1in 13,4111in 2,2351in 4,828

*One case with concurrent TSHR and DUOX2 mutations; †One case with concurrent DUOX2 and DUOXA2 mutations, and the other case with concurrent DUOXA2 and PAX8 mutations.

Abbreviations: AD, autosomal dominant; AR, autosomal recessive; NA, not applicable.

Table 2

Diagnosis of inherited metabolic diseases using an integrated screening model

Sample IDMetabolitecut-off*NBS tests*GeneNT alterationAA alterationConventional criteriaACMG categoryZygosityDiseaseFrequency in APC
IMD_26C3510MUTc.2179C > Tp.R727XKPPComHetMMA1/54
MUTc.322C > Tp.R108CKPLPMMA1/54
IMD_30C5OH0.61.102MCCC1c.475T > Cp.C159RKPLPHet3-MCC deficiency1/54
IMD_31Cit55348SLC25A13c.851delGTATp.M285Pfs*2KPPHetCTLN23/54
IMD_32Cit55430SLC25A13c.851delGTATp.M285Pfs*2KPPHetCTLN23/54
IMD_39FT40.80.4DUOX2c.1232G > Ap.R411KEPVUSNACH1/54
IMD_42FT40.80.6TSHRc.1454C > Ap.A485DEPVUSNACH2/54
IMD_44TSH1223.3DUOX2c.1588A > Tp.K530XKPPHetCH2/54
IMD_47TSH1213.1DUOX2c.2654G > Ap.R885QKPLPHetCH3/54
IMD_48TSH1225.5DUOXA2c.413dupAp.Y138XKPLPHetCH4/54
IMD_50TSH1234.6TSHRc.403A > Tp.N135YEPVUSNACH1/54
TSHRc.1349G > Ap.R450HKPLPHetCH4/54
IMD_52TSH1216.5PAX8c.300dupTACCp.M102fsEPLPHetCH1/54
IMD_54TSH1213.2TSHRc.611C > Tp.A204VKPLPHetCH2/54
IMD_56TSH1212.1DUOXA2c.535T > Cp.Y179HEPVUSNACH1/54
IMD_57TSH1212.1PAX8c.192G > Cp.R64SEPVUSNACH1/54
IMD_66TSH1221.9DUOX2c.4010G > Tp.G1337VEPVUSHetCH1/54
DUOX2c.1588A > Tp.K530XKPPHetCH2/54
IMD_68TSH1217DUOX2c.1462G > Ap.G488RKPLPHetCH4/54
IMD_79Gal1321.7GALEc.1002G > Ap.W334XEPPHetGalactosemia1/54
IMD_80Gal1319GALTc.50+1G > ANAKPPHetGalactosemia1/54
IMD_81Gal1332.2GALEc.47G > Ap.S16NEPVUSNAGalactosemia1/54
IMD_83Gal1319.8GALEc.905G > Ap.G302DKPLPHetGalactosemia2/54
IMD_87Gal1316.5GALTc.998G > Ap.R333QKPLPHetGalactosemia1/54
IMD_89Gal1340.4GALTc.1034C > Ap.A345DKPLPHetGalactosemia1/54
IMD_92TSH1228.5TSHRc.1349G > Ap.R450HKPLPHetCH4/54
IMD_100C5OH0.62.571MCCC1c.288+2T > ANAKPPHet3-MCC deficiency2/54
IMD_101TSH1214.6DUOX2c.2635G>Ap.E879KKPLPHetCH1/54
IMD_106C5OH0.61.016MCCC1c.288+2T > ANAKPPHet3-MCC deficiency2/54
IMD_112Gal1317GALEc.264delTp.F88fsEPLPHetGalactosemia1/54
IMD_113FT40.80.3DUOXA2c.413dupAp.Y138XKPLPHetCH4/54
IMD_124Gal1317.6GALEc.905G > Ap.G302DKPLPHetGalactosemia2/54
IMD_125TSH1212.9TSHRc.1349G > Ap.R450HKPLPHetCH4/54
IMD_139Gal1327GALEc.38A > Gp.Y13CEPVUSNAGalactosemia1/54
GALEc.10A > Gp.K4EEPVUSNAGalactosemia1/54
IMD_142Phe130142.216PAHc.1065+1G > ANAKPPHetPKU1/54
IMD_144TSH1223.1TSHRc.611C > Tp.A204VKPLPHetCH2/54
DUOX2c.2654G > Ap.R885QKPLPHetCH3/54
IMD_149TSH1214.2DUOX2c.3616G > Ap.A1206TEPVUSNACH1/54
DUOX2c.1462G > Ap.G488RKPLPHetCH4/54
IMD_152TSH1212.3DUOX2c.3329G > Ap.R1110QKPLPHetCH1/54
DUOXA2c.738C > Gp.Y246XKPPHetCH3/54
IMD_153TSH1213.6DUOXA2c.738C > Gp.Y246XKPPHetCH3/54
PAX8c.739G > Ap.E247KEPVUSNACH1/54
IMD_159Cit55128.9SLC25A13c.1180+1G > ANAKPPComHetCTLN21/54
SLC25A13c.851delGTATp.M285Pfs*2KPPCTLN23/54
IMD_164C359.126MUTc.1228A > Gp.I410VEPVUSNAMMA1/54
IMD_186TSH/GAL12.0/13.031.3/13.5DUOXA2c.413dupAp.Y138XKPLPHomCH4/54
IMD_189TSH/FT412.0/0.855.7/0.4DUOX2c.1462G > Ap.G488RKPLPHetCH4/54
IMD_191TSH/17α-OHP/FT412/12/0.854.9/18.1/0.2SLC5A5c.1060A > Cp.T354PKPLPComHetCH1/54
SLC5A5c.1605delp.G535fsEPLPCH1/54
IMD_196TSH1294.1DUOX2c.1462G > Ap.G488RKPLPHetCH4/54
IMD_197TSH1214.8TSHRc.1349G > Ap.R450HKPLPHetCH4/54
IMD_203TSH1212.4TSHRc.1556G > Ap.R519HEPVUSNACH1/54
IMD_206TSH1254.8DUOXA2c.280C > Tp.R94CEPVUSNACH1/54
DUOXA2c.413dupAp.Y138XKPLPHetCH4/54
IMD_209TSH1215.1DUOX2c.1319G > Ap.S440NEPVUSNACH1/54
IMD_210TSH00120054.8TSHRc.1449C > Ap.N483KEPVUSNACH1/54
IMD_211TSH1213DUOX2c.227C > Tp.P76LEPVUSNACH1/54
IMD_221TSH1225.7TSHRc.1454C > Ap.A485DEPVUSNACH2/54
IMD_234Gal1327.7GALK1c.1159G > Ap.A387TEPVUSNAGalactosemia2/54
IMD_235Gal1319.6GALK1c.1159G > Ap.A387TEPVUSNAGalactosemia2/54
IMD_237FT40.80.2DUOX2c.2654G > Ap.R885QKPLPHetCH3/54
IMD_238FT40.80.4TPOc.1061G > Tp.W354LEPVUSNACH1/54
IMD_26417α-OHP/FT412.0/0.717.9/0.4DUOXA2c.738C > Gp.Y246XKPPHetCH3/54

Reference sequences of MUT, MCCC1, SLC25A13, DUOX2, TSHR, DUOXA2, PAX8, GALE, GALT, GALT, PAH, SLC5A5, GALK1, and TPO were NM_000255, NM_001293273, NM_001160210, NM_014080, NM_000369, NM_207581, NM_003466, NM_001127621, NM_001258332, NM_000155, NM_000277, NM_000453, NM_000154, and NM_175722, respectively.

*The metabolite units of C3, C5OH, Phe, Cit, Gal, TSH, FT4 and 17α-OHP were µmol/L, µmol/L, µmol/L, µmol/L, µmol/L, mU/L, ng/dL, ng/mL, respectively. Recurrent mutations are in bold.

Abbreviations: KP, known pathogenic mutation based on the Human Genome Mutation Database (DM) or ClinVar (pathogenic) databases; EP, expected pathogenic mutation based on population frequency, in silico prediction, and mutation type (loss of function mutations); P, pathogenic; LP, likely pathogenic; VUS, variant of unknown significance; NA, not applicable; Het, heterozygous; ComHet, compound heterozygous; Hom, homozygous; Cit, citrulline; GAL, galactose; TSH, thyroid stimulating hormone; FT4, free T4; MMA, Methylmalonic aciduria; 3-MCC deficiency, 3-methylcrotonyl-CoA carboxylase deficiency; PKU, Phenylketonuria; CTLN2, Type II citrullinemia; CH, Congenital hypothyroidism.

We validated 13 cases with biallelic mutations for IMDs in the PCD group. Among them, there were 10 cases with treatable diseases, including ornithine carbamoyltransferase deficiency (OTC deficiency, n=2), type II glutaric aciduria (n=1), lysinuric protein intolerance (n=3), PKU (n=1), CH (n=2), and propionic aciduria (n=1) (Table 3). Multiple lines of evidence supporting deleterious effects of the 18 different mutant alleles are summarized in Supplemental Table S9. Details of the mutations and metabolite abnormalities identified in the PCD group are described in Supplemental Table S10.
Table 3

Unexpected detection of cases with biallelic mutations in genes irrelevant to metabolite abnormalities

Sample IDMetabolites (Level; RR or cut-off)GeneNT alterationAA alterationConventional criteriaACMG categoryStatus*ZygosityDisease name
AbnormalRelevant
IMD_35C0 (4.06; cut-off 7)Cit (10.9; RR 2-55), Arg (16.8; RR 0–67) Gln (103; RR 0–300)OTCc.298+5G>CNAKPLPKnownHomOTC deficiency
IMD_36C0 (6.599, cut-off 7)Glu (258; RR 0–805), C4 (0.23; RR 0–1.2), C6 (0.024; RR 0–0.5), C8 (0.007, RR 0–0.35), C10 (0.023 RR 0–0.5), C12 (0.033; RR 0–0.6), C18 (0.416; RR 0–2.13)ETFBc.155insTp.P52fsEPLPNovelHomGA Type II
IMD_132Gal (14.8; RR: less than 13)Arg (1.162; RR 0–67.3), Orn (47.565; RR 0–175)SLC7A7c.498T>Gp.I166MEPVUSNovelHomLPI
IMD_162C5 (1.909; RR: less than 0.81)Arg (3.353; RR 0–67.31), Orn (34.551; RR 0–175)SLC7A7c.498T>Gp.I166MEPVUSNovelHomLPI
IMD_205TSH (12.7; RR: less than 12)Phe (29.4; RR 0–130), Tyr (34.268; RR 0–299), Phe/Tyr (0.858; RR 0–2.5)PAHc.721C>Tp.R241CKPLPKnownComHetPKU
PAHc.442-1G>ANAKPPKnown
IMD_214Gal (21.4; RR: less than 13)TSH (3.2; RR; less than 12), FT4 (1.8; RR; less than 0.8)DUOX2c.3239T>Cp.I1080TKPLPKnownComHetCH
DUOX2c.2678A>Gp.N893SEPVUSNovel
IMD_216Gal (13.5; RR: less than 13)TSH (2.5; RR; less than 12), FT4 (2.3; RR; less than 0.8)DUOX2c.617G>Tp.G206VKPVUSKnownComHetCH
c.4232G>Ap.C1411YKPVUSKnown
IMD_234Gal (27.7; RR: less than 13)Cit (11.4; RR 2-55), Arg (14.7; RR 0–67), Gln (32; RR 0–300)OTCc.298+5G>CNAKPLPKnownHomOTC deficiency
IMD_237FT4 (0.2; RR less than 0.8)C3 (0.4; RR 0.2-5)PCCBc.1283C>Tp.T428IKPLPKnownComHetPA
PCCBc.1316A>Gp.Y439CKPLPKnown
IMD_243FT4 (0.6; RR less than 0.8)Arg (9.5; RR 0-67.3), Orn (36; RR 0-175)SLC7A7c.498T>Gp.I166MEPVUSNovelHomLPI

Reference sequences of OTC, ETFB, HAL, SLC7A7, PAH, DUOX2, and PCCB were NM_000531.5, NM_001014763, NM_001258333, NM_001126105, NM_000277, NM_014080, and NM_000532, respectively. The metabolites units of TSH and FT4 were mU/L, ng/dL, ng/mL, respectively. The unit of the other metabolites was µmol/L.

*The mutation status was assessed based on the Human Genome Mutation Database (DM) or ClinVar (pathogenic) databases.

Abbreviations: AA, amino acid; NT, nucleotide; KP, known pathogenic; EP, expected pathogenic based on population frequency, in silico prediction, and mutation type (loss of function mutations); P, pathogenic; LP, likely pathogenic; VUS, variant of unknown significance; RR, reference range; Gal, galactose; TSH, thyroid-stimulating hormone; FT4, free thyroxine; Cit, citrulline; Arg, arginine; Gln, glutamine; Glu, glutamate; Orn, ornithine; Phe, phenylalanine; Tyr, tyrosine; NA, not applicable; Het, heterozygous; ComHet, compound heterozygous; Hom, homozygous; OTC deficiency, Ornithine carbamoyltransferase deficiency; LPI, Lysinuric protein intolerance; CH, Congenital hypothyroidism; PA, Propionic academia, GA type II, Glutaric acidemia type II; PKU, Phenylketonuria.

3. Mutation incidence of inherited metabolic diseases

We estimated the overall incidence of IMDs based on the current NBS tests to be 1 in 449 in the Korean population. The overall mutation incidence of IMDs calculated through an integrated screening model in the APC group was estimated to be one in 2,235 in the Korean population (Table 1). The highest incidences seen for CH and galactosemia were due to DUOX2 mutations and GALE mutations, respectively (Table 1).

4. Frequency and spectrum of pathogenic mutations

A total of 45 different mutations, including 21 known mutations and 24 expected pathogenic variants, were identified in 54 validated APCs (Table 2). In silico analyses results of validated variants are summarized in Supplemental Table S11. Recurrent mutations from the APC group were found in CTLN2 [p.R285 Pfs*2 in SLC25A13 (n=3)], CH [p.R885Q in DUOX2 (n=3), p.K530X in DUOX2 (n=2), p.G488R in DUOX2 (n=2), p.Y138X in DUOXA2 (n=4), p.R450H in TSHR (n=2), p.Y246X in DUOXA2 (n=2), p.A485D in TSHR (n=2), p.A204V in TSHR (n=2)], galactosemia [p.G302D in GALE (n=2), and 3-MCC deficiency [c.288+2T>A in MCCC1 (n=2)] (Table 2).

5. Founder effects

Seven different recurrent mutations, including SLC25A13 (p.R285Pfs*2), DUOXA2 (p.Y138X), GALE (p.G302D), SLC7A7 (p.I166M), PCCB (p.Y439C), and GALT (p.R224Q) were selected to construct haplotypes. Two samples with DUOXA2 mutations and three samples with DUOX2 mutations were added. Haplotype analysis yielded 392.2 kb, 392.6 kb, 775.3 kb, 399.2 kb, and 88.9 kb segments across the following mutations in DUOXA2, DUOX2, PCCB, SLC25A13, and GALT, respectively. All haplotypes were exclusively observed in mutation-containing cases (Table 4).
Table 4

Comparison of mutation-containing haplotypes between cases and controls

GenesHaplotype *SNPs in haplotypePhysical distance (bp)% in cases% in controls
DUOXA2TCCCGCCCCTATMAGTTTATCCTCCrs397358, rs1473003, rs12913288, rs11635836, rs4775709, rs2467844, rs28662287, rs8024922, rs199138, rs269866, rs269862, rs269856, p.Y138X, rs16977681, rs175088, rs2271435, rs1648314, rs1648306, rs1648298, rs1706828, rs12439643, rs10519018, rs1706767, rs17533116, rs11636114392,60066.7% (8/12)0.00% (0/90)
DUOX2CCTTCTCCCTMATAGTTTATTCTCGrs397358, rs1473003, rs12913288, rs11635836, rs4775709, rs2467844, rs28662287, rs8024922, rs199138, rs269866, p.R885Q, rs269862, rs269856, rs16977681, rs175088, rs2271435, rs1648314, rs1648306, rs1648298, rs1706828, rs12439643, rs10519018, rs1706767, rs17533116, rs11636114392,60050.0% (3/6)0.00% (0/90)
PCCBAATAATGTCGTMGATTCrs16843560, rs4678435, rs3772390, rs9845457, rs561307, rs16843829, rs2290131, rs576771, rs1279840, rs9856769, rs518972, p.Y439C, rs696520, rs7620314, rs900048, rs4521165, rs7616204775,000100.0% (3/3)0.00% (0/90)
SLC25A13TGGCAMCCCACrs184381, rs10267710, rs6465486, rs3779486, rs2301629, p.R285Pfs*2, rs12666465, rs6465496, rs35974282, rs4729249, rs12669236399,200100.0% (4/4)0.00% (0/90)
GALTGCCMCCTrs10972175, rs11791806, rs10814130, p.R224Q, rs3808868, rs1104748, rs281236537,700100.0% (4/4)0.00% (0/90)

The haplotype frequencies in mutation-positive cases were compared with those in 90 control individuals from the Korean HapMap.

*M represents recurrent mutations (p.Y138X in DUOXA2, p.R885Q in DUOX2, p.Y439C in PCCB, p.R285Pfs*2 in SLC25A13, p.R224Q in GALT).

Abbreviation: SNP, single nucleotide polymorphism.

DISCUSSION

The introduction of NGS is likely to change NBS practices. However, current NBS tests will not be replaced by genomic screening because some diseases, including CH, are not usually genetic conditions despite the fact that some related mutations have been reported. In this study, we noted that TSH levels were higher in PPC group cases than in APC group cases. This indicated the possibility of the presence of true CH in the PPC group. On the other hand, current NBS tests have some shortcomings. Disease risk can be modified by the environment over time, so current biochemical tests could yield false negatives or false positives. To complement the current NBS tests without replacing them, we designed NewbornSeq. NewbornSeq showed superior performance in characteristics important for its application in the NBS program: rapid TAT, small amounts of DNA required, minimally invasive sample type, sensitivity, and specificity. In this study, we detected IMDs by applying an integrated screening model based on biochemical tests and NewbornSeq. The integrated screening model provided causative mutations in 20% of newborns with positive results from the biochemical tests in a NBS environment. In addition, it is noteworthy that the shortcomings of the current NBS tests, such as overdiagnosis and overtreatment, can be reduced by using the integrated screening model. For instance, galactosemia and a benign variant (known as Duarte galactosemia) cannot be differentiated by using current biochemical NBS tests. Under the current NBS system, a lactose-free diet might be provided to newborns with benign variants. A differential diagnosis between the pathogenic diseases and their benign variants using the integrated screening model could help avoid unnecessary treatment. In this study, we successfully excluded five Duarte galactosemia cases among 43 cases with increased galactose levels, by applying the integrated screening model. We also identified ten cases with biallelic mutations for preventable IMDs from the PCD group (i.e., secondary findings). These cases would not be detected prior to the implementation of NewbornSeq, suggesting the presence of false negatives in the current NBS pipeline. This might be because some modifiers including prematurity, total parenteral nutrition, or maternal disease may influence the level of metabolites and the age of onset. Although these additional cases were not the primary target of the integrated screening model, they represent an important public health issue. Future studies will determine whether these cases benefited from the early treatment they received. It should be noted that monoallelic mutations were frequently accompanied by metabolite abnormalities in the APC group. Frequent heterozygote mutations might be attributable to false-negatives in the current NewbornSeq pipeline because: i) missing variants due to low depth of coverage, ii) unidentified mutations in regulatory regions, iii) unidentified mutations in amplification-resistant gene regions, iv) allele dropout due to SNPs in PCR primer-binding sites, or v) structural variations (SV). Actually, we showed the possibility of false-negative results in four pathogenic alleles in control samples due to low depth of coverage. False-negatives can also be attributed to monoallelic mutations described in some of IMDs such as Wilson's disease or non-Mendelian mechanisms, such as synergistic heterozygosity [33343536]. To the best of our knowledge, this is the first genetic IMD epidemiologic study in the NBS setting using NGS. The representativeness of the population in this study prompted us to investigate the genetic epidemiology of IMDs in Korea. The incidence of IMDs based on the integrated screening model was 1 in 2,235 newborns using the APC group (Table 1). Using the reported data on the false positive rate (5-10 false positives/1 true positive) of MS/MS, the incidence of IMDs was calculated to be 1 in 2,245 from the biochemical incidence (1:449) [6]. The mutation incidence (1 in 2,235) based on the integrated screening model is quite similar to the disease incidence calculated from the biochemical incidence. This indicates that the integrated screening model provides a reliable and robust estimation of the incidence rate of IMDs, although data regarding clinical phenotypes were not used. We identified founder effects in p.Y138X in DUOXA2, p.R885Q in DUOX2, p.Y439C in PCCB, and p.R224Q in GALT, except the mutation of p.R285Pfs*2 in SLC25A13, which was already reported as a founder mutation in Asians [24]. This study suggested that founder mutations could explain most of the recurrent IMD-related mutations in Koreans. Considering the history of the migration of the Mongoloids, further studies are needed to determine the time of origin and distribution pattern of these founder mutations in East Asian populations, including Chinese and Japanese populations. This study is the first proof-of-concept study for introducing an integrated screening model into the actual NBS system. Importantly, this study raised some issues that should be considered regarding the introduction of NGS in a routine NBS system. First, there is the need for the clinical interpretation of unintended mutations, which would be frequently identified in genetic screening. This study suggested that the long-term follow-up of newborns with secondary findings or monoallelic mutations is necessary. Furthermore, future studies are recommended to determine whether the cases would benefit from the early treatment they received and to investigate whether the pathogenic variants were significantly associated with disease. Second, there is the need for a system that integrates biological knowledge with clinical information. Functional studies on novel mutations are time-consuming and impractical in the NBS setting. In future studies, a more sophisticated system should be introduced to integrate functional data, variant penetrance, and clinical data. Third, there is the need for another analytical platform to detect unidentified mutations, such as SV and regulatory mutations. In this study, although we found one instance of congenital adrenal hyperplasia (CAH) by the use of the integrated screening model, we could not validate the mutations of CYP21A2. The absence of CAH might be due to false-positives from the biochemical tests, false-negatives from NewbornSeq, or lack of proper validation methods; it is difficult to detect mutations of the CYP21A2 gene owing to its high pseudogene homology and frequently observed SV. Future studies are required to develop an additional platform to analyze SVs and genes with pseudogenes for highly suspicious cases. In summary, we highlighted the epidemiologic and screening implications of NGS through the first population-based study in an NBS environment. This study has led to concerns about the opportunities and challenges for the implementation of NGS in NBS because it detected additional IMD cases that were not detected with the current NBS tests. The integrated screening model will be an effective public health strategy because it will enable faster and more accurate IMD detection. The future use of the integrated screening model as a first-tier approach will likely be more beneficial than the current NBS tests.
  34 in total

1.  Molecular diagnosis of infantile mitochondrial disease with targeted next-generation sequencing.

Authors:  Sarah E Calvo; Alison G Compton; Steven G Hershman; Sze Chern Lim; Daniel S Lieber; Elena J Tucker; Adrienne Laskowski; Caterina Garone; Shangtao Liu; David B Jaffe; John Christodoulou; Janice M Fletcher; Damien L Bruno; Jack Goldblatt; Salvatore Dimauro; David R Thorburn; Vamsi K Mootha
Journal:  Sci Transl Med       Date:  2012-01-25       Impact factor: 17.956

2.  Identification and functional analysis of novel dual oxidase 2 (DUOX2) mutations in children with congenital or subclinical hypothyroidism.

Authors:  Giuseppina De Marco; Patrizia Agretti; Lucia Montanelli; Caterina Di Cosmo; Brunella Bagattini; Melissa De Servi; Eleonora Ferrarini; Antonio Dimida; Andrea Claudia Freitas Ferreira; Angelo Molinaro; Claudia Ceccarelli; Federica Brozzi; Aldo Pinchera; Paolo Vitti; Massimo Tonacchera
Journal:  J Clin Endocrinol Metab       Date:  2011-05-11       Impact factor: 5.958

3.  A founder mutation in the GK1 gene is responsible for galactokinase deficiency in Roma (Gypsies).

Authors:  L Kalaydjieva; A Perez-Lezaun; D Angelicheva; S Onengut; D Dye; N U Bosshard; A Jordanova; A Savov; P Yanakiev; I Kremensky; B Radeva; J Hallmayer; A Markov; V Nedkova; I Tournev; L Aneva; R Gitzelmann
Journal:  Am J Hum Genet       Date:  1999-11       Impact factor: 11.025

4.  Tandem mass spectrometric analysis for amino, organic, and fatty acid disorders in newborn dried blood spots: a two-year summary from the New England Newborn Screening Program.

Authors:  T H Zytkovicz; E F Fitzgerald; D Marsden; C A Larson; V E Shih; D M Johnson; A W Strauss; A M Comeau; R B Eaton; G F Grady
Journal:  Clin Chem       Date:  2001-11       Impact factor: 8.327

5.  Rapid whole-genome sequencing for genetic disease diagnosis in neonatal intensive care units.

Authors:  Carol Jean Saunders; Neil Andrew Miller; Sarah Elizabeth Soden; Darrell Lee Dinwiddie; Aaron Noll; Noor Abu Alnadi; Nevene Andraws; Melanie LeAnn Patterson; Lisa Ann Krivohlavek; Joel Fellis; Sean Humphray; Peter Saffrey; Zoya Kingsbury; Jacqueline Claire Weir; Jason Betley; Russell James Grocock; Elliott Harrison Margulies; Emily Gwendolyn Farrow; Michael Artman; Nicole Pauline Safina; Joshua Erin Petrikin; Kevin Peter Hall; Stephen Francis Kingsmore
Journal:  Sci Transl Med       Date:  2012-10-03       Impact factor: 17.956

6.  Screening of newborns and high-risk group of children for inborn metabolic disorders using tandem mass spectrometry in South Korea: a three-year report.

Authors:  Hye-Ran Yoon; Kyung Ryul Lee; Seungwoo Kang; Dong Hwan Lee; Han-Wook Yoo; Won-Ki Min; Dong Hee Cho; Son Moon Shin; Jongwon Kim; Junghan Song; Ho Joo Yoon; Sonsang Seo; Si Houn Hahn
Journal:  Clin Chim Acta       Date:  2005-04       Impact factor: 3.786

Review 7.  Metabolism as a complex genetic trait, a systems biology approach: implications for inborn errors of metabolism and clinical diseases.

Authors:  Jerry Vockley
Journal:  J Inherit Metab Dis       Date:  2008-10-05       Impact factor: 4.982

8.  dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations.

Authors:  Xiaoming Liu; Xueqiu Jian; Eric Boerwinkle
Journal:  Hum Mutat       Date:  2013-07-10       Impact factor: 4.878

9.  ClinVar: public archive of relationships among sequence variation and human phenotype.

Authors:  Melissa J Landrum; Jennifer M Lee; George R Riley; Wonhee Jang; Wendy S Rubinstein; Deanna M Church; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2013-11-14       Impact factor: 16.971

10.  Mutation spectrum in Taiwanese patients with phenylalanine hydroxylase deficiency and a founder effect for the R241C mutation.

Authors:  Yin-Hsiu Chien; Shu-Chuan Chiang; Aichu Huang; Shi-Ping Chou; Szu-San Tseng; Yuan-Te Huang; Wuh-Liang Hwu
Journal:  Hum Mutat       Date:  2004-02       Impact factor: 4.878

View more
  7 in total

1.  Mutation Spectrum of STAR and a Founder Effect of the p.Q258* in Korean Patients with Congenital Lipoid Adrenal Hyperplasia.

Authors:  Eungu Kang; Yoon-Myung Kim; Gu-Hwan Kim; Beom Hee Lee; Han-Wook Yoo; Jin-Ho Choi
Journal:  Mol Med       Date:  2017-05-02       Impact factor: 6.354

2.  Genetic screening techniques and diseases for neonatal genetic diseases.

Authors:  Lianshu Han
Journal:  Zhejiang Da Xue Xue Bao Yi Xue Ban       Date:  2021-08-25

3.  Genomic newborn screening: public health policy considerations and recommendations.

Authors:  Jan M Friedman; Martina C Cornel; Aaron J Goldenberg; Karla J Lister; Karine Sénécal; Danya F Vears
Journal:  BMC Med Genomics       Date:  2017-02-21       Impact factor: 3.063

Review 4.  Uses of Next-Generation Sequencing Technologies for the Diagnosis of Primary Immunodeficiencies.

Authors:  Michael Seleman; Rodrigo Hoyos-Bachiloglu; Raif S Geha; Janet Chou
Journal:  Front Immunol       Date:  2017-07-24       Impact factor: 7.561

5.  Systematic literature review and meta-analysis on the epidemiology of methylmalonic acidemia (MMA) with a focus on MMA caused by methylmalonyl-CoA mutase (mut) deficiency.

Authors:  Tímea Almási; Lin T Guey; Christine Lukacs; Kata Csetneki; Zoltán Vokó; Tamás Zelei
Journal:  Orphanet J Rare Dis       Date:  2019-04-25       Impact factor: 4.123

6.  Genomic and biochemical analysis of repeatedly observed variants in DBT in individuals with maple syrup urine disease of Central American ancestry.

Authors:  Charles J Billington; Kimberly A Chapman; Eyby Leon; Beatrix W Meltzer; Seth I Berger; Matthew Olson; Robert A Figler; Steve A Hoang; Cui Wanxing; Brian R Wamhoff; M Sol Collado; Kristina Cusmano-Ozog
Journal:  Am J Med Genet A       Date:  2022-07-07       Impact factor: 2.578

Review 7.  The Use of Whole Genome and Exome Sequencing for Newborn Screening: Challenges and Opportunities for Population Health.

Authors:  Audrey C Woerner; Renata C Gallagher; Jerry Vockley; Aashish N Adhikari
Journal:  Front Pediatr       Date:  2021-07-19       Impact factor: 3.418

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.