Literature DB >> 26503515

Molecular characterisation of phenylketonuria in a Chinese mainland population using next-generation sequencing.

Nana Li1,2, Haitao Jia3, Zhen Liu1,2, Jing Tao1,2, Song Chen3, Xiaohong Li1,4, Ying Deng1,2, Xi Jin1,2, Jiaping Song3, Liangtao Zhang3, Yu Liang3, Wei Wang3, Jun Zhu1,2,4.   

Abstract

Phenylketonuria (PKU) is an inherited autosomal recessive disorder of phenylalanine metabolism, mainly caused by a deficiency of phenylalanine hydroxylase (PAH). The incidence of various PAH mutations differs among race and ethnicity. Here we report a spectrum of PAH mutations complied from 796 PKU patients from mainland China. The all 13 exons and adjacent intronic regions of the PAH gene were determined by next-generation sequencing. We identified 194 different mutations, of which 41 are not reported before. Several mutations reoccurred with high frequency including p.R243Q, p.EX6-96A > G, p.V399V, p.R241C, p.R111*, p.Y356*, p.R413P, and IVS4-1G > A. 76.33% of mutations were localized in exons 3, 6, 7, 11, 12. We further compared the frequency of each mutation between populations in northern and southern China, and found significant differences in 19 mutations. Furthermore, we identified 101 mutations that are not reported before in Chinese population, our study thus broadens the mutational spectrum of Chinese PKU patients. Additionally, 41 novel mutations will expand and improve PAH mutation database. Finally, our study offers proof that NGS is effective, reduces screening times and costs, and facilitates the provision of appropriate genetic counseling for PKU patients.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26503515      PMCID: PMC4621502          DOI: 10.1038/srep15769

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Phenylketonuria (PKU, OMIM #261600) is an inherited disorder characterized by increased level of phenylalanine in the blood. PKU is frequently caused by functional deficiency of phenylalanine hydroxylase (PAH), an enzyme that converts phenylalanine to other compounds in the body. More than 900 mutations have been identified in PAH, and recorded in the locus-specific database (LSD) known as PAHvdb (http://www.biopku.org/pah/). The occurrence of PKU varies among ethic and geographic regions, reaching approximately 1 in 15,000 newborns1. In mainland China, the average incidence is 1in 11,6142, and in Taiwan 1in 55,0573. The frequencies and distributions of PAH mutations also differs among different populations4567891011121314151617181920212223242526272829303132. The wide variability of common mutations between ethnic groups and geographical areas makes PAH deficiency a genetic disease of great allelic heterogeneity. Comprehensive catalog of PAH mutation spectrum will be of value to evaluate genotype-phenotype correlations, to provide genetic consultations to patients’ families as well as prenatal diagnosis, and to refine diagnoses in and anticipate the dietary requirements of affected patients333435. A large-scale, unbiased comprehensive survey of PAH mutations in Chinese population was not available. Most previous analyses concerning the Chinese population were limited to a few common mutations, or were confined to a certain region of PAH, resulting in a selective bias312363738394041. One report carried out survey of entire PAH in a small-scale survey including 212 patients in mainland China12. In the present study, we report a spectrum of PAH mutations complied from a large cohort of 796 PKU patients in mainland China. We determined the sequence of entire PAH gene using next-generation sequencing (NGS). Among 194 mutations identified, 41 were not reported in literature, and 101 not reported in Chinese population. We believe that these results will facilitate the development of appropriate genetic counseling for PKU patients in China.

Results

Mutation spectrum

In the included cohort of Chinese patients, potential disease-causing mutations were identified on 1516 of 1592 independent alleles, corresponding to a mutation detection rate of 95.23%.A total of 720 patients were completely genotyped, whereas in the remaining 76 individuals only one causative mutation was identified; the other mutation site could not be identified by this platform. Among the fully genotyped patients, two mutations were detected in 683 of the patients, who had either compound heterozygous (n = 622) or homozygous (n = 61) genotypes, three mutations were found in 35 of the patients, and four mutations were revealed in 2 of the patients. The gene analysis results were summarised in Table 1.
Table 1

The spectrum of PAH mutations in Chinese Mainland PKU patients.

Trivial name (protein effect)Syetematic name (DNA level)LocationCharacters of mutationNo. of allelesRF(%)No. of allelesRF(%)No. of allelesRF(%)p value
Chinese (n = 796)Northern China (n = 557)Southern China (n = 239)
p.R243Qc.728G > AE7Missense27917.5320218.137716.110.3300
p.EX6-96A > Gc.611A > GE6Splice1227.66847.54387.950.7780
p.V399Vc.1197A > TE11Silence/Splice935.84746.64193.970.0370**
p.R241Cc.721C > TE7Missense865.40494.40377.740.0070**
p.R111*c.331C > TE3Nonsense764.77605.39163.350.0800
p.Y356*c.1068C > AE11Nonsense714.46575.12142.930.0530
p.R413Pc.1238G > CE12Missense694.33605.3991.880.0020**
IVS4-1G > Ac.442-1G > AI4Splice603.77363.23245.020.0860
p.R53Hc.158G > AE2Missense402.51332.9671.460.0800
p.A434Dc.1301C > AE12Missense291.82191.71102.090.5970
p.R408Qc.1223G > AE12Missense261.63131.17132.720.0250**
IVS7 + 2T > Ac.842 + 2T > AI7Splice241.51221.9720.420.0190**
p.G247Vc.740G > TE7Missense231.44201.8030.630.0730
p.S70delc.206_208delCTTE3Deletion201.26131.1771.460.6250
p.R261Qc.782G > AE7Missense201.26121.0781.670.3270
p.Y166*c.498C > GE5Nonsense171.0780.7291.880.0380**
p.R241Pfs*100c.722delGE7Deletion140.8830.2730.630.2850
p.F161Sc.482T > CE5Missense130.82110.9920.420.2480
p.R176*c.526C > TE6Nonsense130.8290.8140.840.9530
p.G247Rc.739G > CE7Missense130.82100.9030.630.5830
p.S349Ac.1045T > GE10Missense120.75100.9020.420.3110
p.I65Tc.194T > CE3Missense120.7590.8130.630.7030
p.R252Qc.755G > AE7Missense120.7560.5461.260.1300
p.G257Vc.770G > TE7Missense110.6940.3671.460.0150**
p.Q419Rc.1256A > GE12Missense90.5790.8100.000.0490**
p.Y325*c.975C > GE10Nonsense90.5750.4540.840.3440
p.A345Tc.1033G > AE10Missense80.5050.4530.630.6440
p.R408Wc.1222C > TE12Missense80.5060.5420.420.7560
IVS12 + 6T > Ac.1315 + 6T > AI12Splice80.5050.4530.630.6440
p.E56Dc.168G > TE2Missense80.5080.7200.000.0630
p.S70delc.208_210delTCTE3Deletion80.5050.4530.630.6440
IVS4 + 3G > Cc.441 + 3G > CI4Splice80.5070.6310.210.2780
p.A156Pc.466G > CE5Missense80.5040.3640.840.2170
p.R158Qc.473G > AE5Missense80.5070.6310.210.2780
p.M276Kc.827T > AE7Missense80.5070.6310.210.2780
p.E280Kc.838G > AE7Missense80.5070.6310.210.2780
p.V388Mc.1162G > AE11Missense70.4420.1851.050.0170**
p.R400Tc.1199G > CE11Missense70.4440.3630.630.4580
p.R158Wc.472C > TE5Missense70.4450.4520.420.9330
p.L255Sc.764T > CE7Missense70.4460.5410.210.3630
p.I324Nc.971T > AE10Missense70.4470.6300.000.0820
p.F392Ic.1174T > AE11Missense60.3830.2730.630.2850
p.R400Kc.1199G > AE11Missense60.3860.5400.000.1080
p.T418Pc.1252A > CE12Missense60.3830.2730.630.2850
p.Q232*c.694C > TE6Missense60.3850.4510.210.4740
IVS6-1G > Ac.707-1G > AI6Splice60.3840.3620.420.8590
p.R241Hc.722G > AE7Missense60.3840.36102.090.0010**
p.W326*c.977G > AE10Nonsense60.3840.3620.420.8590
IVS10-11G > Ac.1066-11G > AI10Splice40.2530.2710.210.8260
p.H107Rc.320A > GE3Missense40.2520.1820.420.3830
p.R252Wc.754C > TE7Missense40.2510.0930.630.0490**
p.R261*c.781C > TE7Nonsense40.2530.2710.210.8260
IVS7-14_-11delCTTTc.843-14_-11delCTTTI7Deletion40.2530.2710.210.8260
IVS8-7A > Gc.913-7A > GI8Splice40.2530.2710.210.8260
p.A342Hfs*59#c.1023delGE10Deletion30.1920.1810.210.9000
p.F39del #c.113_115delTCTE2Deletion30.1920.1810.210.9000
IVS11-1G > C#c.1200-1G > CI11Splice30.1900.0030.630.0080
p.I421Tc.1262T > CE12Missense30.1930.2700.000.2560
IVS12 + 4A > Gc.1315 + 4A > GI12Splice30.1910.0920.420.1660
IVS12-2A > Cc.1316-2A > CI12Splice30.1910.0920.420.1660
p.A47Ec.140C > AE2Missense30.1900.0030.630.0080**
IVS2 + 5G > Cc.168 + 5G > CI2Splice30.1930.2700.000.2560
p.L98Vc.292T > GE3Missense30.1920.1810.210.9000
p.V230Ic.688G > AE6Missense30.1930.2700.000.2560
p.F233Lc.699C > GE6Missense30.1920.1810.210.9000
p.T278Ic.833C > TE7Missense30.1930.2700.000.2560
p.P281Lc.842C > TE7Missense30.1920.1810.210.9000
p.F331Sc.992T > CE10Missense30.1910.0920.420.1660
p.A342Tc.1024G > AE10Missense20.1320.1800.000.3540
p.S350Yc.1049C > AE10Missense20.1320.1800.000.3540
IVS10-1G > Ac.1066-1G > AI10Splice20.1320.1800.000.3540
p.S359Lc.1076C > TE11Missense20.1320.1800.000.3540
p.K363Nfs*37c.1089delGE11Deletion20.1320.1800.000.3540
p.T372R#c.1115C > GE11Missense20.1320.1800.000.3540
p.Y387Dc.1159T > GE11Missense20.1320.1800.000.3540
p.A403Vc.1208C > TE12Missense20.1310.0910.210.5370
p.I406Sfs*17c. 1215_1219delAATACE12Deletion20.1310.0910.210.5370
p.D415Nc.1243G > AE12Missense20.1310.0910.210.5370
p.E43*#c.127G > TE2Nonsense20.1320.1800.000.3540
p.R71Hc.212G > AE3Missense20.1300.0020.420.0310**
p.I95del #c.279_281delCATE3Deletion20.1320.1800.000.3540
p.G103D#c.308G > AE3Missense20.1300.0020.420.0310**
IVS3-2A > Tc.353-2A > TI3Splice20.1310.0910.210.5370
p.S16*c.44_45delTCE1Deletion20.1320.1800.000.3540
p.P147Lc.440C > TE4Missense20.1310.0910.210.5370
p.G148Vc.443G > TE5Missense20.1320.1800.000.3540
p.Y154Hc.460T > CE5Missense20.1320.1800.000.3540
p.A165Dc.494C > AE5Missense20.1320.1800.000.3540
p.R169G#c.505C > GE5Missense20.1320.1800.000.3540
IVS5 + 1G > Ac.509 + 1G > AI5Splice20.1310.0910.210.5370
p.Y206Cc.617A > GE6Missense20.1320.1800.000.3540
p.C217Yc.650G > AE6Missense20.1320.1800.000.3540
p.H220Pc.659A > CE6Missense20.1320.1800.000.3540
p.T238Ac.712A > GE7Missense20.1320.1800.000.3540
p.G239Dc.716G > AE7Missense20.1300.0020.420.0310**
p.T278Ac.832A > GE7Missense20.1300.0020.420.0310**
p.S303Pfs*38c.907delTE8Deletion20.1310.0910.210.5370
IVS8-2A > Gc.913-2A > GI8Splice20.1300.0020.420.0310**
p.L308Fc.922C > TE9Missense20.1310.0910.210.5370
p.G312Vc.935G > TE9Missense20.1300.0020.420.0310**
p.P314Tc.940C > AE9Missense20.1320.1800.000.3540
p.K335E#c.1003A > GE10Missense10.0610.0900.000.5120
p.Y343*#c.1029T > AE10Nonsense10.0610.0900.000.5120
p.G344Sc.1030G > AE10Missense10.0610.0900.000.5120
p.G344Dc.1031G > AE10Missense10.0610.0900.000.5120
IVS10 + 1G > Tc.1065 + 1G > TI10Splice10.0610.0900.000.5120
IVS10-12delT#c.1066-12delTI10Deletion10.0600.0010.210.1270
IVS10-1G > C#c.1066-1G > CI10Splice10.0600.0010.210.1270
IVS10-1G > T#c.1066-1G > TI10Splice10.0600.0010.210.1270
IVS10-3C > Tc.1066-3C > TI10Splice10.0600.0010.210.1270
p.Y356D#c.1066T > GE11Missense10.0600.0010.210.1270
p.C357Yc.1070G > AE11Missense10.0600.0010.210.1270
p.P362Lc.1085C > TE11Missense10.0610.0900.000.5120
p.L367Vc.1099C > GE11Missense10.0600.0010.210.1270
p.T372Sc.1114A > TE11Missense10.0600.0010.210.1270
p.Q375Ec.1123C > GE11Missense10.0610.0900.000.5120
p.F39delc.116_118delTCTE11Deletion10.0600.0010.210.1270
IVS11 + 1G > Cc.1199 + 1G > CI11Splice10.0610.0900.000.5120
p.I406V#c.1216A > GE12Missense10.0610.0900.000.5120
p.L41Fc.121C > TE2Missense10.0610.0900.000.5120
p.D415Yc.1243G > TE12Missense10.0610.0900.000.5120
p.Q429K#c.1285C > AE12Missense10.0610.0900.000.5120
p.L430Pc.1289T > CE12Missense10.0600.0010.210.1270
p.V5Sfs*34#c.13delGE1Deletion10.0600.0010.210.1270
p.L48Sc.143T > CE2Missense10.0610.0900.000.5120
p.L52Sc.155T > CE2Missense10.0610.0900.000.5120
p.N58*#c.169_170delGAE3Deletion10.0610.0900.000.5120
p.L62Pfs*7#c.184_187insCTGAE3Insertion10.0610.0900.000.5120
p.I65Vc.193A > GE3Missense10.0610.0900.000.5120
p.I65Sc.194T > GE3Missense10.0610.0900.000.5120
p.P69Sc.205C > TE3Missense10.0600.0010.210.1270
p.D75Gc.224A > GE3Missense10.0600.0010.210.1270
p.D75Vc.224A > TE3Missense10.0610.0900.000.5120
p.E78Q#c.232G > CE3Missense10.0600.0010.210.1270
p.E78V#c.233A > TE3Missense10.0600.0010.210.1270
p.D101N#c.301G > AE3Missense10.0610.0900.000.5120
p.R13Qfs*5c.30_31insCE1Insertion10.0610.0900.000.5120
IVS3-2A > G#c.353-2A > GI3Splice10.0600.0010.210.1270
p.F121V#c.361T > GE4Missense10.0610.0900.000.5120
p.E127K#c.379G > AE4Missense10.0600.0010.210.1270
p.E127G#c.380A > GE4Missense10.0600.0010.210.1270
IVS4 + 1G > Ac.441 + 1G > AI4Splice10.0610.0900.000.5120
IVS4 + 2T > Ac.441 + 2T > AI4Splice10.0610.0900.000.5120
p.G148R#c.442G > CE5Missense10.0600.0010.210.1270
p.R155Hc.464G > AE5Missense10.0600.0010.210.1270
p.S16*c.47_48delCTE1Deletion10.0610.0900.000.5120
p.R157Kc.470G > AE5Missense10.0610.0900.000.5120
p.Q160*c.478C > TE5Nonsense10.0600.0010.210.1270
p.R169S#c.505C > AE5Missense10.0610.0900.000.5120
p.R169Cc.505C > TE5Missense10.0610.0900.000.5120
p.H170P#c.509A > CE5Missense10.0610.0900.000.5120
p.H170Rc.509A > GE5Missense10.0610.0900.000.5120
IVS5-2A > Gc.510-2A > GI5Splice10.0600.0010.210.1270
p.H170Qc.510T > AE6Missense10.0610.0900.000.5120
p.G171Ac.512G > CE6Missense10.0610.0900.000.5120
p.Q172Hc.516G > TE6Missense10.0600.0010.210.1270
p.P175Sc.523C > TE6Missense10.0610.0900.000.5120
p.E178K#c.532G > AE6Missense10.0610.0900.000.5120
p.G188Dc.563G > AE6Missense10.0610.0900.000.5120
p.Q20Pc.59A > CE1Missense10.0600.0010.210.1270
IVS1 + 5_ + 6delGC#c.60 + 5_ + 6delGCI1Deletion10.0600.0010.210.1270
p.Q20Hc.60G > CE1Missense10.0600.0010.210.1270
IVS1-3T > C#c.61-3T > CI1Splice10.0610.0900.000.5120
p.Y206*c.618C > AE6Nonsense10.0600.0010.210.1270
p.N223I#c.668A > TE6Missense10.0610.0900.000.5120
p.L227Qc.680T > AE6Missense10.0610.0900.000.5120
p.S231Pc.691T > CE6Missense10.0610.0900.000.5120
p.G239Ac.716G > CE7Missense10.0610.0900.000.5120
p.L242Fc.724C > TE7Missense10.0610.0900.000.5120
p.R243*c.727C > TE7Nonsense10.0610.0900.000.5120
p.L255Vc.763T > GE7Missense10.0610.0900.000.5120
p.F260I#c.778T > AE7Missense10.0610.0900.000.5120
p.C265*c.795C > AE7Nonsense10.0600.0010.210.1270
p.Q267*c.799C > TE7Nonsense10.0610.0900.000.5120
p.Q267Lc.800A > TE7Missense10.0610.0900.000.5120
p.H271Rc.812A > GE7Missense10.0610.0900.000.5120
p.H271Lc.812A > TE7Missense10.0610.0900.000.5120
p.M276Rc.827T > GE7Missense10.0610.0900.000.5120
p.E280Nfs*61#c.833delCE7Deletion10.0600.0010.210.1270
p.E280Gc.839A > GE7Missense10.0600.0010.210.1270
IVS7 + 1G > Ac.842 + 1G > AI7Splice10.0600.0010.210.1270
p.D282Gc.845A > GE8Missense10.0610.0900.000.5120
p.H285Yc.853C > TE8Missense10.0600.0010.210.1270
p.V291M#c.871G > AE8Missense10.0600.0010.210.1270
p.V291L#c.871G > TE8Missense10.0600.0010.210.1270
p.P292S#c.874C > TE8Missense10.0610.0900.000.5120
p.P292Lc.875C > TE8Missense10.0600.0010.210.1270
p.D296H#c.886G > CE8Missense10.0610.0900.000.5120
p.Q304K#c.910C > AE8Missense10.0610.0900.000.5120
p.G307Dc.920G > AE9Missense10.0610.0900.000.5120
p.S310Fc.929C > TE9Missense10.0610.0900.000.5120
p.A322Tc.964G > AE9Missense10.0610.0900.000.5120
p.A322Dc.965C > AE9Missense10.0600.0010.210.1270
p.T328N#c.983C > AE10Missense10.0600.0010.210.1270

Abbreviation: PAH, phenylalanine hydroxylase; RF, relative frequency; #novel mutations; **p <0.05, p value is for the comparision of RF between Northern China and Southern China groups.

A total of 194 different types of mutations were identified, including 134 missense mutations (69.07%), 25 splice-site mutations (12.89%), 18 deletions (9.28%), 14 nonsense mutations (7.22%), 2 insertions (1.03%), and 1 silence/splice (0.52%). 76.33% of the total mutations are found in exons. Most mutations was localized in exon7 (33.44%), exon11 (13.18%), exon6 (10.48%), exon12 (10.29%), exon3 (8.94%). Interestingly, no mutations were identified in exon13. In terms of mutation frequency, p.R243Q was the most prevalent mutation (frequency 17.53%). Other mutations with relatively high frequencies were p.EX6-96A > G, p.V399V, p.R241C, p.R111*, p.Y356*, p.R413P, and IVS4-1G > A (7.66%, 5.84%, 5.40%, 4.77%, 4.46%, 4.33%, 3.77%, respectively). Eight mutations, including p.R53H, p.A434D, p.R408Q, IVS7 + 2T > A, p.G247V, p.S70del, p.R261Q, and p.Y166* were found with relatively lower frequencies (ranging from 1% to 3%). The remaining 178 mutations were found at relative frequencies of less than 1%.

Comparison between northern and southern Chinese populations

The frequencies of each PAH mutations were compared among geographical regions in China. We used sixteen mutations with frequency more than 1% for this comparison. Six mutations showed obvious local mutation clustering: p.V399V, p.R413P and IVS7 + 2T > A were found to be clustered in northern Chinese populations, and p.R241C, p.R408Q and p.Y166* were clustered in southern Chinese populations. Among the remaining 178 mutations that exhibited a relative frequency of <1%, thirteen exhibited significant differential distribution in northern and southern China.

Novel sequence variants

Forty-one novel nucleotide lesions that have not been registered in the BIOPKU database (http://www.biopku.org/pah/) were identified in this study: p.V5Sfs*34, IVS1 + 5_ + 6delGC, IVS1-3T > C, p.F39del, p.E43*, p.N58*, p.L62Pfs*7, p.E78Q, p.E78V, p.I95del, p.D101N, p.G103D, IVS3-2A > G, p.F121V, p.E127K, p.E127G, p.G148R, p.R169S, p.R169G, p.H170P, p.E178K, p.N223I, p.F260I, p.E280Nfs*61, p.V291M, p.V291L, p.P292S, p.D296H, p.Q304K, p.T328N, p.K335E, p.A342Hfs*59, p.Y343*, IVS10-12delT, IVS10-1G > C, IVS10-1G > T, p.Y356D, IVS11-1G > C, p.T372R, p.I406V, p.Q429K. Among these mutations, the majority were missense mutations (n = 25), followed by small deletions (n = 8), nonsense (n = 2), splice (n = 5) and insertions (n = 1). Thirty-four of the novel mutations were detected in coding regions, and the remaining were located in introns. The prediction results of novel mutations are listed in Table 2. A total of 25 novel mutations were predicted; of these, 20 mutations were predicted to be probably damaging, 5 mutations were tolerated, and the remainder of the 16 mutations could not be predicted using this tool. The frequencies of the novel mutations were relatively low, which indicates that they are rare mutations.
Table 2

Pathologic analysis of 25 novel mutations of the PAH gene.

Trivial name (protein effect)Syetematic name (DNA level)Characters of mutationSIFT(score)Trivial name (protein effect)Syetematic name (DNA level)Characters of mutationSIFT(score)
p.E78Qc.232G > CMissenseDamaging(0.007)p.F260Ic.778T > AMissenseDamaging(0.000)
p.E78Vc.233A > TMissenseDamaging(0.002)p.V291Mc.871G > AMissenseTolerated(0.209)
p.D101Nc.301G > AMissenseTolerated(0.322)p.V291Lc.871G > TMissenseDamaging(0.004)
p.G103Dc.308G > AMissenseTolerated(0.240)p.P292Sc.874C > TMissenseDamaging(0.000)
p.F121Vc.361T > GMissenseDamaging(0.000)p.D296Hc.886G > CMissenseDamaging(0.000)
p.E127Kc.379G > AMissenseDamaging(0.002)p.Q304Kc.910C > AMissenseDamaging(0.001)
p.E127Gc.380A > GMissenseDamaging(0.001)p.T328Nc.983C > AMissenseDamaging(0.001)
p.G148Rc.442G > CMissenseDamaging(0.000)p.K335Ec.1003A > GMissenseDamaging(0.026)
p.R169Gc.505C > GMissenseDamaging(0.000)p.Q429Kc.1285C > AMissenseTolerated(1.000)
p.R169Sc.505C > AMissenseDamaging(0.001)p.Y356Dc.1066T > GMissenseDamaging(0.000)
p.H170Pc.509A > CMissenseDamaging(0.001)p.T372Rc.1115C > GMissenseDamaging(0.000)
p.E178Kc.532G > AMissenseTolerated(0.054)p.I406Vc.1216A > GMissenseDamaging(0.03)
p.N223Ic.668A > TMissenseDamaging(0.003)    

Discussion

A comprehensive survey of the mutation spectrum of the protein of interest in a given population not only can provide insight into the structural and functional aspects of the protein as well as genotype-phenotype correlations1415202223262842, but also facilitate genetic counseling in patients’ families. In this study, we described the molecular basis of PKU in a mainland Chinese population by analysing mutations in the PAH gene using NGS. Among a cohort of 796 patients, mutations were detected on 1516 of the 1592 independent alleles, representing a mutation detection rate of 95.23% (Table 1). A total of 194 distinct mutations were found, demonstrating the high genetic heterogeneity that is inherent in PKU. The number of different mutations in a given population is usually high, and is typically comprised of a few prevalent mutations and a large number of private mutations43. In comparing our study with previous reports, 83 of the mutations that we identified have been previously reported; however, the remaining 101 mutations (including 41 novel mutations) were reported for the first time in a Chinese mainland population123637383941. As shown in Table 3, eight mutations including p.R243Q, p.EX6-96A > G, p.V399V, p.R241C, p.R111*, p.Y356*, p.R413P and IVS4-1G > A are common mutations in the mainland Chinese population, although the rank order of these mutations was different. Among them, R241C, p.EX6-96A > G, p.V399V, R413P, R243Q and R111* are also considered to be prevalent in the Chinese Taiwanese population40. The epidemiology of phenylketonuria in China is complicated. The prevalence of PKU in northern China (1/11,000) is close to what has been documented in Caucasian populations, but the prevalence in southern China is much lower44. A comparison between northern and southern China indicated marked differences in the relative frequencies of mutations. Among eight common mutations, V399V, R241C, and R413P gave significant p values between two regions, with respective p values of 0.037, 0.007, and 0.002. The result that p.R413P clustered in northern China is consistent with what has been reported by Gu. et al.12. Furthermore, p.V399V has been previously detected primarily in populations of Xinjiang and northern China3637, whereas p.R241C has been primarily detected in populations in southern China and in Taiwan patients345. The majority of the population in Taiwan has descended from southeast China. Based on our data, we hypothesize that the uniform distribution of V399V, R241C, and R413P is a result of migration and the founder effect.
Table 3

The distribution of common PAH gene mutations within China.

Trivial name (protein effect)Chinese mainland population
Taiwanese population
Present study
p valuePrevious study12
p valueTaiwanese study40
RF(%)
RF(%)
RF(%)
Chinese (n = 796)Northern China (n = 557)Southern China (n = 239)Chinese (n = 212)Northern China (n = 100)Southern China (n = 112)
p.R243Q17.5318.1316.110.330025.4325.5023.210.66164.60
p.EX6-96A > G7.667.547.950.77808.896.5010.270.14639.20
p.V399V5.846.643.970.0370**3.212.503.570.49993.50
p.R241C5.404.407.740.0070**3.703.004.020.545323.20
p.R111*4.775.393.350.08003.704.003.130.65363.50
p.Y356*4.465.122.930.05305.195.554.460.65600.70
p.R413P4.335.391.880.0020**5.438.002.680.0155**4.90
IVS4-1G > A3.773.235.020.08606.426.006.250.87420.70

Abbreviation: PAH, phenylalanine hydroxylase; RF, relative frequency; **p < 0.05.

It is well known that different ethnic groups have their own distinctive and diverse PAH mutant allele series that include either one or a few prevalent founder alleles46. When comparing PAH mutational data between different ethnic groups, correlations between the mutations in and the genetic histories of the investigated populations were found. Marked differences were identified when comparing PAH mutations with ≥ 3% frequency (totaling 34) between Asian and European countries (Table 4). Five mutations, including p.R243Q, p.EX6-96A > G, p.R241C, p.R413P and IVS4-1G > A, were found to be common mutations in East Asian countries such as China, Japan14 and Korea8, accounting for 53.76%, 69.70%, and 62.10% of the total mutations respectively. Three mutations, including p.R111*, p.Y356* and p.T278I, were frequently detected in China and Japan14, China and Korea8, Japan14 and Korea8 respectively. The remaining mutations, including p.V399V, p.A259T, p.R252W, p.Y325* and p.V388M, were found to be common in only one country. In sharp contrast, these mutations except for p.R252W, the above mentioned mutations were either rarely detected or undetected in West Asia and Europe countries. For example in Iran17 and Turkey18, three common mutations including p.R261Q, p.P281L, and IVS10-11G, A were shared, However these mutations were either rare or did not occur in East Asia. Instead, they were prevalent in select Europe countries. In Europe, p.R408W was found to be the common mutation, ranking first in the Czech Republic28 (East Europe) and Germany5 (West Europe), and second in Danemark4 (North Europe) and Serbia32(South Europe). These results suggest that p.R408W was the most prevalent founder allele in the European population46. In contrast, p.R408W was either rarely detected or undetected in populations of East Asia, whereas it was the most common mutation in Turkish populations18. The remaining mutations were only found to be prevalent in subsets of the four countries. For example, p.L48S was the most prevalent mutation in Serbian populations32, and it was also common in Turkish populations18. The p.R158Q, p.A403V, p.Y414C, and IVS12 + 1G > A mutations were relatively common in only two countries. Based on the above comparison, we identified that there were several overlaps of mutant allele distributions between West Asia and Europe and that the mutations that were common in East Asia were different from these.
Table 4

Relative frequencies (%) of common PAH gene mutations detected in Asian and European populations.

MutationAsia
Europe
East Asia
West Asia
East EuropeWest EuropeSouth EuropeNorth Europe
Chinese n = 796Japan14 n = 203Korea8 n = 79Turkish18 n = 569Iran17 n = 124Czech28 n = 665German5 n = 226Serbian32 n = 179Dane4 n = 135
p.R243Q17.535.4012.00NA1.200.000.000.000.00
p.EX6-96A > G7.664.7010.10NA0.000.000.000.000.00
p.V399V5.841.200.00NA0.000.000.000.000.00
p.R241C5.4010.105.70NA0.000.150.000.000.00
p.R111*4.777.900.60NA0.000.000.002.600.30
p.Y356*4.461.705.70NA0.000.000.910.000.30
p.R413P4.3324.103.20NA0.000.000.000.900.00
IVS4-1G > A3.778.9010.10NA0.000.000.000.000.00
p.A259T0.001.005.70NA0.000.000.000.000.00
p.R252W0.254.400.00NA4.803.830.460.000.00
p.T278I0.194.203.20NA0.000.000.000.000.00
p.Y325*0.570.003.20NA0.000.000.000.000.00
p.V388M0.440.503.2NA0.000.150.000.000.30
p.L48S0.060.000.007.000.801.880.0231.000.00
IVS2 + 5G > C0.190.000.00NA6.000.080.460.000.00
IVS4 + 5G > T0.000.000.003.000.800.530.910.000.00
p.R243*0.060.000.00NA6.902.712.051.700.00
p.R261Q1.260.000.608.7012.101.583.883.401.60
p.R261*0.250.700.60NA4.800.080.001.700.00
p.P281L0.190.001.908.408.502.033.656.001.30
p.A300S0.000.000.005.000.400.980.460.000.00
IVS10-11G > A0.250.000.0024.6024.603.612.971.705.20
p.K363Nfs*370.130.000.00NA7.300.000.000.000.00
p.E390G0.000.000.004.100.000.230.915.200.00
IVS11 + 1G > C0.000.000.00NA8.500.080.460.000.00
p.R408W0.500.000.006.400.4042.1124.6616.4018.20
p.I65T0.750.000.00NA0.001.280.000.000.70
p.R158Q0.500.000.60NA0.004.144.573.402.90
p.R270K0.000.000.00NA0.000.450.000.000.00
p.E280K0.500.000.00NA0.800.000.000.002.90
p.I306V0.000.000.00NA0.001.730.465.200.00
p.A403V0.130.200.00NA0.005.115.711.700.00
p.Y414C0.000.000.00NA0.001.584.571.7010.10
IVS12 + 1G > A0.000.000.00NA0.002.4810.272.6037.30

NA: not available.

In the present study, the high mutation detection rate of 95.23% was similar to previous studies in which sequencing analysis of the PAH gene was conducted1215, but it was relatively lower than the results from studies that employed exon analysis combined with multiplex ligation-dependent probe amplification (MLPA)14202428. This is because NGS is able to detect small deletions and insertions, whereas it is not able to detect large deletions or duplications. Despite scanning the entirety of the PAH coding region and its exon–intron boundaries, no mutations were detected in 76 alleles. The most likely explanation behind this is that the mutations are located in regions that were not detectable in this study (for example, in the promoter regions, the 5′ and 3′ UTRs, in non-coding RNA binding sites, or in the intronic sequences far away from exon–intron boundaries). Alternatively, the mutations may have been large deletions or duplications. Using NGS as a routine genetic diagnostic tool enables thousands of DNA sequences to be simultaneously obtained in notably reduced turnaround times and at a significantly reduced cost. Furthermore, this technique provides high sensitivity, specificity and coverage (including all coding regions of the involved exons and adjacent intronic regions). However, the biggest limitation to using NGS is the need to analyse and interpret complicated data. We believe our study will provide guidance for future medical practice such as prenatal screening and early diagnosis of PKU. Diagnostic methods can be developed based on the known characteristics of a population. Currently, PKU screening is performed in newborn babies as a part of the tertiary prevention in birth defects preventive network in China. Developing a new method for screening might enhance primary prevention. Based on the mutational spectrum presented in this study, our hope is that carrier screening can be conducted preceding gestation, which would offer timely guidance with respect to prenatal diagnosis for couples who are both carriers.

Methods

Subjects

A total of 796 unrelated patients from 29 separate newborn screening centres of China were enrolled. These patients were diagnosed at birth either through a neonatal screening program or based on clinical presentation. Demographic data, including age, consanguinity, family history, and geographical origin, and biochemical testing data, including plasma phenylalanine (Phe) levels, dihydropteridine reductase activity, urinary biopterin and neopterin ratio, and tetrahydrobiopterin loading, were collected. The ages of the included patients ranged from 6- months to 5-years old. In families with more than one patient, only one member of each sibling pair was included in the study of mutation frequency. The numbers of patients in northern China and southern China that were divided by the Qinling Mountains and the Huaihe River were 557 and 239, respectively. Both parents of the included patients were native. These patients were classified into one of three separate phenotype categories according to their pretreatment plasma Phe levels, including mild hyperphenylalaninaemia (Phe 120–600 umol/L), mild PKU(Phe 600-1200 μmol/L), and classic PKU (Phe >1200 μmol/L)47. Among the 796 patients, 145 (18.22%) were mild hyperphenylalaninemia(MHP), 215 (27.01%) were mild PKU(mPKU), and 418 (52.51%) were classic PKU(cPKU), the remaining 18 patients (2.26%) could not be classified because the pretreatment Phe levels were not available(Table 5). The entire list of patients’ phenotype and blood Phe levels can be found as Supplementary Table S1. The parental permissions and informed consents were obtained from the parents of all patients. The research was approved by the Ethics Committee of West China Second University Hospital, Sichuan University (No: 2014018) and followed the tenets of the Declaration of Helsinki.
Table 5

Patients’ phenotype and blood Phe levels.

patients’ phenotypepatients’ numberPhe levels (μmol/L)
Mean ± SD
MHP145374.77 ± 141.29
mild PKU215919.22 ± 184.70
classic PKU4181872.55 ± 525.88
unclassified18not available

DNA extraction

Blood samples (4 ml) were collected from each patient and their parents by venipuncture in EDTA. Genomic DNA was extracted from peripheral blood leukocytes using a QIAamp® DNA Blood Mini Kit (Qiagen, Cat.No.51106, Germany) according to the recommended protocol.

Next-generation sequencing

A series of unique primers were designed to amplify all 13 exons and their surrounding introns, which covered 200bp upstream and 200bp downstream of the exons of the PAH gene. A pair of primers that were designed to amplify approximately 150 bp of the human β-globin gene (HBB) (accession number AY260740) was used as an internal quality control to identify false negatives caused by inadequate DNA or failed PCR. To improve the throughput of the assay, the PCR primers were designed to not only amplify the target DNA but also to provide a unique primer index for each of the 96 samples in each of the plates (multiple index PCR). This strategy resulted in 96 sets of 10-bp-long nucleic acid tags that were individually included at the 5′ ends of each of the PAH and HBB primers. The second index that was used for each sample was the 8-bp-long nucleic acid tag from the library adapter sequence, which identified the specific 96-well plate that each sample was included in. This index was attached to the amplicons of the samples through an adapter preparation process. Using this “double index system,” hundreds of samples can be mixed together and detected in one sequencing chip at the same time. PCR was performed on a GeneAmp PCR system 9700 (Applied Biosystems, Foster City, CA), with a cycling protocol that consisted of denaturation at 94 °C for 30 s, 56° C for 30 s, and 72 °C for 1 min. After 35 cycles, gel electrophoresis was used to verify the quality of the amplified DNA, only eligible DNAs was included in the library preparation. The PCR amplification products were prepared for DNA pooling, and the diverse adapter library was added onto the amplification products. After a concentration of DNA was obtained that could satisfy the requirements of the library preparation method, gene mutations were sequenced using an Illumina Hiseq 2000 (Illumina Inc, San Diego, CA, USA) sequencing instrument. After sequencing the samples, the raw sequence data were analysed using in-house software. First, all sequence data were traced back to the specimens from which they arose according to the sequences of the primer and adapter indices. Second, the amplicon sequences of each of the samples were aligned with standard reference PAH sequences from the database PAHvdb (http://www.biopku.org/pah/); SNPs were found in target areas and relevant information was noted. All of the PAH gene sequencing reactions and analyses were performed in the Centre of BGI Health clinical laboratory, Shenzhen, China.

Validation tests of Sanger sequencing

When a given patient’s mutation locus was detected, it was amplified by polymerase chain reaction (PCR) from a parent’s sample and then sequenced bidirectionally in an ABI-3730 DNA analyser. This not only validated the locus but also confirmed the carrier status of the parents. The PCR cycling protocol consisted of an initial denaturation at 95 °C for 3 min, followed by 35 cycles, of 95 °C for 40 s, 55 °C for 30 s, and 72 °C for 30 s, with a final extension at 72 °C for 10 min.

Pathologic analysis of novel mutations

A SIFT prediction was performed (http://sift.jcvi.org/) using the “SIFT Human SNPs” tool to obtain predictions for nonsynonymous SNPs. The annotation version was Homo sapiens GRCh37 Ensembl 63. A list of the chromosomal positions and alleles corresponding to the 41 novel mutations were uploaded into the import the web-site.

Statistical analysis

Statistical analysis was performed using statistical package for social science software (SPSS version 16.0). Mutational frequencies were calculated by the counting method. An x2 analysis was performed to test for differences between two geographic populations. A p value <0.05 was considered statistically significant.

Additional Information

How to cite this article: Li, N. et al. Molecular characterisation of phenylketonuria in a Chinese mainland population using next-generation sequencing. Sci. Rep. 5, 15769; doi: 10.1038/srep15769 (2015).
  45 in total

1.  Phenylketonuria mutations in Northern China.

Authors:  Fang Song; Yu-jin Qu; Ting Zhang; Yu-wei Jin; Hong Wang; Xiao-ying Zheng
Journal:  Mol Genet Metab       Date:  2005-10-26       Impact factor: 4.797

2.  Phenylalanine hydroxylase gene mutations in phenylketonuria patients from India: identification of novel mutations that affect PAH RNA.

Authors:  Murali D Bashyam; Ajay K Chaudhary; E Chandrakanth Reddy; A Radha Rama Devi; G R Savithri; R Ratheesh; Leena Bashyam; E Mahesh; Dity Sen; Ratna Puri; Ishwar C Verma; Inder C Verma; Sheela Nampoothiri; Sunitha Vaidyanathan; Mataguru D Chandrashekar; Prameela Kantheti
Journal:  Mol Genet Metab       Date:  2010-02-02       Impact factor: 4.797

3.  Phenylalanine hydroxylase deficiency: molecular epidemiology and predictable BH4-responsiveness in South Portugal PKU patients.

Authors:  Isabel Rivera; Dina Mendes; Ângela Afonso; Madalena Barroso; Ruben Ramos; Patrícia Janeiro; Anabela Oliveira; Ana Gaspar; Isabel Tavares de Almeida
Journal:  Mol Genet Metab       Date:  2011-07-31       Impact factor: 4.797

4.  Mutational spectrum of phenylketonuria in the Chinese Han population: a novel insight into the geographic distribution of the common mutations.

Authors:  Tianwen Zhu; Shengying Qin; Jun Ye; Wenjuan Qiu; Lianshu Han; Yafen Zhang; Xuefan Gu
Journal:  Pediatr Res       Date:  2010-03       Impact factor: 3.756

5.  The molecular basis of phenylketonuria in Lithuania.

Authors:  J Kasnauskiene; S Giannattasio; P Lattanzio; L Cimbalistiene; V Kucinskas
Journal:  Hum Mutat       Date:  2003-04       Impact factor: 4.878

Review 6.  Phenylketonuria mutations in Europe.

Authors:  Johannes Zschocke
Journal:  Hum Mutat       Date:  2003-04       Impact factor: 4.878

7.  Mutations in the phenylalanine hydroxylase gene identified in 95 patients with phenylketonuria using novel systems of mutation scanning and specific genotyping based upon thermal melt profiles.

Authors:  Steven F Dobrowolski; Clinton Ellingson; Thomas Coyne; Jesse Grey; Ranae Martin; Edwin W Naylor; Richard Koch; Harvey L Levy
Journal:  Mol Genet Metab       Date:  2007-05-14       Impact factor: 4.797

8.  Molecular epidemiology and genotype-phenotype correlation in phenylketonuria patients from South Spain.

Authors:  María A Bueno; Domingo González-Lamuño; Carmen Delgado-Pecellín; Luís Aldámiz-Echevarría; Belén Pérez; Lourdes R Desviat; María L Couce
Journal:  J Hum Genet       Date:  2013-03-21       Impact factor: 3.172

9.  Mutations of the phenylalanine hydroxylase gene in patients with phenylketonuria in Shanxi, China.

Authors:  Yong-An Zhou; Yun-Xia Ma; Quan-Bin Zhang; Wei-Hua Gao; Jian-Ping Liu; Jian-Ping Yang; Gai-Xiu Zhang; Xiao-Gang Zhang; Liang Yu
Journal:  Genet Mol Biol       Date:  2012-10-16       Impact factor: 1.771

10.  Mutation spectrum in Taiwanese patients with phenylalanine hydroxylase deficiency and a founder effect for the R241C mutation.

Authors:  Yin-Hsiu Chien; Shu-Chuan Chiang; Aichu Huang; Shi-Ping Chou; Szu-San Tseng; Yuan-Te Huang; Wuh-Liang Hwu
Journal:  Hum Mutat       Date:  2004-02       Impact factor: 4.878

View more
  20 in total

1.  Mutation spectrum of PAH gene in phenylketonuria patients in Northwest China: identification of twenty novel variants.

Authors:  Yousheng Yan; Chuan Zhang; Xiaohua Jin; Qinhua Zhang; Lei Zheng; Xuan Feng; Shengju Hao; Huafang Gao; Xu Ma
Journal:  Metab Brain Dis       Date:  2019-02-12       Impact factor: 3.584

2.  The Genetic Landscape and Epidemiology of Phenylketonuria.

Authors:  Alicia Hillert; Yair Anikster; Amaya Belanger-Quintana; Alberto Burlina; Barbara K Burton; Carla Carducci; Ana E Chiesa; John Christodoulou; Maja Đorđević; Lourdes R Desviat; Aviva Eliyahu; Roeland A F Evers; Lena Fajkusova; François Feillet; Pedro E Bonfim-Freitas; Maria Giżewska; Polina Gundorova; Daniela Karall; Katya Kneller; Sergey I Kutsev; Vincenzo Leuzzi; Harvey L Levy; Uta Lichter-Konecki; Ania C Muntau; Fares Namour; Mariusz Oltarzewski; Andrea Paras; Belen Perez; Emil Polak; Alexander V Polyakov; Francesco Porta; Marianne Rohrbach; Sabine Scholl-Bürgi; Norma Spécola; Maja Stojiljković; Nan Shen; Luiz C Santana-da Silva; Anastasia Skouma; Francjan van Spronsen; Vera Stoppioni; Beat Thöny; Friedrich K Trefz; Jerry Vockley; Youngguo Yu; Johannes Zschocke; Georg F Hoffmann; Sven F Garbade; Nenad Blau
Journal:  Am J Hum Genet       Date:  2020-07-14       Impact factor: 11.025

3.  The study of the full spectrum of variants leading to hyperphenylalaninemia have revealed 10 new variants in the PAH gene.

Authors:  I Kuznetcova; P Gundorova; O Ryzhkova; A Polyakov
Journal:  Metab Brain Dis       Date:  2019-07-22       Impact factor: 3.584

4.  Follow-up of two newborns with c.158G>A (p.Arg53His) mutation in gene and assessment of the site function.

Authors:  Jie Wang; Bo Zhu; Lichun Zhang; Yitong Zhao; Xiaohua Wang; Yueqi Jia
Journal:  Zhejiang Da Xue Xue Bao Yi Xue Ban       Date:  2021-08-25

5.  Genetic study of the PAH locus in the Iranian population: familial gene mutations and minihaplotypes.

Authors:  Masoumeh Razipour; Elaheh Alavinejad; Seyede Zahra Sajedi; Saeed Talebi; Mona Entezam; Neda Mohajer; Golnaz-Ensieh Kazemi-Sefat; Jalal Gharesouran; Aria Setoodeh; Seyyed Mojtaba Mohaddes Ardebili; Mohammad Keramatipour
Journal:  Metab Brain Dis       Date:  2017-07-04       Impact factor: 3.584

6.  Targeted Next Generation Sequencing in Patients with Inborn Errors of Metabolism.

Authors:  Dèlia Yubero; Núria Brandi; Aida Ormazabal; Àngels Garcia-Cazorla; Belén Pérez-Dueñas; Jaime Campistol; Antonia Ribes; Francesc Palau; Rafael Artuch; Judith Armstrong
Journal:  PLoS One       Date:  2016-05-31       Impact factor: 3.240

7.  The first Mongolian cases of phenylketonuria in selective screening of inborn errors of metabolism.

Authors:  Jamiyan Purevsuren; Baasandai Bolormaa; Chogdon Narantsetseg; Renchindorj Batsolongo; Ochirbat Enkhchimeg; Munkhuu Bayalag; Yuki Hasegawa; Haruo Shintaku
Journal:  Mol Genet Metab Rep       Date:  2016-10-28

8.  Investigation of Five Common Mutations on Phenylalanine Hydroxylase Gene of Phenylketonuria Patients from Two Provinces in North of Iran.

Authors:  Daniel Zamanfar; Hossein Jalali; Mohammad Reza Mahdavi; Morteza Maadanisani; Hossein Zaeri; Eynollah Asadpoor
Journal:  Int J Prev Med       Date:  2017-11-07

9.  Massive parallel sequencing as a new diagnostic approach for phenylketonuria and tetrahydrobiopterin-deficiency in Thailand.

Authors:  Pongsathorn Chaiyasap; Chupong Ittiwut; Chalurmpon Srichomthong; Apiruk Sangsin; Kanya Suphapeetiporn; Vorasuk Shotelersuk
Journal:  BMC Med Genet       Date:  2017-09-16       Impact factor: 2.103

10.  Molecular diagnosis of phenylketonuria in 157 Chinese families and the results of prenatal diagnosis in these families.

Authors:  Yang Xiao; Qiang Gu; Hai-Rong Wu; Song-Tao Wang; Pei Pei; Xue-Fei Zheng; Hong Pan; Yi-Nan Ma
Journal:  Chin Med J (Engl)       Date:  2021-05-19       Impact factor: 2.628

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.