Literature DB >> 31294066

Prediction of functional consequences of the five newly discovered G6PD variations in Taiwan.

Yen-Hui Chiu1,2, Yu-Ning Liu1, Hsiao-Jan Chen3, Ying-Chen Chang4, Shu-Min Kao3, Mei-Ying Liu3, Ying-Yen Weng4, Kwang-Jen Hsiao1,5, Tze-Tze Liu1,4.   

Abstract

Glucose-6-phosphate dehydrogenase deficiency (G6PD deficiency; OMIM #300908) is the most common inborn error disorders worldwide. While the G6PD is the key enzyme of removing oxidative stress in erythrocytes, the early diagnosis is utmost vital to prevent chronic and drug-, food- or infection-induced hemolytic anemia. The characterization of the mutations is also important for the subsequent genetic counseling, especially for female carrier with ambiguous enzyme activities and males with mild mutations. While multiplex SNaPshot assay and Sanger sequencing were performed on 500 G6PD deficient males, five newly discovered variations, namely c.187G > A (p.E63K), c.585G > C (p.Q195H), c.586A > T (p.I196F), c.743G > A (p.G248D), and c.1330G > A (p.V444I) were detected in the other six patients. These variants were previously named as the Pingtung, Tainan, Changhua, Chiayi, and Tainan-2 variants, respectively. The in silico analysis, as well as the prediction of the structure of the resultant mutant G6PD protein indicated that these five newly discovered variants might be disease causing mutations.

Entities:  

Keywords:  G6PD deficiency; In silico analysis; Mutation analysis; Structural predication

Year:  2019        PMID: 31294066      PMCID: PMC6595892          DOI: 10.1016/j.dib.2019.104129

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications table This study extends the G6PD mutation spectrum. The three-dimensional structure illustrates the importance of the amino acid residues related to the function of the G6PD protein. The in silico analysis served as a tool in determining the functional consequence of the mutations, making it potentially valuable for primary care as well as research processes.

Data

This dataset presented the in silico and structural analysis of the five newly discovered variations, namely c.187G > A (p.E63K), c.585G > C (p.Q195H), c.586A > T (p.I196F), c.743G > A (p.G248D), and c.1330G > A (p.V444I) (Fig. 1), detected in the six Taiwanese G6PD deficient patients using Sanger Sequencing (Table 1).
Fig. 1

Detection of five new G6PD variations by Sanger sequencing. G6PD gene sequence showed the wild type sequence with variants of different individuals. (A) c.187G > A in patient A397, (B) c.585G > C in patient A367, (C) c.586A > T in patient A 129, (D) c.743G > A in patient A244 and (E) c.1330G > A in patients A281 and A453. The red arrows showed substitution in a hemizygous state in the missense mutations observed.

Table 1

G6PD activity in newborn screening and following referral for patients carrying newly discovered G6PD variations.

Patient NumberA129A244A281A367A397A453
SexMaleMaleMaleMaleMaleMale
Place of BirthChanghuaChiayiTainanTainanPingtungTainan
Age at newborn screening (day)222233
G6PD activity in newborn screening (U/gHb)a0.25.55.31.75.75.1
Age when confirmed (day)34922151411
Confirmed G6PD activity (U/gHb)b0.16.15.50.28.66.5
Variation foundc.586A > Tc.743G > Ac.1330G > Ac.585G > Cc.187G > Ac.1330G > A

Clinical referral was recommended for those enzyme activity ≦6.0 U/gHb.

The confirmed diagnosis was performed through a quantitative enzyme activity assay by using fresh whole blood. G6PD-deficiency would be suggested for those with G6PD activity ≦10.0 U/gHb.

Detection of five new G6PD variations by Sanger sequencing. G6PD gene sequence showed the wild type sequence with variants of different individuals. (A) c.187G > A in patient A397, (B) c.585G > C in patient A367, (C) c.586A > T in patient A 129, (D) c.743G > A in patient A244 and (E) c.1330G > A in patients A281 and A453. The red arrows showed substitution in a hemizygous state in the missense mutations observed. G6PD activity in newborn screening and following referral for patients carrying newly discovered G6PD variations. Clinical referral was recommended for those enzyme activity ≦6.0 U/gHb. The confirmed diagnosis was performed through a quantitative enzyme activity assay by using fresh whole blood. G6PD-deficiency would be suggested for those with G6PD activity ≦10.0 U/gHb. The comparison sequence of these variants in G6PD protein of different species [2], including Homo sapiens, Mus musculus, Danio rerio (zebrafish), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans were presented in Fig. 2. The in silico analysis using SIFT [3], PolyPhen-2 [3], Mutation Taster [4] and Slicing Finder [5] softwares, as well as the conservation between species and allele frequency in Taiwanese population [6] were summarized in Table 2. Furthermore, the amino acid alterations were presented in the functional domains [7] (Fig. 3) and in partial 3D model of G6PD [8] (Fig. 4). The structure of the resultant mutant G6PD protein were analyzed by HOPE, Have yOur Protein Explained [9] (Table 3).
Fig. 2

The similarity alignment of G6PD proteins across different species. The red characters show the corresponding positions of the five substitutions between species whereas the conserved residues were outlined in green box. The species abbreviations are: D. melanogaster, Drosophila melanogaster; C. elegans, Caenorhabditis elegans.

Table 2

The severity prediction for five newly discovered G6PD missense variations.

Nucleotide substitutionAmino acid substitutionSIFTPolyPhen-2Mutation TasterSplicing finderConservationaAllele FrequencybPredicted Classc
c.187G > Ap.E63KToleratedBenignDisease causingPotential alterationModerately<2/1417dIII-IV
c.585G > Cp.Q195HDamagingProbably damagingDisease causingPotential alterationHighly<1/1000II
c.586A > Tp.I196FDamagingProbably damagingDisease causingPotential alterationHighly<1/1000II
c.743G > Ap.G248DDamagingProbably damagingDisease causingProbably no impactHighly<1/1000III
c.1330G > Ap.V444IToleratedPossibly damagingDisease causingPotential alterationHighly<1/1000III

Sequence comparison between Homo sapiens, Mus musculus, Danio rerio (zebrafish), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans and Saccharomyces cerevisiae as shown in Fig. 2.

Allele frequency in Taiwanese population (https://taiwanview.twbiobank.org.tw/browse38, accessed on 25 April 2019) [6].

Classification of G6PD variants in the study according to the WHO definition [7].

Two alleles in 1417 people with indeterminate sex.

Fig. 3

Schematic representation of alterations in G6PD coding regions and protein functional domains. (A) The coding region of the G6PD gene containing 13 exons. (B) The G6PD protein of 515 amino acids contains two binding domains, namely NAD(P)-binding domain (blue box, amino acids 25–210) and C-terminal domain (green box, amino acids 212–503), and two binding sites, namely NAD(P) binding site (left red box, amino acids 38–44) and G6P-binding site (middle red box, amino acids 198–206), and one dimer interface (right red box, amino acids 380–425). The five mutations were highlighted in black in the coding region and protein domains.

Fig. 4

Close-up views of the ribbon diagram of human G6PD as generated by Swiss PDB viewer. (A) The 3D model structure of G6PD closed to the G6P-binding site, and the Glu63, Gln195, Ile196 and Val444 residuals. (B) A close-up view of G6PD protein contains the NAD(P)-binding site and Gly248 residual. The G6P- and NAD(P)-binding sites were highlighted in cyan, while the residuals were presented in red.

Table 3

Structure prediction of the G6PD variations by HOPE algorithm.

MutantsStructure prediction by HOPE algorithma
p.E63KThe wide-type residue forms a salt bridge with arginine at position 104. The difference in charge will disturb the ionic interaction made by the original, wild-type residue.
p.Q195HThe wild-type residue forms a hydrogen bond with arginine at position 192. The size difference between wild-type and mutant residue makes that the new residue is not in the correct position to make the same hydrogen bond as the original wild-type residue did.
p.I196FThe mutant residue is bigger than the wild-type residue and is located in a domain that is important for the activity of the protein and in contact with residues in another domain. The mutation can affect this interaction and as such affect protein function.
p.G248DThe wild-type residue is a glycine, the most flexible of all residues. This flexibility might be necessary for the protein's function. Mutation of this glycine can abolish this function.
p.V444IThe mutant residue is bigger than the wild-type residue and is located in a domain that is important for binding of other molecules. The mutation might affect this interaction and thereby disturb signal transfer from binding domain to the activity domain.

Using software Have yOur Protein Explained (HOPE, http://www.cmbi.ru.nl/hope/) [9].

The similarity alignment of G6PD proteins across different species. The red characters show the corresponding positions of the five substitutions between species whereas the conserved residues were outlined in green box. The species abbreviations are: D. melanogaster, Drosophila melanogaster; C. elegans, Caenorhabditis elegans. The severity prediction for five newly discovered G6PD missense variations. Sequence comparison between Homo sapiens, Mus musculus, Danio rerio (zebrafish), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans and Saccharomyces cerevisiae as shown in Fig. 2. Allele frequency in Taiwanese population (https://taiwanview.twbiobank.org.tw/browse38, accessed on 25 April 2019) [6]. Classification of G6PD variants in the study according to the WHO definition [7]. Two alleles in 1417 people with indeterminate sex. Schematic representation of alterations in G6PD coding regions and protein functional domains. (A) The coding region of the G6PD gene containing 13 exons. (B) The G6PD protein of 515 amino acids contains two binding domains, namely NAD(P)-binding domain (blue box, amino acids 25–210) and C-terminal domain (green box, amino acids 212–503), and two binding sites, namely NAD(P) binding site (left red box, amino acids 38–44) and G6P-binding site (middle red box, amino acids 198–206), and one dimer interface (right red box, amino acids 380–425). The five mutations were highlighted in black in the coding region and protein domains. Close-up views of the ribbon diagram of human G6PD as generated by Swiss PDB viewer. (A) The 3D model structure of G6PD closed to the G6P-binding site, and the Glu63, Gln195, Ile196 and Val444 residuals. (B) A close-up view of G6PD protein contains the NAD(P)-binding site and Gly248 residual. The G6P- and NAD(P)-binding sites were highlighted in cyan, while the residuals were presented in red. Structure prediction of the G6PD variations by HOPE algorithm. Using software Have yOur Protein Explained (HOPE, http://www.cmbi.ru.nl/hope/) [9].

Experimental design, materials and methods

Mutation identification: sanger sequencing

In 500 G6PD-deficient male newborns detected by G6PD enzyme activity assay [10], nine of which do not carry any of the 21 common mutations described in Taiwan and Southeast Asia using multiplex SNaPshot assay [1]. Their dried blood spots used in newborn screening were subsequently subjected to mutational analysis by sequencing. The whole coding exons and exon-intron boundary sequences of G6PD gene were amplified and analyzed by forward and reverse Sanger sequencing. Putative mutations were confirmed by sequencing of an independent PCR product. The study protocol was reviewed and approved by the Institutional Review Board of Taipei City Hospital, Taiwan.

Sequence alignments between species

Conservation of the peptide sequence around the affected residues was assessed by alignment of orthologous and human G6PD sequences with ClustalW2, [2].

Severity prediction and allele frequency in population

Different online algorithms were used to predict the functional consequences of the five variants. The in silico analyses were performed using the SIFT [3], PolyPhen-2 [3], MutationTaster2 [4], and Human Splicing Finder [5] programs. Furthermore, the allele frequency of the alterations in Taiwanese population was listed as provided in Taiwan Biobank [6].

Distribution of mutations along the coding region and protein sequence

Distribution of alterations was highlighted in the coding region and the functional domains [7]. The A at the ATG translational initiation codon was numbered as 1 in reference accession number NM_001042351. The amino acid numbers were counted from the N-terminal Met of human G6PD protein.

3D structure model of wide type G6PD protein

The 3D structure of G6PD variations observed in this study were presented based on the X-ray crystal structure available at the Protein Data Bank from human G6PD protein (PDB code 1QKI) [8].

Prediction of structural effects of variations

When protein structure is important to predict the effects of variants [11], effect of mutations over G6PD protein structure was determined using HOPE (Have yOur Protein Explained) software [9].

Specifications table

Subject areaGenetics, Genomics and Molecular Biology
More specific subject areaInborn errors of metabolism
Type of dataTables, Figures
How data was acquiredDNA sequencing using 3730xl Genetic Analyzer (Thermo Fisher Scientific, Waltham, MA, USA), mutation severity prediction softwares, structural effect prediction software
Data formatAnalyzed
Experimental factorsDNA extracted from dried blood spot used in newborn screening
Experimental featuresBioinformatic tools
Data source locationTaiwan
Data accessibilityProvided within this article
Related research articleChiu YH, Chen HJ, Chang YC, Liu YN, Kao SM, Liu MY, Weng YY, Hsiao KJ, Liu TT. Applying a multiplexed primer extension method on dried blood spots increased the detection of carriers at risk of glucose-6-phosphate dehydrogenase deficiency in newborn screening program. Clin. Chim. Acta 495 (2019) 271–277. https://doi.org/10.1016/j.cca.2019.04.074[1].
Value of the Data

This study extends the G6PD mutation spectrum.

The three-dimensional structure illustrates the importance of the amino acid residues related to the function of the G6PD protein.

The in silico analysis served as a tool in determining the functional consequence of the mutations, making it potentially valuable for primary care as well as research processes.

  10 in total

1.  Using SIFT and PolyPhen to predict loss-of-function and gain-of-function mutations.

Authors:  Sarah E Flanagan; Ann-Marie Patch; Sian Ellard
Journal:  Genet Test Mol Biomarkers       Date:  2010-08

2.  Clustal W and Clustal X version 2.0.

Authors:  M A Larkin; G Blackshields; N P Brown; R Chenna; P A McGettigan; H McWilliam; F Valentin; I M Wallace; A Wilm; R Lopez; J D Thompson; T J Gibson; D G Higgins
Journal:  Bioinformatics       Date:  2007-09-10       Impact factor: 6.937

3.  MutationTaster2: mutation prediction for the deep-sequencing age.

Authors:  Jana Marie Schwarz; David N Cooper; Markus Schuelke; Dominik Seelow
Journal:  Nat Methods       Date:  2014-04       Impact factor: 28.547

4.  Human glucose-6-phosphate dehydrogenase: the crystal structure reveals a structural NADP(+) molecule and provides insights into enzyme deficiency.

Authors:  S W Au; S Gover; V M Lam; M J Adams
Journal:  Structure       Date:  2000-03-15       Impact factor: 5.006

5.  Applying a multiplexed primer extension method on dried blood spots increased the detection of carriers at risk of glucose-6-phosphate dehydrogenase deficiency in newborn screening program.

Authors:  Yen-Hui Chiu; Hsiao-Jan Chen; Ying-Chen Chang; Yu-Ning Liu; Shu-Min Kao; Mei-Ying Liu; Ying-Yen Weng; Kwang-Jen Hsiao; Tze-Tze Liu
Journal:  Clin Chim Acta       Date:  2019-04-22       Impact factor: 3.786

6.  Role of protein structure in variant annotation: structural insight of mutations causing 6-pyruvoyl-tetrahydropterin synthase deficiency.

Authors:  Joao R C Muniz; Natalie Wing-Sum Szeto; Rebecca Frise; Wen Hwa Lee; Xian-Song Wang; Beat Thöny; Nastassja Himmelreich; Nenad Blau; Kwang-Jen Hsiao; Tze-Tze Liu; Opher Gileadi; Udo Oppermann; Frank Von Delft; Wyatt W Yue; Nelson Leung-Sang Tang
Journal:  Pathology       Date:  2019-03-08       Impact factor: 5.306

7.  External quality assurance programme for newborn screening of glucose-6-phosphate dehydrogenase deficiency.

Authors:  Szu-Hui Chiang; Mei-Ling Fan; Kwang-Jen Hsiao
Journal:  Ann Acad Med Singap       Date:  2008-12       Impact factor: 2.473

Review 8.  Glucose-6-phosphate dehydrogenase deficiency.

Authors:  M D Cappellini; G Fiorelli
Journal:  Lancet       Date:  2008-01-05       Impact factor: 79.321

9.  Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces.

Authors:  Hanka Venselaar; Tim A H Te Beek; Remko K P Kuipers; Maarten L Hekkelman; Gert Vriend
Journal:  BMC Bioinformatics       Date:  2010-11-08       Impact factor: 3.169

10.  Human Splicing Finder: an online bioinformatics tool to predict splicing signals.

Authors:  François-Olivier Desmet; Dalil Hamroun; Marine Lalande; Gwenaëlle Collod-Béroud; Mireille Claustres; Christophe Béroud
Journal:  Nucleic Acids Res       Date:  2009-04-01       Impact factor: 16.971

  10 in total
  1 in total

Review 1.  G6PD Variants and Haemolytic Sensitivity to Primaquine and Other Drugs.

Authors:  Germana Bancone; Cindy S Chu
Journal:  Front Pharmacol       Date:  2021-03-15       Impact factor: 5.810

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.