Literature DB >> 33717985

Carrier frequency and predicted genetic prevalence of Pompe disease based on a general population database.

Kyung Sun Park1.   

Abstract

BACKGROUND: The genetic prevalence of Pompe disease was estimated based on the proportion of individuals who have a causative genotype in a general population database. In addition, clinical severity for causative genotypes was assessed based on currently available locus-specific databases (LSDBs), which contain information on both genotype and clinical severity.
METHODS: Genetic variants in the GAA gene in the Genome Aggregation Database (gnomAD) (v2.1.1) were analyzed in combination with LSDBs of ClinVar, ClinGen Evidence Repository, Pompe disease GAA variant database, and the Pompe Registry. Carrier frequency (CF) and predicted genetic prevalence (pGP) were estimated.
RESULTS: Of 7 populations, East Asian and African showed higher proportions of pathogenic or likely pathogenic variants (PLPVs) associated with classic infantile-onset Pompe disease. Total CF and pGP in the overall population were 1.3% (1 in 77) and 1:23,232, respectively. The highest pGP was observed in the East Asian population at 1:12,125, followed by Non-Finnish European (1:13,756), Ashkenazi Jewish (1:22,851), African/African-American (1:26,560), Latino/Admixed American (1:57,620), South Asian (1:93,087), and Finnish (1:1,056,444).
CONCLUSIONS: Pompe disease has a higher pGP (1:23,232) than earlier accepted (1:40,000). The pGP for Pompe disease was expectedly wide by population and consistent with previous reports based on newborn screening programs (approximately 1:10,000-1:30,000).
© 2021 The Author.

Entities:  

Keywords:  Carrier frequency; Clinical severity; Genetic prevalence; Pompe disease; Population database; gnomAD

Year:  2021        PMID: 33717985      PMCID: PMC7933537          DOI: 10.1016/j.ymgmr.2021.100734

Source DB:  PubMed          Journal:  Mol Genet Metab Rep        ISSN: 2214-4269


Introduction

Pompe disease is a monogenic autosomal recessive disorder that has been exceptionally well-studied in terms of pathogenesis, clinical features, prognosis, screening/diagnostic methods, and treatment strategy. Pompe disease or glycogen storage disease type II (MIM #232300) is caused by deficiency of the lysosomal alpha-glucosidase (GAA), resulting in accumulation of glycogen within the lysosomes and subsequent progressive cardiac and skeletal muscle destruction [1]. The GAA gene encodes GAA protein. Different pathogenic variants in the GAA gene affect the level of GAA enzyme deficiency. Clinical severity and clinical manifestation of Pompe disease appear to be dependent on residual enzyme activity. Classic infantile-onset Pompe disease with almost complete deficiency of the GAA enzyme (<1% of normal) is typically fatal without treatment by enzyme replacement therapy (ERT) due to cardiorespiratory insufficiency within the first year of life. Late-onset Pompe disease is characterized by slowly progressive proximal muscle weakness and respiratory dysfunction. Patients with late-onset Pompe disease include individuals with onset before age 1 year (12 months) without cardiomyopathy and all individuals with onset after age 1 year. Because of the great success of ERT for early treatment [2,3], Pompe disease was included in the newborn screening program [4]. To date, GAA is reported to be the only gene associated with Pompe disease in the Online Mendelian Inheritance in Man database. More than 1000 genetic variants in GAA have been submitted to ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/). Among those, more than 300 variants were classified as pathogenic or likely pathogenic variants with different review statuses (accessed August 2020). Recently, the ClinGen lysosomal storage disorders expert panel released specifications to the 2015 American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) standards and guidelines [5] for the GAA gene (https://clinicalgenome.org/affiliation/50009/). Classification of GAA variants based on the severity rating system (6 classes) using in vitro study was introduced [6,7], and an extended GAA variant database has been released. This updated database (Pompe disease GAA variant database) provides information on GAA variants linking clinical severity based on in silico predictions, expression study results, and clinical information of the reported phenotype [8]. In addition, data analysis of GAA variants were recently reported based on the Pompe Registry, which is an international multi-center observation program designed to track the clinical outcomes of patients with Pompe disease; that study was sponsored by Genzyme (Cambridge, MA), and GAA variants were listed as 3 phenotypic subgroups (groups A, B, C) [9]. The introduction of next-generation sequencing (NGS) techniques made it possible to obtain massive amounts of genomic data with reduced cost and time. Recently, the Genome Aggregation Database (gnomAD), which contains a very large amount of genomic information associated with the general population, has been released [10,11]. Genomic information from the general population could provide important clues for predicting the prevalence of particular Mendelian disorders by analyzing the proportion of unaffected carriers. These latest research results for Pompe disease have been aggregated in the present study. Specifically, the worldwide prevalence and clinical severity of Pompe disease according to population group were estimated based on gnomAD and currently available locus-specific databases (LSDBs), which contain information about both genotype and clinical severity.

Materials and methods

Analysis of GAA variant in gnomAD

The GAA gene (search by genomic region: chr17:78,075,355-78,093,679 (GRCh37/hg19)) was analyzed from gnomAD (v2.1.1) database (https://gnomad.broadinstitute.org/), which contains genetic variants from 125,748 exomes sequences and 15,708 genome sequences. The populations comprise 12,487 unrelated individuals in the African/African-American (AFR) group, 17,720 individuals in the Latino/Admixed American (AMR), 5185 individuals in the Ashkenazi Jewish (ASJ), 9977 individuals in the East Asian (EAS), 12,562 individuals in the Finnish (FIN), 64,603 individuals in the non-Finnish European (NFE), 15,308 individuals in the South Asian (SAS), and 3614 individuals in the other group. The genetic variants in gnomAD database (v.2.1.1) were classified following the 2015 American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) standards and guidelines [5] and ClinGen Lysosomal Storage Disorders Expert Panel specifications (https://www.clinicalgenome.org/affiliation/50009). All variants were analyzed based on NM_000152.5 selected by matched annotation from NCBI and EMBL-EBI (MANE) (https://www.ncbi.nlm.nih.gov/refseq/MANE/). The GRCh37/hg19 genomic build was used for all position descriptions. For comparison of variants in all databases, sequence variants in nomenclature and genomic position were assessed using Mutalyzer (https://mutalyzer.nl/). Loss-of-function variants in GAA gene are responsible for development of Pompe disease. Therefore, PVS1 decision tree [12] by ClinGen sequence variant interpretation working group and ClinGen Lysosomal Storage Disorders Expert Panel specifications was applied for PVS1 ACMG/AMP variant criterion. For the PM2 code, minor allele frequency < 0.001 in all continental populations with >2000 alleles in gnomAD. For multiple lines of computational evidence (PP3), REVEL [13] (>0.75 for missense variants), Mutation Taster [14] and PROVEAN [15] (for in-frame insertion or deletion variants), MaxEntScan [16] and dbscSNV [17] (for predicted impact on splicing) were used to estimate the potential effect of variant pathogenicity. The active site and catalytic barrel of GAA for PVS1 decision tree were checked using Pfam (https://pfam.xfam.org/), InterPro (https://www.ebi.ac.uk/interpro/) and UniProt (https://www.uniprot.org/). In addition, ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/, last accessed December 15, 2020), Pompe disease GAA variant database [8] (for simplicity, Pompe DB, http://www.pompevariantdatabase.nl/, last accessed December 15, 2020), and the Pompe Registry [9] were used as locus-specific databases (LSDBs). Variants classified by ClinGen were included as variants reviewed by expert panel (3 gold stars of review status) in ClinVar. A Venn diagram for 3 or 4 sets using interactiVenn [18] provided a comparison of genetic variants in gnomAD with those in ClinVar, ClinGen, Pompe DB, and Pompe Registry (Fig. 1). The Mutationmapper tool (https://www.cbioportal.org/mutation_mapper) was used for simultaneously visualizing genetic lesions [19]. Epitools was used (https://epitools.ausvet.com.au/) for statistical analysis.
Fig. 1

Genetic variants in gnomAD, ClinVar (including ClinGen), Pompe DB, or Pompe registry. (A) All genetic variants in the ClinVar, Pompe DB, or Pompe registry. The number in parentheses gives the number of genetic variants found in ClinGen. (B) Comparison of PLPVs in gnomAD and genetic variants in ClinVar, Pompe DB, or Pompe registry. The red color number in parentheses gives the number of PLPVs classified by ClinGen. All LSDBs were accessed December 15, 2020.

Genetic variants in gnomAD, ClinVar (including ClinGen), Pompe DB, or Pompe registry. (A) All genetic variants in the ClinVar, Pompe DB, or Pompe registry. The number in parentheses gives the number of genetic variants found in ClinGen. (B) Comparison of PLPVs in gnomAD and genetic variants in ClinVar, Pompe DB, or Pompe registry. The red color number in parentheses gives the number of PLPVs classified by ClinGen. All LSDBs were accessed December 15, 2020.

Phenotype of clinical severity

In this study, clinical severity of specific variants was described by combining data from the Pompe DB and Pompe Registry. In the Pompe DB, patients who present symptoms with onset age younger than 12 months and hypertrophic cardiomyopathy were classified as classic infantile Pompe disease. Patients who present symptoms with onset age before 18 years and without cardiomyopathy were classified as childhood Pompe disease, and patients with onset age 18 years or later were classified as adult Pompe disease [8]. In the Pompe registry, patients with symptom onset at younger than 12 months of age with cardiomyopathy were classified as group A, patients with symptom onset at 12 years or younger without cardiomyopathy as group B, and patients with onset age 12 years or older as group C [9]. In this study, if the Pompe DB or Pompe registry reported that particular variants were associated with classic infantile type, those were regarded as possibly ‘classic infantile’ variants.

Carrier frequency and predicted genetic prevalence analysis

The carrier frequency (CF) and predicted genetic prevalence (pGP) for Pompe disease were analyzed based on the presumed pathogenic or likely pathogenic variants (PLPVs) in GAA in each population group. The allele frequency of PLPV (AFV) and CFV for a variant V were calculated considering only heterozygous PLPV [20,21] as follows:where the allele count (number of variant alleles), allele number (number of genotyped alleles = 2 * number of individuals), and homozygous count (number of homozygous individuals) for a variant were provided by gnomAD. The CF and pGP at the gene level (CFG and pGPG, respectively) were calculated as previously described [21]:

Results

Genetic variants in the GAA gene in gnomAD

GnomAD contained a total of 3270 genetic variants in GAA gene across populations. Among those, genetic variants within low complexity region or with flag of “uncertain”, “likely not loss-of-function (LOF)”, “not LOF” by loss-of-function manual curation in gnomAD were not included in this analysis. To date, 1716 variants are included in the ClinVar (including ClinGen), Pompe DB, or Pompe registry (Fig. 1A). Among those, 225 variants were included in all LSDBs. The presumed PLPVs in the GAA gene in gnomAD are described in Table S1. In this study, a total of 154 different variants were classified into PLPVs: 75 variants were (likely) pathogenic variants with a review status of two or more gold stars (two or three) in ClinVar, and the other 79 were (likely) pathogenic variants with a review status of below 2 gold stars (zero or one), variants of uncertain significance, variants with conflicting interpretations of pathogenicity or absent in ClinVar. The ACMG evidence codes for the other 79 variants were described in Table S1. Of the 154 different PLPVs, 125 variants (81.2%) were in both gnomAD and ClinVar, 125 variants (81.2%) were in both gnomAD and Pompe DB, 109 variants (70.8%) were in both gnomAD and Pompe registry, and 105 variants (68.2%) were included in all LSDBs (Fig. 1B). There were representative 9 PLPVs in global population: (i) which had an allele frequency (AF) greater than 0.0001 in global populations or (ii) an AF greater than 0.0001 in a particular population and also found in multiple populations with AF greater than 0.0001 (Fig. 2, Table 1). The AF of c.-32-13 T > G was highest (AF = 0.0034 in global population), followed by those of c.2238G > C (p.Trp746Cys, AF = 0.00031), c.2560C > T (p.Arg854Ter, AF = 0.000209), c.841C > T (p.Arg281Trp, AF = 0.000205), [c.752C > T; c.761C > T] ([p.Ser251Leu; p.Ser254Leu], AF = 0.00019), 1552-3C > G (AF = 0.00013), 1935C > A (p.Asp645Glu, AF = 0.00012), c.525del (p.Glu176ArgfsTer45, AF = 0.00010) and c.2237G > C (p.Trp746Ser, AF = 0.00007). Notably, a particular population group occupied a dominant proportion (more than 70%) in allele counts of each of representative PLPVs: NFE in allele counts of c.-32-13 T > G (71.7% of total, NFE AF = 0.0053), c.525del (85.2%, NFE AF = 0.00019), c.841C > T (89.5%, NFE AF = 0.00041), 1552-3C > G (92.1%, NFE AF = 0.00027), and c.2238G > C (83.9%, NFE AF = 0.00057); EAS in allele counts of [c.752C > T; c.761C > T] (100% of total, EAS AF = 0.0028), 1935C > A (100%, EAS AF = 0.0017); and AFR in allele counts of c.2560C > T (81.0% of total, AFR AF = 0.0019). Of the 7 populations, the EAS group showed the highest sum of AF of PLPVs with classic infantile type (0.00426), followed by AFR (0.003838), NFE (0.001347), ASJ (0.000975), AMR (0.000957), SAS (0.000939), and FIN (0.000314).
Fig. 2

Representative 9 pathogenic or likely pathogenic variants in the overall population. Trefoil, Trefoil (P type) domain (81–130 amino acids); Gal_m.., Gal_mutarotas_2: galactose mutarotase-like (256–318 amino acids); Glyco_hydro_31, Glycosyl hydrolases family 31 (340–824 amino acids).

Table 1

Representative pathogenic or likely pathogenic variants with allele frequency greater than 0.0001 and at least 3 or more alleles by population group.

VariantPompe DBa
Pompe registry b
Major population in gnomAD cAllele frequency in gnomAD (v2.1.1), #order of high frequency in population d
No. of patientsPhenotype with null alleleNo. of patientsGroupGlobal (N = 141,456)AFR (N = 12,487)AMR (N = 17,720)ASJ (N = 5185)EAS (N = 9977)FIN (N = 12,562)NFE (N = 64,603)SAS (N = 15,308)
c.-32-13 T > G733CH, AD689B,CGlobal0.00340, #10.00094, #20.00269, #10.00554, #10.00021, #90.00016, #20.00529, #10.00190, #1
c.525del, p.Glu176ArgfsTer45154CI111A,B,CGlobal0.00010, #80.00017, #500000.00019, #50
c.546 + 5G > T2Unknown0EAS0.000040000.00050, #3000
[c.752C > T; c.761C > T], [p.Ser251Leu; p.Ser254Leu]26Unknown<5BGlobal0.00019, #50000.00276, #1000
c.841C > T, p.Arg281Trp0<5BGlobal0.00021, #40.00016, #600000.00041, #3(0.00003)
c.853C > T, p.Pro285Ser3CH, AD<5CAFR0.000010.00013, #7000000
c.1316 T > A, p.Met439Lys14CI<5CEAS0.000030000.00038, #4000
c.1411_1414del, p.Glu471ProfsTer518CI<5AEAS0.000020000.00025, #7000
c.1552-3C > G2Unknown0Global0.00013, #60(0.00008)0000.00027, #40
c.1634C > T, p.Pro545Leu7CH, AD7B,CFIN0.0000200000.00012, #3(0.000007)0
c.1725C > A, p.Tyr575Ter1Unknown0FIN0.0000200000.00020, #100
c.1843G > A, p.Gly615Arg9CI9A,B,CEAS0.000020000.00022, #8000
c.1856G > A, p.Ser619Asn4CH, AD<5BSAS0.000020000000.00013, #3
c.1935C > A, p.Asp645Glu104CI42A,B,CGlobal0.00012, #70000.00173, #2000
c.1942G > A, p.Gly648Ser24Unknown7A,CSAS0.00005(0.00007)0000(0.000009)0.00034, #2
c.1979G > A, p.Arg660His6CH9A,B,CAFR0.000040.00025, #4(0.00009)000(0.000008)0
c.2173C > T, p.Arg725Trp8CH, AD8A,CASJ0.00004000.00077, #200(0.00002)0
c.2237G > C, p.Trp746Ser1CH<5BGlobal0.00007, #90.00032, #30.00011, #30(0.00005)(0.00003)(0.00005)0
c.2238G > C, p.Trp746Cys37CH, AD38B,CGlobal0.00031, #20(0.00008)00.00035, #500.00057, #20
c.2560C > T, p.Arg854Ter77CI45A,B,CGlobal0.00021, #30.00189, #10.00020, #200(0.00004)(0.00002)(0.00003)
c.2662G > T, p.Glu888Ter26CI14A,B,CEAS0.000020000.00027, #6000
c.2815_2816del, p.Val939LeufsTer786CI5A,B,CEAS0.000010000.00016, #10000

CI, classic infantile; CH, childhood; AD, adult; AFR, African/African-American; AMR, Latino/Admixed American; ASJ, Ashekenazi Jewish; EAS, East Asian; FIN, Finnish; NFE, Non-Finnish European; SAS, South Asian.

In the Pompe DB [8], patients who present symptoms with onset age younger than 12 months and hypertrophic cardiomyopathy were classified as classic infantile Pompe disease, patients who present symptoms with onset age before 18 years and without cardiomyopathy as childhood Pompe disease, and patients with onset age 18 years or later as adult Pompe disease.

In the Pompe registry [9], patients with symptom onset at younger than 12 months of age with cardiomyopathy were classified as group A, patients with symptom onset at 12 years or younger without cardiomyopathy as group B, and patients with onset age 12 years or older as group C.

Global refers to pathogenic or likely pathogenic variants (PLPVs) with allele frequency greater than 0.0001 by global population or representative PLPVs (greater than allele frequency 0.0001) in particular population found also in multiple populations with AF greater than 0.0001.

Allele frequency in parentheses addresses less than 0.0001. N refers to number of individuals included in gnomAD (2.1.1).

Representative 9 pathogenic or likely pathogenic variants in the overall population. Trefoil, Trefoil (P type) domain (81–130 amino acids); Gal_m.., Gal_mutarotas_2: galactose mutarotase-like (256–318 amino acids); Glyco_hydro_31, Glycosyl hydrolases family 31 (340–824 amino acids). Representative pathogenic or likely pathogenic variants with allele frequency greater than 0.0001 and at least 3 or more alleles by population group. CI, classic infantile; CH, childhood; AD, adult; AFR, African/African-American; AMR, Latino/Admixed American; ASJ, Ashekenazi Jewish; EAS, East Asian; FIN, Finnish; NFE, Non-Finnish European; SAS, South Asian. In the Pompe DB [8], patients who present symptoms with onset age younger than 12 months and hypertrophic cardiomyopathy were classified as classic infantile Pompe disease, patients who present symptoms with onset age before 18 years and without cardiomyopathy as childhood Pompe disease, and patients with onset age 18 years or later as adult Pompe disease. In the Pompe registry [9], patients with symptom onset at younger than 12 months of age with cardiomyopathy were classified as group A, patients with symptom onset at 12 years or younger without cardiomyopathy as group B, and patients with onset age 12 years or older as group C. Global refers to pathogenic or likely pathogenic variants (PLPVs) with allele frequency greater than 0.0001 by global population or representative PLPVs (greater than allele frequency 0.0001) in particular population found also in multiple populations with AF greater than 0.0001. Allele frequency in parentheses addresses less than 0.0001. N refers to number of individuals included in gnomAD (2.1.1).

Distribution of carrier frequency (CF) and predicted genetic prevalence (pGP) in each population

Unaffected carriers for GAA (the total CF for GAA) are predicted to comprise 1.3% of the overall population (Fig. 3A). Of the 7 population groups, the EAS showed the highest CF (1.8%), followed by NFE (1.7%), ASJ (1.3%), AFR(1.2%), AMR (0.8%), SAS (0.7%), and FIN (0.2%).
Fig. 3

Distribution of carrier frequency and predicted genetic prevalence in each population. (A) Distribution of carrier frequency in each population. (B) Distribution of predicted genetic prevalence in each population.

Distribution of carrier frequency and predicted genetic prevalence in each population. (A) Distribution of carrier frequency in each population. (B) Distribution of predicted genetic prevalence in each population. Overall, the pGP for Pompe disease was 4.30 individuals per 100,000 births (1:23,232) (Fig. 3B). The pGP of the EAS was 8.25 per 100,000 births (1:12,125), followed by NFE (7.27 per 100,000 births, 1:13,756), ASJ (4.38 per 100,000 births, 1:22,851), AFR (3.76 per 100,000 births, 1:26,560), AMR (1.74 per 100,000 births, 1:57,620), SAS (1.07 per 100,000 births, 1:93,087), and FIN (0.09 per 100,000 births, 1:1,056,444).

Discussion

In this study, the major questions were how GAA variants detected in a generally healthy population reflect the probabilities of developing Pompe disease and how these probabilities change with respect to clinical severity between population groups. To answer these questions, the GAA variants found in gnomAD for each population group were analyzed in combination with data from other LSDBs that include the clinical and genotype information of Pompe patients. Generally, the incidence of Pompe disease is estimated to be 1 in 40,000 live births [[22], [23], [24]] and varies depending on ethnicity and geographic region from 1:14,000 (African American), 1:40,000 (Netherlands and USA), 1:50,000 (Taiwan), 1:145,000 (Australia) to 1:600,000 (Portugal) [[22], [23], [24], [25], [26], [27], [28]]. However, the incidence of Pompe disease reported by newborn screening programs is much higher than those estimated: 1:16,919 in Taiwan [29], 1:8684 in Austria [30], 1:4400 in Hungary [31], and from 1:10,152 to 1:27,581 in USA (Table S2) [[32], [33], [34], [35], [36], [37], [38]]. There was a difference in incidence of Pompe disease between countries, although their major ethnic populations are the same. These differences might be caused by how distribution of ethnic subpopulations in the particular country or geographic region, choice of screening methods, how the cutoff is set for screening, or whether second tier tests such as genetic testing are included in newborn screening programs. Because of those factors, it was difficult to directly compare the incidence of Pompe disease according to population. In the current study, pGP showed wide variations due to population group, ranging from 8.24 per 100,000 births (1:12,125) to 0.09 per 100,000 births (1:1,056,444). Although this study estimated the genetic prevalence (not incidence) of Pompe disease based on the CF in gnomAD (the difference between prevalence and incidence is described in https://www.cureffi.org/2019/06/05/using-genetic-data-to-estimate-disease-prevalence/), these results in this study were somewhat consistent with previous reports on newborn screening programs. The pGP (highest pGP in this study was 1:12,125) of Pompe disease in the EAS group was slightly higher than the real incidence in Taiwan (1:16,919), but the difference was not significant (P = 0.765 by z-test). Previous reports [[39], [40], [41], [42], [43]] collectively show that there are differences in most frequent pathogenic variants even within the same Chinese group according to geographic region. Therefore, either prevalence or pGP might be different depending on which subpopulation was more prevalent in gnomAD: here, there were genomic data of a total of 9977 from the EAS group including 1909 Koreans, 76 Japanese, and 7992 other East Asian in gnomAD (V.2.1.1). Notably, the pGP of Pompe disease in FIN was the lowest in this study. The same findings were reported by previous studies in Pompe patients [44,45]. Although Pompe disease is well studied and LSDBs contains a lot of genetic information, only 13% (225 variants/1726 known variants) overlapped in all the 3 LSDBs. Of those 225 variants, 100 variants were classified as PLPVs with a review status of two and more gold stars in ClinVar (data not shown). Among 154 presumed PLPVs in gnomAD, 105 PLPVs (68%) overlapped in all 3 LSDBs, of those, 67 were PLPVs with a review status of two and more gold stars in ClinVar, and 33 were PLPVs with three gold stars (reviewed by ClinGen). Among presumed PLPVs in gnomAD (v.2.1.1.), c.-32-13 T > G was most frequently found in all population except East Asian. As expected, the c.525del (AF = 0.00019) in NFE, [c.752C > T; c.761C > T] (AF = 0.00276) and c.1935C > A (AF = 0.00173) in EAS, c.2560C > T in AFR (AF = 0.00189) showed high allele frequency. However, there were several unexpected findings for linking population to high AF of PLPVs. The c.525del in AFR (AF = 0.00017) is as frequent as in NFE (AF = 0.00019). In Pompe DB (last accessed December 15, 2020), 154 patients with c.525del has been reported, however, only 1 patient was African American. The AF (0.0002) of c.2560C > T in AMR is higher than expected. To date, total of 77 patients with c.2560C > T has been reported in Pompe DB, 18 Pompe patients with c.2560C > T were reported in Latin America. Although the c.841C > T and c.1552-3C > T showed relatively higher AF, clinical reports about these variants were rare. The c.2237G > C (global AF = 0.00007) and c.2238G > C (global AF = 0.00031) were found in multiple populations with high AF. However, only 1 clinical case (Chinese patient with c.2237G > C) has been reported in Pompe DB. Although the c.2238G > C (NFE AF = 0.00057) was the second most frequently found in NFE in gnomAD, there were only 2 European patients' reports among 37 patients (most of them were Asian) in Pompe DB. Because of these unexpected findings, the AF of selected PLPVs in gnomAD between version 2 and version 3 were compared. There was not much difference in some variants (c.2560C > T in AMR (AF = 0.0003926) and AFR (AF = 0.00198) in gnomAD (v3); c.2237G > C in global (AF = 0.000085)), while others differed between the database versions (c.525del in AFR (AF 0.00012) and NFE (AF 0.00032) in gnomAD (v3); c.2238G > C in NFE (AF = 0.00069) and EAS (AF = 0)). The differences between predicted and real development of Pompe disease might be caused by the difference of haplotypes between populations or ethnic subpopulatoins, therefore, resulting in difference of compound heterozygosity, phenotype (whether Pompe disease develops), and clinical severity. In addition, the difference between the two versions (v2 and v3) of gnomAD might be due to the difference in the number of samples between populations, distribution of subpopulations or test methods. Genetic screening is the process of genetic testing designed to identify those at increased risk of having or developing a disease, with the goal of prevention or early treatment in a specified population [46]. The most common causative variants in Pompe disease reported in previous studies also were found in gnomAD [8,9,39,47]. In addition, a sum of PLPVs AF greater than 0.0001 in 5 population groups (except FIN and SAS groups) represented more than 70% of the total PLPVs AF. These results indicate that genetic screening for Pompe disease may be feasible according to population group. Knowledge of the GAA PLPVs combined with clinical severity in a particular continental population could be valuable for setting up a screening program. When considering screening for Pompe disease (either measuring enzymatic activity or genetic testing), it is important that no classic infantile type be missed. This is particularly important because enzymatic activity of GAA might change depending on combination of PLPVs in each population group [48,49], and this has an effect on the chosen cutoff. Remarkably, the proportions of PLPVs associated with classic infantile type in AFR and EAS groups were higher than those of other populations. The most common PLPVs associated with classic infantile type were c.2560C > T (p.Arg854Ter) in AFR (AF = 0.00189), c.1935C > A (p.Asp645Glu) in EAS groups (AF = 0.00173) and c.525del (p.Glu176ArgfsTer45) in NFE (AF = 0.00019). In this study, structural variations (SV) or copy number variations (CNV) were not included. A deletion variant of exon 18, c.2481 + 102_2646 + 31del (537 bp deletion, or c.2481 + 110_2646 + 39del, chr17:78091650–78,092,187 based on GRCh37/hg19) was reported as a frequent mutation in Dutch patients [50]. In gnomAD SVs (v2.1), one heterozygous CNV deletion (530 bp deletion, chr17:78091657–78,092,187) was reported in European population: global AF of this variant was 0.000046 (1/21,694) and European AF was 0.00013 (1/7,624). This variant was assumed to be the famous exon 18 deletion. However, it was unclear whether this CNV deletion estimated by CNV caller in gnomAD SVs (v2.1) was same as the variant of c.2481 + 102_2646 + 39del because of difference of genomic position and deletion size. A gnomAD v3 includes only genomic data, therefore, SV reference data is expected to be expanded. If large SV reference data is released through gnomAD v3, further research including SV/ CNV as well as single nucleotide variants, insertion/deletion variants in GAA is needed. Unfortunately, the number of people in each population enrolled in gnomAD does not reflect the real-world population. In addition, the variants with 2 or more gold stars of review status accounts for less than 30% of the total variants in ClinVar (accessed 15 Dec 2020). Additionally, most genetic studies on Pompe disease have been analyzed based on specific populations. Since some real causative variants can be classified as variants of uncertain significance and not as PLPVs due to insufficient clinical, genetic, or functional information, the CF and pGP calculated in the present study might be underestimated.

Conclusions

To the best of the author's knowledge, this is the first study that analyzed Pompe disease based on genomic data of the general population and estimated unaffected carriers and genetic prevalence by population. With this novel and alternative approach, Pompe disease (1:23,232) is estimated to be more frequent than formerly accepted (1:40,000). The pGP in the present study support the latest outcome of newborn screening programs (approximately 1:10,000–1:30,000). In addition, the pGP for Pompe disease was wide based on population. The proportions of PLPVs associated with classic infantile type also varied by population and were notably higher in African and East Asian groups. This approach to analyze genomic information of Mendelian disorders in the general population suggests another helpful direction for predictive and preventive medicine.

Author statement

Carrier frequency and predicted genetic prevalence of Pompe disease based on a general population database (MGMREPORTS-D-20-00195). Kyung Sun Park (Department of Laboratory Medicine, School of Medicine, Kyung Hee University, Seoul, Korea): Conceptualization, Methodology, Data analysis, Visualization, Original draft preparation, and Writing- Reviewing and Editing.

Funding

This study was supported by a (NRF) grant funded by the Korean government () (No. 2019R1A2C2002177).

Declaration of Competing Interest

The author has no competing interests.
  50 in total

1.  Update of the pompe disease mutation database with 60 novel GAA sequence variants and additional studies on the functional effect of 34 previously reported variants.

Authors:  Marian Kroos; Marianne Hoogeveen-Westerveld; Helen Michelakakis; Robert Pomponio; Ans Van der Ploeg; Dicky Halley; Arnold Reuser
Journal:  Hum Mutat       Date:  2012-05-29       Impact factor: 4.878

Review 2.  Newborn screening for Pompe disease in Japan: report and literature review of mutations in the GAA gene in Japanese and Asian patients.

Authors:  Ken Momosaki; Jun Kido; Shinichiro Yoshida; Keishin Sugawara; Tatsuki Miyamoto; Takahito Inoue; Toshika Okumiya; Shirou Matsumoto; Fumio Endo; Shinichi Hirose; Kimitoshi Nakamura
Journal:  J Hum Genet       Date:  2019-05-10       Impact factor: 3.172

3.  Newborn Screening for Pompe Disease.

Authors:  Olaf A Bodamer; C Ronald Scott; Roberto Giugliani
Journal:  Pediatrics       Date:  2017-07       Impact factor: 7.124

4.  Carrier frequency analysis of mutations causing autosomal-recessive-inherited retinal diseases in the Israeli population.

Authors:  Mor Hanany; Gilad Allon; Adva Kimchi; Anat Blumenfeld; Hadas Newman; Eran Pras; Ohad Wormser; Ohad S Birk; Libe Gradstein; Eyal Banin; Tamar Ben-Yosef; Dror Sharon
Journal:  Eur J Hum Genet       Date:  2018-04-30       Impact factor: 4.246

5.  Very Early Treatment for Infantile-Onset Pompe Disease Contributes to Better Outcomes.

Authors:  Chia-Feng Yang; Chen Chang Yang; Hsuan-Chieh Liao; Ling-Yi Huang; Chuan-Chi Chiang; Hui-Chen Ho; Chih-Jou Lai; Tzu-Hung Chu; Tsui-Feng Yang; Ting-Rong Hsu; Wen-Jue Soong; Dau-Ming Niu
Journal:  J Pediatr       Date:  2015-12-10       Impact factor: 4.406

6.  Frequency of glycogen storage disease type II in The Netherlands: implications for diagnosis and genetic counselling.

Authors:  M G Ausems; J Verbiest; M P Hermans; M A Kroos; F A Beemer; J H Wokke; L A Sandkuijl; A J Reuser; A T van der Ploeg
Journal:  Eur J Hum Genet       Date:  1999-09       Impact factor: 4.246

7.  Mass Spectrometry but Not Fluorimetry Distinguishes Affected and Pseudodeficiency Patients in Newborn Screening for Pompe Disease.

Authors:  Hsuan-Chieh Liao; Min-Ju Chan; Chia-Feng Yang; Chuan-Chi Chiang; Dau-Ming Niu; Chun-Kai Huang; Michael H Gelb
Journal:  Clin Chem       Date:  2017-04-27       Impact factor: 8.327

8.  Algorithm for Pompe disease newborn screening: results from the Taiwan screening program.

Authors:  Shu-Chuan Chiang; Wuh-Liang Hwu; Ni-Chung Lee; Li-Wen Hsu; Yin-Hsiu Chien
Journal:  Mol Genet Metab       Date:  2012-04-24       Impact factor: 4.797

9.  Clinical and GAA gene mutation analysis in mainland Chinese patients with late-onset Pompe disease: identifying c.2238G > C as the most common mutation.

Authors:  Xiao Liu; Zhaoxia Wang; Weina Jin; He Lv; Wei Zhang; Chengli Que; Yu Huang; Yun Yuan
Journal:  BMC Med Genet       Date:  2014-12-20       Impact factor: 2.103

10.  Newborn Screening for Pompe Disease: Pennsylvania Experience.

Authors:  Can Ficicioglu; Rebecca C Ahrens-Nicklas; Joshua Barch; Sanmati R Cuddapah; Brenda S DiBoscio; James C DiPerna; Patricia L Gordon; Nadene Henderson; Caitlin Menello; Nicole Luongo; Damara Ortiz; Rui Xiao
Journal:  Int J Neonatal Screen       Date:  2020-11-13
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.