| Literature DB >> 34680056 |
Che-Hong Chen1, Benjamin R Kraemer1, Lucia Lee1, Daria Mochly-Rosen1.
Abstract
Human aldehyde dehydrogenase (ALDH) is a multigene family with 19 functional members encoding a class of diverse but important enzymes for detoxification or biotransformation of different endogenous and exogenous aldehyde substrates. Genetic mutations in the ALDH genes can cause the accumulation of toxic aldehydes and abnormal carbonyl metabolism and serious human pathologies. However, the physiological functions and substrate specificity of many ALDH genes are still unknown. Although many genetic variants of the ALDH gene family exist in human populations, their phenotype or clinical consequences have not been determined. Using the most comprehensive global human Genome Aggregation Database, gnomAD, we annotated here 1350 common variants in the 19 ALDH genes. These 1350 common variants represent all known genetic polymorphisms with a variant allele frequency of ≥0.1% (or an expected occurrence of ≥1 carrier per 500 individuals) in any of the seven major ethnic groups recorded by gnomAD. We detailed 13 types of DNA sequence variants, their genomic positions, SNP ID numbers, and allele frequencies among the seven major ethnic groups worldwide for each of the 19 ALDH genes. For the 313 missense variants identified in the gnomAD, we used two software algorithms, Polymorphism Phenotyping (PolyPhen) and Sorting Intolerant From Tolerant (SIFT), to predict the consequences of the variants on the structure and function of the enzyme. Finally, gene constraint analysis was used to predict how well genetic mutations were tolerated by selection forces for each of the ALDH genes in humans. Based on the ratio of observed and expected variant numbers in gnomAD, the three ALDH1A gene members, ALDH1A1, ALDH1A2, and ALDH1A3, appeared to have the lowest tolerance for loss-of-function mutations as compared to the other ALDH genes (# observed/# expected ratio 0.15-0.26). These analyses suggest that the ALDH1A1, ALDH1A2, and ALDH1A3 enzymes may serve a more essential function as compared with the other ALDH enzymes; functional loss mutations are much less common in healthy human populations than expected. This informatic analysis may assist the research community in determining the physiological function of ALDH isozymes and associate common variants with clinical phenotypes.Entities:
Keywords: ALDH; aldehyde; aldehyde dehydrogenase; genetic variants; gnomAD
Mesh:
Substances:
Year: 2021 PMID: 34680056 PMCID: PMC8533364 DOI: 10.3390/biom11101423
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Summary of human ALDH Isozymes. (A) Table outlining all 19 human ALDH isozymes with the number of amino acid residues, UniProtKB accession number, chromosomal location, and cellular location listed. (B) Phylogenetic tree showing the evolution of human ALDH isozymes with the number of amino acid residues shown in parenthesis (modified from Koppaka et al. (2012) [35]). The color denotes the cellular localization and corresponds between the phylogenetic tree and table as follows: pink, mitochondria; green, cytosol; blue, multiple compartments; no color, unknown location.
Summary of 1350 common variants from 19 human ALDH genes. The number of variants of 13 different types of DNA mutations in each gene are listed based to their Variant Effect Predictor (VEP) annotations.
| Gene | No. of Amino Acid | No. of Variants ≥0.1% | Intron | Missense | Synonymous | Splice Region | 3’ UTR | 5’ UTR | Frameshift | Stop Gained | Splice Acceptor | Start Lost | Splice Donor | Coding Sequence | Stop Lost |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ALDH1A1 | 501 | 49 | 32 | 5 | 5 | 4 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| ALDH1A2 | 518 | 68 | 45 | 6 | 8 | 2 | 1 | 5 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| ALDH1A3 | 512 | 70 | 33 | 10 | 17 | 3 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| ALDH1B1 | 517 | 36 | 0 | 19 | 10 | 0 | 3 | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 |
| ALDH1L1 | 902 | 124 | 63 | 26 | 19 | 7 | 4 | 1 | 0 | 2 | 1 | 0 | 1 | 0 | 0 |
| ALDH1L2 | 923 | 109 | 72 | 19 | 9 | 2 | 3 | 2 | 0 | 2 | 0 | 0 | 0 | 0 | 0 |
| ALDH2 | 517 | 47 | 26 | 9 | 9 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| ALDH3A1 | 453 | 86 | 42 | 24 | 12 | 0 | 4 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 |
| ALDH3A2 | 485 | 34 | 15 | 9 | 6 | 2 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| ALDH3B1 | 468 | 78 | 33 | 14 | 15 | 2 | 8 | 0 | 3 | 0 | 1 | 0 | 0 | 2 | 0 |
| ALDH3B2 | 385 | 84 | 35 | 32 | 8 | 1 | 3 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| ALDH4A1 | 563 | 102 | 51 | 23 | 17 | 2 | 3 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| ALDH5A1 | 535 | 59 | 19 | 20 | 9 | 4 | 2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ALDH6A1 | 535 | 36 | 17 | 6 | 4 | 6 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| ALDH7A1 | 539 | 74 | 44 | 13 | 9 | 1 | 1 | 4 | 1 | 0 | 0 | 1 | 0 | 0 | 0 |
| ALDH8A1 | 487 | 35 | 6 | 10 | 10 | 4 | 1 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| ALDH9A1 | 494 | 62 | 22 | 22 | 7 | 2 | 1 | 6 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| ALDH16A1 | 802 | 136 | 63 | 35 | 33 | 2 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| ALDH18A1 | 795 | 61 | 31 | 11 | 13 | 3 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 2Summary of common variants of ALDH isozymes. (A) Ratio of the type of common variants present in the 19 human ALDH isozymes. (B) Number of common loss-of-function variants in all 19 ALDH isozymes. The isozymes are listed in ascending order of the loss of function o/e ratio.
Gene constraint analysis of synonymous variants, missense variants, and loss-of-function variants of the 19 human ALDH genes. Exp. SNVs (Expected Variant Counts); Obs. SNVs (Observed Variant Counts). The constraint metrics (o/e ratio) is defined as the ratio between Obs. SNPs and Exp. SNVs. The parenthesis shows the 90% confidence interval (CI) for each of the o/e value. Genes that are less tolerant to certain class of genetic variants (value of o/e ratio <0.60) are indicated in bold faces.
| Synonymous Variants | Missense Variants | Loss of Function Variants | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | UniProtKB ID | No. of Amino Acid | Exp. SNVs | Obs. SNVs | Constraint Metrics (o/e Ratio) | Exp. SNVs | Obs. SNVs | Constraint Metrics (o/e Ratio) | Exp. SNVs | Obs. SNVs | Constraint Metrics (o/e Ratio) |
| ALDH1A1 | P00352 | 501 | 101.6 | 106.0 | 1.04 (0.89–1.23) | 276.4 | 199.0 | 0.72 (0.64–0.81) | 27.3 | 4.0 |
|
| ALDH1A2 | O94788 | 518 | 105.2 | 128.0 | 1.22 (1.05–1.41) | 283.1 | 215.0 | 0.76 (0.68–0.85) | 26.5 | 6.0 |
|
| ALDH1A3 | P47895 | 512 | 123.0 | 122.0 | 0.99 (0.85–1.15) | 301.2 | 192.0 | 0.64 (0.57–0.72) | 23.2 | 6.0 |
|
| ALDH18A1 | P54886 | 795 | 165.8 | 158.0 | 0.95 (0.84–1.09) | 441.2 | 325.0 | 0.74 (0.67–0.81) | 39.8 | 13.0 |
|
| ALDH4A1 | P30038 | 563 | 156.0 | 150.0 | 0.96 (0.84–1.10) | 354.5 | 357.0 | 1.01 (0.92–1.10) | 28.9 | 13.0 |
|
| ALDH3A2 | P51648 | 485 | 104.9 | 85.0 | 0.81 (0.68–0.97) | 277.0 | 245.0 | 0.88 (0.80–0.98) | 23.6 | 12.0 |
|
| ALDH1L1 | O75891 | 902 | 235.3 | 241.0 | 1.02 (0.92–1.14) | 552.0 | 512.0 | 0.93 (0.86–1.00) | 45.0 | 25.0 |
|
| ALDH3B1 | P43353 | 468 | 128.5 | 130.0 | 1.01 (0.88–1.17) | 306.2 | 259.0 | 0.85 (0.76–0.94) | 18.3 | 11.0 | 0.60 (0.38–0.99) |
| ALDH5A1 | P51649 | 535 | 119.5 | 112.0 | 0.94 (0.80–1.10) | 289.1 | 254.0 | 0.88 (0.79–0.97) | 23.4 | 14.0 | 0.60 (0.40–0.94) |
| ALDH6A1 | Q02252 | 535 | 103.6 | 87.0 | 0.84 (0.70–1.00) | 292.3 | 242.0 | 0.83 (0.74–0.92) | 29.5 | 19.0 | 0.64 (0.45–0.94) |
| ALDH1L2 | Q3SY69 | 923 | 183.0 | 167.0 | 0.91 (0.80–1.04) | 500.3 | 414.0 | 0.83 (0.76–0.90) | 49.1 | 34.0 | 0.69 (0.53–0.92) |
| ALDH2 | P05091 | 517 | 134.9 | 143.0 | 1.06 (0.92–1.22) | 316.1 | 250.0 | 0.79 (0.71–0.88) | 24.4 | 18.0 | 0.74 (0.51–1.09) |
| ALDH1B1 | P30837 | 517 | 140.5 | 133.0 | 0.95 (0.82–1.09) | 321.5 | 333.0 | 1.04 (0.95–1.13) | 13.0 | 10.0 | 0.77 (0.47–1.30) |
| ALDH9A1 | P49189 | 494 | 107.7 | 119.0 | 1.10 (0.95–1.29) | 292.0 | 312.0 | 1.07 (0.97–1.17) | 25.5 | 20.0 | 0.78 (0.55–1.14) |
| ALDH16A1 | Q8IZ83 | 802 | 228.6 | 228.0 | 1.00 (0.89–1.11) | 503.3 | 530.0 | 1.05 (0.98–1.13) | 43.1 | 35.0 | 0.81 (0.62–1.08) |
| ALDH7A1 | P49419 | 539 | 105.6 | 108.0 | 1.02 (0.87–1.20) | 295.8 | 278.0 | 0.94 (0.85–1.04) | 37.4 | 35.0 | 0.93 (0.71–1.24) |
| ALDH8A1 | Q9H2A2 | 487 | 122.0 | 115.0 | 0.94 (0.81–1.10) | 291.8 | 268.0 | 0.92 (0.83–1.02) | 18.2 | 18.0 | 0.99 (0.68–1.46) |
| ALDH3A1 | P30838 | 453 | 128.6 | 137.0 | 1.07 (0.93–1.23) | 287.1 | 273.0 | 0.95 (0.86–1.05) | 22.0 | 24.0 | 1.09 (0.79–1.53) |
| ALDH3B2 | P48448 | 385 | 114.9 | 123.0 | 1.07 (0.92–1.24) | 250.2 | 256.0 | 1.02 (0.92–1.14) | 16.0 | 18.0 | 1.13 (0.78–1.66) |