| Literature DB >> 27884205 |
Bing Yu1, Paul S de Vries1, Ginger A Metcalf2, Zhe Wang1, Elena V Feofanova1, Xiaoming Liu1, Donna Marie Muzny2, Lynne E Wagenknecht3, Richard A Gibbs2, Alanna C Morrison1, Eric Boerwinkle4,5.
Abstract
BACKGROUND: Blood levels of amino acids are important biomarkers of disease and are influenced by synthesis, protein degradation, and gene-environment interactions. Whole genome sequence analysis of amino acid levels may establish a paradigm for analyzing quantitative risk factors.Entities:
Keywords: Amino acids; Metabolomics; Multi-ethnic; Rare variants; Whole genome sequence
Mesh:
Substances:
Year: 2016 PMID: 27884205 PMCID: PMC5123402 DOI: 10.1186/s13059-016-1106-x
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Identified significant genetic associations with serum amino acid levels. Gene names with a single line underneath indicate the association was reported in previous studies for European ancestry; gene names with double lines underneath indicate the association was reported in previous studies for both African and European ancestry. Gene names shown in the single variant test were assigned according to the leading common variant annotations
Gene exon-based results demonstrating a significant association among both discovery (p < 4.6 × 10−8) and replication (p < 0.003) stages for the T5 burden test
| Metabolite | Gene | Discovery (AA) | Replication (EA) | ||||||
|---|---|---|---|---|---|---|---|---|---|
|
| Beta | cMAC | VarExp |
| Beta | cMAC | VarExp | ||
| Dimethylglycine |
| 3.2 × 10−31 | 0.64 | 96 | 3.6% | 8.1 × 10−12 | 0.39 | 73 | 1.7% |
| N-acetylthreonine |
| 1.1 × 10−10 | 0.12 | 239 | 0.6% | 4.7 × 10−5 | 0.26 | 24 | 0.4% |
| N-acetylalanine |
| 4.1 × 10−41 | 0.16 | 239 | 1.5% | 3.9 × 10−15 | 0.25 | 24 | 0.6% |
| Asparagine |
| 1.1 × 10−10 | 0.34 | 157 | 1.4% | 2.7 × 10−5 | 0.38 | 58 | 0.9% |
| Indolelactate |
| 2.7 × 10−21 | 0.39 | 87 | 1.6% | 1.1 × 10−7 | 0.26 | 33 | 0.5% |
| Alpha-hydroxyisovalerate |
| 1.6 × 10−8 | 0.64 | 21 | 0.8% | 8.2 × 10−6 | 0.41 | 18 | 0.5% |
| Proline |
| 1.4 × 10−29 | 0.14 | 324 | 1.4% | 1.5 × 10−11 | 0.09 | 295 | 0.7% |
cMAC cumulative minor allele count, VarExp variance explained by the loci
Sliding windows demonstrating a significant association among both discovery (p < 1.1 × 10−9) and replication (p < 0.01) stages for the T5 burden test
| Metabolite | Discovery (AA) | Replication (EA) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Window (gene) |
| Beta | cMAC | VarExp | Window (gene) |
| Beta | cMAC | VarExp | |
| N-acetyl-1-methylhistidine | Chr2: 73744005–73748004 ( | 1.6 × 10−15 | 0.12 | 933 | 1.0% | Chr2: 73744005–73748004 ( | 0.0004 | 0.17 | 156 | 0.5% |
| N-acetyl-1-methylhistidine | Chr2: 73614005–73618004 ( | 6.2 × 10−11 | −0.11 | 728 | 0.7% | Chr2: 73614005–73618004 ( | 0.005 | −0.07 | 336 | 0.2% |
| Acisoga | Chr6: 132952009–132956008 ( | 9.4 × 10−10 | 0.06 | 1504 | 0.4% | Chr6: 132952009–132956008 ( | 0.009 | −0.04 | 764 | 0.1% |
cMAC cumulative minor allele count, VarExp variance explained by the loci
Single variant results demonstrating a significant association among both discovery (p < 7.1 × 10−10) and replication (p < 0.003) stages
| Metabolite | Variant information | Discovery (AA) | Replication (EA) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | SNP | Function | Chr:position | REF/ALT | MAF | Beta |
| Var Exp | MAF | Beta |
| Var Exp | |
| Glycine |
| rs1047891 | Missense | 2:211540507 | C/A | 0.37 | 0.09 | 4.5 × 10−19 | 1.3% | 0.31 | 0.16 | 4.9 × 10−45 | 3.6% |
| Dimethylglycine |
| rs933683 | Intronic | 5:78324003 | G/T | 0.44 | −0.15 | 2.3 × 10−14 | 1.9% | 0.29 | −0.09 | 9.5 × 10−6 | 0.7% |
| Asparagine |
| rs11131799 | Intronic | 4:178363378 | G/A | 0.49 | −0.14 | 2.4 × 10−10 | 2.5% | 0.36 | −0.26 | 3.9 × 10−23 | 4.5% |
| N-acetyl-1-methylhistidine |
| rs13538 | Missense | 2:73868328 | A/G | 0.48 | 0.34 | 3.3 × 10−75 | 9.7% | 0.23 | 0.51 | 3.4 × 10−85 | 14.2% |
| Glutarylcarnitine |
| rs8012 | Missense | 19:13010520 | A/G | 0.19 | −0.12 | 9.5 × 10−14 | 1.2% | 0.46 | −0.11 | 2.5 × 10−17 | 1.5% |
| N-acetyl phenylalanine |
| rs13431529 | Intronic | 2:73876041 | G/C | 0.49 | 0.09 | 4.3 × 10−10 | 1.0% | 0.23 | 0.06 | 1.2 × 10−5 | 0.4% |
| 3-Methoxytyrosine |
| rs11575302 | Silent | 7:50607694 | G/A | 0.15 | 0.15 | 2.5 × 10−17 | 1.5% | 0.02 | 0.19 | 1.4 × 10−7 | 0.5% |
| Indolepropionate |
| rs8044331 | Intronic | 16:20450302 | T/C | 0.42 | −0.17 | 5.3 × 10−10 | 1.8% | 0.22 | −0.11 | 0.001 | 0.5% |
| Alpha-hydroxyisovalerate |
| rs17023507 | UTR5 | 1:119923247 | C/T | 0.10 | −0.25 | 1.6 × 10−13 | 1.9% | 0.002 | −0.64 | 0.001 | 0.4% |
| Proline |
| rs1814288 | Intronic | 22:18923383 | C/T | 0.30 | −0.06 | 7.8 × 10−12 | 0.7% | 0.21 | −0.03 | 0.003 | 0.1% |
| Acisoga |
| rs2272996 | Missense | 6: 133015271 | T/C | 0.19 | 0.18 | 8.1 × 10−16 | 0.2% | 0.27 | 0.26 | 4.8 × 10−34 | 5.1% |
REF/ALT reference allele and alternative allele, MAF minor allele frequency, VarExp variance explained by the loci
Conditional analysis of selected regions adjusting for the lead common variant identified by previous genome-wide association studies
| Metabolite | Region | Type | GWAS Lead SNV | Discovery (AA) | Replication (EA) | ||
|---|---|---|---|---|---|---|---|
|
|
|
|
| ||||
| Indolelactate* |
| Gene | rs15676 | 1.3 × 10−20 | 1.1 × 10−20 | 2.1 × 10−6 | 4.1 × 10−6 |
| N-acetyl-1-methylhistidine | Chr2: 73744005–73748004 ( | Window | rs13538 | 1.6 × 10−15 | 0.005 | 4.0 × 10−4 | 0.2 |
| N-acetyl-1-methylhistidine | Chr2: 73614005–73618004 ( | Window | rs13538 | 6.2 × 10−11 | 0.9 | 0.005 | 0.8 |
| Asparagine* |
| Gene | rs4690522 | 6.8 × 10−10 | 9.1 × 10−10 | 1.5 × 10−5 | 6.0 × 10−8 |
| Dimethlyglycine* |
| Gene | rs248386 | 1.1 × 10−26 | 4.3 × 10−27 | 4.4 × 10−11 | 4.5 × 10−10 |
| Alpha-hydroxyisovalerate* |
| Gene | rs12141041 | 1.5 × 10−5 | 3.0 × 10−5 | 9.3 × 10−5 | 2.0 × 10−4 |
| Proline* |
| Gene | rs2540641 | 1.4 × 10−26 | 1.7 × 10−26 | 1.3 × 10−12 | 1.2 × 10−13 |
*Unadjusted results may differ from main analysis because only individuals with both exome sequencing and whole genome sequencing were included in the conditional analysis. SNV single nucleotide variant