| Literature DB >> 20162021 |
Taylor J Maxwell1, Matthew L Bendall, Jeffrey Staples, Todd Jarvis, Keith A Crandall.
Abstract
Genotype/phenotype association analyses (Treescan) with plasma lipid levels and functional site prediction methods (TreeSAAP and PolyPhen) were performed using sequence data for ANGPTL4 from 3,551 patients in the Dallas Heart Study. Biological assays of rare variants in phenotypic tails and results from a Treescan analysis were used as "known" variants to assess the site prediction abilities of PolyPhen and TreeSAAP. The E40K variant in European Americans and the R278Q variant in African Americans were significantly associated with multiple lipid phenotypes. Combining TreeSAAP and PolyPhen performed well to predict "known" functional variants while reducing noise from false positives.Entities:
Keywords: ANGPTL4; TreeSAAP; association studies; phylogenetics; selection; treescan
Mesh:
Substances:
Year: 2010 PMID: 20162021 PMCID: PMC2821009 DOI: 10.3390/ijms11010370
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1.Phylogenetic network showing relationships among sampled haplotypes. Edges are labeled with the base or amino acid change and colored based on results from significant.PolyPhen and TreeSAAP results. The nodes are colored according to the known effect of the variant on the protein as determined by in vitro assays [3,4]. Yellow nodes indicate variants that prevent secretion; orange nodes indicate variants that cause a nonfunctional protein to be secreted; white nodes were not tested in vitro; and gray is a synonymous substitution.
Haplotype frequencies for the haplotypes in Figure 1 overall and each population. All = combined; EA = European American, AA = African American, MA = Mexican American.
| h1 | 0.51242 | 0.54341 | 0.47735 | 0.54052 | 0.68493 |
| h2 | 0.26020 | 0.28447 | 0.21706 | 0.34828 | 0.23288 |
| h12 | 0.06475 | 0.14402 | 0.02102 | 0.05862 | 0.06164 |
| h13 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h14 | 0.00015 | 0.00051 | 0 | 0 | 0 |
| h15 | 0.00030 | 0.00102 | 0 | 0 | 0 |
| h16 | 0.04186 | 0.00153 | 0.07936 | 0.00517 | 0.00685 |
| h17 | 0.00105 | 0.00255 | 0.00030 | 0.00086 | 0 |
| h18 | 0.00045 | 0.00102 | 0 | 0.00086 | 0 |
| h19 | 0.01220 | 0 | 0.02399 | 0 | 0 |
| h23 | 0.00015 | 0.00051 | 0 | 0 | 0 |
| h24 | 0.02997 | 0.00051 | 0.05774 | 0.00259 | 0 |
| h25 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h26 | 0.00030 | 0 | 0.00059 | 0 | 0 |
| h35 | 0.00045 | 0 | 0.00089 | 0 | 0 |
| h36 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h37 | 0.00211 | 0 | 0.00415 | 0 | 0 |
| h39 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h40 | 0.00015 | 0.00051 | 0 | 0 | 0 |
| h41 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h42 | 0.00015 | 0.00051 | 0 | 0 | 0 |
| h44 | 0.00015 | 0 | 0 | 0.00086 | 0 |
| h45 | 0.00015 | 0.00051 | 0 | 0 | 0 |
| h53 | 0.00030 | 0 | 0.00059 | 0 | 0 |
| h54 | 0.05285 | 0.00204 | 0.10127 | 0.00431 | 0 |
| h55 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h56 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h57 | 0.00015 | 0.00051 | 0 | 0 | 0 |
| h58 | 0.00467 | 0.00153 | 0.00651 | 0.00517 | 0 |
| h63 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h64 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h72 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h73 | 0.00015 | 0.00051 | 0 | 0 | 0 |
| h82 | 0.00015 | 0.00051 | 0 | 0 | 0 |
| h90 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h91 | 0.00030 | 0 | 0.00059 | 0 | 0 |
| h92 | 0.00030 | 0.00051 | 0.00000 | 0 | 0.00685 |
| h93 | 0.00060 | 0 | 0.00118 | 0 | 0 |
| h94 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h95 | 0.00407 | 0 | 0 | 0.02328 | 0 |
| h96 | 0.00708 | 0.01277 | 0.00296 | 0.00948 | 0.00685 |
| h97 | 0.00030 | 0 | 0.00059 | 0 | 0 |
| h98 | 0.00015 | 0 | 0.00030 | 0 | 0 |
| h102 | 0.00015 | 0.00051 | 0 | 0 | 0 |
PolyPhen and TreeSAAP results for each missense polymorphisms used in the study. Each rare variant is defined by which part of the triglyceride phenotype distribution it was found (H = high, M = Middle, L= Low) according to Romeo et al. [3]. For the five common missense variants, Significant and NonSig (Nonsignificant) refers to phenotypic associations from the Treescan analyses. The Biological Assay column refers to assays in Table 3 of Romeo et al. [4]. A “-“ means no tests were performed. All significant PolyPhen predictions are in bold. All TreeSAAP properties considered significant with a score of 6 or more [13] are reported, all with an extreme value of 8 are in bold. The TreeSAAP property symbol key is provided below.
| M-1-T | M | - | NA | benign | |
| P-5-L | M | - | NA | benign | |
| E-40-K | Significant | - | 1.424 | benign | |
| M-41-I | NonSig | - | 1.16 | benign | |
| S-67-R | M | - | 1.563 | ||
| R-72-L | M | - | 1.958 | ||
| E-167-K | L | LPL Inhib | 0.194 | benign | |
| P-174-S | M | - | 1.715 | ||
| E-190-Q | NonSig | - | 0.243 | benign | |
| E-196-K | M | - | 1.541 | ||
| G-223-R | L | Secretion | 2.065 | ||
| R-230-C | M | - | 2.792 | ||
| F-237-V | M | - | 2.51 | ||
| P-251-T | H | Nothing | 1.781 | ||
| T-266-M | NonSig | - | 0.783 | benign | K0, Ht |
| R-278-Q | Significant | - | 0.644 | benign | pHi |
| V-291-M | M | - | 1.012 | benign | |
| L-293-M | M | - | 1.236 | benign | |
| E-296-V | M | - | 2.057 | ||
| P-307-S | M | - | 0.955 | benign | αc |
| V-308-M | M | - | 1.199 | benign | |
| R-336-C | L | Secretion | 2.255 | ||
| D-338-E | M | - | 1.626 | ||
| W-349-C | L | Secretion | 3.677 | ||
| G-361-R | L | Secretion | 2.274 | μ | |
| R-371-Q | H | Nothing | 1.558 | pHi | |
| R-384-W | L | Secretion | 2.304 | Br, Ht |
Figure 2.A schematic representation of ANGPTL4 coding region. The locations of variant sites are colored according to their affect on protein functionality as previously described [4]. Yellow sites prevent protein secretion, orange sites cause a non-functional protein to be secreted, and black sites were not tested in vitro. Amino acid sites identified by PolyPhen as “possibly damaging” are indicated in light red; “probably damaging” sites are shown in dark red. Radically changing (categories 6, 7, and 8) amino acid sites identified by TreeSAAP are shown in blue, with category 8 sites in dark blue.
A comparison of results between PolyPhen, TreeSAAP, and their combinations with “known” data. Strict PolyPhen only counts “probably damaging” as significant while Strict TreeSAAP only counts category 8 as significant. P-values are from a two-tailed Fisher’s exact test of a 2 by 2 table comparing the “Functional or Significant column to the “Middle or Not Sig” column. Sensitivity, specificity, alpha, and beta levels are from this comparison.
| Significant | 5 | 2 | 8 | 0.673 | 1.828 | 0.254 | 15.766 | |
| Not Significant | 3 | 0 | 9 | |||||
| 0.625 | 0.529 | 0.471 | 0.375 | |||||
| 493.08 | ||||||||
| Significant | 7 | 2 | 7 | 9.130 | 0.859 | 8 | ||
| Not Significant | 1 | 0 | 10 | |||||
| 0.875 | 0.588 | 0.412 | 0.125 | |||||
| Significant | 5 | 0 | 3 | 0.061 | 7.012 | 0.846 | 77.356 | |
| Not Significant | 3 | 2 | 14 | |||||
| 0.625 | 0.824 | 0.176 | 0.375 | |||||
| Significant | 5 | 0 | 5 | 0.194 | 3.762 | 0.505 | 34.675 | |
| Not Significant | 3 | 2 | 12 | |||||
| 0.625 | 0.706 | 0.294 | 0.375 | |||||
| Significant | 4 | 0 | 4 | 0.359 | 3.084 | 0.385 | 27.020 | |
| Not Significant | 4 | 2 | 13 | |||||
| 0.5 | 0.765 | 0.235 | 0.5 | |||||
| Significant | 3 | 0 | 2 | 0.283 | 4.192 | 0.369 | 64.438 | |
| Not Significant | 5 | 2 | 15 | |||||
| 0.375 | 0.882 | 0.118 | 0.625 | |||||
| 11.52 | 626.87 | |||||||
| Significant | 7 | 0 | 6 | 6 | 1.077 | 1 | ||
| Not Significant | 1 | 2 | 11 | |||||
| 0.875 | 0.647 | 0.353 | 0.125 | |||||