| Literature DB >> 25969697 |
Logan Dumitrescu1, Robert Goodloe1, Yukiko Bradford2, Eric Farber-Eger3, Jonathan Boston3, Dana C Crawford4.
Abstract
BACKGROUND: Biorepositories linked to de-identified electronic medical records (EMRs) have the potential to complement traditional epidemiologic studies in genotype-phenotype studies of complex human diseases and traits. A major challenge in meeting this potential is the use of EMR-derived data to extract phenotypes and covariates for genetic association studies. Unlike traditional epidemiologic data, EMR-derived data are collected for clinical care and are therefore highly variable across patients. The variability of clinical data coupled with the challenges associated with searching unstructured clinical notes requires the development of algorithms to extract phenotypes for analysis. Given the number of possible algorithms that could be developed for any one EMR-derived phenotype, we explored here the impact algorithm decision logic has on genetic association study results for a single quantitative trait, high density lipoprotein cholesterol (HDL-C).Entities:
Keywords: Electronic medical record; Genetic risk score; HDL-C; PAGE I study; eMERGE network
Year: 2015 PMID: 25969697 PMCID: PMC4428098 DOI: 10.1186/s13040-015-0048-2
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Lipid-lowering medication class and list of drugs
| Fibrates | Resisns | Statins |
|---|---|---|
| Gemfibrozil (Lopid®) | Cholestyramine (Questran®, Questran® Light, Prevalite®, Locholest®, Locholest® Light) | Atorvastatin (Lipitor®) |
| Fenofibrate (Antara®, Lofibra®, Tricor®, Triglide™) | Colestipol (Colestid®) | Fluvastatin (Lescol®) |
| Clofibrate (Atromid-S) | Colesevelam Hcl (WelChol®) | Lovastatin (Mevacor® and Altoprev™) |
| Pravastatin (Pravachol®) | ||
| Rosuvastatin Calcium (Crestor®) | ||
| Simvastatin (Zocor®) | ||
| Lovastatin + niacin (Advicor®) | ||
| Atorovastatin + amlodipine (Caduet®) | ||
| Simvastatin + ezetimibe (Vytorin™) |
Four major medication classes containing lipid-lowering medications were used to search the clinical notes: fibrates, niacin, resins, selective cholesterol absorption inhibitors (Ezetimibe or Zetia®), and statins (also known an HMG CoA reductase inhibitors). For each of the medication classes included in the search, we have listed the specific drugs considered, including both the generic and brand names.
SNPs used to calculate the genetic risk score for HDL-C in African Americans
| SNP | Gene of interest | Effect Allele | Effect on HDL-C†(mg/dl) |
|---|---|---|---|
| rs247617 |
| C | −0.111 |
| rs1077834 |
| A | −0.033 |
| rs10096633 |
| G | −0.042 |
| rs189069311 |
| A | −0.080 |
| rs255054 |
| A | −0.042 |
| rs6601299 |
| A | −0.063 |
| rs4810479 |
| G | −0.029 |
†Beta coefficients were drawn from meta-analysis results of PAGE African Americans [23].
EAGLE BioVU African American demographics for HDL-C
| Variable | No. Obs. | Median | IQR |
|---|---|---|---|
| Medical record length (years) | 4,912 | 3 | 8 |
| Clinic visits (N) | 4,912 | 3 | 6 |
| ICD-9 codes (N)† | 4,897 | 54 | 85 |
| All HDL-C (mg/dl) | 4,890 | 51 | 23 |
| First HDL-C (mg/dl) | 4,912 | 50 | 23 |
| Last HDL-C (mg/dl) | 4,912 | 49 | 22 |
| pre-medication HDL-C (mg/dl) | 4,074 | 51 | 23 |
| post-medication HDL-C (mg/dl) | 2,086 | 48 | 22 |
Number of observations (No. Obs.) as well as medians and interquartile ranges (IQR) are given for each variable. †Includes only unique ICD-9 codes per individual.
Figure 1Distribution of HDL-C risk alleles in EAGLE BioVU African Americans.
Figure 2Additive effects of HDL-C risk alleles on various HDL-C measurements. Effect sizes (betas) from the linear regression analysis with the unweighted GRS, adjusted for sex, are shown as expected HDL-C levels (in mg/dl; black diamonds), along with their 95% confidence intervals.