| Literature DB >> 31194788 |
Robert W Read1, Karen A Schlauch1, Gai Elhanan1, William J Metcalf1, Anthony D Slonim2, Ramsey Aweti3, Robert Borkowski3, Joseph J Grzymski1,2.
Abstract
In this study, we perform a full genome-wide association study (GWAS) to identify statistically significantly associated single nucleotide polymorphisms (SNPs) with three red blood cell (RBC) components and follow it with two independent PheWASs to examine associations between phenotypic data (case-control status of diagnoses or disease), significant SNPs, and RBC component levels. We first identified associations between the three RBC components: mean platelet volume (MPV), mean corpuscular volume (MCV), and platelet counts (PC), and the genotypes of approximately 500,000 SNPs on the Illumina Infimum DNA Human OmniExpress-24 BeadChip using a single cohort of 4,673 Northern Nevadans. Twenty-one SNPs in five major genomic regions were found to be statistically significantly associated with MPV, two regions with MCV, and one region with PC, with p<5x10-8. Twenty-nine SNPs and nine chromosomal regions were identified in 30 previous GWASs, with effect sizes of similar magnitude and direction as found in our cohort. The two strongest associations were SNP rs1354034 with MPV (p = 2.4x10-13) and rs855791 with MCV (p = 5.2x10-12). We then examined possible associations between these significant SNPs and incidence of 1,488 phenotype groups mapped from International Classification of Disease version 9 and 10 (ICD9 and ICD10) codes collected in the extensive electronic health record (EHR) database associated with Healthy Nevada Project consented participants. Further leveraging data collected in the EHR, we performed an additional PheWAS to identify associations between continuous red blood cell (RBC) component measures and incidence of specific diagnoses. The first PheWAS illuminated whether SNPs associated with RBC components in our cohort were linked with other hematologic phenotypic diagnoses or diagnoses of other nature. Although no SNPs from our GWAS were identified as strongly associated to other phenotypic components, a number of associations were identified with p-values ranging between 1x10-3 and 1x10-4 with traits such as respiratory failure, sleep disorders, hypoglycemia, hyperglyceridemia, GERD and IBS. The second PheWAS examined possible phenotypic predictors of abnormal RBC component measures: a number of hematologic phenotypes such as thrombocytopenia, anemias, hemoglobinopathies and pancytopenia were found to be strongly associated to RBC component measures; additional phenotypes such as (morbid) obesity, malaise and fatigue, alcoholism, and cirrhosis were also identified to be possible predictors of RBC component measures.Entities:
Mesh:
Year: 2019 PMID: 31194788 PMCID: PMC6564422 DOI: 10.1371/journal.pone.0218078
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Cohort characteristics.
| Age (years) | 47.24 ± 15.82 | |
| Male (%) | 1328 (28.24) | |
| African American (%) | 53 (1.12) | |
| Asian (%) | 100 (2.12) | |
| Caucasian (%) | 4,175 (89.34) | |
| Latino (%) | 138 (2.93) | |
| Native American (%) | 30 (0.64) | |
| Pacific Islander (%) | 11 (0.23) | |
| Unknown (%) | 168 (3.6) | |
| MPV (fL) | 10.58 ± 0.98 | [9, 12.9] (fL) |
| MCV (fL) | 91.53 ± 4.40 | [81.4, 97.8] (fL) |
| PC (K/uL) | 251.83 ± 61.92 | [164, 446] (K/uL) |
Table of cohort characteristics. Continuous variables are presented as mean ± SD; categorical variables are presented as counts and percentages. All values were standardized to the reference ranges of the most recent administered laboratory test. The median MPV value is 10.49; the median of the MCV values is 91.76; the median PC value is 246. The IQR of MPV is [9.90, 11.16]; the IQR of MCV is [89.06, 94.30]; the IQR of PC is [209.67, 286.67].
Fig 1MCV GWAS Manhattan plot.
Genome-wide association study results for MCV. The x-axis represents the genomic position of 498,916 SNPs. The y-axis represents -log10-transformed raw p-values of each genotypic association. The red horizontal line indicates the threshold of significance p = 5x10-8.
Statistically significant GWAS SNPs.
| rsID | Chrom | Cyto Region | Associated Gene | Minor Allele | MAF | β | (SE) | GWAS | Mutation Classification | RBC |
|---|---|---|---|---|---|---|---|---|---|---|
| rs10274553 | chr3 | p14.3 | ARHGEF3 | C | 49.74 | -0.1198 | 0.020 | 3.82x10-9 | intron | MPV |
| rs10509186 | chr3 | p14.3 | ARHGEF3 | T | 45.55 | -0.1187 | 0.021 | 7.75x10-9 | intron | MPV |
| rs10822186 | chr7 | q22.3 | NA | G | 49.22 | -0.1107 | 0.020 | 4.38x10-8 | unknown | MPV |
| rs11130549 | chr7 | q22.3 | NA | C | 34.11 | -0.1238 | 0.022 | 9.94x10-9 | unknown | MPV |
| rs12355784 | chr7 | q22.3 | NA | A | 45.46 | -0.1184 | 0.021 | 9.32x10-9 | unknown | MPV |
| rs1354034 | chr7 | q22.3 | NA | T | 41.14 | 0.1546 | 0.021 | 2.39x10-13 | unknown | MPV |
| rs1788103 | chr7 | q22.3 | NA | G | 48.18 | -0.1261 | 0.020 | 5.15x10-10 | unknown | MPV |
| rs1790588 | chr7 | q22.3 | NA | C | 48.01 | -0.1273 | 0.020 | 3.31x10-10 | unknown | MPV |
| rs1790974 | chr10 | q21.3 | JMJD1C | T | 43.82 | -0.1128 | 0.020 | 3.32x10-8 | intron | MPV |
| rs1935 | chr10 | q21.3 | JMJD1C | G | 45.58 | -0.1135 | 0.021 | 3.57x10-8 | intron | MPV |
| rs201979226 | chr10 | q21.3 | JMJD1C | C | 48.78 | 0.1183 | 0.020 | 5.89x10-9 | intron, near-gene-5 | MPV |
| rs342240 | chr10 | q21.3 | JMJD1C | A | 41.36 | 0.129 | 0.021 | 3.49x10-10 | intron, untranslated-3 | MPV |
| rs342275 | chr10 | q21.3 | JMJD1C | T | 41.9 | 0.1292 | 0.020 | 2.96x10-10 | intron | MPV |
| rs342293 | chr10 | q21.3 | JMJD1C | G | 44.31 | 0.1325 | 0.020 | 6.61x10-11 | missense | MPV |
| rs342296 | chr10 | q21.3 | REEP3 | A | 44.03 | 0.131 | 0.020 | 1.04x10-10 | intron | MPV |
| rs34818942 | chr12 | q24.31 | WDR66 | T | 7.29 | 0.254 | 0.039 | 7.77x10-11 | intron | MPV |
| rs386614085 | chr12 | q24.31 | RHOF | G | 45.45 | -0.1172 | 0.021 | 1.21x10-8 | intron | MPV |
| rs4379723 | chr18 | q22.2 | CD226 | C | 45.45 | -0.1172 | 0.021 | 1.29x10-8 | missense | MPV |
| rs763361 | chr18 | q22.2 | CD226 | T | 47.24 | -0.1273 | 0.020 | 3.26x10-10 | intron | MPV |
| rs7910927 | chr18 | q22.2 | CD226 | G | 45.52 | -0.1146 | 0.021 | 2.68x10-8 | intron | MPV |
| rs7961894 | chr18 | q22.2 | DOK6 | T | 10.24 | 0.2221 | 0.033 | 2.68x10-11 | untranslated-3 | MPV |
| rs218237 | chr4 | q12 | NA | T | 15.19 | 0.740 | 0.126 | 5.07x10-9 | unknown | MCV |
| rs9402686 | chr6 | q23.3 | NA | A | 24.89 | 0.647 | 0.104 | 4.60x10x-10 | unknown | MCV |
| rs7776054 | chr6 | q23.3 | NA | G | 24.28 | 0.645 | 0.104 | 6.65x10-10 | unknown | MCV |
| rs9399137 | chr6 | q23.3 | NA | C | 23.83 | 0.648 | 0.105 | 7.82x10-10 | unknown | MCV |
| rs7775698 | chr6 | q23.3 | NA | T | 24.23 | 0.642 | 0.104 | 8.24x10-10 | unknown | MCV |
| rs4895441 | chr6 | q23.3 | NA | G | 25.08 | 0.628 | 0.103 | 1.27x10-9 | unknown | MCV |
| rs111194878 | chr6 | q23.3 | NA | A | 25.41 | 0.622 | 0.103 | 1.47x10-9 | unknown | MCV |
| rs9373124 | chr6 | q23.3 | NA | C | 26.16 | 0.599 | 0.102 | 5.14x10-9 | unknown | MCV |
| rs855791 | chr22 | q12.3 | TMPRSS6 | A | 44.2 | -0.621 | 0.090 | 5.23x10-12 | missense | MCV |
| rs4820268 | chr22 | q12.3 | TMPRSS6 | G | 46.56 | -0.604 | 0.090 | 2.65x10-11 | coding-synon | MCV |
| rs5756504 | chr22 | q12.3 | TMPRSS6 | T | 36.93 | 0.567 | 0.092 | 7.77x10-10 | intron | MCV |
| rs130624 | chr22 | q12.3 | NA | G | 42.77 | 0.549 | 0.090 | 1.13x10-9 | unknown | MCV |
| rs5756506 | chr22 | q12.3 | TMPRSS6 | C | 36.92 | 0.563 | 0.092 | 1.15x10-9 | intron | MCV |
| rs386563505 | chr22 | q12.3 | NA | A | 40.75 | 0.525 | 0.091 | 7.12x10-9 | unknown | MCV |
| rs385893 | chr9 | p24.1 | NA | T | 49 | -7.744 | 1.258 | 8.04x10-10 | unknown | PC |
| rs10974808 | chr9 | p24.1 | RCL1 | G | 11.48 | 11.490 | 1.943 | 3.53x10-9 | intron | PC |
| rs423955 | chr9 | p24.1 | NA | C | 34.17 | -7.387 | 1.325 | 2.64x10-8 | near-gene-5 | PC |
This table lists the statistically significant SNPs associated in our cohort with MPV, MCV, and PC. Effect sizes and their standard deviations are presented in fL per each copy of the minor allele. Raw p-values generated by the GWAS are presented.
respiratory failure
; those with PC included GERD and other diseases of the esophagus. Our study also showed links with MPV-associated SNPs and skin cancer, hypoglycemia, hyperglyceridemia, IBS, among others. These associations are outlined in S3A–S3C Fig.PheWAS Results for MPV, MCV and PC.
| Phecode | Description | Group | RBC | β | SE | N | |
|---|---|---|---|---|---|---|---|
| 287.3 | Thrombocytopenia | hematopoietic | MPV | 0.75 | 0.11 | 9.06x10-12 | 4104 |
| 287 | Purpura and other hemorrhagic conditions | hematopoietic | MPV | 0.62 | 0.10 | 2.86x10-10 | 4124 |
| 286.3 | Coagulation defects complicating pregnancy or postpartum | hematopoietic | MPV | 2.47 | 0.39 | 3.66x10-10 | 4029 |
| 655 | Known or suspected fetal abnormality affecting mother | pregnancy complications | MPV | 0.56 | 0.11 | 8.85x10-7 | 4455 |
| 798 | Malaise and fatigue | symptoms | MPV | 0.16 | 0.03 | 5.21x10-6 | 4162 |
| 261 | Vitamin deficiency | endocrine/metabolic | MPV | 0.14 | 0.04 | 5.43x10-5 | 4049 |
| 61.4 | Vitamin D deficiency | endocrine/metabolic | MPV | 0.14 | 0.04 | 6.55x10-5 | 3992 |
| 282.8 | Other hemoglobinopathies | hematopoietic | MCV | -12.18 | 0.87 | 1.97x10-43 | 3751 |
| 282 | Hereditary hemolytic anemias | hematopoietic | MCV | -10.33 | 0.82 | 8.62x10-36 | 3754 |
| 280 | Iron deficiency anemias | hematopoietic | MCV | -3.76 | 0.36 | 1.30x10-25 | 3854 |
| 278 | Overweight, obesity and other hyperalimentation | endocrine/metabolic | MCV | -1.48 | 0.14 | 3.56x10-25 | 4365 |
| 278.1 | Obesity | endocrine/metabolic | MCV | -1.71 | 0.17 | 2.01x10-23 | 3874 |
| 280.1 | Iron deficiency anemias unspecified or not due to blood loss | hematopoietic | MCV | -3.81 | 0.38 | 5.03x10-23 | 3837 |
| 278.11 | Morbid obesity | endocrine/metabolic | MCV | -2.04 | 0.21 | 6.83x10-22 | 3540 |
| 281.9 | Deficiency anemias | hematopoietic | MCV | 8.73 | 1.10 | 2.60x10-15 | 3743 |
| 289.9 | Abnormality of red blood cells | hematopoietic | MCV | -7.30 | 1.08 | 1.94x10-11 | 3742 |
| 289 | Other diseases of blood and blood-forming organs | hematopoietic | MCV | 2.84 | 0.43 | 7.37x10-11 | 3827 |
| 317.11 | Alcoholic liver damage | mental disorders | MCV | 8.46 | 1.31 | 1.38x10-10 | 4009 |
| 281 | Other deficiency anemia | hematopoietic | MCV | 4.55 | 0.73 | 4.33x10-10 | 3759 |
| 317 | Alcohol-related disorders | mental disorders | MCV | 4.15 | 0.67 | 5.96x10-10 | 4041 |
| 317.1 | Alcoholism | mental disorders | MCV | 5.59 | 0.91 | 9.92x10-10 | 4021 |
| 571.8 | Liver abscess and sequelae of chronic liver disease | digestive | MCV | 8.01 | 1.50 | 9.85x10-8 | 3855 |
| 571.51 | Cirrhosis of liver without mention of alcohol | digestive | MCV | 7.22 | 1.42 | 3.61x10-7 | 3856 |
| 342 | Hemiplegia | neurological | MCV | -14.80 | 3.09 | 1.67x10-6 | 4060 |
| 573.2 | Liver replaced by transplant | digestive | MCV | 14.01 | 2.99 | 2.97x10-6 | 3849 |
| 571.81 | Portal hypertension | digestive | MCV | 9.80 | 2.12 | 4.00x10-6 | 3851 |
| 250.4 | Abnormal glucose | endocrine/metabolic | MCV | -0.88 | 0.19 | 6.30x10-6 | 3946 |
| 250 | Diabetes mellitus | endocrine/metabolic | MCV | -1.03 | 0.23 | 9.77x10-6 | 3697 |
| 250.21 | Type 2 diabetes with ketoacidosis | endocrine/metabolic | MCV | 12.99 | 3.04 | 1.94x10-5 | 3296 |
| 530.2 | Esophageal bleeding (varices/hemorrhage) | digestive | MCV | 8.18 | 1.91 | 1.96x10-5 | 3176 |
| 70.2 | Viral hepatitis B | infectious diseases | MCV | -17.81 | 4.31 | 3.63x10-5 | 4266 |
| 539 | Bariatric surgery | digestive | MCV | -1.98 | 0.48 | 3.71x10-5 | 4550 |
| 287.3 | Thrombocytopenia | hematopoietic | PC | -85.15 | 6.56 | 9.17x10-38 | 4104 |
| 287 | Purpura and other hemorrhagic conditions | hematopoietic | PC | -71.06 | 5.92 | 1.11x10-32 | 4124 |
| 278 | Overweight, obesity and other hyperalimentation | endocrine/metabolic | PC | 14.23 | 1.97 | 6.12x10-13 | 4366 |
| 284 | Aplastic anemia | hematopoietic | PC | -98.52 | 14.12 | 3.54 x10-12 | 3749 |
| 284.1 | Pancytopenia | hematopoietic | PC | -100.90 | 15.03 | 2.16x10-11 | 3747 |
| 278.1 | Obesity | endocrine/metabolic | PC | 14.80 | 2.35 | 3.18x10-10 | 3875 |
| 278.11 | Morbid obesity | endocrine/metabolic | PC | 17.70 | 2.92 | 1.56 x10-9 | 3541 |
| 571.51 | Cirrhosis of liver without mention of alcohol | digestive | PC | -120.03 | 19.85 | 1.61x10-9 | 3857 |
| 571.8 | Liver abscess and sequelae of chronic liver disease | digestive | PC | -121.58 | 21.03 | 7.96x10-9 | 3856 |
| 288.1 | Decreased white blood cell count | hematopoietic | PC | -32.91 | 5.82 | 1.68x10-8 | 3832 |
| 287.31 | Primary thrombocytopenia | hematopoietic | PC | -134.74 | 25.98 | 2.24x10-7 | 4028 |
| 286.3 | Coagulation defects complicating pregnancy or postpartum | hematopoietic | PC | -120.29 | 23.71 | 4.10x10-7 | 4029 |
| 655 | Known or suspected fetal abnormality affecting mother | pregnancy complications | PC | -35.93 | 7.09 | 4.27x10-7 | 4455 |
| 571.81 | Portal hypertension | digestive | PC | -143.14 | 29.73 | 1.53x10-6 | 3852 |
| 288.2 | Elevated white blood cell count | hematopoietic | PC | 22.82 | 4.99 | 4.88x10-6 | 3875 |
| 395.2 | Nonrheumatic aortic valve disorders | circulatory system | PC | -29.71 | 6.50 | 5.03x10-6 | 4019 |
| 280.1 | Iron deficiency anemias unspecified or not due to blood loss | hematopoietic | PC | 24.29 | 5.74 | 2.36x10-5 | 3838 |
| 555 | Inflammatory bowel disease and other gastroenteritis and colitis | digestive | PC | 33.57 | 7.99 | 2.70x10-5 | 3504 |
Table of phenotype groups (phecodes) reaching statistical significance (p<3.4x10-5) when associated to continuous MPV, MCV and PC component values. Phecodes and their description, effect sizes (β) of the regression, standard error (SE), and p-values are included. Each phecode group contains at least 20 cases. Note that two associations with MPV at slightly higher p-values (p = 5.43x10-5 and p = 6.55x10-5) are also included; these are presented in the Discussion.
Fig 2MCV PheWAS plot.
This figure illustrates the results of individual linear regression between incidence of phenotype groups (phecodes) and continuous MCV component measures. The model includes age, gender and ethnicity as covariates. Each point represents the p-value of the association between one of 1,488 phecodes with at least 20 cases assigned to it, and the MCV component measure. The horizontal red line represents the significance level p = 3.4x10-5.
Table presenting gene functions.
| Gene | Gene Description | Region | RBC | Function | Reference |
|---|---|---|---|---|---|
| Rho Guanine Nucleotide Exchange Factor 3 | p14.3 | MPV | Increases activity of Rho GTPases by catalyzing the release of bound GDP; may have a role in megakaryocytes maturation | [ | |
| Histone Demethylase | q21.3 | MPV | Possible hormone-dependent transcriptional activation | [ | |
| Receptor Accessory Protein 3 | q21.3 | MPV | Membrane protein | [ | |
| WD Repeat Domain 66 | q24.31 | MPV | May create and alter platelet volumes | [ | |
| Ras Homolog Family Member F | q24.31 | MPV | May regulate platelet filopodia formation | [ | |
| Cluster of Differentiation 226 | q22.22 | MPV | Catalyzes binding of activated platelets to endothelial cells; may have a role in in platelet signal transduction | [ | |
| Docking Protein 6 | q22.2 | MPV | Protein scaffolding | [ | |
| Transmembrane Protease, Serine 6 | q12.13 | MCV | Acts by cleaving hemojuvelin | [ | |
| RNA Terminal Phosphate Cyclase Like 1 | p24.1 | PC | rRNA processing | [ |
This table presents functions of genes associated to all SNPs found significantly associated to one or more RBC components in the GWASs.
Column identifiers for GWAS Results.
| Column name | Definition |
|---|---|
| Chromosome | |
| Individual SNP identifier | |
| Location of SNP on relative chromosome | |
| Alternative Allele | |
| Selected statistical test–ADD represents the additive effect | |
| Indicates the number of observations–non-missing genotypes | |
| The effect size for this variant, defined per copy of the A1 allele | |
| The standard error of the effect size | |
| Lower end of the 95% confidence interval for the effect size | |
| Upper end of the 95% confidence interval for the effect size | |
| The value of the test statistic | |
| The p-value for the association test |
Table describing the column headers for the results file of our genome wide associations. This summary results file only lists the top 10,000 SNPs in order to prevent a re-identification attack.