| Literature DB >> 30643251 |
Mengzhen Liu1, Yu Jiang2,3, Robbee Wedow4,5,6, Yue Li7,8, David M Brazel4,9,10, Fang Chen2,3, Gargi Datta1, Jose Davila-Velderrain7,8, Daniel McGuire2,3, Chao Tian11, Xiaowei Zhan12,13, Hélène Choquet14, Anna R Docherty15,16, Jessica D Faul17, Johanna R Foerster18, Lars G Fritsche18, Maiken Elvestad Gabrielsen19, Scott D Gordon20, Jeffrey Haessler21, Jouke-Jan Hottenga22, Hongyan Huang23,24, Seon-Kyeong Jang1, Philip R Jansen25,26, Yueh Ling2,9, Reedik Mägi27, Nana Matoba28, George McMahon29, Antonella Mulas30, Valeria Orrù30, Teemu Palviainen31, Anita Pandit18, Gunnar W Reginsson32, Anne Heidi Skogholt19, Jennifer A Smith17,33, Amy E Taylor29, Constance Turman23,24, Gonneke Willemsen22, Hannah Young1, Kendra A Young34, Gregory J M Zajac18, Wei Zhao33, Wei Zhou35, Gyda Bjornsdottir32, Jason D Boardman4,5,6, Michael Boehnke18, Dorret I Boomsma22, Chu Chen21, Francesco Cucca30, Gareth E Davies36, Charles B Eaton37, Marissa A Ehringer4,38, Tõnu Esko8,27, Edoardo Fiorillo30, Nathan A Gillespie15,20, Daniel F Gudbjartsson32,39, Toomas Haller27, Kathleen Mullan Harris40,41, Andrew C Heath42, John K Hewitt4,43, Ian B Hickie44, John E Hokanson34, Christian J Hopfer4,45, David J Hunter23,24,46, William G Iacono1, Eric O Johnson47, Yoichiro Kamatani28, Sharon L R Kardia33, Matthew C Keller4,43, Manolis Kellis7,8, Charles Kooperberg21, Peter Kraft23,24,48, Kenneth S Krauter4,9, Markku Laakso49,50, Penelope A Lind51, Anu Loukola31, Sharon M Lutz52, Pamela A F Madden42, Nicholas G Martin20, Matt McGue1, Matthew B McQueen4,38, Sarah E Medland51, Andres Metspalu27, Karen L Mohlke53, Jonas B Nielsen54, Yukinori Okada28,55, Ulrike Peters21,56, Tinca J C Polderman25, Danielle Posthuma25,57, Alexander P Reiner21,56, John P Rice58, Eric Rimm24,59, Richard J Rose60, Valgerdur Runarsdottir61, Michael C Stallings4,43, Alena Stančáková49, Hreinn Stefansson32, Khanh K Thai14, Hilary A Tindle62, Thorarinn Tyrfingsson61, Tamara L Wall63, David R Weir17, Constance Weisner14, John B Whitfield20, Bendik Slagsvold Winsvold64, Jie Yin14, Luisa Zuccolo29,65, Laura J Bierut58, Kristian Hveem19,66,67, James J Lee1, Marcus R Munafò65,68, Nancy L Saccone69, Cristen J Willer35,54,70, Marilyn C Cornelis71, Sean P David72, David A Hinds11, Eric Jorgenson14, Jaakko Kaprio31,73, Jerry A Stitzel4,38, Kari Stefansson32,74, Thorgeir E Thorgeirsson32, Gonçalo Abecasis18, Dajiang J Liu75,76, Scott Vrieze77.
Abstract
Tobacco and alcohol use are leading causes of mortality that influence risk for many complex diseases and disorders1. They are heritable2,3 and etiologically related4,5 behaviors that have been resistant to gene discovery efforts6-11. In sample sizes up to 1.2 million individuals, we discovered 566 genetic variants in 406 loci associated with multiple stages of tobacco use (initiation, cessation, and heaviness) as well as alcohol use, with 150 loci evidencing pleiotropic association. Smoking phenotypes were positively genetically correlated with many health conditions, whereas alcohol use was negatively correlated with these conditions, such that increased genetic risk for alcohol use is associated with lower disease risk. We report evidence for the involvement of many systems in tobacco and alcohol use, including genes involved in nicotinic, dopaminergic, and glutamatergic neurotransmission. The results provide a solid starting point to evaluate the effects of these loci in model organisms and more precise substance use measures.Entities:
Mesh:
Year: 2019 PMID: 30643251 PMCID: PMC6358542 DOI: 10.1038/s41588-018-0307-5
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1.Genetic correlations between substance use phenotypes and phenotypes from other large genome-wide association studies.
Genetic correlations between each of the phenotypes are shown in the first 5 rows, with heritability estimates displayed down the diagonal. All genetic correlations and heritability estimates were calculated using LD Score Regression. Blue shading represents negative genetic correlations, and red shading represents positive correlations, with increasing color intensity reflecting increasing strength of a correlation. A single asterisk reflects significant genetic correlations at the p<.05 level. Double asterisks reflect significant genetic correlations at the Bonferroni-correction p<.000278 level (corrected for 180 independent tests). Note that SmkCes was oriented such that higher scores reflected current smoking, and for AgeSmk lower scores reflect earlier ages of initiation, both of which are typically associated with negative outcomes. AgeSmk=Age of Initiation of Smoking; CigDay=Cigarettes per Day; SmkInit=Smoking Initiation; SmkCes=Smoking Cessation; DrnkWk=Drinks per Week.
Figure 2.Pleiotropy.
Depicted here are results from the multivariate analysis of pleiotropy. For each locus, the method returns the best fitting solution of which phenotypes were associated with that locus. All loci with one or more associated phenotypes are shown here. For example, every locus associated with AgeSmk was found to be pleiotropic for other phenotypes (green, blue, red, purple, and fuchsia bars), and no locus showed association with only AgeSmk (no dark grey bar for AgeSmk). When sample sizes are unequal across phenotypes, the method also improves power for those phenotypes with smaller samples. The total number of loci associated with each trait (whether pleiotropic or not) from these analyses was 40 (AgeSmk), 48 (SmkCes), 72 (CigDay), 111 (DrnkWk), and 278 (SmkInit). Full information is in Supplementary Table 11.
Figure 3.Heritability and polygenic prediction.
The light gray bars reflect SNP heritability, estimated with LD Score Regression. The light blue and gold bars reflect the predictive power of polygenic risk scores in Add Health and the Health and Retirement Study (HRS), respectively. Despite the 41-year generational gap between participants from these two studies, and major tobacco-related policy changes during that time, the polygenic scores are similarly predictive in both samples. Error bars are 95% confidence intervals estimated with 1000 bootstrapped repetitions. Dark gray bars represent the total phenotypic variance explained by only genome-wide significant SNPs. H2=heritability.
Figure 4.Correlations among exemplary DEPICT gene sets.
There were 68 clusters available for Smoking initiation and 10 for Drinks Per Week (CigDay, AgeSmk, and SmkCes did not have > 1 exemplary sets.) Blue shading represents positive correlations, and red shading represents negative correlations, with increasing color intensity reflecting increasing strength of a correlation. Cluster names are truncated for space, with a full list of all names in Supplementary Table 18. The number after each name is the number of gene sets in each cluster. The matrix naturally falls into three blue superclusters along the diagonal. The largest supercluster contains primarily gene sets related to neurotransmitter receptors, ion channels (sodium, potassium, calcium), learning/memory, and other aspects of CNS function. The middle supercluster includes gene sets defined by regulation of transcription and translation, including RNA binding and transcription factor activity. The final supercluster is composed primarily of gene sets related to development of the nervous system.
Nonsynonymous sentinel variants.
The sentinel variant in approximately 4% of loci was nonsynonymous. Shown here are all nonsynonymous sentinel variants, and all nonsynonymous variants in near-perfect LD with a sentinel variant. If the listed gene was also associated (through single variant or gene-based test) with another phenotype, that phenotype is listed in parentheses. Several genes have been implicated in previous studies of substance use/addiction, including CHRNA5, BDNF, GCKR, and ADH1B.
| Phenotype | Gene | rsID | Chr | Position | REF | ALT | AF | Beta | N | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| CigDay (SmkCes) | rs16969968[ | 15 | 78,882,925 | G | A | .34 | .075 | 1.2×10−278 | 330,721 | .34 | |
| CigDay | rs7766641 | 6 | 26,184,102 | G | A | .27 | −.014 | 2.9×10−10 | 335,553 | .78 | |
| CigDay (AgeSmk) | rs1024323 | 4 | 3,006,043 | C | T | .38 | −.012 | 8.7×10−9 | 337,334 | .17 | |
| SmkInit | rs462779[ | 6 | 111,695,887 | G | A | .81 | −.019 | 4.5×10−29 | 1,232,091 | .67 | |
| SmkInit (DrnkWk) | rs6265 | 11 | 27,679,916 | C | T | .20 | −.016 | 2.8×10−19 | 1,232,091 | .13 | |
| SmkInit | rs1139897 | 16 | 720,986 | G | A | .23 | −.012 | 1.8×10−15 | 1,232,091 | .61 | |
| SmkInit (DrnkWk) | rs6962772[ | 7 | 99,081,730 | A | G | .15 | −.015 | 2.1×10−14 | 1,232,091 | .92 | |
| SmkInit | rs4818005[ | 21 | 40,574,305 | A | G | .58 | −.010 | 3.9×10−14 | 1,232,091 | .75 | |
| SmkInit | rs6050446 | 20 | 25,195,509 | A | G | .97 | .035 | 8.8×10−13 | 1,225,969 | .33 | |
| SmkInit | rs17857342[ | 11 | 64,138,905 | T | G | .38 | −.010 | 9.8×10−12 | 1,232,091 | .16 | |
| SmkInit | rs147052174 | 1 | 179,783,167 | G | T | .02 | .037 | 2.3×10−10 | 1,232,091 | .59 | |
| SmkInit | rs34553878 | 9 | 134,907,263 | A | G | .11 | .016 | 1.2×10−9 | 1,232,091 | .28 | |
| SmkInit | rs45444697[ | 1 | 155033918 | C | T | .21 | .010 | 5.3×10−9 | 1,232,091 | .46 | |
| SmkInit | rs9481410[ | 6 | 97,677,118 | G | A | .76 | .010 | 1.1×10−8 | 1,232,091 | .04 | |
| SmkInit | rs62618693 | 11 | 32,956,492 | C | T | .04 | −.020 | 2.1×10−8 | 1,232,091 | 1.00 | |
| DrnkWk | rs1229984 | 4 | 100,239,319 | T | C | .96 | .060 | 2.2×10−308 | 941,280 | .05 | |
| DrnkWk | rs1260326 | 2 | 27,730,940 | T | C | .60 | .008 | 8.1×10−45 | 941,280 | .10 | |
| DrnkWk | rs13107325 | 4 | 103,188,709 | C | T | .07 | −.009 | 1.5×10−22 | 941,280 | .33 | |
| DrnkWk | rs28929474 | 14 | 94,844,947 | C | T | .02 | −.012 | 1.3×10−11 | 941,280 | .50 | |
| DrnkWk (SmkInit) | rs11692465 | 2 | 98,275,354 | G | A | .09 | .008 | 2.5×10−11 | 937,516 | .40 | |
| DrnkWk | rs3803800 | 17 | 7,462,969 | A | G | .79 | .004 | 1.5×10−10 | 941,280 | .67 | |
| DrnkWk | rs3748034 | 4 | 3,446,091 | G | T | .14 | −.005 | 1.7×10−8 | 941,280 | .65 |
Note: Phenotype abbreviations are defined in Figure 1. Chr=Chromosome; REF=reference allele; ALT=alternate allele; AF=allele frequency of ALT allele; Q=Cochrane’s Q statistic p-value.
These variants were not themselves sentinel, but were in near-perfect LD with a sentinel variant (R[2] >.99, from the 1000 Genomes European population). The scale of Beta is on the unit of the standard deviation of the phenotype. For binary phenotypes the standard deviation was calculated from the weighted average prevalence across all studies included in the meta-analysis (available in Supplementary Table 7).