Literature DB >> 21818250

Genomic regions identified by overlapping clusters of nominally-positive SNPs from genome-wide studies of alcohol and illegal substance dependence.

Catherine Johnson1, Tomas Drgon, Donna Walther, George R Uhl.   

Abstract

Declaring "replication" from results of genome wide association (GWA) studies is straightforward when major gene effects provide genome-wide significance for association of the same allele of the same SNP in each of multiple independent samples. However, such unambiguous replication is unlikely when phenotypes display polygenic genetic architecture, allelic heterogeneity, locus heterogeneity and when different samples display linkage disequilibria with different fine structures. We seek chromosomal regions that are tagged by clustered SNPs that display nominally-significant association in each of several independent samples. This approach provides one "nontemplate" approach to identifying overall replication of groups of GWA results in the face of difficult genetic architectures. We apply this strategy to 1 M SNP GWA results for dependence on: a) alcohol (including many individuals with dependence on other addictive substances) and b) at least one illegal substance (including many individuals dependent on alcohol). This approach provides high confidence in rejecting the null hypothesis that chance alone accounts for the extent to which clustered, nominally-significant SNPs from samples of the same racial/ethnic background identify the same sets of chromosomal regions. It identifies several genes that are also reported in other independent alcohol-dependence GWA datasets. There is more modest confidence in: a) identification of individual chromosomal regions and genes that are not also identified by data from other independent samples, b) the more modest overlap between results from samples of different racial/ethnic backgrounds and c) the extent to which any gene not identified herein is excluded, since the power of each of these individual samples is modest. Nevertheless, the strong overlap identified among the samples with similar racial/ethnic backgrounds supports contributions to individual differences in vulnerability to addictions that come from newer allelic variants that are common in subsets of current humans.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21818250      PMCID: PMC3144872          DOI: 10.1371/journal.pone.0019210

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Genome wide association (GWA) is a method of choice for identifying genes whose variants influence vulnerability to complex disorders. Declaring “replication” of individual results of genome wide association studies is straightforward when major gene effects provide associations between marker and phenotype that display the same phase and “genome wide” levels of significance (p ca 10−8) in each of several independent samples. However, such “template” replication for individual markers is unlikely to be achieved in many otherwise-reasonable samples for many phenotypes. Phenotypes and samples that display polygenic genetic architecture, allelic heterogeneity, locus heterogeneity and sample-to-sample differences in fine structures of linkage disequilibrium can provide especial difficulties for this “template” approach. These difficulties can be exacerbated when data comes from different genotyping platforms that do not assess allele frequencies for identical sets of SNPs. Much current genome wide association and linkage data suggests that we may have identified many or even most of the loci at which we might expect “template” analyses to identify reproducible genome wide significance in reasonably sized samples (see references below). Much of the risk attributable to genetic influences on common phenotypes appears likely to arise from polygenic influences whose properties are likely to provide many false negative results in searches for replicated “genome wide” significance in multiple independent samples that use “template” criteria for replication. Vulnerability to heavy use and development of dependence on alcohol and/or an illegal abused substance (“addiction vulnerability”) appears to be such a trait. The substantial genetic influences on addiction vulnerability are documented by data from family, adoption and twin studies [1], [2], [3], [4]. Twin studies also document shared heritable influences on vulnerability to dependence on addictive substances from different pharmacological classes, including alcohol and illegal drugs from several pharmacological classes [2], [3], [5]. Combined data from linkage and initial GWA studies [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19] suggest that much of the genetic influence on vulnerability to substance dependence is likely to be polygenic. We have developed a “nontemplate” strategy that identifies overall replication of sets of genome wide association (GWA) results in the face of difficulties with genetic architectures, samples and genotyping methods [9], [14], [20], [21]. Such an approach can complement meta-analyses that seek to combine data from single markers whose significance in single samples does not achieve genome wide significance. We now report application of this nontemplate strategy to identify overall replication of groups of results from GWA studies of samples of individuals with dependence on alcohol and illegal substances vs matched controls [21], (http://www.ncbi.nlm. nih.gov/gap). We separately compare data from independent samples of individuals with European-American genetic backgrounds and samples of individuals with African-American genetic backgrounds. These data come from individual genotyping and multiple-pool genotyping approaches that use 1 M SNP Illumina and Affymetrix platforms, respectively. The results focus attention on chromosomal regions that are identified by clusters of SNPs for which case vs control differences achieve nominal statistical significance in multiple samples from the same racial/ethnic group. We describe the high confidence with which this approach rejects the null hypothesis that clusters of nominally-significant SNPs from different samples of individuals from the same racial/ethnic group identify the same chromosomal regions with frequencies expected by chance. We note the more modest levels of confidence that this approach provides for identification of individual SNPs, individual chromosomal regions, individual genes and for the overlap between data from samples of the two racial/ethnic groups studied, except in genes in which we and other investigators have identified associations in independent samples. We discuss this work in light of its technical and analytic limitations and in its similarities with and differences from “template” GWA analyses and meta-analyses that seek reproducible associations of striking levels of significance at single SNP markers. The current “nontemplate” replication of sets of results may be useful in other settings in which the underlying properties of the disorder and of the samples create difficulties for searches for individual SNPs with replicated genome wide significance.

Materials and Methods

Subjects, genotyping and assignment of nominal significance of dependent vs control allele frequencies in each sample

1) dbGAP samples from the FSCD, COGA and COGEND studies

Genotypes from unrelated subjects who provided written consents and met DSM criteria for alcohol dependence and consenting control subjects with no evidence for dependence on any drug were assembled from three sets of subjects and deposited in dbGAP (http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000092.v1.p1). Family study of cocaine dependence (FSCD) subjects were recruited from treatment centers close to St. Louis. Mo; 55% of contacted subjects participated [22]. Community-based comparison subjects were recruited through driver's license records from the Missouri Family Registry and were matched to alcohol dependent subjects based on date of birth, ethnicity, gender, and zip code. Eighty percent of screened and eligible comparison subjects participated. Other participants came from individuals who participated in the Collaborative Study on the Genetics of Alcoholism (COGA) [23] and the Collaborative Study on the Genetics of Nicotine Dependence [10]. Dependent individuals displayed DSM (Diagnostic and Statistical Manual IV) dependence on alcohol. Controls, defined in dbGap variable phv00022939.v1.p1.c2 “final_type”, displayed no DSM dependence on alcohol, cocaine, marijuana, opioids or other drugs but may have evinced DSM nicotine dependence, FTND scores >4 and/or regular smoking as defined by smoking >100 cigarettes in their lives. We identified 1171 dependent and 1395 control unrelated European-American subjects and 652 dependent and 499 control unrelated African-American subjects for this analysis. Subjects were 45% male; 48% of the alcohol-dependent subjects were also dependent on cocaine. Genotyping for these samples was performed using Illumina 1 M SNP arrays at the Center for Inherited Disease Research (CIDR), with quality controls and principal components analysis (PCA) controls for racial/ethnic background available at the dbGAP website. Genotypes from dependent and control individuals were selected from dbGAP files, excluding SNPs with minor allele frequencies less than 0.01–0.02 (for European and African American samples, respectively) and those with missing call rates >5%. p values for each SNP were based on χ2 tests.

2) NIDA/MNB samples

European-American and African-American research volunteers, largely non treatment seeking, came to the NIDA research facility in Baltimore, Maryland between 1990 and 2007 in response to advertisements and referrals from other research volunteers. Subjects provided written informed consents, self-reported ethnicity data, drug use histories via the Drug Use Survey and DSMIII-R or IV diagnoses (Diagnostic and Statistical Manual) and were reimbursed for their time as previously described [6], [17], [21], [24]. Genotypes were assessed in DNA pools using Affymetrix 6.0 arrays and methods that we have extensively validated, as previously described [6], [7], [8], [9], [21]. Pooling 1) provided us with the maximal ability to protect the genetic confidentiality of subjects who volunteered for study of genetics of illegal behaviors, 2) allowed us to utilize DNAs from individuals who consented to participation in this study during time periods when consents did not explicitly describe studies using high densities of DNA markers, 3) allowed us to use methods that we have developed and validated in this and in previous work and 4) reduced costs. Many of these subjects would thus not have been available for studies that assessed substantial numbers of polymorphisms using individual genotyping. Nominal p values for each SNP were determined based on t tests that compared data from multiple abuser vs control pools that contained DNAs from 680 European-American and 940 African-American individuals who had mean ages of 32.8 and 34.0 and were 69.5 and 58.8% male, respectively, as described [21]. In addition, to provide additional validation for the pooling results for the SNPs that formed the basis of the clusters evaluated herein, we also performed individual genotyping using Affymetrix 6.0 arrays for the 155 African American research volunteers who constituted virtually all of the members of 8 DNA pools and who had consented to unlimited individual genotyping. These individual genotyping results all passed Affymetrix quality control standards and resulted in ≥98% call rates.

3) Identification of chromosomal regions containing clusters of SNPs with nominally-significant case vs control differences in single or multiple samples

We performed analyses based on previously-defined criteria using datasets of approximately 1 million SNPs [21]. We identified chromosomal regions of interest in individual samples by seeking regions in which at least 4 clustered SNPs displayed case vs control differences with nominal, p<0.05 levels of statistical significance. We defined clustering based on separation of each clustered SNP from the nearest nominally-significant SNP by ≤10 kb. We identified similarities between the results obtained from multiple samples by identifying the chromosomal regions that were tagged by such clustered, nominally positive SNPs in each of the samples of individuals from the same racial/ethnic groups. We identified genes for which these chromosomal intervals lay within the exons of the gene and/or in 10 kb of 5′ or 3′ flanking sequence.

4) Monte Carlo methods for assignment of levels of significance to: a) the extent of clustering in each sample and b) the degree to which clustered nominally-positive SNPs from multiple independent samples identify the same chromosomal regions

Monte Carlo methods were used to assign empirical statistical probabilities to two null hypotheses, starting with the sets of all SNPs and the nominally positive SNPs that displayed p<0.05 case vs control values. We first tested the null hypothesis that chromosomal clustering of these nominally positive SNPs occurred at the level expected by chance in these datasets. For each Monte Carlo trial that tested this null hypothesis, we randomly selected a number of “pseudo positive” SNPs from each dataset that matched the number that achieved nominal significance in the bona fide dataset. Thus, we constructed a list of autosomal SNPs assayed in each sample and assigned a number to each SNP that corresponded to its position on the list. To select the pseudopositive SNPs for each trial of the European-American datasets, we selected 75,413 random numbers for the NIDA (see below) and 49,843 random numbers for the dbGAP datasets. For the African American datasets, we used 83,330 and 45,325 random numbers, respectively. For each trial, the SNPs identified by the positions on the list that corresponded to these randomly-assigned numbers were then queried for the extent to which their results equaled or exceeded the results obtained for the actual dataset. In 10,000 such trials for each sample, we compared results concerning the extent of chromosomal clustering from these sets of pseudopositive SNPs to those for the true positive SNPs. These empirical Monte Carlo p values thus addressed the null hypothesis that the true positive SNPs from each single sample were randomly arrayed on the chromosomes. Of course, the clustering of SNPs that provided nominally-significant case vs control differences in each individual sample did not allow us to discern whether the haplotypes identified in such a manner were related to a) phenotypic differences or to b) stochastic differences in haplotype frequencies between case and control samples. Monte Carlo methods were also used to assign empirical statistical probabilities to a second null hypotheses: that the same chromosomal regions were identified by the clustered, nominally positive SNPs in independent samples with the frequencies expected by chance. In 10,000 trials from pairs of independent samples, we compared the extent of overlap between the chromosomal regions identified by the clustered, nominally-positive SNPs in each sample. The Monte Carlo p values that derive from these trials thus addressed the second null hypothesis that the chromosomal regions identified by clusters of nominally positive SNPs in each of multiple samples were identified only on stochastic bases that were unrelated to phenotype. Secondary analysis of dbGAP data used permutation approaches as implemented in PLINK (v1.06) (http://pngu.mgh.harvard.edu/purcell/plink/) [25]. We randomized assignment of the phenotypes to data derived from the current SNPs and analyzed the data from 3,000 permutation trials that addressed each of several null hypotheses (see below). To assess the power of our current approach we used current sample sizes and standard deviations, power calculator PS v2.1.31 [26], [27] and α = 0.05.

Results

As noted elsewhere [21], variation among the allele frequency estimates between pools from individuals of the same phenotype for each racial/ethnic group from the NIDA/MNB samples was +/− 0.02 (standard error of the mean SEM).

European-American samples

For the dbGAP data from European-Americans, χ2 tests displayed p<0.05 for 49,843 autosomal Illumina SNPs. For the NIDA/MNB European-American samples, 75,413 of the autosomal Affymetrix 6.0 SNPs displayed t values with p<0.05 in comparisons between data from substance dependent vs control samples [21].

Searches for genome wide significance in each European-American sample

We identified case vs control p values for t test results from NIDA/MNB samples and for χ2 results from dbGAP samples from unrelated individuals. Permutation testing for the dbGAP European-American samples revealed p<0.0003 (3,000 trials) for the number of SNPs with nominal case vs control p values<0.05. However, virtually none of these p values reached the 10−8 level deemed necessary for genome wide significance.

Searches for clustering of SNPs with nominally-significant case vs control differences in each European-American sample

We identified 3125 clusters of SNPs that displayed nominally significant, p<0.05 case vs control differences for χ2 results from dbGAP samples and 2931 clusters with nominally significant t test results from NIDA/MNB samples.

Searches for chromosomal regions identified by clustered SNPs with nominally-significant case vs control differences in both European-American samples

Two hundred four chromosomal regions contained clusters of nominally-significant SNPs from both of these two European-American samples. None of 10,000 Monte Carlo simulation trials that each began with random sets of SNPs selected from each of the datasets identified as many overlapping regions as found in the true dataset. The overall Monte Carlo p<0.0001 for the overlap noted in the true data thus provides very high levels of confidence that these independently-derived sets of results do not identify the same set of chromosomal regions by chance alone. Thus, the null hypothesis that the chromosomal regions identified by both samples are identified based only on stochastic grounds is falsified by these Monte Carlo data. In addition, none of 3,000 permutation trials provides data that identifies as many chromosomal regions from permutated data as those identified by the real datasets. Thus, the null hypothesis that the chromosomal regions identified by both samples are identified based only on stochastic grounds also nullified by permutation testing data. The genes that: a) lie in chromosomal regions identified by data from both European-American samples and b) display the most nominally-significant SNPs are listed in Table 1; the complete list of chromosomal regions identified in this way is listed in Table S1. The fraction of the genome occupied by these results is 210% of the size expected by chance, based on the fractions of the genome occupied by clustered nominally positive results from each of these two European-American samples (data not shown).
Table 1

Chromosomal regions and genes identified by clusters of SNPs that provide nominally-significant differences between individuals dependent on alcohol (dbGAP alcohol dependent v ctl) or at least one illegal substance (NIDA/MNB drug dependent v ctl) in subjects of European-American heritage.

dbGAP alcohol dependent v ctlNIDA/MNB drug dependent v ctl
ch # SNPs bp:start bp:end pmin SNPpmin # SNPs bp:begin bp:end pmin SNPpmin gene(s)
142000819920013168rs38203179.84E-0342001247720016730rs118109161.09E-02RNF186
175528859255299911rs4834627.99E-0455529755955314269rs121189861.60E-03PCSK9, USP24
185531097355329655rs6838805.67E-0455529755955314269rs121189861.60E-03PCSK9, USP24
166580902965825669rs112086741.36E-0246580628465815607rs17494997.20E-03LEPR
14156627725156642253rs127565703.68E-034156619836156635649rs46611293.97E-03OR10T2
19166920481166942595rs5247054.40E-039166920955166961651rs5773172.28E-04DPT
16166957917166974598rs10525915.06E-039166920955166961651rs5773172.28E-04DPT
112170613171170634300rs22271982.21E-047170608822170631953rs121459691.65E-03DNM3
14177978272177994423rs10524472.61E-035177993055178013737rs17543526.97E-03C1orf76
18199376526199391033rs66941222.04E-044199383345199385625rs75418845.23E-03TMEM9
19229429623229483170rs168540126.47E-034229444384229460142rs45673432.74E-03C1orf131, GNPAT
112243603976243639175rs11738375.29E-037243627708243641337rs9627867.14E-03KIF26B
115244056488244072421rs97282481.84E-044244061674244078022rs7802404.53E-03SMYD3
271992781819951465rs110966266.28E-0361993632019953950rs67093853.04E-03TTC32
2113825094638280657rs1834876.45E-0443826913238283958rs170147052.46E-03C2orf58
24166876339166885605rs44384976.47E-035166871909166879436rs127121571.33E-03SCN9A
216233405444233445376rs26759662.84E-035233429928233444593rs9559442.84E-04NGEF, TNRC15, UNQ830
31271362767170141rs13538285.56E-03571669237178151rs168654406.74E-03GRM7
3271092348611013807rs46847461.94E-0471094317210961153rs175834336.13E-03SLC6A11
341527036815283244rs13189371.44E-0441527444115283244rs124731731.31E-02CAPN7, SH3BP5
3122938252029425986rs130841472.17E-0392940729729432158rs27001653.91E-03RBMS3
393792418037957695rs98227613.39E-0373793504937958835rs67107821.79E-03CTDSPL
356263469862643369rs173562521.31E-0346262516462637893rs115632011.59E-02CADPS
36144082777144094347rs67789663.11E-034144085356144087277rs15132151.22E-02PCOLCE2
4654785925489229rs40177821.70E-02454850615489334rs68090022.83E-03STK32B
4760926456116331rs68507513.02E-031060837256111793rs45743092.61E-03JAKMIP1
442093328920944461rs175201301.52E-0242094338320960603rs133164809.42E-03KCNIP4
472203742722051143rs14630002.15E-0352204179122059150rs23504882.20E-04GPR125
445465908554667957rs22781414.16E-0345466197154671211rs76502513.75E-03GSX2
459014116690161886rs17957229.59E-0449014009790149157rs29036432.72E-03FAM13A1
4149577485695821573rs117240233.68E-0449580747295811400rs100270431.86E-03PDLIM5
47148002578148028761rs13967161.04E-034147999449148013733rs175871446.65E-05TTC29
44178492741178506247rs76890992.76E-025178492904178512644rs117317093.30E-04NEIL3
411185544039185583197rs7245288.75E-0411185570737185604825rs171481902.78E-03IRF2
45187423999187443174rs42418242.14E-035187416129187441931rs105181125.31E-04F11, KLKB1
571673377216752512rs22884331.71E-0291673710916755375rs100199421.17E-02MYO10
576087796760899218rs15508162.10E-0246088396160898674rs76769412.72E-03ZSWIM6
557828710378299426rs9219452.12E-02107825044378293007rs108663071.77E-02ARSB
5189610815796159991rs303333.95E-0349610838796124514rs29355984.75E-03CAST, ERAP1
5189610815796159991rs303333.95E-0359613871096151541rs121870403.59E-03CAST, ERAP1
54107328554107341302rs109009002.59E-026107334222107357187rs77106174.20E-03FBXL17
59156312645156338524rs68833178.94E-035156313948156338362rs168944580.03297TIMD4
58169753936169783643rs131751437.58E-044169755299169768102rs2983871.00E-02KCNIP1, KCNMB1
68125611562125629660rs37997323.45E-037125615011125629660rs10102841.03E-03HDDC2, TPD52L1
65128874118128891370rs173641181.09E-024128869961128879810rs19977811.02E-02PTPRK
610152750003152780038rs2149892.46E-045152756299152775286rs41433347.29E-03SYNE1
65167036150167052518rs47100813.99E-036167007415167038192rs168964072.82E-03RPS6KA2
771148544311520621rs69726152.45E-0361147901111498204rs94490672.42E-03THSD7A
782164727621678449rs64615933.91E-0342165252621661805rs10203207.00E-04DNAH11
752947861129483175rs13623642.94E-0252947260029486168rs5894691.10E-02CHN2
785046893450492914rs9637396.76E-04115049170250517353rs93989131.14E-02DDC, FIGNL1
745062382850640401rs125408749.46E-0495062451550661951rs170600995.62E-03GRB10
755065309650663588rs9807163.29E-0395062451550661951rs170600995.62E-03GRB10
714149948324150001479rs69465795.64E-034149950223149957752rs119727312.46E-03GIMAP6
74154283212154287670rs8787421.06E-024154278320154298576rs102370375.03E-03DPP6
8630307193047485rs10771536.75E-04430355163042040rs69427892.68E-02CSMD1
81034929833504244rs24693901.37E-03534916403497769rs126994724.43E-03CSMD1
8439250033934454rs19710782.50E-02939072903934466rs77895507.17E-03CSMD1
8441731584179668rs18475701.68E-02741663914177720rs78045951.56E-05CSMD1
851042615910433464rs70080871.39E-0241041581510426892rs43853777.64E-03UNQ9391
8101719842717226143rs70035033.03E-03101717741617230166rs125388922.44E-03MTMR7, VPS37A
9441932144216155rs109743901.37E-02441932144196671rs3416768.82E-03GLIS3
96100869363100891728rs15375045.46E-036100853373100876607rs42370436.49E-04COL15A1
95111559709111564648rs20258783.23E-036111545414111562096rs169385889.40E-03PALM2
910124177516124204757rs105134029.32E-035124194135124204757rs23193611.32E-02PTGS1
94128199379128209802rs48365372.19E-024128187181128205955rs69822244.57E-03FAM125B
94129336305129348849rs18917301.49E-024129346310129350356rs168953907.01E-03FAM129B
1081439770514415265rs70822191.14E-0471438566414426301rs169026921.08E-03FRMD4A
1045370144953715401rs11945163.30E-0245369674253702708rs169290923.32E-03PRKG1
1066160701061624482rs123559083.95E-0366158596161607010rs101191778.68E-03ANK3
10129728121997339124rs15364449.74E-0489728671297318201rs78737668.41E-03SORBS1
1011127458448127494379rs112446644.14E-034127471364127485187rs74700863.52E-03UROS
104127630157127646057rs44037251.92E-034127635773127646057rs15291921.13E-03FANK1
1142097134120972871rs107667611.69E-0372095959720974482rs18919834.11E-03NELL1
1162153069721556573rs49228474.62E-0362153807421567545rs171581391.23E-02NELL1
116122127741122146466rs128047118.64E-036122109340122129879rs122613266.31E-03UBASH3B
12418540051867513rs47658552.27E-02418517111860652rs169203341.58E-03CACNA2D4
12462106496216045rs31813013.26E-02561961756218155rs169244154.62E-03CD9
1262513687125157470rs73036691.02E-0242513068925143445rs177039182.11E-03CASC1, LRMP
1249390583393915516rs111078454.83E-0349389957193908819rs38167850.016554NDUFA12
126110129183110133727rs38092912.21E-026110118664110143457rs108294483.37E-03CUX2
1369917206199202794rs11254362.55E-0349917925199187733rs107499020.012953CLYBL
1492208130522111496rs12636631.19E-0282209935222112693rs10784025.22E-04 TRA@, TRAC, DAD1
14123228771432314819rs9103183.38E-0343229573732307125rs6088711.48E-03AKAP6
1465612203556145673rs71413052.45E-0245612913356137765rs2168521.17E-03C14orf101
1497278626872812034rs72027.26E-0377278652772791053rs123209554.52E-03PAPLN
14279397782294053547rs116260912.74E-0559403336094053376rs110512193.19E-04SERPINA12
144102682945102688800rs7192521.09E-025102686871102702856rs19085921.14E-03RPL21P12
1543174194431745419rs47796286.64E-0343174194431750247rs22325622.88E-02RYR3
1597752807477560994rs71699636.33E-0447753868877556997rs38111701.68E-03KIAA1024
1577805027278071496rs18798943.85E-0347804888178059873rs176859913.86E-03BCL2A1
15158750777487562668rs80281231.34E-0348751014287521605rs10916467.37E-04ABHD2
15158750777487562668rs80281231.34E-0348756249287564607rs125790031.91E-03RLBP1
1559155016091553882rs18720529.80E-0469153230891561784rs111079094.23E-03UNQ9370
1549867517898686110rs80296501.91E-0369867709298691878rs107783386.03E-03ADAMTS17
1569937242799380316rs24120043.22E-0349937575099389091rs49643536.79E-03LRRK1
16147919714179228642rs124482906.69E-0447920505179220547rs73181151.17E-03CDYL2
1648178451581789713rs176759332.75E-03178178335381829795rs95644362.34E-04CDH13
1668253506182542626rs22452226.97E-0348252393782537492rs95996464.77E-03OSGIN1
1648300859283017058rs2478052.20E-0248299273083011363rs73294342.20E-02ATP2C2
17122879768128835589rs9525401.74E-0442879768128806815rs175028181.40E-03ACCN1
17122879768128835589rs9525401.74E-0462882782828847221rs169595731.66E-03ACCN1
1745075214950762786rs124535448.62E-0355076065050785227rs3544451.95E-03HLF
1845387121853884508rs49413044.08E-0345387121853879450rs15316343.69E-03NEDD4L
1845528195955298778rs129612643.31E-0355528709155289937rs22938397.33E-04CCBE1
1845531723355327896rs72432441.90E-0255530197255322776rs171116879.60E-05CCBE1
1945681011056822600rs48028316.81E-0355680049456823545rs14028619.21E-03SIGLEC5
2071573813615756549rs61355622.77E-0351575644415767911rs116360911.04E-03MACROD2
2091584603015864192rs60343284.87E-04171582369115880560rs74964926.47E-03MACROD2
2043642251036431069rs126248433.51E-0343641105036428083rs8939092.23E-02LBP
20104205903242099777rs60313011.55E-0344208634742093700rs47787218.16E-04TOX2
2065114748251159764rs169975255.30E-0465115325851173380rs18658147.16E-05TSHZ2
2143911608639121078rs112541.72E-03233911608639156867rs99392139.26E-04ETS2
2154005179840064802rs81288506.09E-03124005719640094216rs15286015.25E-03IGSF5
21104050359240529634rs4479408.54E-03254048511040547818rs124485296.06E-03DSCAM
2164056735440589103rs28375452.06E-0264057299340603541rs123249551.03E-03DSCAM
2144680708046811595rs28393274.78E-0354678856346811098rs18627516.07E-04DIP2A
22152183247821835952rs57596212.41E-0362182483321834959rs20459254.17E-04RAB36
2244349424343510219rs57659301.85E-0254349416143512004rs133335802.50E-02PRR5

Columns list: chromosome, number of nominally-positive SNPs in dbGAP samples, beginning and end of the chromosomal region identified by clustered nominally-significant associations in dbGAP samples, SNP with the minimal p value found in the region, the p value for the SNP with the minimal p value in this region, similar data for the NIDA/MNB samples, and the gene(s) identified by these clustered SNPs.

Columns list: chromosome, number of nominally-positive SNPs in dbGAP samples, beginning and end of the chromosomal region identified by clustered nominally-significant associations in dbGAP samples, SNP with the minimal p value found in the region, the p value for the SNP with the minimal p value in this region, similar data for the NIDA/MNB samples, and the gene(s) identified by these clustered SNPs.

African American samples

45,325 SNPs displayed nominally-significant case vs control differences for dbGAP samples from African American individuals. For the NIDA/MNB African-American samples, 83,330 SNPs displayed “nominally significant” t values with p<0.05 from this racial/ethnic group. Permutation testing for the dbGAP African-Americans revealed p = 0.69 for the number of SNPs with nominal case vs control p values<0.05 (500 trials).

Searches for genome wide significance in each African-American sample

We identified case vs control p values for χ2 results from dbGAP samples and for t test results from NIDA/MNB pooled samples [21]. None of these p values approached the 10−8 level deemed necessary for genome wide significance.

Searches for clustering of SNPs with nominally-significant case vs control differences in each African-American sample

We identified clusters of SNPs that displayed nominally significant, p<0.05 case vs control differences for p values from χ2 results from dbGAP samples and t test results from NIDA/MNB samples (2026 and 3383 clusters, respectively).

Searches for chromosomal regions identified by clustered SNPs with nominally-significant case vs control differences in both African-American samples

One hundred twenty nine chromosomal regions were identified by clustered nominally-positive results from both of the two African-American samples. None of 10,000 Monte Carlo simulation trials that each began with random sets of SNPs selected from each of the datasets identified as many overlapping regions as found in the true dataset; hence Monte Carlo p<0.0001. Thus, the null hypothesis that the chromosomal regions identified by both African American samples are found based only on stochastic grounds is nullified by these Monte Carlo data. However, 199 of 200 permutation trials did provide data that identifies as many chromosomal regions from permutated data as those identified by the real datasets. Thus, the null hypothesis that the chromosomal regions identified by both samples are identified based only on stochastic grounds was not nullified by permutation testing, in ways that suggest that structure in the data may have contributed to the known propensity for permutation testing to overestimate false discovery rates in the presence of such structure [28], [29]. The genes that: a) lie in chromosomal regions identified by data from both African-American samples and b) display the most nominally-significant SNPs are listed in Table 2; the complete list of chromosomal regions identified in this way is listed in Table S2. The fraction of the genome occupied by these results is about 220% of that expected by chance, based on the fraction of the genome occupied by clustered nominally positive results from each of the African American samples (data not shown).
Table 2

Chromosomal regions and genes identified by clusters of SNPs that provide nominally-significant differences between individuals dependent on alcohol (dbGAP alcohol dependent v ctl) or at least one illegal substance (NIDA/MNB drug dependent v ctl) in subjects of African-American heritage.

dbGAP alcohol dependent v ctlNIDA/MNB drug dependent v ctl
ch # SNPs bp:begin bp:end pmin SNPpmin # SNPs bp:begin bp:end pmin SNPpmin gene(s)
192808423728106641rs66794321.94E-0242808580028094024rs172572524.41E-03C1orf38, RPA2
15160357908160384670rs13370721.36E-024160382664160395446rs121241051.52E-02NOS1AP
15182071303182087392rs104945701.11E-027182079430182107726rs118064975.41E-04RGL1
14212649607212665577rs107796141.19E-024212650752212655224rs170228662.21E-03PTPN14
24166129660166148793rs108037991.16E-044166123345166130256rs168509143.41E-03FAM130A2
24240548770240568964rs118937106.42E-034240542948240556386rs134246121.19E-03NDUFA10
381446985914483315rs111286996.25E-0351447045914486703rs172371321.77E-03SLC6A6
341638074416383669rs98359115.03E-0341637756316389202rs6899531.43E-02RFTN1
372172643721747061rs9575891.32E-0382171702921737614rs130776241.02E-02ZNF385D
374145156641467634rs14956924.32E-0574144747541465298rs120540142.48E-03ULK4
356263560662643369rs9788798.10E-0366261538362637642rs15120151.31E-03CADPS
34144385643144403567rs67766343.98E-0211144384755144408513rs67861296.79E-04PBX2P1
499369974093730844rs76828425.98E-0499368432093699740rs173196728.72E-06GRID2
499369974093730844rs76828425.98E-0449371969293727390rs170196083.45E-03GRID2
5575740837602448rs100355411.85E-02775912377615180rs13924812.56E-03ADCY2
554160847941624460rs6696848.81E-03104157503241617270rs6208764.99E-03TCP1L2
54148176930148194422rs126527575.08E-034148169604148179324rs21167142.06E-03ADRB2
54167355762167368572rs170695781.99E-035167361210167387579rs170696362.93E-04ODZ2
641222533012232841rs22282131.71E-0241222554112239453rs23275142.36E-03HIVEP1
643123438831247483rs11087469.83E-0443123097631243685rs95010632.86E-03CCHCR1, POU5F1, TCF19
6103227841132307330rs20712871.22E-0263228809832302370rs22694184.43E-04NOTCH4
674601616946032945rs47148925.78E-04104598704246031767rs93672289.12E-03CLIC5
66129830351129839777rs65696031.91E-046129838313129856864rs170574642.75E-03LAMA2
65147933198147957929rs77435386.08E-035147940910147969453rs94978163.43E-03SAMD5
66148834310148848906rs11241633.87E-035148822606148840150rs69276623.53E-03SASH1
67168744759168766408rs121975844.58E-034168765804168776041rs94562596.38E-03SMOC2
743774962537757420rs27091145.41E-0363775433237786876rs27091143.52E-03GPR141
710154494400154529011rs16190152.17E-034154510872154528905rs17301861.82E-02HTR5A
8514664411477325rs177486773.40E-03414683721478011rs176815303.86E-03DLGAP2
8635430653557725rs173266702.39E-03635373443551589rs170670799.79E-06CSMD1
8441941594209292rs101049109.77E-03442007014215192rs39909095.21E-03CSMD1
861726678117290933rs126763888.09E-0341728115617285461rs74600822.31E-02MTMR7
857234745272360941rs119915621.34E-0287235159272382875rs69898678.03E-04EYA1
84102824050102836428rs64687922.12E-027102818744102835352rs11253342.81E-03NCALD
86139357180139371129rs15124065.04E-047139358561139377143rs15124077.92E-05FAM135B
85141035897141054146rs69811653.17E-035141032458141044168rs8813782.54E-03NIBP
9471554277162630rs109760821.43E-02471409977157510rs9135817.73E-04JMJD2C
9494082779423131rs43426638.69E-031393837449435270rs108161247.94E-04PTPRD, RN7SLP2
97118770530118810301rs70420363.43E-046118810026118836028rs20502743.15E-03ASTN2
1046626083166274062rs12272441.90E-0276624556166275840rs107618661.42E-03ANXA2P3
10167449931474567822rs125735125.00E-0397454051374570789rs64806711.23E-03ECD, NUDT13
1049056403190575327rs118179781.33E-0249057012490575118rs49344232.08E-02ANKRD22, LIPM
105115300542115311814rs49188421.77E-026115301876115327383rs70939626.99E-03HABP2
106135216408135227438rs81814251.50E-024135209148135223425rs96299771.94E-03FLJ44653, SYCE1
1151225335712271091rs71062051.43E-0351224131712255138rs110222709.33E-03MICALCL
1142062097920632993rs16177692.17E-0372058673920621980rs22988261.57E-04SLC6A5
1148772700587740895rs47533591.68E-0348771988187730124rs6181439.42E-03CTSC
1168809168288116810rs19938429.39E-0358809502688109123rs28922934.54E-03GRM5
1149225907892271738rs96667891.52E-0259225980892275974rs124210522.14E-03FAT3
114122111119122121536rs49358042.30E-034122121115122128473rs15401131.29E-03UBASH3B
1256522134765236556rs107480538.18E-0446520887365229384rs79713704.95E-03GRIP1
1247101030171018781rs105066531.09E-02107098958171025118rs177831316.08E-04TRHDE
13179468317394714801rs42584817.94E-03139468485594734859rs95902131.54E-04ABCC4
135102501056102512255rs1573821.01E-0212102491230102519109rs15498362.17E-03SLC10A2
134108262545108277505rs95210651.24E-034108262052108276593rs3907907.41E-03MYO16
1468503965985062024rs18840091.20E-0348504350885064044rs177097141.19E-03FLRT2
1498507472485098018rs19554182.29E-0348508608985096268rs9856201.98E-04FLRT2
1542476718124769395rs285510168.86E-0442476122124768832rs48875291.07E-03GABRA5
1648146987581483050rs169588261.42E-03148146972481505974rs93195785.41E-05CDH13
1951181556211851258rs2862462.12E-0341181663211836425rs14663088.54E-03VN2R13P, VN2R14P, ZNF439, ZNF440
20467022466720263rs2357045.80E-03467140196722420rs130445795.60E-03BMP2
2061998376720012701rs60465938.04E-0351999708820023996rs98085945.69E-04C20orf26
20104890776848942090rs10626519.85E-0444891216748930175rs60961385.45E-03BCAS4
2055425203754262859rs60990571.10E-0375423312354261400rs61235684.10E-03MC3R
2172642220426447377rs124827534.85E-03122641936826465912rs99847641.61E-03APP
2164043572840454200rs28374681.36E-0264044650140459247rs119117493.25E-04DSCAM
2261569510215706432rs1656111.02E-0451568988115706432rs20751201.62E-04CECR8
2242918567429194610rs57531586.46E-03102916934629215980rs57531522.18E-04SEC14L3
22113572749635747143rs60005291.11E-0253573324835738081rs1305987.85E-03C22orf33, MPST, TST

Columns list: chromosome, number of nominally-positive SNPs in dbGAP samples, beginning and end of the chromosomal region identified by clustered nominally-significant associations in dbGAP samples, SNP with the minimal p value found in the region, the p value for the SNP with the minimal p value in this region, similar data for the NIDA/MNB samples, and the gene(s) identified by these clustered SNPs.

Columns list: chromosome, number of nominally-positive SNPs in dbGAP samples, beginning and end of the chromosomal region identified by clustered nominally-significant associations in dbGAP samples, SNP with the minimal p value found in the region, the p value for the SNP with the minimal p value in this region, similar data for the NIDA/MNB samples, and the gene(s) identified by these clustered SNPs.

Searches for genes identified by clustered SNPs with nominally-significant case vs control differences in all four samples

The clusters from both of the two African-American samples identified six genes that were also identified by clusters from both of the two European-American samples. CDH13, CSMD1 and DSCAM are three cell adhesion molecules that we have identified in many prior studies of addiction vulnerability and/or abilities to quit smoking (see below), while CADPS, MTMR7 and UBASH3B have been identified in fewer prior studies. This modest overlap contrasts with the larger overall overlap between the Affymetrix datasets for the African-American vs European American NIDA/MNB samples [21] and the Illumina datasets for the African-American vs European American dbGAP samples. In the latter case, we can identify 146 chromosomal regions, 88 of which contain 126 genes, in which overlapping results between the two racial/ethnic groups are found in ways not found by chance in 10,000 Monte Carlo simulation trials (data not shown).

Validation of pooling vs individual genotyping for SNPs whose results provided the clusters

We compared individual vs pooled allele frequency estimations for the ca. 500 SNPs that displayed minor allele frequencies >0.1 and provided clustered, nominally positive results in data from the NIDA pooled samples. The results from these SNPs displayed mean 0.66 Pearson correlation coefficients between data from pooled and individual genotyping. These correlations were more modest than those identified in validating studies for pooling that used larger ranges of expected allele frequencies. Thus, there was an average 0.19 range of expected values for these genotypes vs 0.9 range for the SNPs and pools used in initial studies that validated pooling with these Affymetrix 6.0 arrays) [21].

Discussion

Genome-wide association data of increasing richness is available for many complex disorders. Several of these GWA datasets contain relatively robust results at “oligogenic” loci that can also be identified, in many cases, by linkage-based approaches [30], [31], [32], [33]. Even moderately secure GWA identification of “polygenic” influences on disease, however, is likely to require replicated data from multiple independent samples. “Template” analyses seek SNPs that provide “genome wide significance” with the same phase of association in data from multiple independent samples. However, there have been no unanimous criteria for declaring replication of sets of data in circumstances in which no SNP achieves this level of statistical significance in each of multiple samples. We have focused on identification of statistical significance for sets of chromosomal regions that are each identified by sets of nominally-significant SNPs from several independent samples. This approach identifies chromosomal regions and genes that are very likely, as a group, to display bona fide association with individual differences in vulnerability to develop dependence on an addictive substance. This overall confidence derives from approaches that address distinct sets of null and/or alternative hypotheses to explain the results obtained. First, seeking chromosomal regions in each sample that are identified by at least 4 closely-spaced nominally-positive SNPs addresses the null hypothesis that the results obtained are randomly distributed across chromosomes. This initial process also addresses the alternative hypothesis that the nominally-positive SNPs are identified based on technical problems in correctly assigning allele frequency differences to case vs control sample comparisons (or in correctly identifying the true variances for these values). Of course, we would expect to see clustering of nominally-positive SNPs in each sample in regions in which there was either a) linkage disequilibrium between the SNPs studied and between these SNPs and functional variants that influenced addiction vulnerability or b) linkage disequilibrium between these SNPs and stochastic differences in haplotype frequencies in individual samples of cases vs those in a single sample of controls that are unrelated to the phenotype. The second way in which we seek replication identifies, in independent samples, many of the same chromosomal regions based on their content of clustered, nominally positive SNPs. This comparison addresses the null hypothesis that the clustering observed in each sample derives from stochastic case vs control differences in haplotype frequencies rather than case vs control differences that are truly related to differences in phenotypes. This comparison also provides additional support for our ability to reject the null and alternative hypotheses relating to assay noise. We thus identify more chromosomal regions and genes based on the overlap between the chromosomal regions identified by data from each sample than we would expect if the only reason for clustering of nominally positive SNPs in each sample was stochastic variation in the frequencies with which blocks of restricted haplotype diversity are found in cases vs controls that are unrelated to the phenotype. Availability of data from other recently-reported genome wide association studies also provides a third way in which we seek replication, based on identification by the current data, of more of the same genes that were identified in other reports from independent samples and different analyses than we would expect by chance. This comparison also provides additional means for us to refute then null hypothesis that the clustering observed in each sample derives from stochastic case vs control differences in haplotype frequencies rather than case vs control differences that are truly related to differences in phenotypes. In replicated samples that compared 500 k allele frequencies in alcohol dependent to population control samples, Treutlein and colleagues [15] have used a mixed analytic strategy to identify nine genes. Products of two of these genes, ADH1C and PECR, are likely to play direct roles in alcohol metabolism and thus provide weak candidates for overlap with data from the NIDA/MNB samples. Our current results identify three of the remaining seven genes: CDH13, ERAP and CAST. Based on chance, we should have identified fewer than one of these genes (0.07 genes on average). We have also recently begun analyses of a 500,000 SNP dataset supplied by these authors. We have identified chromosomal regions tagged by clusters of at least 3 SNPs which lie within 25 kb of each other that display nominally-significant case vs control differences in this sample, criteria that we have previously used for 500 k datasets. These analyses identify 30 of the genomic regions and 18 of the genes identified by both dbGAP and NIDA/MNB European American samples, providing more than 19 times the amount of overlap expected by chance (Uhl GR, Johnson C, Treutlein J, Cichon S, Ridinger M, Wodarz N, Soyka M, Zill P, Maier W, Moessner R, Gaebel W, Dahmen N, Fehr C, Scherbaum N, Steffens M, Ludwig KU, Frank J, Wichmann HE, Schreiber S, Dragano N, Sommer WH, Leonardi-Essmann F, Lourdusamy A, Gebicke-Haerter P, Wienker TF, Sullivan PF, Nöthen MM, Kiefer F, Spanagel R, Mann K, Rietschel M, unpublished observations, 2011). There are a number of important limitations that come from these samples, these analyses, and from the application of this approach to these datasets. The two distinct null hypotheses both require careful thinking about linkage disequilibrium, since it is easy to confuse data and analyses that bear on linkage disequilibrium among markers that display case control differences in single samples, the chromosomal regions that such markers label, and the chromosomal regions labeled by such sets of markers in multiple independent samples. Especial difficulties in clarity may arise since we anticipate true positive results that combine differences between cases and controls that are based on linkage disequilibrium among markers that display case vs control association with disease and between these markers and the functional allelic variants that provide the variation in gene function that influences phenotype. Without dissecting these differences, it is easy to come to the incorrect conclusion that the method described herein is only detecting the linkage disequilibrium structure and not disease association. There are other limitations. The NIDA/MNB samples, largely of individuals who were not seeking treatment, were recruited at a single site and compare dependent individuals with heavy levels of substance use to controls with modest or no substance use. These features might provide differences from the dbGAP samples which were recruited at a number of sites from largely treatment-seeking individuals or probands. The dbGAP samples compare alcohol dependent individuals to controls whose levels of illegal substance use do not produce dependence, but might be substantial. To parallel the recently-reported analysis of this data by Beirut and colleagues [16], we have included, in the control group, individuals who smoked significant numbers of cigarettes and/or display DSM or FTND dependence on nicotine. Reanalyses of the data from the dbGAP European American sample after excluding the 376 individuals with FTDN scores >4 and/or DSM nicotine dependence yields overlap with data from NIDA samples that is even stronger than that identified in the main analyses presented here, even though more than ¼ of the “controls” are removed from these analyses (Johnson et al, unpublished observations, 2010). Due to the small number of individuals with Asian or Hispanic racial/ethnic backgrounds in this sample, we have excluded them from the present analyses. This exclusion also renders our comparisons different from those used in the recent report of data from many of these same dbGAP individuals [16]. While Monte Carlo simulation tests weigh strongly against the null hypothesis that chance alone accounts for the degree to which the same genes are identified by data from each of the two samples from individuals of the same racial/ethnic background, permutation tests only reach high levels of statistical significance in rejecting this null hypothesis in the European American subjects. Principal components analyses suggest that much of the variance in this data is not due to phenotype or racial/ethnic group (Johnson et al, unpublished observations, 2010); such structure might account for the permutation results from the African American data [28], [29]. Based on statistical considerations, the present analyses are likely to provide many false negative results. The power of each of these samples to detect polygenic influences is moderate. The requirement for convergent identification of the same chromosomal region by data from both samples of the same racial/ethnic background provides a likelihood of even more false negative results. Case vs control allele frequency differences in the NIDA/MNB samples were genotyped using multiple DNA pools and an Affymetrix 6.0 platform, providing t tests that use information about both mean differences and variances. Case vs control differences in the dbGAP samples were assessed using Illumina platform genotyping of individual samples, yielding χ2 results without explicit assessment of variance. The requirement that at least 4 nominally-significant SNPs lie within 10 kb of each other cannot be fulfilled in a number of chromosomal regions or in a number of genes in which the density of SNPs is too low to meet this stringent requirement (see Supplement of [14] for list of the genes that cannot be assessed with these criteria using the Affymetrix platform). There are only about ¼ million autosomal SNPs that are shared between the ca. 900 K and 1 M autosomal SNPs evaluated by the Affymetrix and Illumina platforms, respectively, further exacerbating this problem in many genomic regions. Despite these limitations, there is highly-significant overall convergence between two comparisons of NIDA/MNB and dbGAP GWA data from substance-dependent individuals vs controls: one comparison in European-American subjects and another comparison in African-American subjects. For each of these comparisons, the degree to which clusters of nominally-positive SNPs identify the same chromosomal regions and genes is never found by chance in up to 10,000 Monte Carlo simulation trials. This evidence for replication, defined in this fashion, also provides striking contrasts to results from attempts to identify replication (and/or generalization) in other ways. For example, results that seek to identify the extent to which the same SNPs display nominally-significant associations with the same phase in each of these replicate samples within each racial/ethnic group identify about as many SNPs with these properties as expected by chance (data not shown). We have previously reported the apparent success of “nontemplate” analyses that are similar to those used herein when applied to data from four independent case vs control samples for bipolar disorder [20]. None of these bipolar vs control samples, individually, provided results with genome wide significance. These samples combined data from individual and pooled genotyping using different genotyping platforms. Despite these difficulties, the results of nontemplate analyses provided much more frequent identification of the same genomic regions and genes by clustered, nominally positive SNPs from multiple independent samples in bipolar disorder than we would anticipate by chance. Studies that focus on identifying “template” same-phase association with genome wide levels of significance in multiple independent samples appear most likely to succeed when oligogenic genetic architecture confers large association signals in each independent sample, when the same SNP sets are studied in each, when the disease exhibits little allelic or locus heterogeneity and when there are good matches between the fine patterns of linkage disequilibrium of the samples being studied. Apparent replication “failures” using this approach could thus relate to a number of features that include associations of modest magnitude, sample-to-sample differences in fine patterns of linkage disequilibrium, different amounts of information provided by markers with population-specific differences in allele frequencies, allelic heterogeneity and locus heterogeneity. Monte Carlo methods allow us to test the probabilities of chance clustering of nominally-positive SNPs and the chance of convergence between clusters identified in one sample with clusters identified in other samples. Our Monte Carlo approaches deploy an empirical method that uses the existing dataset as a source for randomly selected SNPs for each Monte Carlo trial. The results of these simulations provide strong overall confidence that these sets of results are not due to chance. By contrast, these approaches alone provide unequivocal identification of few individual SNPs or genes. This lack of unequivocal identification of individual SNPs is consistent with polygenic/allelic heterogeneity current working models for the genetic architecture of vulnerability to substance abuse [14], [34]. However, identification of associations at some loci, such as the CDH13 locus, in many independent samples (see below) makes it very highly unlikely that this locus does not harbor allelic variants that influence interactions between humans and addictive substances. Previous analyses that have compared the MNB/NIDA European-American to African-American results have identified genomic regions that are labeled by clustered, nominally-positive SNPs from both samples, supporting roles for some allelic variants that are likely to be old in relation to human history [6], [7], [9], [21]. Data from analyses that combine results from individuals with different racial/ethnic backgrounds also provide suggestive results in regions such as the GABA receptor gene cluster on chromosome 4 for evolutionarily-old variants [16], [35]. Identification in both studies of SNP markers whose allelic frequencies distinguish controls from addicts of different ethnicities supports “common disease/common allele” genetic architecture [36] for part of the genetics of addiction vulnerability. However, the substantially greater convergence, noted here, for data from the same racial/ethnic groups also points to possibly-substantial roles for variants that have been accumulated more recently in human populations that have been more separate until relatively recently. Genes identified by this work include those in several classes. When we compare the list of genes identified by these samples to functional classes as annotated in Gene ontology (GO) using Biobase, we find the greatest (9.2×10−9–1.2×10−6) statistical significance for overrepresentation of the genes whose products are involved with the following biological processes: signal transmission (57 observed/28 expected by chance), signaling process (57/28), cell communication (39/16), regulation of cellular process (90/57), regulation of localization (23/7), signaling (68/39), negative regulation of biological process (42/19), regulation of biological process (94/63), biological regulation (100/70) and synaptic transmission (16/4). CDH13 associations with addiction phenotypes have now been identified in both the four samples studied here and in a number of prior reports. We initially identified associations between substance dependence vulnerability and CDH13 variants in smaller subsets of COGA and MNB samples in studies that utilized earlier microarray types [9], [37]. We and others have subsequently identified such associations in several other samples for addiction-related phenotypes that include: a) vulnerability to substance dependence [12], independent replicated alcohol dependence datasets [15], [19], [38], b) individual differences in acute responses to alcohol administration [39] and c) abilities to quit smoking [24], [40], [41]. Allelic variants in CDH13, a glycophosphoinositol-anchored cadherin that is expressed in neurons that lie in interesting brain circuits, are thus very strong candidates to contribute to addiction-related phenotypes. The findings presented in the current report thus add the strong evidence for involvement of variants in several individual genes, add to the ongoing consideration of methods for comparing GWA datasets and enhance understanding of genetic underpinnings of human addiction. For addictions, as for many complex disorders, such data provides an increasingly rich basis for improved understanding and for personalized prevention and treatment strategies. The complete list of chromosomal regions that a) are identified by data from both European-American samples and b) display the most nominally-significant SNPs (genes listed in Table 1). (XLS) Click here for additional data file. The complete list of chromosomal regions that a) are identified by data from both African-American samples and b) display the most nominally-significant SNPs (genes listed in Table 2). (XLS) Click here for additional data file.
  39 in total

1.  A note on using permutation-based false discovery rate estimates to compare different analysis methods for microarray data.

Authors:  Yang Xie; Wei Pan; Arkady B Khodursky
Journal:  Bioinformatics       Date:  2005-09-27       Impact factor: 6.937

2.  Pooled association genome scanning: validation and use to identify addiction vulnerability loci in two samples.

Authors:  Qing-Rong Liu; Tomas Drgon; Donna Walther; Catherine Johnson; Oxanna Poleskaya; Judith Hess; George R Uhl
Journal:  Proc Natl Acad Sci U S A       Date:  2005-08-09       Impact factor: 11.205

3.  Linkage disequilibrium, haplotype and association studies of a chromosome 4 GABA receptor gene cluster: candidate gene variants for addictions.

Authors:  Tomas Drgon; Claudio D'Addario; George R Uhl
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2006-12-05       Impact factor: 3.568

4.  Novel genes identified in a high-density genome wide association study for nicotine dependence.

Authors:  Laura Jean Bierut; Pamela A F Madden; Naomi Breslau; Eric O Johnson; Dorothy Hatsukami; Ovide F Pomerleau; Gary E Swan; Joni Rutter; Sarah Bertelsen; Louis Fox; Douglas Fugman; Alison M Goate; Anthony L Hinrichs; Karel Konvicka; Nicholas G Martin; Grant W Montgomery; Nancy L Saccone; Scott F Saccone; Jen C Wang; Gary A Chase; John P Rice; Dennis G Ballinger
Journal:  Hum Mol Genet       Date:  2006-12-07       Impact factor: 6.150

5.  Addiction molecular genetics: 639,401 SNP whole genome association identifies many "cell adhesion" genes.

Authors:  Qing-Rong Liu; Tomas Drgon; Catherine Johnson; Donna Walther; Judith Hess; George R Uhl
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2006-12-05       Impact factor: 3.568

6.  Pooled association genome scanning for alcohol dependence using 104,268 SNPs: validation and use to identify alcoholism vulnerability loci in unrelated individuals from the collaborative study on the genetics of alcoholism.

Authors:  Catherine Johnson; Tomas Drgon; Qing-Rong Liu; Donna Walther; Howard Edenberg; John Rice; Tatiana Foroud; George R Uhl
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2006-12-05       Impact factor: 3.568

7.  Complement factor H variant increases the risk of age-related macular degeneration.

Authors:  Jonathan L Haines; Michael A Hauser; Silke Schmidt; William K Scott; Lana M Olson; Paul Gallins; Kylee L Spencer; Shu Ying Kwan; Maher Noureddine; John R Gilbert; Nathalie Schnetz-Boutaud; Anita Agarwal; Eric A Postel; Margaret A Pericak-Vance
Journal:  Science       Date:  2005-03-10       Impact factor: 47.728

8.  A family study of alcohol dependence: coaggregation of multiple disorders in relatives of alcohol-dependent probands.

Authors:  John I Nurnberger; Ryan Wiegand; Kathleen Bucholz; Sean O'Connor; Eric T Meyer; Theodore Reich; John Rice; Marc Schuckit; Lucy King; Theodore Petti; Laura Bierut; Anthony L Hinrichs; Samuel Kuperman; Victor Hesselbrock; Bernice Porjesz
Journal:  Arch Gen Psychiatry       Date:  2004-12

Review 9.  Genetic heterogeneity of Alzheimer's disease: complexity and advances.

Authors:  Jean-Charles Lambert; Philippe Amouyel
Journal:  Psychoneuroendocrinology       Date:  2007-07-30       Impact factor: 4.905

10.  Molecular genetics of nicotine dependence and abstinence: whole genome association using 520,000 SNPs.

Authors:  George R Uhl; Qing-Rong Liu; Tomas Drgon; Catherine Johnson; Donna Walther; Jed E Rose
Journal:  BMC Genet       Date:  2007-04-03       Impact factor: 2.797

View more
  25 in total

1.  Shared additive genetic variation for alcohol dependence among subjects of African and European ancestry.

Authors:  Leslie A Brick; Matthew C Keller; Valerie S Knopik; John E McGeary; Rohan H C Palmer
Journal:  Addict Biol       Date:  2017-11-27       Impact factor: 4.280

Review 2.  Polymorphisms in sex steroid receptors: From gene sequence to behavior.

Authors:  Donna L Maney
Journal:  Front Neuroendocrinol       Date:  2017-07-10       Impact factor: 8.606

3.  Mouse Model for Protein Tyrosine Phosphatase D (PTPRD) Associations with Restless Leg Syndrome or Willis-Ekbom Disease and Addiction: Reduced Expression Alters Locomotion, Sleep Behaviors and Cocaine-Conditioned Place Preference.

Authors:  Jana Drgonova; Donna Walther; Katherine J Wang; G Luke Hartstein; Bryson Lochte; Juan Troncoso; Noriko Uetani; Yoichiro Iwakura; George R Uhl
Journal:  Mol Med       Date:  2015-07-14       Impact factor: 6.354

4.  Gene expression within the extended amygdala of 5 pairs of rat lines selectively bred for high or low ethanol consumption.

Authors:  William J McBride; Mark W Kimpel; Jeanette N McClintick; Zheng-Ming Ding; Petri Hyytia; Giancarlo Colombo; Tiebing Liang; Howard J Edenberg; Lawrence Lumeng; Richard L Bell
Journal:  Alcohol       Date:  2013-10-01       Impact factor: 2.405

5.  Genome-wide association studies of alcohol dependence, DSM-IV criterion count and individual criteria.

Authors:  Dongbing Lai; Leah Wetherill; Sarah Bertelsen; Caitlin E Carey; Chella Kamarajan; Manav Kapoor; Jacquelyn L Meyers; Andrey P Anokhin; David A Bennett; Kathleen K Bucholz; Katharine K Chang; Philip L De Jager; Danielle M Dick; Victor Hesselbrock; John Kramer; Samuel Kuperman; John I Nurnberger; Towfique Raj; Marc Schuckit; Denise M Scott; Robert E Taylor; Jay Tischfield; Ahmad R Hariri; Howard J Edenberg; Arpana Agrawal; Ryan Bogdan; Bernice Porjesz; Alison M Goate; Tatiana Foroud
Journal:  Genes Brain Behav       Date:  2019-06-04       Impact factor: 3.449

6.  Cdh13 and AdipoQ gene knockout alter instrumental and Pavlovian drug conditioning.

Authors:  C P King; L Militello; A Hart; C L St Pierre; E Leung; C L Versaggi; N Roberson; J Catlin; A A Palmer; J B Richards; P J Meyer
Journal:  Genes Brain Behav       Date:  2017-05-02       Impact factor: 3.449

7.  Cadherin 13: human cis-regulation and selectively-altered addiction phenotypes and cerebral cortical dopamine in knockout mice.

Authors:  Jana Drgonova; Donna Walther; G Luke Hartstein; Mohammad O Bukhari; Michael H Baumann; Jonathan Katz; Frank Scott Hall; Elizabeth R Arnold; Shaun Flax; Anthony Riley; Olga Rivero-Martin; Klaus-Peter Lesch; Juan Troncoso; Barbara Ranscht; George R Uhl
Journal:  Mol Med       Date:  2016-08-18       Impact factor: 6.354

8.  Genome-wide significant association signals in IPO11-HTR1A region specific for alcohol and nicotine codependence.

Authors:  Lingjun Zuo; Xiang-Yang Zhang; Fei Wang; Chiang-Shan R Li; Lingeng Lu; Liefu Ye; Heping Zhang; John H Krystal; Hong-Wen Deng; Xingguang Luo
Journal:  Alcohol Clin Exp Res       Date:  2012-12-06       Impact factor: 3.455

9.  Dissecting ancestry genomic background in substance dependence genome-wide association studies.

Authors:  Renato Polimanti; Can Yang; Hongyu Zhao; Joel Gelernter
Journal:  Pharmacogenomics       Date:  2015-08-12       Impact factor: 2.533

10.  Common genetic variants and gene expression associated with white matter microstructure in the human brain.

Authors:  Emma Sprooten; Emma E Knowles; D Reese McKay; Harald H Göring; Joanne E Curran; Jack W Kent; Melanie A Carless; Thomas D Dyer; Eugene I Drigalenko; Rene L Olvera; Peter T Fox; Laura Almasy; Ravi Duggirala; Peter Kochunov; John Blangero; David C Glahn
Journal:  Neuroimage       Date:  2014-04-13       Impact factor: 6.556

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.