Literature DB >> 23977022

Population genomics of cardiometabolic traits: design of the University College London-London School of Hygiene and Tropical Medicine-Edinburgh-Bristol (UCLEB) Consortium.

Tina Shah1, Jorgen Engmann, Caroline Dale, Sonia Shah, Jon White, Claudia Giambartolomei, Stela McLachlan, Delilah Zabaneh, Alana Cavadino, Chris Finan, Andrew Wong, Antoinette Amuzu, Ken Ong, Tom Gaunt, Michael V Holmes, Helen Warren, Daniel I Swerdlow, Teri-Louise Davies, Fotios Drenos, Jackie Cooper, Reecha Sofat, Mark Caulfield, Shah Ebrahim, Debbie A Lawlor, Philippa J Talmud, Steve E Humphries, Christine Power, Elina Hypponen, Marcus Richards, Rebecca Hardy, Diana Kuh, Nicholas Wareham, Claudia Langenberg, Yoav Ben-Shlomo, Ian N Day, Peter Whincup, Richard Morris, Mark W J Strachan, Jacqueline Price, Meena Kumari, Mika Kivimaki, Vincent Plagnol, Frank Dudbridge, John C Whittaker, Juan P Casas, Aroon D Hingorani.   

Abstract

Substantial advances have been made in identifying common genetic variants influencing cardiometabolic traits and disease outcomes through genome wide association studies. Nevertheless, gaps in knowledge remain and new questions have arisen regarding the population relevance, mechanisms, and applications for healthcare. Using a new high-resolution custom single nucleotide polymorphism (SNP) array (Metabochip) incorporating dense coverage of genomic regions linked to cardiometabolic disease, the University College-London School-Edinburgh-Bristol (UCLEB) consortium of highly-phenotyped population-based prospective studies, aims to: (1) fine map functionally relevant SNPs; (2) precisely estimate individual absolute and population attributable risks based on individual SNPs and their combination; (3) investigate mechanisms leading to altered risk factor profiles and CVD events; and (4) use Mendelian randomisation to undertake studies of the causal role in CVD of a range of cardiovascular biomarkers to inform public health policy and help develop new preventative therapies.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23977022      PMCID: PMC3748096          DOI: 10.1371/journal.pone.0071345

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

After decades of inconsistent findings, firm genetic associations are finally being reported for risk of coronary heart disease (CHD), arterial aneurysmal disease, peripheral vascular disease and diabetes [1]–[5], and for major atherosclerotic risk factors (LDL-cholesterol [6], blood pressure (BP) [7], and smoking behaviour [8]). Moreover, variants have been identified for their associations with many other cardiovascular disease (CVD)-associated biomarkers of less certain causal relevance, including HDL-cholesterol and triglycerides [6], apolipoproteins [9], lipoprotein (a) [10], inflammation and coagulation factors [11], uric acid, and glucose [12]. This progress in cardiovascular genetics has been made possible by sequencing the human genome [13], [14], characterising the frequency and correlation between genetic variation (1000 Genomes Project and the human HapMap) and exploiting technical advances in array-based genotyping. The assembly of large case-control collections for genome wide-association studies (GWAS) using microarrays that genotype up to 1 million single nucleotide polymorphisms (SNPs), the use of stringent significance thresholds for association, and rigorous validation by replication in independent studies [15], [16], have all contributed to these major advances in genetic epidemiology. Conservative significance thresholds required to reduce false-positive associations can lead to trait-associated variants of modest effect being overlooked in insufficiently large samples. Identifying these loci is still valuable because of the insight provided on disease aetiology and possible drug targets. To this end, multiple GWAS are being pooled by consortia usually focusing on a single trait or disease end-point, e.g. diabetes (DIAGRAM [5]), CHD (CARDIoGRAM [17]), stroke (the Stroke Genetics Consortium [18]), BP (the International Consortium of BP-GWAS [19]), lipids (the Global Lipids Genetic Consortium [20]), plasma glucose and related traits (MAGIC [21]), smoking behaviour (Ox-GSK [8]), and electrocardiographic QT-interval (QT-IGC). The contributing studies represent a mix of study designs, and include geographically diverse populations with differing risk factor exposures, CVD event rates, and age ranges. These studies usually provide cross-sectional disease outcome and phenotype data. Genotyping arrays for GWAS provide broad coverage of the genome through SNPs that capture information on un-typed SNPs due to linkage disequilibrium (LD). Because of LD between variants, multiple associated SNPs are observed at each locus, mostly in non-coding DNA regions [22], which may span several genes, or may be remote from known genes. Replication usually centres on just one or two strongly associating SNPs. Consequently, determining the identity and number of causal variants is often difficult, owing to the potentially large number of unexamined variants. Fine mapping, using denser SNP association data from re-sequencing or custom gene/locus-centric SNP arrays, can help resolve causal genes and SNPs. Therefore, despite the success of GWAS, unanswered questions remain. These include: (1) the precise location of the causal SNP(s) and the gene(s) they influence; (2) the absolute (as opposed to the relative) effects of loci, and how these change with age or differing non-genetic exposures; (3) the combined influence of multiple disease- or trait-associated SNPs; (4) the full constellation of risk factors and biomarkers influenced by each SNP (pleiotropy); and (5) the precise mechanism by which many CVD-associated SNPs lead to disease. These residual uncertainties can be addressed by higher resolution SNP typing at associated loci using gene-centric arrays, such as the Metabochip [23], in highly phenotyped, prospective cohort studies with serial risk factor/biomarker measures and information on incident disease. The Metabochip provides improved imputation accuracy, as well as greater power for detecting associations with common variants in fine mapping regions (122,241 SNPs are on the array to fine-map 257 loci which showed genome-wide significant evidence for association with one or more of the 23 cardio-metabolic traits), compared to GWAS arrays due to the increased SNP density of the Metabochip in genic regions [23], therefore offering an advantage for genetic association analyses. In addition, SNPs were selected from the databases developed by the International HapMap Project (http://www.hapmap.org) and the 1000 Genomes Project (http://www.1000genomes.org), allowing inclusion of SNPs across a wide range of the allele frequency spectrum, including those with a minor allele frequency of less than 0.1%. The UCL-LSHTM-Edinburgh-Bristol (UCLEB) Consortium has been established to allow interrogation of genetic associations using the Metabochip. The consortium consists of 12 well-established prospective observational studies comprising over 30,000 participants: Northwick Park Heart Study II (NPHS II), Whitehall-II Study (WHII), British Regional Heart Study (BRHS), English Longitudinal Study of Ageing (ELSA), MRC National Survey of Health and Development (MRC NSHD), 1958 Birth Cohort (1958BC), Edinburgh Artery Study (EAS), Edinburgh Type 2 Diabetes Study (ET2DS), Edinburgh Heart Disease Prevention Study (EHDPS), Aspirin for Asymptomatic Atherosclerosis Trial (AAAT), Caerphilly Prospective Study (CaPS) and the British Women's Heart and Health Study (BWHHS). The unique properties of the consortium provide an opportunity to investigate genetic determinants of risk factors for CVD through Metabochip-wide association analyses (MWAs), as well as associations with measures of subclinical disease such as carotid intima media thickness (cIMT), and with clinical events. Integration of genotype, biomarker and disease outcome data can also provide insight into disease mechanisms and potential therapeutic targets.

Methods and Design

Cohorts

All 12 studies in the UCLEB consortium are UK-based with wide geographic representation. Participants are almost exclusively of European ancestry. Principal components analysis (PCA) has been used to investigate the presence of population structure and exclude outliers. The age at recruitment ranges from birth (MRC NSHD and 1958BC) to >90 years (ELSA), with most cohorts recruiting subjects in mid-life. Despite the range of ages at the time of recruitment, the current age of participants spans the 5th to 9th decades of life, a time at which the majority of common, non-communicable diseases manifest, rendering the consortium a valuable continuing source for cases of incident diseases. All studies have longitudinal follow up (range 5–62 years) and details of incident disease (see ). Each of the studies has a prospective cohort design, the MRC NSHD and 1958BC also being birth cohorts. The AAAT study was designed as a RCT but the prospective follow-up allows inclusion as a cohort study. All have a DNA repository with most studies already having published genetic analyses. There are various existing collaborative links between the studies; MRC NSHD, 1958BC, ELSA, CaPS and WHII in the HALCyon network; MRC NSHD, 1958BC, BRHS, ELSA, WHII, NPHS II and EAS as the UCL Genetics Consortium, and BRHS and BWHHS as sister studies with the same sampling frame and clinic procedures.
Table 1

Summary of recruitment, inclusion criteria and clinical assessments in the UCLEB studies.

StudyLSHTM – Bristol basedBristol basedUCL basedEdinburgh basedTotal
BWHHSCaPSMRC NSHDELSANPHS-II1958BCWH-IIBRHSEASEHDPS* ET2DSAAA trial
Study design ProspectiveProspectiveProspective birth cohortProspectiveProspectiveProspective birth cohortProspectiveProspectiveProspectiveProspectiveProspectiveProspective
Sampling Frame General practicesGeneral practicesBirth registerRespondents of HSEGeneral practicesBirth registerWorkplaceGeneral practicesGeneral practicesGeneral practicesDiabetes register (via General practices)General practices
% men 010050441005066100501005028
Baseline year 1999–20011979–198319462002–20031989–199419581985–19881978–19801987–19881985–19882006–20071998–2001
N at recruitment 428625125362120993052174161030877351592159210663350 67318
Number of resurveys 441–12 (life course) 1–4 (adulthood)65 annual998223Annual for CVD
Number of surveys with clinical data 151361533232
Years Follow up 1022671117512430202048
N with DNA 3800 1500†† 2700†† 5616†† 2775†† 8017†† 5008†† 3945†† 940†† 1200 1060 2833†† 39394
N with Metabochip 20241397246419820583934082453850010570 21474
N with 50k chip/custom chip 344300000545600000 8899
N with GWAS 000800005595000000 13595
Stored biological specimens
PlasmaYesYes2700YesNoYesYes3945 (2)NoNo1066Yes
UrineNo2700NoNoNoNoNoNoNo1066No
White cellsYesYesNoNoNoYesNoNoNoNo1066No
Lymphoid cell linesNo2500NoNoNoNoNoNoNoNoNo
DNAKBio; BristolKBio; BristolGeneService; KBioGeneService; KBioUCLBristolGeneService; KBioGeneService; KBioUCL; CRF EdinburghCRF EdinburghCRF EdinburghCRF Edinburgh

DNA extraction/standardisation in progress †DNA extracted at baseline ††DNA extracted at subsequent resurvey.

DNA extraction/standardisation in progress †DNA extracted at baseline ††DNA extracted at subsequent resurvey. Each of the studies has a defined sampling frame, inclusion criteria, and procedures for the collection and recording of demographic details, biological samples, and clinical measures. Each of the 12 cohorts has a wide range of clinical and biological measures with overlap across studies, facilitating pooled analyses with substantial power. Many have also used common or comparable measurement methods with many of the blood markers having been measured in the same laboratory. All studies follow participants for incident disease and mortality, and have an ongoing programme of clinical assessments/biological sampling. The UCLEB consortium currently includes Metabochip information [23] augmented by imputation using the 1000 Genomes (http://www.1000genomes.org) dataset (1 million typed and imputed SNPs) from 8 studies consisting of around 21,000 samples, more than 100 phenotypes. These include a maximum of 70 blood and other biomarkers (of which 19 are available in more than 15000 participants); carotid ultrasound measures of atherosclerosis from 7200 individuals across 5 studies (MRC NSHD, WHII, BRHS, EAS and ET2DS); around 2000 cases of prevalent and incident type 2 diabetes; and almost 6000 cardiovascular events (see and ).
Table 2

Measures available in the UCLEB consortium.

Number of aggregate measuresPhenotypes
>35,000SBP, DBP, Smoking, Total Cholesterol, Fibrinogen, BMI, Height, Weight, Alcohol consumption
>30,000LDL, HDL, Triglycerides, Social class, Physical activity
>25,000Waist-hip ratio, HbA1c, CRP, Respiratory function (FEV1, FVC and PEFR), von Willebrand factor
>20,000Glucose, Stress, Verbal memory, Factor VII
>15,000D-Dimer, Educational achievement, Viscosity, ECG, Tissue plasminogen activator, IL-6, Cortisol, Short term memory, Insulin, MRC respiratory questionnaire, White cell count, Creatinine, eGFR
>10,000Muscle function (Walking speed, Standing balance, Grip strength and Chair rises), Lp(a), Liver function (ALT, AST and GGT), ApoAI, ApoB, Mental flexibility (TMT), Homocysteine, Cognitive function (Mill Hill VS, Letter search/cancel, WMS logical/verbal memory, MMSE, Non-verbal reasoning, Processing speed (DST) and AH4)
>5,000Digitised ECG (PR interval, QRS duration, QT interval and indices of left ventricular hypertrophy), Pulse wave velocity, Haematocrit, Prothrombin, Ferritin, IGF-1, cIMT, Cotinine, Telomere length, I-CAM, skin folds, V-CAM, ApoE, Platelets
>2,500Arterial distensibility, Heart rate variability, ABPI, TNF-α, Bilirubin, Leptin, Dehydroepiandrosterone sulfate, Fibrin peptide A, Proteinuria, Factor VIII, Factor IX, Activated partial thromboplastin time, Activated protein C added to the Activated partial thromboplastin time, Activated protein C and Activated partial thromboplastin time, Alkaline phosphatase, Serum urea, Serum potassium, Serum sodium, Serum urate, Serum magnesium, Serum calcium, Serum phosphate, Total serum protein, Red blood cell count, Haemoglobin, Mean cell volume, Mean platelet volume, Neutrophils, Lymphocytes, Monocytes, Eosinophils, Basinophils, Vitamin C, Vitamin E, Beta-carotene, Adiponectin, IL-18, MMP-9, sCD40L, Natriuretic peptide, E-selectin, Flow-mediated dilation, Pulse-wave analysis, ApoAII
Table 3

Disease event definitions, incident and prevalent events by disease and medication use in the UCLEB consortium.

STUDYTotal Incident CHDTotal Prevalent CHDTotal Incident StrokeTotal Prevalent StrokeTotal Diabetes% on Lipid Lowering drugs% on BP drugs% on Glucose Drugs
BWHHS2351741571172299%34%4%
CaPS33911832140438<1%23%2%
MRC NSHD1094423733%45%8%
ELSA691864314921110%48%7%
1958BC* 1071%4%2%
WH-II69206awaiting data8527412%16%3%
BRHS369422195109awaiting data10%31%4%
EAS176251022173-21%3%
ET2DS401926622105785%82%81%
TOTAL 2469 1323 1656 543 2282

Total incident CHD =  incident non-fatal MI or revascularization plus fatal CHD (ICD codes I20–I25, I516).

Total prevalent CHD  =  prevalent non-fatal MI or revascularization.

Total incident stroke  =  incident non-fatal stroke (ischaemic & haemorrhagic combined, but excluding TIA) plus fatal stroke (ICD codes I60×, I61×, I62, I629 I63×, I64× I65× I66×, I67, I672, I678, I679, I69×, G46×, G450, G451, G452, G453).

Total diabetes defined by a combination of self-report, medical history review, use of glucose lowering medication, or fasting glucose >7 mmol/L.

1958BC are currently undertaking case ascertainment.

Total incident CHD =  incident non-fatal MI or revascularization plus fatal CHD (ICD codes I20–I25, I516). Total prevalent CHD  =  prevalent non-fatal MI or revascularization. Total incident stroke  =  incident non-fatal stroke (ischaemic & haemorrhagic combined, but excluding TIA) plus fatal stroke (ICD codes I60×, I61×, I62, I629 I63×, I64× I65× I66×, I67, I672, I678, I679, I69×, G46×, G450, G451, G452, G453). Total diabetes defined by a combination of self-report, medical history review, use of glucose lowering medication, or fasting glucose >7 mmol/L. 1958BC are currently undertaking case ascertainment.

UCL-based studies

Northwick Park Heart Study II (NPHS II) [24]

From April 1989 to April 1994, 3012 healthy Caucasian men, aged 51–60 years, registered with 9 general medical practices across the UK were recruited for prospective surveillance. All eligible subjects were free of a history of unstable angina, MI or evidence of silent infarction, coronary surgery, aspirin or anticoagulant therapy, cardiovascular disease, malignancy (except skin cancer other than melanoma), or any condition precluding informed consent. Non-fasting plasma samples were taken at baseline and annually until year 6. A standard ECG was recorded and coded according to Minnesota criteria at baseline and at 6 years. Baseline plus 5 annual repeat measures are available in the entire cohort on BMI, lipids, DBP, SBP, fibrinogen and FVII. Single measures of homocysteine, Lp(a), and CRP are available. Repeat phenotypes were measured at the same centres and are complete for all those mentioned. Smoking status is known. Endpoints for CHD were fatal and non-fatal MI based on WHO criteria, silent MI or coronary revascularization procedures and sudden unexplained death. Information on fatal cancers and diabetes has also been recorded. DNA was extracted from blood samples collected in 2000.

British Regional Heart Study (BRHS) [25]

From 1978 to 1980, 7735 men aged 40–59 were recruited from general practices across the UK. A wide range of phenotypic measures is available for established risk markers such as lipids, blood pressure and inflammatory and haemostatic markers. Most of these measures were taken both at recruitment and re-examination, which occurred in 1998–2000 when men were aged 60–79. At this re-examination 4252 participants attended and DNA was extracted for 3945. A case-control sample was selected using 1095 cases with prevalent data at re-examination or incident cases of CHD or stroke over the next 8 years and 1358 controls. The controls were frequency matched based on being in the same town and within the same fixed 5-year age band as the cases. Data on important behavioural variables such as cigarette and alcohol consumption, as well as physical activity, have been regularly collected through follow up. Well validated outcome variables such as major coronary heart disease, stroke, diabetes, and revascularisation, as well as cause-specific mortality, continue to be collected from medical records almost 30 years after recruitment. (http://www.ucl.ac.uk/pcph/research-groups-themes/brhs-pub).

Whitehall II Study (WHII) [26]

Whitehall II recruitment of 10,308 participants (70% men) between 1985 and 1988 involved 20 London based Civil service departments. Genetic samples were collected in 2004 from over 6,000 participants. The study is highly phenotyped for cardiovascular and other ageing related health outcomes, with 9 phases of follow up (5 with clinical assessment and biological sampling), over 20 years of follow up. A wide variety of health behaviour and environmental data are also collected and the participants are consented for linkage to recorded clinical data such as Hospital Episode Statistics (HES), the Office of National Statistics mortality data and the national registry of acute coronary syndromes in England and Wales (Myocardial Ischaemia National Audit Project). (http://www.ucl.ac.uk/whitehallII/).

English Longitudinal Study of Ageing (ELSA) [27]

This is a national cohort of participants (48% men) aged over 50 years recruited from the Health Surveys for England in 1998, 1999 and 2001. Genetic data were collected at Wave 2 of the study (2004/5). A wide range of phenotypic measures relevant to ageing are available. These measures were made at Wave 0 of the study (1998, 1999 and 2001) and at follow up (2004/5). Data on health behaviours and a wide range of health outcomes are available. Nearly all participants (97%) are also consented to linkage to routine data such as HES, which allows for the assessment of health outcomes and cause specific mortality. A case-control sample was selected using 412 cases and 1573 controls. Controls and cases were matched by sex and 5-year age bands at Wave 2. (http://www.ifs.org.uk/elsa/).

Medical Research Council National Survey of Health and Development (MRC NSHD) [28]

This is an on-going prospective birth cohort study consisting of all births in England, Scotland and Wales in one week in March 1946. The sample includes single, legitimate births whose fathers were in non-manual or agricultural occupations and a randomly selected one in four of all others, whose fathers were in manual labour. The original cohort, now 66 years of age, comprised 2,547 women and 2,815 men who have been followed-up over 20 times since their birth. The data collected to date include repeat cognitive function, physical, lifestyle and anthropomorphic measures, as well as blood analytes and other measures. The cohort recently completed a particularly intensive phase of clinical assessment and biological sampling with blood and urine sampling and analysis, and cardiac and vascular imaging [29]. DNA was extracted from blood samples collected in 1999. Follow-up for disease outcomes is by self-reports of doctor diagnosed events that have been validated against General Practice (GP) records. (http://www.nshd.mrc.ac.uk/).

1958 Birth cohort (1958BC) [30]

1958BC is a prospective birth cohort study consisting of all births in England, Scotland and Wales in one week in March 1958. Participants to the cohort have been followed-up regularly since birth with prospective information collected on a wide range of indicators related to health, health behaviour, lifestyle, growth and development. There have been 9 contacts with the participants since their birth (ages 7, 11, 16, 23, 33, 41, 45, 47, and 50 years). The biomedical survey at age 45 years included collection of blood samples and DNA from about 8000 participants. Immortalised cell lines providing an unlimited resource for future genetic studies have been created for 7500 participants. Follow-up for disease outcomes is by self-reports of doctor diagnosed events and record linkage for fatal events.

Bristol/London School-based studies

Caerphilly prospective study (CaPS) [31]

This study is based on men aged between 45 to 59 years who resided in the small South Wales town of Caerphilly between the examination dates of 1979 &1983. Of the 2818 eligible, 2512, (89%) were recruited. The men were studied at baseline (Phase 1) and each subsequent 5 year period (Phase 2–5) and have therefore been followed up for around 20 years. An additional 447 patients were recruited at phase 2. The cohort has a wide range of cardiovascular phenotypes and at phase 3, cognitive function was also assessed, which has been supplemented with clinical dementia and cognitive impairment at phase 5. DNA was extracted from blood samples collected in 1992–1994. Follow-up for disease outcomes is by self-report from participants, who are also linked to hospital episode discharge summaries for validation checks to comply with WHO criteria, as well as death certificates for fatal events. (http://www.bris.ac.uk/social-community-medicine/projects/caerphilly/about/).

British Women's Heart and Health Study (BWHHS) [32]

Established in 1999 as a study of women to parallel the BRHS, it used the same sampling frame and very similar clinic protocols to the 20 year follow-up of BRHS. From 1999–2001, 4286 women aged 60–79 were randomly selected from 23 general practices across the UK. A wide range of phenotypic measurements were obtained at baseline including anthropometry, blood pressure, ECG, lung function tests and fasting blood samples. Glucose, insulin, lipids, clotting and inflammatory markers have been assayed and stored serum is available. DNA was extracted from blood samples collected at baseline in 1999–2001. Data on sociodemographic and lifestyle variables such as cigarette smoking, alcohol consumption, diet, physical activity, reproductive health, education, occupation, quality of life, and activities of daily living have been collected repeatedly (baseline, years 3, 7, and 12). Follow-up for disease outcomes is by biennial medical record review (with validation checks) and cancer registrations and death certificates obtained from the National Health Service (NHS) Central Registration. Detailed follow-up is collected on coronary heart disease, stroke, type 2 diabetes, pulmonary embolism, deep vein thrombosis and cancer events. The UCLEB case-control sample was selected using 523 cases with prevalent disease at recruitment or incident cases of CHD or stroke up to 2010. 1501 controls were frequency matched based on being in the same town and within the same fixed 5-year age band as the cases. (http://www.lshtm.ac.uk/eph/ncde/research/bwhhs/index.html).

Edinburgh-based studies

Edinburgh Artery Study (EAS) [33]

At baseline (August 1987-September 1988), an age-stratified random sample of men and women, aged 55–74 years, was selected from the age-sex registers of ten general practices with catchment populations spread geographically and socioeconomically throughout the city of Edinburgh. Subjects were excluded if they were unfit to participate (e.g. due to severe mental illness or terminal disease). These exclusions were replaced by other randomly sampled subjects. The study population is almost exclusively European. DNA was extracted at 5 years follow-up. Physical examinations were performed by specially trained research nurses using standardised operating procedures. The quality of clinical measurements were checked before and during the study by repeat measurements taken intermittently by the study co-ordinator. Individual observer measurements were assessed for drift. Blood assays were performed in accredited laboratories using international standards. Subjects have been followed up for 20 years for cardiovascular events, using repeat self-reporting questionnaires, record linkage for hospitalisations and deaths, and validation of events against pre-specified criteria through searching of hospital and GP notes. Comprehensive clinical examination was repeated at 5 and 12 years after commencement of the study, resulting in repeat measurements of several key variables.

Edinburgh Heart Disease Prevention Study (EHDPS) [34]

At baseline (1985–1987), all men aged 30–59 years and registered with one of two general practices in the city of Edinburgh, were invited to participate in this study. The response rate was 69% and follow-up of non-responders showed that there was no substantial bias. DNA was extracted at baseline. Physical examinations were performed by a specially trained research nurse using standardised operating procedures. Blood assays were performed in accredited laboratories using international standards. Subjects have been followed up after 20 years for cardiovascular events, using repeat self-reporting questionnaires and record linkage for hospitalisations and deaths.

Edinburgh Type 2 Diabetes Study (ET2DS) [35]

At baseline (August 2006 to August 2007), an age-stratified random sample of men and women with type 2 diabetes, aged 60–74 years, was selected from the Lothian Diabetes Register (LDR), a comprehensive database of subjects with known type 2 diabetes living in Lothian. Subjects were excluded if they did not meet WHO criteria for type 2 diabetes, or if they were physically unable to complete the clinical and cognitive examination. The study population is almost exclusively European. DNA was extracted at baseline. Physical examinations were performed by specially trained research nurses using standardised operating procedures. The quality of measurements was checked using observation of research staff by study investigators and inter-observer variability assessments were made for key variables. Blood assays were performed in accredited laboratories using international standards. Retrospective data on cardiovascular disease and selected physical and biochemical variables were retrieved using record linkage for hospitalisations and deaths since 1985 and using data from the LDR. Subjects returned for further clinical examination after one year and were examined again after they had participated for 4 years.

Asymptomatic Atherosclerosis Aspirin Trial (AAAT) [36]

At baseline (1998 to 2000), all men and women aged 50–75 and registered with participating general practices spread throughout Edinburgh, Glasgow and Lanarkshire (83% of all practices within study area), were invited to participate in this study. Subjects were excluded if they had a history of MI, stroke, angina or peripheral arterial disease (PAD) and included if they had an ankle brachial index of 0.9 or less. They were therefore a healthy, but moderately increased risk population, in terms of cardiovascular disease. Subjects were further excluded if they were currently using aspirin or other antiplatelet or had a contraindication to aspirin therapy. Subjects were followed up for 8 years for cardiovascular events, using annual contacts with subjects, record linkage for hospitalisations and rigorous validation of events against pre-defined criteria using hospital and GP notes. Physical examination was performed at baseline and after 5 years by specially trained research nurses using standard operating procedures, and blood assays were performed in accredited laboratories using international standards. DNA was extracted 3 months after recruitment.

Work flow and organisation

The workflow developed to co-ordinate genotyping, merge and collate with harmonised phenotype and disease end-point data is summarised in .

Organisation and governance

Informed consent was obtained for all subjects included in UCLEB research. Written approval from individual Research Ethics Committees to use anonymised individual level data has been obtained by each participating study. All data obtained and generated within the UCLEB consortium are fully anonymised, contain no personal identifiers and adhere to the contributing studies restrictions on deductive disclosure. Senior investigators from each participating study were invited to join the UCLEB steering group to discuss key projects to be carried forward and to assess the potential for external collaborations. An analysis group meets weekly focusing on methods development and analyses. There are regular weekly teleconferences organised from the coordinating centre at UCL during which the whole consortium is updated on various aspects of genotype and phenotype data management, and on-going and proposed analyses. An access-restricted Google-hosted website has been set up to facilitate sharing of minutes, analysis plans, project proposals and other useful resources. The cohorts included in the UCLEB consortium have individual policies and mechanisms for data sharing and all have an excellent track record in this regard. Most of the studies have contributed to the highly cited Emerging Risk Factors Collaboration reports on CV risk factors and biomarkers, and to GWAS consortia and other large-scale genetic discovery efforts. Opportunities for data sharing are maximised because requests to individual studies are also possible, e.g. for projects that focus on certain study designs or where outcomes are available only in a subset. The proposals to access UCLEB data are evaluated by members of the steering committee using a standardised data request form. Aggregate SNP data for specifically requested traits are shared with external collaborators according to the external project analysis plan.

Genotyping

Around 21,000 individuals across the cohorts have been typed using Metabochip (see ), a genotyping platform consisting of ∼200,000 SNPs, which cover the loci identified by GWAS in cardiometabolic diseases, and rare variants from the 1000 Genomes Project [23]. This will be supplemented by SNP data from a whole genome array in the 1958 Birth Cohort, the 50 k HumanCVD Beadchip [37] in WHII and BWHHS, and prior candidate and GWAS replication work in all studies. The NPHS-II, EHDPS and AAAT studies are available for new bespoke genotyping e.g. to validate associations from UCLEB samples genotyped by Metabochip. This will yield a powerful aggregate dataset rich in genetic and phenotypic detail.
Table 4

Information on genotyped and imputed SNPs.

WHII (UCL Genomics)* WHII (Cambridge)* CaPSEASET2DSBRHSBWHHSMRC NSHD1958BCELSA
Genotyping
Array ReaderiScanBeadArrayiScanBeadstation S225iScaniScaniScanBeadArrayiScaniScan
GenomeStudio Versionv2010.1v2010.1v2010.3v2009.1v2010.3v2010.3v2010.3v2010.1NAv2010.3
Clustering AlgorithmGenTrain, followed by reclustering based on dataCustom – using top 100 samples to create cluster file and apply it to the set.GenTrain, followed by reclustering based on dataGenTrain 2.0, followed by reclustering based on dataGenTrain, followed by reclustering based on dataGenTrain, followed by reclustering based on dataGenTrain, followed by reclustering based on dataCustom – using top 100 samples to create cluster file and apply it to the set.GenTrain, followed by reclustering based on dataGenTrain, followed by reclustering based on data
GenCall Threshold0.150.150.150.150.150.150.150.150.150.15
Allele FormatPlus StrandPlus StrandPlus StrandPlus StrandPlus StrandPlus StrandPlus StrandPlus StrandPlus StrandPlus Strand
Total SNPs196725196725196725196725196725196725196725196725196725196725
Total Samples100824051411863107524542068248858402007
Sample Call Rate threshold0.950.950.95no QC on supplied data0.950.950.950.950.950.95
Samples passing call rate 0.95100523881376814105023811994247558132004
Samples passing all QC30781349764100723421980246455601883
SNP Call Rate threshold0.950.95no QC on supplied data0.950.950.950.950.950.95
SNPs passing call rate 0.95193203192040186324192701185419190386150443183675191478
Post-Imputation
No. polymorphic SNPs with Rsq ≥0.3383865740974683967944409977140099874079402411322738645844085542
No. polymorphic SNPs with Rsq ≥0.8121780213094371265020131284912712041298293131179113028631303682

The WHII study was typed at two centres with different array readers, which were compared and shown to be concordant.

The WHII study was typed at two centres with different array readers, which were compared and shown to be concordant. Duplicate samples have been genotyped to compute the error rate. Initial quality control analysis on genotyped data identified any problem samples that have been subsequently excluded in further analysis. These included: checks for discordance between reported and genetically-determined ethnicity, replicate concordance, sample mix-up (unknown duplicates and comparison to previously genotyped data where available), gender ambiguity and cryptic relatedness (see ).

Imputation using data from 1000 Genomes

Although coverage of the genome is less comprehensive using the Metabochip than a whole genome array, imputation against the 1000 genomes European ancestry reference sample extends coverage from 200,000 to approximately 1 million SNPs, when the R2 is ≥0.8 (see ), with dense coverage of loci of interest for cardiometabolic disease, including approximately 70% of the druggable genome. Imputation using Metabochip data served to fine-map in and around regions covered by the array. However, the gene-centric nature of Metabochip means that there are extensive intergenic regions of the genome that have no SNP coverage, and therefore, imputation provided little additional information in these regions. Our imputation process was based on the strategy summarised at: http://genome.sph.umich.edu/wiki/Minimac:_1000_Genomes_Imputation_Cookbook, and proceeded in three distinct stages: chunking, phasing and imputing.

Chunking

To speed up the overall imputation process, the genome was broken into overlapping chunks specified by reference to the physical map. Each chunk consisted of 1000 consecutive SNPs and consecutive chunks overlapped by 250 SNPs. The final chunk in each chromosome consisted of any remaining SNPs (less than 1000) plus a 250 SNP overlap with the penultimate chunk.

Phasing

Each chunk was phased using the program MACH1 (downloaded from: http://www.sph.umich.edu/csg/abecasis/MACH/download/), which implements a Markov Chain Monte Carlo (MCMC) Haplotyping algorithm [38], [39]. The phasing process can be set to consider all possible haplotypic states for the genotypes but this is too computationally intensive to be practical. We therefore phased each chunk considering 500 states and the number of rounds of MCMC was set to 30.

Imputation

Phased haplotypes were used as a basis for imputation of untyped SNPs using the method described by Li et al. 2010 [39], which used an external set of reference haplotypes (1000 genomes, February 2012 release, CEU haplotype set) to infer the most likely allele call for untyped loci. The method was implemented using the software Minimac (downloaded from: http://www.sph.umich.edu/csg/cfuchsb/minimac-beta-2013.7.17.tgz. Following imputation, chunks were reassembled into full chromosomes. The genotypes for SNPs that lay in overlapped regions were taken from the chunk in which the R2 statistic for imputation quality was greatest. The UCLEB data was collated in a dosage format, which is readily utilised by the analytical package PLINK [40], as well as in probability format that can be used by the R Package snpStats (downloaded from: http://www.bioconductor.org/packages/2.10/bioc/html/snpStats.html), which offers a useful range of analytical functions coupled with the flexibility of the R programming environment for data manipulation and further analysis.

Phenotype and disease endpoints

Phenotypic data available for sharing from each study were harmonized to agreed units and categories determined by individual study and phenotype data managers. Data were then collated at the UCLEB coordinating centre ( ). Disease definitions were as follows: CHD is all non-fatal myocardial infarction (MI) or any revascularisation procedure (coronary artery bypass surgery or angioplasty) and fatal CHD. Stroke is all non-fatal stroke (ischaemic & haemorrhagic combined) excluding TIA and fatal stroke. Fatal events are defined or matched according to a pre-defined list of ICD codes. Type II diabetes is based on either self-reported, medical history review, taking glucose lowering medication, or a fasting glucose >7 mmol/L. Medication data were also collated, including those on lipid lowering drugs (statins or other medication), blood pressure lowering drugs, and glucose drugs ( ).

Analyses

Analysis plans have been written that can be adapted by each working group for specific analyses, and Stata, PLINK and R syntax are available for these analyses. In order to address the core aims of UCLEB, five broad analytical approaches will be applied. Metabochip-wide genetic discovery analysis to be run individually on each study, from which study-specific estimates will be pooled by fixed effects meta-analysis weighting by inverse variance using the METAL program, as well as by random effects meta-analysis, to generate summary estimates for each trait. Medication use for the main drug classes affecting cardiometabolic traits (cholesterol, blood pressure and glucose) has been carefully recorded. Previously validated methods [41], [42] for adjusting genetic associations for medication use will be applied. Heterogeneity will be estimated by Cochran's Q statistic and the I2 value (and 95% confidence interval; CI). SNPs most likely to mark or represent causal sites will be identified using variable selection methods, as well as conditional analyses to identify independent SNPs at each locus applying locus-specific Bonferroni correction. Analysis of multiple phenotypes (phenome scans) for improving power and investigating pleiotropy. Considering the extra information provided by the covariance between the traits allows the joint analysis of multiple phenotypes, in order to increase power. One such implementation is MultiPhen [43], in which the genotype of each SNP is regressed on a group of phenotypes. Pleiotropy will be explored in two ways; (i) using Bayesian statistical methodology, where the association between multiple phenotypes and multiple SNPs will be analysed simultaneously using the GUESS software [44]; and (ii) through phenome scans based on associations reported in prior studies, where the association of a selection of known disease associated genetic loci is tested for the large number of available phenotypes in the UCLEB data to establish common pathways and functional links. Application of genetic associations in risk prediction models. The absolute, relative and attributable genetic risk of CVD events (overall, and separately for fatal/non-fatal CHD and stroke) will be estimated and compared with the associations of non-genetic risk factors using SNPs identified by prior GWAS that are included in Metabochip. Analyses will use Cox proportional hazards models and derived hazard ratios (HRs). Risks will be estimated within study and pooled by random effects meta-analysis with exploration of heterogeneity and effect modification by strata of age, CVD risk factors, and by geographic region. We will generate standard prediction metrics (e.g. C-statistics, net-reclassification, integrated discrimination index) that will compare the benefit of genotypes for CVD prediction with established non-genetic risk scores. The large number of incident CV events in the UCLEB consortium (>5000) will allow precise risk estimates to be obtained and minimise potential false-positive findings. Mendelian randomisation (MR) analyses. MR studies typically quantify and compare three associations: (i) biomarker-disease in prospective cohort studies; (ii) genotype-biomarker in cross-sectional or prospective cohort studies; and (iii) genotype-disease in prevalent case-control studies or studies of incident cases and controls nested within a cohort study. Triangulation of the risk estimates provides evidence on causation, with the magnitude of the causal association being estimated by instrumental variables regression. Despite emerging successes of MR, two limitations remain. First, many biomarker-associated SNPs are of weak effect, compromising power and necessitating large sample sizes. Second, although SNPs identified for an index association with a single CVD risk factor/biomarker are not generally associated with exogenous exposures influencing CVD risk (e.g. diet, physical activity, socioeconomic status), they are frequently associated with a diverse range of endogenous biomarkers. This can compromise interpretation of an MR analysis based on a single locus. The issue of power is addressed in UCLEB both through use of gene scores and through external collaboration with other studies and consortia to ensure case numbers are not limiting. The use of genetic instruments comprising SNPs from different genetic loci, each independently associated with the biomarker of interest, increases power, since each SNP contributes incrementally to the marker variance, and also helps reduce non-specificity, because the relative genetic effect on traits other than biomarker of interest tends to attenuate. Genetic effects on risk factor/biomarker trajectories. Analysis of risk factor trajectories will involve generation of standard deviation scores for each trait, cross-sectional descriptive analyses at different ages, and the development of hierarchical mixed models which account for correlation between repeated measures, as well as age-by-genotype interaction tests. We will use multiple imputation for the management of missing data. gives additional information on the analyses that have been prioritised by UCLEB.

Power calculations

Power and sample size calculations were conducted to evaluate the ability of the UCLEB datasets to discover novel loci and to test the effect of loci validated for an index trait on other traits and outcomes. Conservative alpha values of 1×10−7 (GWAS level) were adopted for the former and 10−4 for the latter. All sample size estimates correspond to the total number of individuals and assume a 10-year CVD event rate of 16%, based on average age, follow-up period and gender of participants in UCLEB cohorts.

Quantitative traits

For quantitative traits, a good approximation for the sample size is given by N = (zα+zβ)2/q2 where zα and zβ are values of the standard normal distribution for specified alpha and beta error and q2 is the amount of variance explained, which is determined by effect size and minor allele frequency in the case of single SNPs. In addition to alpha and sample size, power depends on the variance explained by a SNP, which is in turn related to minor allele frequency (MAF, pAu) and effect size (d) and is given by q2 = 2pAu(1-pAu)d2. Tables S1a and S1b present the resulting power estimates in order to be able to discover SNPs that explain as little as 0.5% of the variance with 80% power. The tables provide power for sample sizes in the range 2,500 and 20,000 to detect variance in a quantitative trait of 0.5–5% at a p-value of 10−7 (), and a p-value of 10−4 ().

Events

For disease events, power depends additionally on incidence rate. A differentiation is made between common alleles (see Tables S2a and S2b) and rare alleles (see Tables S2c and S2d). Higher MAFs are often accompanied by smaller effect sizes while rare alleles have a higher likelihood of larger effects. As illustrated by Tables S2a–d, the combination of very small MAF and very small effect leads to a substantial inflation of the effect of sample size on power. With an overall sample of 30,000 subjects, and a 16% 10-year event rate, power estimates mostly exceed 80% for a range of plausible effect sizes. For example, with a total sample of 27,323 individuals among whom the 10-year event rate is 16%, power to detect a common SNP with MAF of 15% and odds ratio of 1.2 is 80%. With regard to rare alleles that have a frequency of only 1%, achieving a power of 80% for an odds ratio of 2, requires 18,456 subjects.

Discussion

UCLEB is a large-scale epidemiological resource bringing together diverse expertise across studies, scientific advisors, phenotype experts, statistical experts and analysts. UCLEB has an established research governance structure, data sharing arrangements, steering, operations and analysis groups, a centralised, secure, access-managed genetic and phenotypic data repository, and agreed analysis protocols. The strength of cohort-based analyses is that genetic loci can be identified for every quantitative trait recorded in sufficiently large numbers. UCLEB has more than 100 traits indexing 16 organs and biological systems. Moreover, as well as the established 1-trait: 1-Metabochip-wide analysis model, there is an opportunity to integrate Metabochip information, for example, to identify quantitative traits with overlapping genetic regulation and genetic loci with effects on multiple pathways. Heritability estimates provide an indication of the overall genetic contribution, but not the number of genes influencing a trait or the size of effect. Despite similar heritability, genes for some traits such as blood inflammation markers [45], [46] have been identified by moderately-sized GWAS, while others such as BMI [47] and height [48] have required much larger data sets to identify genes of smaller effect with statistical confidence. In general, traits more proximal in the pathway from genome variation to disease (e.g. blood proteins and metabolites) are likely to be identified with smaller sample sizes than distal (higher order) phenotypes such as ECG parameters and BP. However, expansion of the population resources in both domains would be beneficial even for traits where some genetic loci have already been identified (e.g. LDL-cholesterol), because of the large residual unexplained phenotypic variance. For the majority of traits in this consortium, the genetic determinants have yet to be fully characterised. By exploiting measures of continuous phenotypes where available, case-control GWAS have not only provided insight on disease-associated loci but have also started to uncover genetic effects on quantitative traits. However, highly-phenotyped, population-based prospective studies are particularly suited for genetic analysis of quantitative traits because of: (1) the range of measures; (2) longitudinal follow-up that also allows exploration of genetic effects on trajectories, and at critical periods; (3) the addition of new phenotypic detail with new waves of resurvey and data collection, which enrich the resource, so that the value of the investment in genotyping extends beyond any single trait or outcome; (4) assessment of context dependent genetic effects because of exhaustive information on diet, records (and sometimes objective measures) of smoking and physical activity, as well as education, employment, medication and many other environmental factors [49], [50]; and (5) repeated measures not only of many of the outcomes of interest but also some of the important environmental modifiers (e.g. smoking, alcohol consumption, physical activity and social indices), which will help improve precision by allowing control for regression dilution bias, and will facilitate analysis of gene-environment interaction [51]. Together with the use of the Metabochip SNP array that more densely captures genome variation, including rare and copy number variants, this should allow more comprehensive evaluation of genetic effects on disease-relevant traits. A number of large consortia have already been assembled to discover and subsequently fine map genetic associations of cardiometabolic traits and disease end points [52]–[54] using Metabochip. UCLEB studies have already contributed to some of these efforts, but these are typically based on sharing of summary level data on a narrow range of phenotypes or disease end-points. The added value of the UCLEB consortium comes from the ability to undertake discovery of the genetic loci for less commonly available phenotypes not currently the subject of consortium based efforts, extend the associations of known cardiometabolic loci to a wider range of phenotypes (phenome scans), to investigate the effects of known genetic variants on biomarker trajectories and variability (where repeat measures are available), to evaluate the predictive utility of known disease associated genetic variants for incident CVD events, and to generate and optimise genetic tools for Mendelian randomisation analyses based on effect size and specificity for the trait of interest. In this respect, the UCLEB is most similar in design, organisation, exposure measures and aims to two other assemblies of population based studies engaged in collaborative genomic research. These are the National Heart Lung and Blood Institute Candidate Gene Association Resource (CARe) [55], and the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium [56].

Conclusions

In conclusion, UCLEB provides a stable, long-term resource for large-scale, integrated genomics analyses, with the potential to add proteomic and metabolomic technologies as they emerge. The archived biological samples will facilitate more comprehensive phenomic analysis to match the breadth of genomic data. The integration of multiple layers of –omics data within the framework of cohort studies should eventually lead to a more comprehensive understanding of the mechanisms of common disease [57]. We recognise that very large sample sizes are required for assessments of gene-environment interactions (more so for disease outcomes than quantitative traits) and are likely to require “consortia of consortia”. Therefore, UCLEB will also continue to collaborate externally and contribute to the wider research efforts focused around specific diseases and phenotypes. UCLEB will do this by building on an already extensive network of successful links with other investigators in the UK and internationally, that will permit powerful large-scale analyses. UCLEB workflow. (DOCX) Click here for additional data file. Prioritised analyses in UCLEB. (DOCX) Click here for additional data file. Table S1a: Power for discovery for quantitative traits. Table S1b: Power for translation for quantitative traits (DOCX) Click here for additional data file. Table S2a: Power for discovery for common alleles. Table S2b: Power for translation for common alleles. Table S2c: Power for discovery for rare alleles. Table S2d: Power for translation for rare alleles. (DOCX) Click here for additional data file.
  54 in total

Review 1.  Genotype imputation.

Authors:  Yun Li; Cristen Willer; Serena Sanna; Gonçalo Abecasis
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

2.  Large-scale association analysis identifies new risk loci for coronary artery disease.

Authors:  Panos Deloukas; Stavroula Kanoni; Christina Willenborg; Martin Farrall; Themistocles L Assimes; John R Thompson; Erik Ingelsson; Danish Saleheen; Jeanette Erdmann; Benjamin A Goldstein; Kathleen Stirrups; Inke R König; Jean-Baptiste Cazier; Asa Johansson; Alistair S Hall; Jong-Young Lee; Cristen J Willer; John C Chambers; Tõnu Esko; Lasse Folkersen; Anuj Goel; Elin Grundberg; Aki S Havulinna; Weang K Ho; Jemma C Hopewell; Niclas Eriksson; Marcus E Kleber; Kati Kristiansson; Per Lundmark; Leo-Pekka Lyytikäinen; Suzanne Rafelt; Dmitry Shungin; Rona J Strawbridge; Gudmar Thorleifsson; Emmi Tikkanen; Natalie Van Zuydam; Benjamin F Voight; Lindsay L Waite; Weihua Zhang; Andreas Ziegler; Devin Absher; David Altshuler; Anthony J Balmforth; Inês Barroso; Peter S Braund; Christof Burgdorf; Simone Claudi-Boehm; David Cox; Maria Dimitriou; Ron Do; Alex S F Doney; NourEddine El Mokhtari; Per Eriksson; Krista Fischer; Pierre Fontanillas; Anders Franco-Cereceda; Bruna Gigante; Leif Groop; Stefan Gustafsson; Jörg Hager; Göran Hallmans; Bok-Ghee Han; Sarah E Hunt; Hyun M Kang; Thomas Illig; Thorsten Kessler; Joshua W Knowles; Genovefa Kolovou; Johanna Kuusisto; Claudia Langenberg; Cordelia Langford; Karin Leander; Marja-Liisa Lokki; Anders Lundmark; Mark I McCarthy; Christa Meisinger; Olle Melander; Evelin Mihailov; Seraya Maouche; Andrew D Morris; Martina Müller-Nurasyid; Kjell Nikus; John F Peden; N William Rayner; Asif Rasheed; Silke Rosinger; Diana Rubin; Moritz P Rumpf; Arne Schäfer; Mohan Sivananthan; Ci Song; Alexandre F R Stewart; Sian-Tsung Tan; Gudmundur Thorgeirsson; C Ellen van der Schoot; Peter J Wagner; George A Wells; Philipp S Wild; Tsun-Po Yang; Philippe Amouyel; Dominique Arveiler; Hanneke Basart; Michael Boehnke; Eric Boerwinkle; Paolo Brambilla; Francois Cambien; Adrienne L Cupples; Ulf de Faire; Abbas Dehghan; Patrick Diemert; Stephen E Epstein; Alun Evans; Marco M Ferrario; Jean Ferrières; Dominique Gauguier; Alan S Go; Alison H Goodall; Villi Gudnason; Stanley L Hazen; Hilma Holm; Carlos Iribarren; Yangsoo Jang; Mika Kähönen; Frank Kee; Hyo-Soo Kim; Norman Klopp; Wolfgang Koenig; Wolfgang Kratzer; Kari Kuulasmaa; Markku Laakso; Reijo Laaksonen; Ji-Young Lee; Lars Lind; Willem H Ouwehand; Sarah Parish; Jeong E Park; Nancy L Pedersen; Annette Peters; Thomas Quertermous; Daniel J Rader; Veikko Salomaa; Eric Schadt; Svati H Shah; Juha Sinisalo; Klaus Stark; Kari Stefansson; David-Alexandre Trégouët; Jarmo Virtamo; Lars Wallentin; Nicholas Wareham; Martina E Zimmermann; Markku S Nieminen; Christian Hengstenberg; Manjinder S Sandhu; Tomi Pastinen; Ann-Christine Syvänen; G Kees Hovingh; George Dedoussis; Paul W Franks; Terho Lehtimäki; Andres Metspalu; Pierre A Zalloua; Agneta Siegbahn; Stefan Schreiber; Samuli Ripatti; Stefan S Blankenberg; Markus Perola; Robert Clarke; Bernhard O Boehm; Christopher O'Donnell; Muredach P Reilly; Winfried März; Rory Collins; Sekar Kathiresan; Anders Hamsten; Jaspal S Kooner; Unnur Thorsteinsdottir; John Danesh; Colin N A Palmer; Robert Roberts; Hugh Watkins; Heribert Schunkert; Nilesh J Samani
Journal:  Nat Genet       Date:  2012-12-02       Impact factor: 38.330

3.  Rationale, design, and methodology of the Women's Genome Health Study: a genome-wide association study of more than 25,000 initially healthy american women.

Authors:  Paul M Ridker; Daniel I Chasman; Robert Y L Zee; Alex Parker; Lynda Rose; Nancy R Cook; Julie E Buring
Journal:  Clin Chem       Date:  2007-12-10       Impact factor: 8.327

4.  Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium: Design of prospective meta-analyses of genome-wide association studies from 5 cohorts.

Authors:  Bruce M Psaty; Christopher J O'Donnell; Vilmundur Gudnason; Kathryn L Lunetta; Aaron R Folsom; Jerome I Rotter; André G Uitterlinden; Tamara B Harris; Jacqueline C M Witteman; Eric Boerwinkle
Journal:  Circ Cardiovasc Genet       Date:  2009-02

5.  Meta-analysis of Dense Genecentric Association Studies Reveals Common and Uncommon Variants Associated with Height.

Authors:  Matthew B Lanktree; Yiran Guo; Muhammed Murtaza; Joseph T Glessner; Swneke D Bailey; N Charlotte Onland-Moret; Guillaume Lettre; Halit Ongen; Ramakrishnan Rajagopalan; Toby Johnson; Haiqing Shen; Christopher P Nelson; Norman Klopp; Jens Baumert; Sandosh Padmanabhan; Nathan Pankratz; James S Pankow; Sonia Shah; Kira Taylor; John Barnard; Bas J Peters; Cliona M Maloney; Maximilian T Lobmeyer; Alice Stanton; M Hadi Zafarmand; Simon P R Romaine; Amar Mehta; Erik P A van Iperen; Yan Gong; Tom S Price; Erin N Smith; Cecilia E Kim; Yun R Li; Folkert W Asselbergs; Larry D Atwood; Kristian M Bailey; Deepak Bhatt; Florianne Bauer; Elijah R Behr; Tushar Bhangale; Jolanda M A Boer; Bernhard O Boehm; Jonathan P Bradfield; Morris Brown; Peter S Braund; Paul R Burton; Cara Carty; Hareesh R Chandrupatla; Wei Chen; John Connell; Chrysoula Dalgeorgou; Anthonius de Boer; Fotios Drenos; Clara C Elbers; James C Fang; Caroline S Fox; Edward C Frackelton; Barry Fuchs; Clement E Furlong; Quince Gibson; Christian Gieger; Anuj Goel; Diederik E Grobbee; Claire Hastie; Philip J Howard; Guan-Hua Huang; W Craig Johnson; Qing Li; Marcus E Kleber; Barbara E K Klein; Ronald Klein; Charles Kooperberg; Bonnie Ky; Andrea Lacroix; Paul Lanken; Mark Lathrop; Mingyao Li; Vanessa Marshall; Olle Melander; Frank D Mentch; Nuala J Meyer; Keri L Monda; Alexandre Montpetit; Gurunathan Murugesan; Karen Nakayama; Dave Nondahl; Abiodun Onipinla; Suzanne Rafelt; Stephen J Newhouse; F George Otieno; Sanjey R Patel; Mary E Putt; Santiago Rodriguez; Radwan N Safa; Douglas B Sawyer; Pamela J Schreiner; Claire Simpson; Suthesh Sivapalaratnam; Sathanur R Srinivasan; Christine Suver; Gary Swergold; Nancy K Sweitzer; Kelly A Thomas; Barbara Thorand; Nicholas J Timpson; Sam Tischfield; Martin Tobin; Maciej Tomaszewski; Maciej Tomaszweski; W M Monique Verschuren; Chris Wallace; Bernhard Winkelmann; Haitao Zhang; Dongling Zheng; Li Zhang; Joseph M Zmuda; Robert Clarke; Anthony J Balmforth; John Danesh; Ian N Day; Nicholas J Schork; Paul I W de Bakker; Christian Delles; David Duggan; Aroon D Hingorani; Joel N Hirschhorn; Marten H Hofker; Steve E Humphries; Mika Kivimaki; Debbie A Lawlor; Kandice Kottke-Marchant; Jessica L Mega; Braxton D Mitchell; David A Morrow; Jutta Palmen; Susan Redline; Denis C Shields; Alan R Shuldiner; Patrick M Sleiman; George Davey Smith; Martin Farrall; Yalda Jamshidi; David C Christiani; Juan P Casas; Alistair S Hall; Pieter A Doevendans; Jason D Christie; Gerald S Berenson; Sarah S Murray; Thomas Illig; Gerald W Dorn; Thomas P Cappola; Eric Boerwinkle; Peter Sever; Daniel J Rader; Muredach P Reilly; Mark Caulfield; Philippa J Talmud; Eric Topol; James C Engert; Kai Wang; Anna Dominiczak; Anders Hamsten; Sean P Curtis; Roy L Silverstein; Leslie A Lange; Marc S Sabatine; Mieke Trip; Danish Saleheen; John F Peden; Karen J Cruickshanks; Winfried März; Jeffrey R O'Connell; Olaf H Klungel; Cisca Wijmenga; Anke Hilse Maitland-van der Zee; Eric E Schadt; Julie A Johnson; Gail P Jarvik; George J Papanicolaou; Struan F A Grant; Patricia B Munroe; Kari E North; Nilesh J Samani; Wolfgang Koenig; Tom R Gaunt; Sonia S Anand; Yvonne T van der Schouw; Nicole Soranzo; Garret A Fitzgerald; Alex Reiner; Robert A Hegele; Hakon Hakonarson; Brendan J Keating
Journal:  Am J Hum Genet       Date:  2010-12-30       Impact factor: 11.025

6.  Health inequalities among British civil servants: the Whitehall II study.

Authors:  M G Marmot; G D Smith; S Stansfeld; C Patel; F North; J Head; I White; E Brunner; A Feeney
Journal:  Lancet       Date:  1991-06-08       Impact factor: 79.321

7.  Cohort Profile: The 1946 National Birth Cohort (MRC National Survey of Health and Development).

Authors:  Michael Wadsworth; Diana Kuh; Marcus Richards; Rebecca Hardy
Journal:  Int J Epidemiol       Date:  2005-10-04       Impact factor: 7.196

8.  The metabochip, a custom genotyping array for genetic studies of metabolic, cardiovascular, and anthropometric traits.

Authors:  Benjamin F Voight; Hyun Min Kang; Jun Ding; Cameron D Palmer; Carlo Sidore; Peter S Chines; Noël P Burtt; Christian Fuchsberger; Yanming Li; Jeanette Erdmann; Timothy M Frayling; Iris M Heid; Anne U Jackson; Toby Johnson; Tuomas O Kilpeläinen; Cecilia M Lindgren; Andrew P Morris; Inga Prokopenko; Joshua C Randall; Richa Saxena; Nicole Soranzo; Elizabeth K Speliotes; Tanya M Teslovich; Eleanor Wheeler; Jared Maguire; Melissa Parkin; Simon Potter; N William Rayner; Neil Robertson; Kathleen Stirrups; Wendy Winckler; Serena Sanna; Antonella Mulas; Ramaiah Nagaraja; Francesco Cucca; Inês Barroso; Panos Deloukas; Ruth J F Loos; Sekar Kathiresan; Patricia B Munroe; Christopher Newton-Cheh; Arne Pfeufer; Nilesh J Samani; Heribert Schunkert; Joel N Hirschhorn; David Altshuler; Mark I McCarthy; Gonçalo R Abecasis; Michael Boehnke
Journal:  PLoS Genet       Date:  2012-08-02       Impact factor: 5.917

9.  Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways.

Authors:  Robert A Scott; Vasiliki Lagou; Ryan P Welch; Eleanor Wheeler; May E Montasser; Jian'an Luan; Reedik Mägi; Rona J Strawbridge; Emil Rehnberg; Stefan Gustafsson; Stavroula Kanoni; Laura J Rasmussen-Torvik; Loïc Yengo; Cecile Lecoeur; Dmitry Shungin; Serena Sanna; Carlo Sidore; Paul C D Johnson; J Wouter Jukema; Toby Johnson; Anubha Mahajan; Niek Verweij; Gudmar Thorleifsson; Jouke-Jan Hottenga; Sonia Shah; Albert V Smith; Bengt Sennblad; Christian Gieger; Perttu Salo; Markus Perola; Nicholas J Timpson; David M Evans; Beate St Pourcain; Ying Wu; Jeanette S Andrews; Jennie Hui; Lawrence F Bielak; Wei Zhao; Momoko Horikoshi; Pau Navarro; Aaron Isaacs; Jeffrey R O'Connell; Kathleen Stirrups; Veronique Vitart; Caroline Hayward; Tõnu Esko; Evelin Mihailov; Ross M Fraser; Tove Fall; Benjamin F Voight; Soumya Raychaudhuri; Han Chen; Cecilia M Lindgren; Andrew P Morris; Nigel W Rayner; Neil Robertson; Denis Rybin; Ching-Ti Liu; Jacques S Beckmann; Sara M Willems; Peter S Chines; Anne U Jackson; Hyun Min Kang; Heather M Stringham; Kijoung Song; Toshiko Tanaka; John F Peden; Anuj Goel; Andrew A Hicks; Ping An; Martina Müller-Nurasyid; Anders Franco-Cereceda; Lasse Folkersen; Letizia Marullo; Hanneke Jansen; Albertine J Oldehinkel; Marcel Bruinenberg; James S Pankow; Kari E North; Nita G Forouhi; Ruth J F Loos; Sarah Edkins; Tibor V Varga; Göran Hallmans; Heikki Oksa; Mulas Antonella; Ramaiah Nagaraja; Stella Trompet; Ian Ford; Stephan J L Bakker; Augustine Kong; Meena Kumari; Bruna Gigante; Christian Herder; Patricia B Munroe; Mark Caulfield; Jula Antti; Massimo Mangino; Kerrin Small; Iva Miljkovic; Yongmei Liu; Mustafa Atalay; Wieland Kiess; Alan L James; Fernando Rivadeneira; Andre G Uitterlinden; Colin N A Palmer; Alex S F Doney; Gonneke Willemsen; Johannes H Smit; Susan Campbell; Ozren Polasek; Lori L Bonnycastle; Serge Hercberg; Maria Dimitriou; Jennifer L Bolton; Gerard R Fowkes; Peter Kovacs; Jaana Lindström; Tatijana Zemunik; Stefania Bandinelli; Sarah H Wild; Hanneke V Basart; Wolfgang Rathmann; Harald Grallert; Winfried Maerz; Marcus E Kleber; Bernhard O Boehm; Annette Peters; Peter P Pramstaller; Michael A Province; Ingrid B Borecki; Nicholas D Hastie; Igor Rudan; Harry Campbell; Hugh Watkins; Martin Farrall; Michael Stumvoll; Luigi Ferrucci; Dawn M Waterworth; Richard N Bergman; Francis S Collins; Jaakko Tuomilehto; Richard M Watanabe; Eco J C de Geus; Brenda W Penninx; Albert Hofman; Ben A Oostra; Bruce M Psaty; Peter Vollenweider; James F Wilson; Alan F Wright; G Kees Hovingh; Andres Metspalu; Matti Uusitupa; Patrik K E Magnusson; Kirsten O Kyvik; Jaakko Kaprio; Jackie F Price; George V Dedoussis; Panos Deloukas; Pierre Meneton; Lars Lind; Michael Boehnke; Alan R Shuldiner; Cornelia M van Duijn; Andrew D Morris; Anke Toenjes; Patricia A Peyser; John P Beilby; Antje Körner; Johanna Kuusisto; Markku Laakso; Stefan R Bornstein; Peter E H Schwarz; Timo A Lakka; Rainer Rauramaa; Linda S Adair; George Davey Smith; Tim D Spector; Thomas Illig; Ulf de Faire; Anders Hamsten; Vilmundur Gudnason; Mika Kivimaki; Aroon Hingorani; Sirkka M Keinanen-Kiukaanniemi; Timo E Saaristo; Dorret I Boomsma; Kari Stefansson; Pim van der Harst; Josée Dupuis; Nancy L Pedersen; Naveed Sattar; Tamara B Harris; Francesco Cucca; Samuli Ripatti; Veikko Salomaa; Karen L Mohlke; Beverley Balkau; Philippe Froguel; Anneli Pouta; Marjo-Riitta Jarvelin; Nicholas J Wareham; Nabila Bouatia-Naji; Mark I McCarthy; Paul W Franks; James B Meigs; Tanya M Teslovich; Jose C Florez; Claudia Langenberg; Erik Ingelsson; Inga Prokopenko; Inês Barroso
Journal:  Nat Genet       Date:  2012-08-12       Impact factor: 38.330

10.  Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes.

Authors:  Andrew P Morris; Benjamin F Voight; Tanya M Teslovich; Teresa Ferreira; Ayellet V Segrè; Valgerdur Steinthorsdottir; Rona J Strawbridge; Hassan Khan; Harald Grallert; Anubha Mahajan; Inga Prokopenko; Hyun Min Kang; Christian Dina; Tonu Esko; Ross M Fraser; Stavroula Kanoni; Ashish Kumar; Vasiliki Lagou; Claudia Langenberg; Jian'an Luan; Cecilia M Lindgren; Martina Müller-Nurasyid; Sonali Pechlivanis; N William Rayner; Laura J Scott; Steven Wiltshire; Loic Yengo; Leena Kinnunen; Elizabeth J Rossin; Soumya Raychaudhuri; Andrew D Johnson; Antigone S Dimas; Ruth J F Loos; Sailaja Vedantam; Han Chen; Jose C Florez; Caroline Fox; Ching-Ti Liu; Denis Rybin; David J Couper; Wen Hong L Kao; Man Li; Marilyn C Cornelis; Peter Kraft; Qi Sun; Rob M van Dam; Heather M Stringham; Peter S Chines; Krista Fischer; Pierre Fontanillas; Oddgeir L Holmen; Sarah E Hunt; Anne U Jackson; Augustine Kong; Robert Lawrence; Julia Meyer; John R B Perry; Carl G P Platou; Simon Potter; Emil Rehnberg; Neil Robertson; Suthesh Sivapalaratnam; Alena Stančáková; Kathleen Stirrups; Gudmar Thorleifsson; Emmi Tikkanen; Andrew R Wood; Peter Almgren; Mustafa Atalay; Rafn Benediktsson; Lori L Bonnycastle; Noël Burtt; Jason Carey; Guillaume Charpentier; Andrew T Crenshaw; Alex S F Doney; Mozhgan Dorkhan; Sarah Edkins; Valur Emilsson; Elodie Eury; Tom Forsen; Karl Gertow; Bruna Gigante; George B Grant; Christopher J Groves; Candace Guiducci; Christian Herder; Astradur B Hreidarsson; Jennie Hui; Alan James; Anna Jonsson; Wolfgang Rathmann; Norman Klopp; Jasmina Kravic; Kaarel Krjutškov; Cordelia Langford; Karin Leander; Eero Lindholm; Stéphane Lobbens; Satu Männistö; Ghazala Mirza; Thomas W Mühleisen; Bill Musk; Melissa Parkin; Loukianos Rallidis; Jouko Saramies; Bengt Sennblad; Sonia Shah; Gunnar Sigurðsson; Angela Silveira; Gerald Steinbach; Barbara Thorand; Joseph Trakalo; Fabrizio Veglia; Roman Wennauer; Wendy Winckler; Delilah Zabaneh; Harry Campbell; Cornelia van Duijn; Andre G Uitterlinden; Albert Hofman; Eric Sijbrands; Goncalo R Abecasis; Katharine R Owen; Eleftheria Zeggini; Mieke D Trip; Nita G Forouhi; Ann-Christine Syvänen; Johan G Eriksson; Leena Peltonen; Markus M Nöthen; Beverley Balkau; Colin N A Palmer; Valeriya Lyssenko; Tiinamaija Tuomi; Bo Isomaa; David J Hunter; Lu Qi; Alan R Shuldiner; Michael Roden; Ines Barroso; Tom Wilsgaard; John Beilby; Kees Hovingh; Jackie F Price; James F Wilson; Rainer Rauramaa; Timo A Lakka; Lars Lind; George Dedoussis; Inger Njølstad; Nancy L Pedersen; Kay-Tee Khaw; Nicholas J Wareham; Sirkka M Keinanen-Kiukaanniemi; Timo E Saaristo; Eeva Korpi-Hyövälti; Juha Saltevo; Markku Laakso; Johanna Kuusisto; Andres Metspalu; Francis S Collins; Karen L Mohlke; Richard N Bergman; Jaakko Tuomilehto; Bernhard O Boehm; Christian Gieger; Kristian Hveem; Stephane Cauchi; Philippe Froguel; Damiano Baldassarre; Elena Tremoli; Steve E Humphries; Danish Saleheen; John Danesh; Erik Ingelsson; Samuli Ripatti; Veikko Salomaa; Raimund Erbel; Karl-Heinz Jöckel; Susanne Moebus; Annette Peters; Thomas Illig; Ulf de Faire; Anders Hamsten; Andrew D Morris; Peter J Donnelly; Timothy M Frayling; Andrew T Hattersley; Eric Boerwinkle; Olle Melander; Sekar Kathiresan; Peter M Nilsson; Panos Deloukas; Unnur Thorsteinsdottir; Leif C Groop; Kari Stefansson; Frank Hu; James S Pankow; Josée Dupuis; James B Meigs; David Altshuler; Michael Boehnke; Mark I McCarthy
Journal:  Nat Genet       Date:  2012-08-12       Impact factor: 38.330

View more
  30 in total

1.  Liver Function and Risk of Type 2 Diabetes: Bidirectional Mendelian Randomization Study.

Authors:  N Maneka G De Silva; Maria Carolina Borges; Aroon D Hingorani; Jorgen Engmann; Tina Shah; Xiaoshuai Zhang; Jian'an Luan; Claudia Langenberg; Andrew Wong; Diana Kuh; John C Chambers; Weihua Zhang; Marjo-Ritta Jarvelin; Sylvain Sebert; Juha Auvinen; Tom R Gaunt; Deborah A Lawlor
Journal:  Diabetes       Date:  2019-05-14       Impact factor: 9.461

Review 2.  Axes of a revolution: challenges and promises of big data in healthcare.

Authors:  Smadar Shilo; Hagai Rossman; Eran Segal
Journal:  Nat Med       Date:  2020-01-13       Impact factor: 53.440

3.  Mendelian randomization identifies blood metabolites previously linked to midlife cognition as causal candidates in Alzheimer's disease.

Authors:  Jodie Lord; Bradley Jermy; Rebecca Green; Andrew Wong; Jin Xu; Cristina Legido-Quigley; Richard Dobson; Marcus Richards; Petroula Proitsi
Journal:  Proc Natl Acad Sci U S A       Date:  2021-04-20       Impact factor: 11.205

4.  Sixty-five common genetic variants and prediction of type 2 diabetes.

Authors:  Philippa J Talmud; Jackie A Cooper; Richard W Morris; Frank Dudbridge; Tina Shah; Jorgen Engmann; Caroline Dale; Jon White; Stela McLachlan; Delilah Zabaneh; Andrew Wong; Ken K Ong; Tom Gaunt; Michael V Holmes; Debbie A Lawlor; Marcus Richards; Rebecca Hardy; Diana Kuh; Nicholas Wareham; Claudia Langenberg; Yoav Ben-Shlomo; S Goya Wannamethee; Mark W J Strachan; Meena Kumari; John C Whittaker; Fotios Drenos; Mika Kivimaki; Aroon D Hingorani; Jacqueline F Price; Steve E Humphries
Journal:  Diabetes       Date:  2014-12-04       Impact factor: 9.461

5.  Psychometric precision in phenotype definition is a useful step in molecular genetic investigation of psychiatric disorders.

Authors:  M K Xu; D Gaysina; J H Barnett; L Scoriels; L N van de Lagemaat; A Wong; M Richards; T J Croudace; P B Jones
Journal:  Transl Psychiatry       Date:  2015-06-30       Impact factor: 6.222

6.  Extension of GWAS results for lipid-related phenotypes to extreme obesity using electronic health record (EHR) data and the Metabochip.

Authors:  Ankita Parihar; G Craig Wood; Xin Chu; Qunjan Jin; George Argyropoulos; Christopher D Still; Alan R Shuldiner; Braxton D Mitchell; Glenn S Gerhard
Journal:  Front Genet       Date:  2014-08-05       Impact factor: 4.599

7.  Metabolic profiles of socio-economic position: a multi-cohort analysis.

Authors:  Oliver Robinson; Alice R Carter; Mika Ala-Korpela; Juan P Casas; Nishi Chaturvedi; Jorgen Engmann; Laura D Howe; Alun D Hughes; Marjo-Riitta Järvelin; Mika Kähönen; Ville Karhunen; Diana Kuh; Tina Shah; Yoav Ben-Shlomo; Reecha Sofat; Chung-Ho E Lau; Terho Lehtimäki; Usha Menon; Olli Raitakari; Andy Ryan; Rui Providencia; Stephanie Smith; Julie Taylor; Therese Tillin; Jorma Viikari; Andrew Wong; Aroon D Hingorani; Mika Kivimäki; Paolo Vineis
Journal:  Int J Epidemiol       Date:  2021-07-09       Impact factor: 7.196

8.  Plasma urate concentration and risk of coronary heart disease: a Mendelian randomisation analysis.

Authors:  Jon White; Reecha Sofat; Gibran Hemani; Tina Shah; Jorgen Engmann; Caroline Dale; Sonia Shah; Felix A Kruger; Claudia Giambartolomei; Daniel I Swerdlow; Tom Palmer; Stela McLachlan; Claudia Langenberg; Delilah Zabaneh; Ruth Lovering; Alana Cavadino; Barbara Jefferis; Chris Finan; Andrew Wong; Antoinette Amuzu; Ken Ong; Tom R Gaunt; Helen Warren; Teri-Louise Davies; Fotios Drenos; Jackie Cooper; Shah Ebrahim; Debbie A Lawlor; Philippa J Talmud; Steve E Humphries; Christine Power; Elina Hypponen; Marcus Richards; Rebecca Hardy; Diana Kuh; Nicholas Wareham; Yoav Ben-Shlomo; Ian N Day; Peter Whincup; Richard Morris; Mark W J Strachan; Jacqueline Price; Meena Kumari; Mika Kivimaki; Vincent Plagnol; John C Whittaker; George Davey Smith; Frank Dudbridge; Juan P Casas; Michael V Holmes; Aroon D Hingorani
Journal:  Lancet Diabetes Endocrinol       Date:  2016-01-16       Impact factor: 44.867

9.  Mendelian Randomisation study of the influence of eGFR on coronary heart disease.

Authors:  Pimphen Charoen; Dorothea Nitsch; Jorgen Engmann; Tina Shah; Jonathan White; Delilah Zabaneh; Barbara Jefferis; Goya Wannamethee; Peter Whincup; Amy Mulick Cassidy; Tom Gaunt; Ian Day; Stela McLachlan; Jacqueline Price; Meena Kumari; Mika Kivimaki; Eric Brunner; Claudia Langenberg; Yoav Ben-Shlomo; Aroon Hingorani; John Whittaker; Juan Pablo Casas; Frank Dudbridge
Journal:  Sci Rep       Date:  2016-06-24       Impact factor: 4.379

10.  Variant rs10911021 that associates with coronary heart disease in type 2 diabetes, is associated with lower concentrations of circulating HDL cholesterol and large HDL particles but not with amino acids.

Authors:  Katherine E Beaney; Jackie A Cooper; Stela McLachlan; S Goya Wannamethee; Barbara J Jefferis; Peter Whincup; Yoav Ben-Shlomo; Jacqueline F Price; Meena Kumari; Andrew Wong; Ken Ong; Rebecca Hardy; Diana Kuh; Mika Kivimaki; Antti J Kangas; Pasi Soininen; Mika Ala-Korpela; Fotios Drenos; Steve E Humphries
Journal:  Cardiovasc Diabetol       Date:  2016-08-22       Impact factor: 9.951

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.