| Literature DB >> 20360734 |
Nick Craddock, Matthew E Hurles, Niall Cardin, Richard D Pearson, Vincent Plagnol, Samuel Robson, Damjan Vukcevic, Chris Barnes, Donald F Conrad, Eleni Giannoulatou, Chris Holmes, Jonathan L Marchini, Kathy Stirrups, Martin D Tobin, Louise V Wain, Chris Yau, Jan Aerts, Tariq Ahmad, T Daniel Andrews, Hazel Arbury, Anthony Attwood, Adam Auton, Stephen G Ball, Anthony J Balmforth, Jeffrey C Barrett, Inês Barroso, Anne Barton, Amanda J Bennett, Sanjeev Bhaskar, Katarzyna Blaszczyk, John Bowes, Oliver J Brand, Peter S Braund, Francesca Bredin, Gerome Breen, Morris J Brown, Ian N Bruce, Jaswinder Bull, Oliver S Burren, John Burton, Jake Byrnes, Sian Caesar, Chris M Clee, Alison J Coffey, John M C Connell, Jason D Cooper, Anna F Dominiczak, Kate Downes, Hazel E Drummond, Darshna Dudakia, Andrew Dunham, Bernadette Ebbs, Diana Eccles, Sarah Edkins, Cathryn Edwards, Anna Elliot, Paul Emery, David M Evans, Gareth Evans, Steve Eyre, Anne Farmer, I Nicol Ferrier, Lars Feuk, Tomas Fitzgerald, Edward Flynn, Alistair Forbes, Liz Forty, Jayne A Franklyn, Rachel M Freathy, Polly Gibbs, Paul Gilbert, Omer Gokumen, Katherine Gordon-Smith, Emma Gray, Elaine Green, Chris J Groves, Detelina Grozeva, Rhian Gwilliam, Anita Hall, Naomi Hammond, Matt Hardy, Pile Harrison, Neelam Hassanali, Husam Hebaishi, Sarah Hines, Anne Hinks, Graham A Hitman, Lynne Hocking, Eleanor Howard, Philip Howard, Joanna M M Howson, Debbie Hughes, Sarah Hunt, John D Isaacs, Mahim Jain, Derek P Jewell, Toby Johnson, Jennifer D Jolley, Ian R Jones, Lisa A Jones, George Kirov, Cordelia F Langford, Hana Lango-Allen, G Mark Lathrop, James Lee, Kate L Lee, Charlie Lees, Kevin Lewis, Cecilia M Lindgren, Meeta Maisuria-Armer, Julian Maller, John Mansfield, Paul Martin, Dunecan C O Massey, Wendy L McArdle, Peter McGuffin, Kirsten E McLay, Alex Mentzer, Michael L Mimmack, Ann E Morgan, Andrew P Morris, Craig Mowat, Simon Myers, William Newman, Elaine R Nimmo, Michael C O'Donovan, Abiodun Onipinla, Ifejinelo Onyiah, Nigel R Ovington, Michael J Owen, Kimmo Palin, Kirstie Parnell, David Pernet, John R B Perry, Anne Phillips, Dalila Pinto, Natalie J Prescott, Inga Prokopenko, Michael A Quail, Suzanne Rafelt, Nigel W Rayner, Richard Redon, David M Reid, Susan M Ring, Neil Robertson, Ellie Russell, David St Clair, Jennifer G Sambrook, Jeremy D Sanderson, Helen Schuilenburg, Carol E Scott, Richard Scott, Sheila Seal, Sue Shaw-Hawkins, Beverley M Shields, Matthew J Simmonds, Debbie J Smyth, Elilan Somaskantharajah, Katarina Spanova, Sophia Steer, Jonathan Stephens, Helen E Stevens, Millicent A Stone, Zhan Su, Deborah P M Symmons, John R Thompson, Wendy Thomson, Mary E Travers, Clare Turnbull, Armand Valsesia, Mark Walker, Neil M Walker, Chris Wallace, Margaret Warren-Perry, Nicholas A Watkins, John Webster, Michael N Weedon, Anthony G Wilson, Matthew Woodburn, B Paul Wordsworth, Allan H Young, Eleftheria Zeggini, Nigel P Carter, Timothy M Frayling, Charles Lee, Gil McVean, Patricia B Munroe, Aarno Palotie, Stephen J Sawcer, Stephen W Scherer, David P Strachan, Chris Tyler-Smith, Matthew A Brown, Paul R Burton, Mark J Caulfield, Alastair Compston, Martin Farrall, Stephen C L Gough, Alistair S Hall, Andrew T Hattersley, Adrian V S Hill, Christopher G Mathew, Marcus Pembrey, Jack Satsangi, Michael R Stratton, Jane Worthington, Panos Deloukas, Audrey Duncanson, Dominic P Kwiatkowski, Mark I McCarthy, Willem Ouwehand, Miles Parkes, Nazneen Rahman, John A Todd, Nilesh J Samani, Peter Donnelly.
Abstract
Copy number variants (CNVs) account for a major proportion of human genetic polymorphism and have been predicted to have an important role in genetic susceptibility to common disease. To address this we undertook a large, direct genome-wide study of association between CNVs and eight common human diseases. Using a purpose-designed array we typed approximately 19,000 individuals into distinct copy-number classes at 3,432 polymorphic CNVs, including an estimated approximately 50% of all common CNVs larger than 500 base pairs. We identified several biological artefacts that lead to false-positive associations, including systematic CNV differences between DNAs derived from blood and cell lines. Association testing and follow-up replication analyses confirmed three loci where CNVs were associated with disease-IRGM for Crohn's disease, HLA for Crohn's disease, rheumatoid arthritis and type 1 diabetes, and TSPAN8 for type 2 diabetes-although in each case the locus had previously been identified in single nucleotide polymorphism (SNP)-based studies, reflecting our observation that most common CNVs that are well-typed on our array are well tagged by SNPs and so have been indirectly explored through SNP studies. We conclude that common CNVs that can be typed on existing platforms are unlikely to contribute greatly to the genetic basis of common human diseases.Entities:
Mesh:
Year: 2010 PMID: 20360734 PMCID: PMC2892339 DOI: 10.1038/nature08979
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962
Summary of the discovery source for genomic regions targeted on the WTCCC CNV genotyping array
GSV CNVs were prioritised according to extent of polymorphism in European discovery samples. See Online Methods for full details of other sources.
| Source of Loci | Number of | Number of | Number of | |
|---|---|---|---|---|
|
| ||||
| CNVs | GSV Discovery Project | 10,835 | 10,217 | 3,096 |
| Affymetrix 500k | 18 | 14 | 12 | |
| Affymetrix 6.0 | 83 | 81 | 47 | |
| Illumina 1M | 82 | 81 | 18 | |
| WTCCC CNV Loci | 231 | 209 | 108 | |
| Novel Sequence | Novel Insert Regions | 292 | 292 | 151 |
|
| ||||
| Total | 11,541 | 10,894 | 3,432 | |
Figure 1Flow-chart showing which CNVs are included on the array
The chart shows the reasons for CNVs being removed from consideration (the column of arrows and text to the right of the figure) from those originally targeted on the array and the number of CNVs remaining at each stage of filtering.
Figure 2Illustrative CNVs
Histograms of three multiallelic CNVs (one per row) previously reported to be associated with autoimmune diseases: Beta-Defensin (CNVR3771.10), CCL3L1 (CNVR7077.12) and FCGR3B (CNVR383.1), showing 6, 5, and 4 fitted copy number classes respectively. The histogram of normalised intensity ratios is shown for one control and the three autoimmune collections. Histograms are overlaid by the fitted distribution used to model each class (variously the red, blue, light green, cyan, magenta and dark green curves). In all such figures, the area under the fitted curve of a particular colour is the same for all collections at the same CNV.
Figure 3Genome-wide association results
Distribution of −log10(p) along the 23 chromosomes where p is the p-value for the one degree-of-freedom test of association for each disease. The x-axis shows the chromosomes numbered from 1 (on the left) to X (on the right). CNVs included in these plots were filtered on the basis of a clustering quality score (see SoM for details) and manual inspection of the most significant associations. The two apparent associations on chromosome 2 for rheumatoid arthritis and type 1 diabetes result from a dispersed duplication in which the variation is actually located within the HLA locus (see Box).
Replicated CNV associations and those at replicated loci
Only one of the several associated CNVs mapping to the HLA in the reference sequence is shown for each of RA, T1D and CD. Further details of replication assays and methods are given in the supplementary material. AC_000138.1_44 is a novel sequence insertion present in the Venter genome sequence but not in the reference sequence and hence no chromosomal location is presented. Fitted number of classes – the number of diploid copy-number classes. P-value - Combined Controls – the p value from the frequentist association test combining UKBS and 58C as controls. log – the log10 of the Bayes Factor from the Bayesian association analysis combining UKBS and 58C as controls. OR - Combined Controls – The odds ratio estimated for each additional copy of the CNV based on both UKBS and 58C as controls. Extended Reference refers to the analogous quantities calculated in comparing cases of the disease in question with UKBS, 58C, and aetiologically-unrelated cases. Control MAF – The minor allele frequency in controls (UKBS +58C). Case MAF – The minor allele frequency in cases. Minor allele frequency is only estimated for CNVs with 3 or fewer copy number classes.
| Disease | CNV | Chromosome | Start | Length | Locus | Fitted | P-value | P-value | log10(BF) | log10(BF) | OR | OR | Control | Case | Replication: | Replication |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| T2D | CNVR5583.1 | 12 | 69,818,942 | 1.0 |
| 3 | 3.9E-05 | 2.5E-06 | 2.8 | 4.3 | 0.85 | 0.85 | 0.40 | 0.36 | 4549 / 5579 | 3.9E-05 |
| CD | CNVR2646.1 | 5 | 150,157,836 | 3.9 |
| 3 | 1.1E-07 | 5.5E-05 | 5.8 | 4.1 | 0.68 | 0.75 | 0.07 | 0.10 | 6894 / 7977 | 7.5E-11 |
| CD | CNVR2647.1 | 5 | 150,183,562 | 20.1 |
| 3 | 1.0E-07 | 4.3E-05 | 6.1 | 3.8 | 0.68 | 0.76 | 0.07 | 0.10 | 6894 / 7977 | 3.9E-10 |
| CD | CNVR2841.20 | 6 | 31,416,574 | 5.1 | HLA | 3 | 1.7E-05 | 1.1E-05 | 3.6 | 3.9 | 0.80 | 0.82 | 0.19 | 0.23 | NA | NA |
| T1D | CNVR2845.46 | 6 | 32,582,950 | 6.7 | HLA | 2 | 8.0E-153 | 2.1E-196 | 125.5 | 154.4 | 0.20 | 0.26 | 0.14 | 0.01 | NA | NA |
| RA | CNVR2845.14 | 6 | 32,609,209 | 4.0 | HLA | 4 | 1.4E-39 | 8.1E-60 | 51.5 | 73.5 | 1.77 | 1.83 | NA | NA | NA | NA |
| RA | CNVR1065.1 | 2⇒6 | 179,004,449 | 0.8 | HLA | 3 | 6.8E-49 | 1.6E-69 | 51.0 | 73.7 | 1.85 | 1.94 | 0.36 | 0.49 | NA | NA |
| T1D | CNVR1065.1 | 2⇒6 | 179,004,449 | 0.8 | HLA | 3 | 1.3E-29 | 1.1E-39 | 28.0 | 38.4 | 1.62 | 1.61 | 0.36 | 0.47 | NA | NA |
| RA | AC_000138.1 | NA | NA | 5.6 | HLA | 3 | 8.3E-04 | 1.1E-05 | 1.3 | 2.7 | 0.87 | 0.86 | 0.25 | 0.28 | 3398 / 2743 | 1.1E-03 |
| T1D | AC_000138.1 | NA | NA | 5.6 | HLA | 3 | 2.0E-31 | 2.7E-45 | 31.0 | 45.1 | 0.59 | 0.57 | 0.25 | 0.36 | 3883 / 2649 | 7.3E-50 |
| CD | CNVR7113.6 | 17 | 40,930,407 | 33.9 | Chr17inv | 3 | 1.2E-03 | 5.8E-04 | 1.4 | 1.6 | 1.15 | 1.14 | 0.24 | 0.21 | 4978 / 6069 | 8.6E-05 |
| T1D | CNVR7113.6 | 17 | 40,930,407 | 33.9 | Chr17inv | 3 | 1.6E-03 | 7.5E-04 | 1.0 | 1.2 | 1.13 | 1.12 | 0.24 | 0.21 | 7911 / 9395 | 4.6E-06 |
Replication sample includes WTCCC samples
Box Figure 1
Box Figure 2
Box Figure 3