MOTIVATION: Genotype imputation has become an indispensible step in genome-wide association studies (GWAS). Imputation accuracy, directly influencing downstream analysis, has shown to be improved using re-sequencing-based reference panels; however, this comes at the cost of high computational burden due to the huge number of potentially imputable markers (tens of millions) discovered through sequencing a large number of individuals. Therefore, there is an increasing need for access to imputation quality information without actually conducting imputation. To facilitate this process, we have established a publicly available SNP and indel imputability database, aiming to provide direct access to imputation accuracy information for markers identified by the 1000 Genomes Project across four major populations and covering multiple GWAS genotyping platforms. RESULTS: SNP and indel imputability information can be retrieved through a user-friendly interface by providing the ID(s) of the desired variant(s) or by specifying the desired genomic region. The query results can be refined by selecting relevant GWAS genotyping platform(s). This is the first database providing variant imputability information specific to each continental group and to each genotyping platform. In Filipino individuals from the Cebu Longitudinal Health and Nutrition Survey, our database can achieve an area under the receiver-operating characteristic curve of 0.97, 0.91, 0.88 and 0.79 for markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. Specifically, by filtering out 48.6% of markers (corresponding to a reduction of up to 48.6% in computational costs for actual imputation) based on the imputability information in our database, we can remove 77%, 58%, 51% and 42% of the poorly imputed markers at the cost of only 0.3%, 0.8%, 1.5% and 4.6% of the well-imputed markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. AVAILABILITY: http://www.unc.edu/∼yunmli/imputability.html
MOTIVATION: Genotype imputation has become an indispensible step in genome-wide association studies (GWAS). Imputation accuracy, directly influencing downstream analysis, has shown to be improved using re-sequencing-based reference panels; however, this comes at the cost of high computational burden due to the huge number of potentially imputable markers (tens of millions) discovered through sequencing a large number of individuals. Therefore, there is an increasing need for access to imputation quality information without actually conducting imputation. To facilitate this process, we have established a publicly available SNP and indel imputability database, aiming to provide direct access to imputation accuracy information for markers identified by the 1000 Genomes Project across four major populations and covering multiple GWAS genotyping platforms. RESULTS: SNP and indel imputability information can be retrieved through a user-friendly interface by providing the ID(s) of the desired variant(s) or by specifying the desired genomic region. The query results can be refined by selecting relevant GWAS genotyping platform(s). This is the first database providing variant imputability information specific to each continental group and to each genotyping platform. In Filipino individuals from the Cebu Longitudinal Health and Nutrition Survey, our database can achieve an area under the receiver-operating characteristic curve of 0.97, 0.91, 0.88 and 0.79 for markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. Specifically, by filtering out 48.6% of markers (corresponding to a reduction of up to 48.6% in computational costs for actual imputation) based on the imputability information in our database, we can remove 77%, 58%, 51% and 42% of the poorly imputed markers at the cost of only 0.3%, 0.8%, 1.5% and 4.6% of the well-imputed markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. AVAILABILITY: http://www.unc.edu/∼yunmli/imputability.html
Authors: Amanda F Marvelle; Leslie A Lange; Li Qin; Yunfei Wang; Ethan M Lange; Linda S Adair; Karen L Mohlke Journal: J Hum Genet Date: 2007-07-18 Impact factor: 3.172
Authors: Damien C Croteau-Chonka; Ying Wu; Yun Li; Marie P Fogarty; Leslie A Lange; Christopher W Kuzawa; Thomas W McDade; Judith B Borja; Jingchun Luo; Omar AbdelBaky; Terry P Combs; Linda S Adair; Ethan M Lange; Karen L Mohlke Journal: Hum Mol Genet Date: 2011-10-18 Impact factor: 6.150
Authors: Michael S Cunnington; Mauro Santibanez Koref; Bongani M Mayosi; John Burn; Bernard Keavney Journal: PLoS Genet Date: 2010-04-08 Impact factor: 5.917
Authors: Eric Yi Liu; Steven Buyske; Aaron K Aragaki; Ulrike Peters; Eric Boerwinkle; Chris Carlson; Cara Carty; Dana C Crawford; Jeff Haessler; Lucia A Hindorff; Loic Le Marchand; Teri A Manolio; Tara Matise; Wei Wang; Charles Kooperberg; Kari E North; Yun Li Journal: Genet Epidemiol Date: 2012-02 Impact factor: 2.135
Authors: Momoko Horikoshi; Reedik Mӓgi; Martijn van de Bunt; Ida Surakka; Antti-Pekka Sarin; Anubha Mahajan; Letizia Marullo; Gudmar Thorleifsson; Sara Hӓgg; Jouke-Jan Hottenga; Claes Ladenvall; Janina S Ried; Thomas W Winkler; Sara M Willems; Natalia Pervjakova; Tõnu Esko; Marian Beekman; Christopher P Nelson; Christina Willenborg; Steven Wiltshire; Teresa Ferreira; Juan Fernandez; Kyle J Gaulton; Valgerdur Steinthorsdottir; Anders Hamsten; Patrik K E Magnusson; Gonneke Willemsen; Yuri Milaneschi; Neil R Robertson; Christopher J Groves; Amanda J Bennett; Terho Lehtimӓki; Jorma S Viikari; Johan Rung; Valeriya Lyssenko; Markus Perola; Iris M Heid; Christian Herder; Harald Grallert; Martina Müller-Nurasyid; Michael Roden; Elina Hypponen; Aaron Isaacs; Elisabeth M van Leeuwen; Lennart C Karssen; Evelin Mihailov; Jeanine J Houwing-Duistermaat; Anton J M de Craen; Joris Deelen; Aki S Havulinna; Matthew Blades; Christian Hengstenberg; Jeanette Erdmann; Heribert Schunkert; Jaakko Kaprio; Martin D Tobin; Nilesh J Samani; Lars Lind; Veikko Salomaa; Cecilia M Lindgren; P Eline Slagboom; Andres Metspalu; Cornelia M van Duijn; Johan G Eriksson; Annette Peters; Christian Gieger; Antti Jula; Leif Groop; Olli T Raitakari; Chris Power; Brenda W J H Penninx; Eco de Geus; Johannes H Smit; Dorret I Boomsma; Nancy L Pedersen; Erik Ingelsson; Unnur Thorsteinsdottir; Kari Stefansson; Samuli Ripatti; Inga Prokopenko; Mark I McCarthy; Andrew P Morris Journal: PLoS Genet Date: 2015-07-01 Impact factor: 5.917
Authors: Madeline H Kowalski; Huijun Qian; Ziyi Hou; Jonathan D Rosen; Amanda L Tapia; Yue Shan; Deepti Jain; Maria Argos; Donna K Arnett; Christy Avery; Kathleen C Barnes; Lewis C Becker; Stephanie A Bien; Joshua C Bis; John Blangero; Eric Boerwinkle; Donald W Bowden; Steve Buyske; Jianwen Cai; Michael H Cho; Seung Hoan Choi; Hélène Choquet; L Adrienne Cupples; Mary Cushman; Michelle Daya; Paul S de Vries; Patrick T Ellinor; Nauder Faraday; Myriam Fornage; Stacey Gabriel; Santhi K Ganesh; Misa Graff; Namrata Gupta; Jiang He; Susan R Heckbert; Bertha Hidalgo; Chani J Hodonsky; Marguerite R Irvin; Andrew D Johnson; Eric Jorgenson; Robert Kaplan; Sharon L R Kardia; Tanika N Kelly; Charles Kooperberg; Jessica A Lasky-Su; Ruth J F Loos; Steven A Lubitz; Rasika A Mathias; Caitlin P McHugh; Courtney Montgomery; Jee-Young Moon; Alanna C Morrison; Nicholette D Palmer; Nathan Pankratz; George J Papanicolaou; Juan M Peralta; Patricia A Peyser; Stephen S Rich; Jerome I Rotter; Edwin K Silverman; Jennifer A Smith; Nicholas L Smith; Kent D Taylor; Timothy A Thornton; Hemant K Tiwari; Russell P Tracy; Tao Wang; Scott T Weiss; Lu-Chen Weng; Kerri L Wiggins; James G Wilson; Lisa R Yanek; Sebastian Zöllner; Kari E North; Paul L Auer; Laura M Raffield; Alexander P Reiner; Yun Li Journal: PLoS Genet Date: 2019-12-23 Impact factor: 6.020
Authors: Guojun Hou; Isaac T W Harley; Xiaoming Lu; Tian Zhou; Ning Xu; Chao Yao; Yuting Qin; Ye Ouyang; Jianyang Ma; Xinyi Zhu; Xiang Yu; Hong Xu; Dai Dai; Huihua Ding; Zhihua Yin; Zhizhong Ye; Jun Deng; Mi Zhou; Yuanjia Tang; Bahram Namjou; Ya Guo; Matthew T Weirauch; Leah C Kottyan; John B Harley; Nan Shen Journal: Nat Commun Date: 2021-01-08 Impact factor: 14.919
Authors: Lei Zhang; Hyung Jin Choi; Karol Estrada; Paul J Leo; Jian Li; Yu-Fang Pei; Yinping Zhang; Yong Lin; Hui Shen; Yao-Zhong Liu; Yongjun Liu; Yingchun Zhao; Ji-Gang Zhang; Qing Tian; Yu-ping Wang; Yingying Han; Shu Ran; Rong Hai; Xue-Zhen Zhu; Shuyan Wu; Han Yan; Xiaogang Liu; Tie-Lin Yang; Yan Guo; Feng Zhang; Yan-fang Guo; Yuan Chen; Xiangding Chen; Lijun Tan; Lishu Zhang; Fei-Yan Deng; Hongyi Deng; Fernando Rivadeneira; Emma L Duncan; Jong Young Lee; Bok Ghee Han; Nam H Cho; Geoffrey C Nicholson; Eugene McCloskey; Richard Eastell; Richard L Prince; John A Eisman; Graeme Jones; Ian R Reid; Philip N Sambrook; Elaine M Dennison; Patrick Danoy; Laura M Yerges-Armstrong; Elizabeth A Streeten; Tian Hu; Shuanglin Xiang; Christopher J Papasian; Matthew A Brown; Chan Soo Shin; André G Uitterlinden; Hong-Wen Deng Journal: Hum Mol Genet Date: 2013-11-17 Impact factor: 6.150
Authors: Alexander P Reiner; Paul L Auer; Nicole Soranzo; Valentina Iotchkova; Jie Huang; John A Morris; Deepti Jain; Caterina Barbieri; Klaudia Walter; Josine L Min; Lu Chen; William Astle; Massimilian Cocca; Patrick Deelen; Heather Elding; Aliki-Eleni Farmaki; Christopher S Franklin; Mattias Franberg; Tom R Gaunt; Albert Hofman; Tao Jiang; Marcus E Kleber; Genevieve Lachance; Jian'an Luan; Giovanni Malerba; Angela Matchan; Daniel Mead; Yasin Memari; Ioanna Ntalla; Kalliope Panoutsopoulou; Raha Pazoki; John R B Perry; Fernando Rivadeneira; Maria Sabater-Lleal; Bengt Sennblad; So-Youn Shin; Lorraine Southam; Michela Traglia; Freerk van Dijk; Elisabeth M van Leeuwen; Gianluigi Zaza; Weihua Zhang; Najaf Amin; Adam Butterworth; John C Chambers; George Dedoussis; Abbas Dehghan; Oscar H Franco; Lude Franke; Mattia Frontini; Giovanni Gambaro; Paolo Gasparini; Anders Hamsten; Aaron Issacs; Jaspal S Kooner; Charles Kooperberg; Claudia Langenberg; Winfried Marz; Robert A Scott; Morris A Swertz; Daniela Toniolo; Andre G Uitterlinden; Cornelia M van Duijn; Hugh Watkins; Eleftheria Zeggini; Mathew T Maurano; Nicholas J Timpson Journal: Nat Genet Date: 2016-09-26 Impact factor: 38.330
Authors: Shelina Ramnarine; Juan Zhang; Li-Shiun Chen; Robert Culverhouse; Weimin Duan; Dana B Hancock; Sarah M Hartz; Eric O Johnson; Emily Olfson; Tae-Hwi Schwantes-An; Nancy L Saccone Journal: PLoS One Date: 2015-10-12 Impact factor: 3.240