Literature DB >> 31504792

Legacy Data Confound Genomics Studies.

Luke Anderson-Trocmé1,2, Rick Farouni1,2, Mathieu Bourgey1,2, Yoichiro Kamatani3, Koichiro Higasa3, Jeong-Sun Seo4,5, Changhoon Kim4, Fumihiko Matsuda3, Simon Gravel1,2.   

Abstract

Recent reports have identified differences in the mutational spectra across human populations. Although some of these reports have been replicated in other cohorts, most have been reported only in the 1000 Genomes Project (1kGP) data. While investigating an intriguing putative population stratification within the Japanese population, we identified a previously unreported batch effect leading to spurious mutation calls in the 1kGP data and to the apparent population stratification. Because the 1kGP data are used extensively, we find that the batch effects also lead to incorrect imputation by leading imputation servers and a small number of suspicious GWAS associations. Lower quality data from the early phases of the 1kGP thus continue to contaminate modern studies in hidden ways. It may be time to retire or upgrade such legacy sequencing data.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  batch effect; imputation; mutational signature; population genetics; reference cohorts; statistical genetics

Mesh:

Year:  2020        PMID: 31504792     DOI: 10.1093/molbev/msz201

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  6 in total

Review 1.  Inferring evolutionary dynamics of mutation rates through the lens of mutation spectrum variation.

Authors:  Jedidiah Carlson; William S DeWitt; Kelley Harris
Journal:  Curr Opin Genet Dev       Date:  2020-06-30       Impact factor: 5.578

2.  Nonparametric coalescent inference of mutation spectrum history and demography.

Authors:  William S DeWitt; Kameron Decker Harris; Aaron P Ragsdale; Kelley Harris
Journal:  Proc Natl Acad Sci U S A       Date:  2021-05-25       Impact factor: 11.205

3.  Evolutionary Genetic Signatures of Selection on Bone-Related Variation within Human and Chimpanzee Populations.

Authors:  Daryn A Stover; Genevieve Housman; Anne C Stone; Michael S Rosenberg; Brian C Verrelli
Journal:  Genes (Basel)       Date:  2022-01-21       Impact factor: 4.141

4.  Similarity-Based Analysis of Allele Frequency Distribution among Multiple Populations Identifies Adaptive Genomic Structural Variants.

Authors:  Marie Saitou; Naoki Masuda; Omer Gokcumen
Journal:  Mol Biol Evol       Date:  2022-03-02       Impact factor: 8.800

5.  Assessing the role of rare genetic variants in drug-resistant, non-lesional focal epilepsy.

Authors:  Stefan Wolking; Claudia Moreau; Mark McCormack; Roland Krause; Martin Krenn; Samuel Berkovic; Gianpiero L Cavalleri; Norman Delanty; Chantal Depondt; Michael R Johnson; Bobby P C Koeleman; Wolfram S Kunz; Holger Lerche; Anthony G Marson; Terence J O'Brien; Slave Petrovski; Josemir W Sander; Graeme J Sills; Pasquale Striano; Federico Zara; Fritz Zimprich; Sanjay M Sisodiya; Simon L Girard; Patrick Cossette
Journal:  Ann Clin Transl Neurol       Date:  2021-05-21       Impact factor: 4.511

6.  Analysis of the Batch Effect Due to Sequencing Center in Population Statistics Quantifying Rare Events in the 1000 Genomes Project.

Authors:  Iago Maceda; Oscar Lao
Journal:  Genes (Basel)       Date:  2021-12-24       Impact factor: 4.096

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.