| Literature DB >> 31020325 |
Santiago Alvarez Prado1, Isabelle Sanchez2, Llorenç Cabrera-Bosquet1, Antonin Grau1, Claude Welcker1, François Tardieu1, Nadine Hilgert2.
Abstract
Based on case studies, we discuss the extent to which genome-wide association studies (GWAS) are affected by outlier plants, i.e. those deviating from the expected distribution on a multi-criteria basis. Using a raw dataset consisting of daily measurements of leaf area, biomass, and plant height for thousands of plants, we tested three different cleaning methods for their effects on genetic analyses. No-cleaning resulted in the highest number of dubious quantitative trait loci, especially at loci with highly unbalanced allelic frequencies. A trade-off was identified between the risk of false-positives (with no-cleaning and/or a low threshold for minor allele frequency) and the risk of missing interesting rare alleles. Cleaning can lower the risk of the latter by making it possible to choose a higher threshold in GWAS.Entities:
Keywords: Allele frequency; genetic analysis; outliers; phenomics; quantitative trait loci; statistical analysis
Year: 2019 PMID: 31020325 PMCID: PMC6685653 DOI: 10.1093/jxb/erz191
Source DB: PubMed Journal: J Exp Bot ISSN: 0022-0957 Impact factor: 6.992