Literature DB >> 23934885

A novel approach to estimating heterozygosity from low-coverage genome sequence.

Katarzyna Bryc1, Nick Patterson, David Reich.   

Abstract

High-throughput shotgun sequence data make it possible in principle to accurately estimate population genetic parameters without confounding by SNP ascertainment bias. One such statistic of interest is the proportion of heterozygous sites within an individual's genome, which is informative about inbreeding and effective population size. However, in many cases, the available sequence data of an individual are limited to low coverage, preventing the confident calling of genotypes necessary to directly count the proportion of heterozygous sites. Here, we present a method for estimating an individual's genome-wide rate of heterozygosity from low-coverage sequence data, without an intermediate step that calls genotypes. Our method jointly learns the shared allele distribution between the individual and a panel of other individuals, together with the sequencing error distributions and the reference bias. We show our method works well, first, by its performance on simulated sequence data and, second, on real sequence data where we obtain estimates using low-coverage data consistent with those from higher coverage. We apply our method to obtain estimates of the rate of heterozygosity for 11 humans from diverse worldwide populations and through this analysis reveal the complex dependency of local sequencing coverage on the true underlying heterozygosity, which complicates the estimation of heterozygosity from sequence data. We show how we can use filters to correct for the confounding arising from sequencing depth. We find in practice that ratios of heterozygosity are more interpretable than absolute estimates and show that we obtain excellent conformity of ratios of heterozygosity with previous estimates from higher-coverage data.

Entities:  

Keywords:  heterozygosity; low-coverage sequence data

Mesh:

Year:  2013        PMID: 23934885      PMCID: PMC3781980          DOI: 10.1534/genetics.113.154500

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  15 in total

1.  mlRho - a program for estimating the population mutation and recombination rates from shotgun-sequenced diploid genomes.

Authors:  Bernhard Haubold; Peter Pfaffelhuber; Michael Lynch
Journal:  Mol Ecol       Date:  2010-03       Impact factor: 6.185

2.  Inference of population genetic parameters in metagenomics: a clean look at messy data.

Authors:  Philip L F Johnson; Montgomery Slatkin
Journal:  Genome Res       Date:  2006-09-05       Impact factor: 9.043

3.  Genotype, haplotype and copy-number variation in worldwide human populations.

Authors:  Mattias Jakobsson; Sonja W Scholz; Paul Scheet; J Raphael Gibbs; Jenna M VanLiere; Hon-Chung Fung; Zachary A Szpiech; James H Degnan; Kai Wang; Rita Guerreiro; Jose M Bras; Jennifer C Schymick; Dena G Hernandez; Bryan J Traynor; Javier Simon-Sanchez; Mar Matarin; Angela Britton; Joyce van de Leemput; Ian Rafferty; Maja Bucan; Howard M Cann; John A Hardy; Noah A Rosenberg; Andrew B Singleton
Journal:  Nature       Date:  2008-02-21       Impact factor: 49.962

4.  Fast and flexible simulation of DNA sequence data.

Authors:  Gary K Chen; Paul Marjoram; Jeffrey D Wall
Journal:  Genome Res       Date:  2008-11-24       Impact factor: 9.043

5.  Population genetic inference from resequencing data.

Authors:  Rong Jiang; Simon Tavaré; Paul Marjoram
Journal:  Genetics       Date:  2008-11-03       Impact factor: 4.562

6.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

7.  Human whole-genome shotgun sequencing.

Authors:  J L Weber; E W Myers
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

8.  A high-coverage genome sequence from an archaic Denisovan individual.

Authors:  Matthias Meyer; Martin Kircher; Marie-Theres Gansauge; Heng Li; Fernando Racimo; Swapan Mallick; Joshua G Schraiber; Flora Jay; Kay Prüfer; Cesare de Filippo; Peter H Sudmant; Can Alkan; Qiaomei Fu; Ron Do; Nadin Rohland; Arti Tandon; Michael Siebauer; Richard E Green; Katarzyna Bryc; Adrian W Briggs; Udo Stenzel; Jesse Dabney; Jay Shendure; Jacob Kitzman; Michael F Hammer; Michael V Shunkov; Anatoli P Derevianko; Nick Patterson; Aida M Andrés; Evan E Eichler; Montgomery Slatkin; David Reich; Janet Kelso; Svante Pääbo
Journal:  Science       Date:  2012-08-30       Impact factor: 47.728

9.  Genomic mapping by fingerprinting random clones: a mathematical analysis.

Authors:  E S Lander; M S Waterman
Journal:  Genomics       Date:  1988-04       Impact factor: 5.736

10.  Estimation of allele frequency and association mapping using next-generation sequencing data.

Authors:  Su Yeon Kim; Kirk E Lohmueller; Anders Albrechtsen; Yingrui Li; Thorfinn Korneliussen; Geng Tian; Niels Grarup; Tao Jiang; Gitte Andersen; Daniel Witte; Torben Jorgensen; Torben Hansen; Oluf Pedersen; Jun Wang; Rasmus Nielsen
Journal:  BMC Bioinformatics       Date:  2011-06-11       Impact factor: 3.169

View more
  11 in total

1.  Joint Estimates of Heterozygosity and Runs of Homozygosity for Modern and Ancient Samples.

Authors:  Gabriel Renaud; Kristian Hanghøj; Thorfinn Sand Korneliussen; Eske Willerslev; Ludovic Orlando
Journal:  Genetics       Date:  2019-05-14       Impact factor: 4.562

2.  Elucidation of the speciation history of three sister species of crown-of-thorns starfish (Acanthaster spp.) based on genomic analysis.

Authors:  Hideaki Yuasa; Rei Kajitani; Yuta Nakamura; Kazuki Takahashi; Miki Okuno; Fumiya Kobayashi; Takahiro Shinoda; Atsushi Toyoda; Yutaka Suzuki; Nalinee Thongtham; Zac Forsman; Omri Bronstein; Davide Seveso; Enrico Montalbetti; Coralie Taquet; Gal Eyal; Nina Yasuda; Takehiko Itoh
Journal:  DNA Res       Date:  2021-08-25       Impact factor: 4.477

3.  Population genomics of pearl millet (Pennisetum glaucum (L.) R. Br.): Comparative analysis of global accessions and Senegalese landraces.

Authors:  Zhenbin Hu; Bassirou Mbacké; Ramasamy Perumal; Mame Codou Guèye; Ousmane Sy; Sophie Bouchet; P V Vara Prasad; Geoffrey P Morris
Journal:  BMC Genomics       Date:  2015-12-09       Impact factor: 3.969

4.  Inferring Heterozygosity from Ancient and Low Coverage Genomes.

Authors:  Athanasios Kousathanas; Christoph Leuenberger; Vivian Link; Christian Sell; Joachim Burger; Daniel Wegmann
Journal:  Genetics       Date:  2016-11-07       Impact factor: 4.562

Review 5.  Genomic Selection in Aquaculture: Application, Limitations and Opportunities With Special Reference to Marine Shrimp and Pearl Oysters.

Authors:  Kyall R Zenger; Mehar S Khatkar; David B Jones; Nima Khalilisamani; Dean R Jerry; Herman W Raadsma
Journal:  Front Genet       Date:  2019-01-23       Impact factor: 4.599

6.  The presence and impact of reference bias on population genomic studies of prehistoric human populations.

Authors:  Torsten Günther; Carl Nettelblad
Journal:  PLoS Genet       Date:  2019-07-26       Impact factor: 5.917

7.  Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing.

Authors:  Peter Edge; Vikas Bansal
Journal:  Nat Commun       Date:  2019-10-11       Impact factor: 14.919

8.  Mapping co-ancestry connections between the genome of a Medieval individual and modern Europeans.

Authors:  Manuel Ferrando-Bernal; Carlos Morcillo-Suarez; Toni de-Dios; Pere Gelabert; Sergi Civit; Antonia Díaz-Carvajal; Imma Ollich-Castanyer; Morten E Allentoft; Sergi Valverde; Carles Lalueza-Fox
Journal:  Sci Rep       Date:  2020-04-22       Impact factor: 4.379

9.  Genome Survey Sequencing for the Characterization of the Genetic Background of Rosa roxburghii Tratt and Leaf Ascorbate Metabolism Genes.

Authors:  Min Lu; Huaming An; Liangliang Li
Journal:  PLoS One       Date:  2016-02-05       Impact factor: 3.240

10.  Dealing with paralogy in RADseq data: in silico detection and single nucleotide polymorphism validation in Robinia pseudoacacia L.

Authors:  Cindy F Verdu; Erwan Guichoux; Samuel Quevauvillers; Olivier De Thier; Yec'han Laizet; Adline Delcamp; Frédéric Gévaudant; Arnaud Monty; Annabel J Porté; Philippe Lejeune; Ludivine Lassois; Stéphanie Mariette
Journal:  Ecol Evol       Date:  2016-09-22       Impact factor: 2.912

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.