Literature DB >> 26929272

High rates of phasing errors in highly polymorphic species with low levels of linkage disequilibrium.

Marek Bukowicki1, Susanne U Franssen1, Christian Schlötterer1.   

Abstract

Short read sequencing of diploid individuals does not permit the direct inference of the sequence on each of the two homologous chromosomes. Although various phasing software packages exist, they were primarily tailored for and tested on human data, which differ from other species in factors that influence phasing, such as SNP density, amounts of linkage disequilibrium (LD) and sample sizes. Despite becoming increasingly popular for other species, the reliability of phasing in non-human data has not been evaluated to a sufficient extent. We scrutinized the phasing accuracy for Drosophila melanogaster, a species with high polymorphism levels and reduced LD relative to humans. We phased two D. melanogaster populations and compared the results to the known haplotypes. The performance increased with size of the reference panel and was highest when the reference panel and phased individuals were from the same population. Full genomic SNP data and inclusion of sequence read information also improved phasing. Despite humans and Drosophila having similar switch error rates between polymorphic sites, the distances between switch errors were much shorter in Drosophila with only fragments <300-1500 bp being correctly phased with ≥95% confidence. This suggests that the higher SNP density cannot compensate for the higher recombination rate in D. melanogaster. Furthermore, we show that populations that have gone through demographic events such as bottlenecks can be phased with higher accuracy. Our results highlight that statistically phased data are particularly error prone in species with large population sizes or populations lacking suitable reference panels.
© 2016 John Wiley & Sons Ltd.

Entities:  

Keywords:  Drosophila melanogaster; computational phasing; haplotype analysis; single nucleotide polymorphism; switch errors

Mesh:

Year:  2016        PMID: 26929272      PMCID: PMC4916997          DOI: 10.1111/1755-0998.12516

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  29 in total

1.  Detecting identity by descent and estimating genotype error rates in sequence data.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Am J Hum Genet       Date:  2013-10-24       Impact factor: 11.025

2.  Phasing of many thousands of genotyped samples.

Authors:  Amy L Williams; Nick Patterson; Joseph Glessner; Hakon Hakonarson; David Reich
Journal:  Am J Hum Genet       Date:  2012-08-10       Impact factor: 11.025

3.  The Drosophila melanogaster Genetic Reference Panel.

Authors:  Trudy F C Mackay; Stephen Richards; Eric A Stone; Antonio Barbadilla; Julien F Ayroles; Dianhui Zhu; Sònia Casillas; Yi Han; Michael M Magwire; Julie M Cridland; Mark F Richardson; Robert R H Anholt; Maite Barrón; Crystal Bess; Kerstin Petra Blankenburg; Mary Anna Carbone; David Castellano; Lesley Chaboub; Laura Duncan; Zeke Harris; Mehwish Javaid; Joy Christina Jayaseelan; Shalini N Jhangiani; Katherine W Jordan; Fremiet Lara; Faye Lawrence; Sandra L Lee; Pablo Librado; Raquel S Linheiro; Richard F Lyman; Aaron J Mackey; Mala Munidasa; Donna Marie Muzny; Lynne Nazareth; Irene Newsham; Lora Perales; Ling-Ling Pu; Carson Qu; Miquel Ràmia; Jeffrey G Reid; Stephanie M Rollmann; Julio Rozas; Nehad Saada; Lavanya Turlapati; Kim C Worley; Yuan-Qing Wu; Akihiko Yamamoto; Yiming Zhu; Casey M Bergman; Kevin R Thornton; David Mittelman; Richard A Gibbs
Journal:  Nature       Date:  2012-02-08       Impact factor: 49.962

4.  Circumventing heterozygosity: sequencing the amplified genome of a single haploid Drosophila melanogaster embryo.

Authors:  Charles H Langley; Marc Crepeau; Charis Cardeno; Russell Corbett-Detig; Kristian Stevens
Journal:  Genetics       Date:  2011-03-24       Impact factor: 4.562

5.  PoPoolation2: identifying differentiation between populations using sequencing of pooled DNA samples (Pool-Seq).

Authors:  Robert Kofler; Ram Vinay Pandey; Christian Schlötterer
Journal:  Bioinformatics       Date:  2011-10-23       Impact factor: 6.937

6.  The Drosophila genome nexus: a population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population.

Authors:  Justin B Lack; Charis M Cardeno; Marc W Crepeau; William Taylor; Russell B Corbett-Detig; Kristian A Stevens; Charles H Langley; John E Pool
Journal:  Genetics       Date:  2015-01-27       Impact factor: 4.562

Review 7.  Genomic data as the "hitchhiker's guide" to cattle adaptation: tracking the milestones of past selection in the bovine genome.

Authors:  Yuri T Utsunomiya; Ana M Pérez O'Brien; Tad S Sonstegard; Johann Sölkner; José F Garcia
Journal:  Front Genet       Date:  2015-02-10       Impact factor: 4.599

8.  Patterns of linkage disequilibrium and long range hitchhiking in evolving experimental Drosophila melanogaster populations.

Authors:  Susanne U Franssen; Viola Nolte; Ray Tobler; Christian Schlötterer
Journal:  Mol Biol Evol       Date:  2014-11-17       Impact factor: 16.240

9.  Population Genomics of sub-saharan Drosophila melanogaster: African diversity and non-African admixture.

Authors:  John E Pool; Russell B Corbett-Detig; Ryuichi P Sugino; Kristian A Stevens; Charis M Cardeno; Marc W Crepeau; Pablo Duchen; J J Emerson; Perot Saelao; David J Begun; Charles H Langley
Journal:  PLoS Genet       Date:  2012-12-20       Impact factor: 5.917

10.  A bioinformatics workflow for detecting signatures of selection in genomic data.

Authors:  Murray Cadzow; James Boocock; Hoang T Nguyen; Phillip Wilcox; Tony R Merriman; Michael A Black
Journal:  Front Genet       Date:  2014-08-26       Impact factor: 4.599

View more
  8 in total

1.  Estimating the Timing of Multiple Admixture Pulses During Local Ancestry Inference.

Authors:  Paloma Medina; Bryan Thornlow; Rasmus Nielsen; Russell Corbett-Detig
Journal:  Genetics       Date:  2018-09-11       Impact factor: 4.562

2.  Breeding Top Genotypes and Accelerating Response to Recurrent Selection by Selecting Parents with Greater Gametic Variance.

Authors:  Piter Bijma; Yvonne C J Wientjes; Mario P L Calus
Journal:  Genetics       Date:  2019-11-26       Impact factor: 4.562

3.  Genomic variation from an extinct species is retained in the extant radiation following speciation reversal.

Authors:  David Frei; Rishi De-Kayne; Oliver M Selz; Ole Seehausen; Philine G D Feulner
Journal:  Nat Ecol Evol       Date:  2022-02-24       Impact factor: 19.100

4.  Exploring Evolutionary Relationships Across the Genome Using Topology Weighting.

Authors:  Simon H Martin; Steven M Van Belleghem
Journal:  Genetics       Date:  2017-03-24       Impact factor: 4.562

5.  Shifting the paradigm in Evolve and Resequence studies: From analysis of single nucleotide polymorphisms to selected haplotype blocks.

Authors:  Neda Barghi; Christian Schlötterer
Journal:  Mol Ecol       Date:  2019-02       Impact factor: 6.185

6.  Linkage Disequilibrium Estimation in Low Coverage High-Throughput Sequencing Data.

Authors:  Timothy P Bilton; John C McEwan; Shannon M Clarke; Rudiger Brauning; Tracey C van Stijn; Suzanne J Rowe; Ken G Dodds
Journal:  Genetics       Date:  2018-03-27       Impact factor: 4.562

7.  Detecting selected haplotype blocks in evolve and resequence experiments.

Authors:  Kathrin A Otte; Christian Schlötterer
Journal:  Mol Ecol Resour       Date:  2020-09-06       Impact factor: 7.090

8.  Digging into the Genomic Past of Swiss Honey Bees by Whole-Genome Sequencing Museum Specimens.

Authors:  Melanie Parejo; David Wragg; Dora Henriques; Jean-Daniel Charrière; Andone Estonba
Journal:  Genome Biol Evol       Date:  2020-12-06       Impact factor: 3.416

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.