Motivation: Knowledge of haplotypes, i.e. phased and ordered marker alleles on a chromosome, is essential to answer many questions in genetics and genomics. By generating short pieces of DNA sequence, high-throughput modern sequencing technologies make estimation of haplotypes possible for single individuals. In polyploids, however, haplotype estimation methods usually require deep coverage to achieve sufficient accuracy. This often renders sequencing-based approaches too costly to be applied to large populations needed in studies of Quantitative Trait Loci. Results: We propose a novel haplotype estimation method for polyploids, TriPoly, that combines sequencing data with Mendelian inheritance rules to infer haplotypes in parent-offspring trios. Using realistic simulations of both short and long-read sequencing data for banana (Musa acuminata) and potato (Solanum tuberosum) trios, we show that TriPoly yields more accurate progeny haplotypes at low coverages compared to existing methods that work on single individuals. We also apply TriPoly to phase Single Nucleotide Polymorphisms on chromosome 5 for a family of tetraploid potato with 2 parents and 37 offspring sequenced with an RNA capture approach. We show that TriPoly haplotype estimates differ from those of the other methods mainly in regions with imperfect sequencing or mapping difficulties, as it does not rely solely on sequence reads and aims to avoid phasings that are not likely to have been passed from the parents to the offspring. Availability and implementation: TriPoly has been implemented in Python 3.5.2 (also compatible with Python 2.7.3 and higher) and can be freely downloaded at https://github.com/EhsanMotazedi/TriPoly. Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: Knowledge of haplotypes, i.e. phased and ordered marker alleles on a chromosome, is essential to answer many questions in genetics and genomics. By generating short pieces of DNA sequence, high-throughput modern sequencing technologies make estimation of haplotypes possible for single individuals. In polyploids, however, haplotype estimation methods usually require deep coverage to achieve sufficient accuracy. This often renders sequencing-based approaches too costly to be applied to large populations needed in studies of Quantitative Trait Loci. Results: We propose a novel haplotype estimation method for polyploids, TriPoly, that combines sequencing data with Mendelian inheritance rules to infer haplotypes in parent-offspring trios. Using realistic simulations of both short and long-read sequencing data for banana (Musa acuminata) and potato (Solanum tuberosum) trios, we show that TriPoly yields more accurate progeny haplotypes at low coverages compared to existing methods that work on single individuals. We also apply TriPoly to phase Single Nucleotide Polymorphisms on chromosome 5 for a family of tetraploid potato with 2 parents and 37 offspring sequenced with an RNA capture approach. We show that TriPoly haplotype estimates differ from those of the other methods mainly in regions with imperfect sequencing or mapping difficulties, as it does not rely solely on sequence reads and aims to avoid phasings that are not likely to have been passed from the parents to the offspring. Availability and implementation: TriPoly has been implemented in Python 3.5.2 (also compatible with Python 2.7.3 and higher) and can be freely downloaded at https://github.com/EhsanMotazedi/TriPoly. Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Sven Schrinner; Rebecca Serra Mari; Richard Finkers; Paul Arens; Björn Usadel; Tobias Marschall; Gunnar W Klau Journal: iScience Date: 2022-05-25
Authors: David Gerard; Luis Felipe Ventorim Ferrão; Antonio Augusto Franco Garcia; Matthew Stephens Journal: Genetics Date: 2018-09-05 Impact factor: 4.562
Authors: Alejandro Thérèse Navarro; Giorgio Tumino; Roeland E Voorrips; Paul Arens; Marinus J M Smulders; Eric van de Weg; Chris Maliepaard Journal: BMC Bioinformatics Date: 2022-02-14 Impact factor: 3.169
Authors: Sven D Schrinner; Rebecca Serra Mari; Jana Ebler; Mikko Rautiainen; Lancelot Seillier; Julia J Reimer; Björn Usadel; Tobias Marschall; Gunnar W Klau Journal: Genome Biol Date: 2020-09-21 Impact factor: 13.583