Literature DB >> 30541067

Implementation of a Stirling number estimator enables direct calculation of population genetics tests for large sequence datasets.

Swaine L Chen1,2.   

Abstract

MOTIVATION: Stirling numbers enter into the calculation of several population genetics statistics, including Fu's Fs. However, as alignments become large (≥50 sequences), the Stirling numbers required rapidly exceed the standard floating point range. Another recursive method for calculating Fu's Fs suffers from floating point underflow issues.
RESULTS: I implemented an estimator for Stirling numbers that has the advantage of being uniformly applicable to the full parameter range for Stirling numbers. I used this to create a hybrid Fu's Fs calculator that accounts for floating point underflow. My new algorithm is hundreds of times faster than the recursive method. This algorithm now enables accurate calculation of statistics such as Fu's Fs for very large alignments.
AVAILABILITY AND IMPLEMENTATION: An R implementation is available at http://github.com/swainechen/hfufs. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2019        PMID: 30541067     DOI: 10.1093/bioinformatics/bty1012

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  1 in total

1.  A Faster and More Accurate Algorithm for Calculating Population Genetics Statistics Requiring Sums of Stirling Numbers of the First Kind.

Authors:  Swaine L Chen; Nico M Temme
Journal:  G3 (Bethesda)       Date:  2020-11-05       Impact factor: 3.154

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.