Literature DB >> 27252899

Applications of species accumulation curves in large-scale biological data analysis.

Chao Deng1, Timothy Daley1, Andrew D Smith1.   

Abstract

The species accumulation curve, or collector's curve, of a population gives the expected number of observed species or distinct classes as a function of sampling effort. Species accumulation curves allow researchers to assess and compare diversity across populations or to evaluate the benefits of additional sampling. Traditional applications have focused on ecological populations but emerging large-scale applications, for example in DNA sequencing, are orders of magnitude larger and present new challenges. We developed a method to estimate accumulation curves for predicting the complexity of DNA sequencing libraries. This method uses rational function approximations to a classical non-parametric empirical Bayes estimator due to Good and Toulmin [Biometrika, 1956, 43, 45-63]. Here we demonstrate how the same approach can be highly effective in other large-scale applications involving biological data sets. These include estimating microbial species richness, immune repertoire size, and k-mer diversity for genome assembly applications. We show how the method can be modified to address populations containing an effectively infinite number of species where saturation cannot practically be attained. We also introduce a flexible suite of tools implemented as an R package that make these methods broadly accessible.

Entities:  

Keywords:  accumulation region; immune repertoire; microbiome diversity; rational function approximation; species accumulation curve; species richness

Year:  2015        PMID: 27252899      PMCID: PMC4885658          DOI: 10.1007/s40484-015-0049-7

Source DB:  PubMed          Journal:  Quant Biol        ISSN: 2095-4689


  20 in total

1.  Bacterial diversity within the human subgingival crevice.

Authors:  I Kroes; P W Lepp; D A Relman
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-07       Impact factor: 11.205

2.  A fast, lock-free approach for efficient parallel counting of occurrences of k-mers.

Authors:  Guillaume Marçais; Carl Kingsford
Journal:  Bioinformatics       Date:  2011-01-07       Impact factor: 6.937

3.  Estimating the species accumulation curve using mixtures.

Authors:  Chang Xuan Mao; Robert K Colwell; Jing Chang
Journal:  Biometrics       Date:  2005-06       Impact factor: 2.571

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  Estimating terrestrial biodiversity through extrapolation.

Authors:  R K Colwell; J A Coddington
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  1994-07-29       Impact factor: 6.237

6.  The developing human immune system: T-cell receptor repertoire of children and young adults shows a wide discrepancy in the frequency of persistent oligoclonal T-cell expansions.

Authors:  L R Wedderburn; A Patel; H Varsani; P Woo
Journal:  Immunology       Date:  2001-03       Impact factor: 7.397

7.  Age-related decrease in TCR repertoire diversity measured with deep and normalized sequence profiling.

Authors:  Olga V Britanova; Ekaterina V Putintseva; Mikhail Shugay; Ekaterina M Merzlyak; Maria A Turchaninova; Dmitriy B Staroverov; Dmitriy A Bolotin; Sergey Lukyanov; Ekaterina A Bogdanova; Ilgar Z Mamedov; Yuriy B Lebedev; Dmitriy M Chudakov
Journal:  J Immunol       Date:  2014-02-07       Impact factor: 5.422

8.  Predicting the molecular complexity of sequencing libraries.

Authors:  Timothy Daley; Andrew D Smith
Journal:  Nat Methods       Date:  2013-02-24       Impact factor: 28.547

9.  Human gut microbiome viewed across age and geography.

Authors:  Tanya Yatsunenko; Federico E Rey; Mark J Manary; Indi Trehan; Maria Gloria Dominguez-Bello; Monica Contreras; Magda Magris; Glida Hidalgo; Robert N Baldassano; Andrey P Anokhin; Andrew C Heath; Barbara Warner; Jens Reeder; Justin Kuczynski; J Gregory Caporaso; Catherine A Lozupone; Christian Lauber; Jose Carlos Clemente; Dan Knights; Rob Knight; Jeffrey I Gordon
Journal:  Nature       Date:  2012-05-09       Impact factor: 49.962

10.  How to apply de Bruijn graphs to genome assembly.

Authors:  Phillip E C Compeau; Pavel A Pevzner; Glenn Tesler
Journal:  Nat Biotechnol       Date:  2011-11-08       Impact factor: 54.908

View more
  12 in total

1.  Unbiased quantification of immunoglobulin diversity at the DNA level with VDJ-seq.

Authors:  Peter Chovanec; Daniel J Bolland; Louise S Matheson; Andrew L Wood; Felix Krueger; Simon Andrews; Anne E Corcoran
Journal:  Nat Protoc       Date:  2018-05-03       Impact factor: 13.491

2.  Predicting the Number of Bases to Attain Sufficient Coverage in High-Throughput Sequencing Experiments.

Authors:  Chao Deng; Timothy Daley; Peter Calabrese; Jie Ren; Andrew D Smith
Journal:  J Comput Biol       Date:  2019-11-15       Impact factor: 1.479

3.  Simultaneous trimodal single-cell measurement of transcripts, epitopes, and chromatin accessibility using TEA-seq.

Authors:  Elliott Swanson; Cara Lord; Julian Reading; Alexander T Heubeck; Palak C Genge; Zachary Thomson; Morgan DA Weiss; Xiao-Jun Li; Adam K Savage; Richard R Green; Troy R Torgerson; Thomas F Bumol; Lucas T Graybuck; Peter J Skene
Journal:  Elife       Date:  2021-04-09       Impact factor: 8.140

4.  Opportunities for improving cancer treatment using systems biology.

Authors:  Jason I Griffiths; Adam L Cohen; Veronica Jones; Ravi Salgia; Jeffrey T Chang; Andrea H Bild
Journal:  Curr Opin Syst Biol       Date:  2019-11-27

5.  MMP-9 inhibition promotes anti-tumor immunity through disruption of biochemical and physical barriers to T-cell trafficking to tumors.

Authors:  Vladi Juric; Chris O'Sullivan; Erin Stefanutti; Maria Kovalenko; Andrew Greenstein; Vivian Barry-Hamilton; Igor Mikaelian; Jeremiah Degenhardt; Peng Yue; Victoria Smith; Amanda Mikels-Vigdal
Journal:  PLoS One       Date:  2018-11-30       Impact factor: 3.240

6.  Mithramycin induces promoter reprogramming and differentiation of rhabdoid tumor.

Authors:  Maggie H Chasse; Benjamin K Johnson; Elissa A Boguslawski; Katie M Sorensen; Jessica E Rosien; Min H Kang; C Patrick Reynolds; Lyong Heo; Zachary B Madaj; Ian Beddows; Gabrielle E Foxa; Susan M Kitchen-Goosen; Bart O Williams; Timothy J Triche; Patrick J Grohar
Journal:  EMBO Mol Med       Date:  2020-12-17       Impact factor: 12.137

7.  Urban Aerobiomes are Influenced by Season, Vegetation, and Individual Site Characteristics.

Authors:  Gwynne Á Mhuireach; Hannah Wilson; Bart R Johnson
Journal:  Ecohealth       Date:  2020-11-10       Impact factor: 3.184

8.  BUTTERFLY: addressing the pooled amplification paradox with unique molecular identifiers in single-cell RNA-seq.

Authors:  Johan Gustafsson; Jonathan Robinson; Jens Nielsen; Lior Pachter
Journal:  Genome Biol       Date:  2021-06-08       Impact factor: 13.583

9.  In-solution Y-chromosome capture-enrichment on ancient DNA libraries.

Authors:  Diana I Cruz-Dávalos; María A Nieves-Colón; Alexandra Sockell; G David Poznik; Hannes Schroeder; Anne C Stone; Carlos D Bustamante; Anna-Sapfo Malaspinas; María C Ávila-Arcos
Journal:  BMC Genomics       Date:  2018-08-14       Impact factor: 3.969

10.  Deciphering eukaryotic gene-regulatory logic with 100 million random promoters.

Authors:  Carl G de Boer; Eeshit Dhaval Vaishnav; Ronen Sadeh; Esteban Luis Abeyta; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2019-12-02       Impact factor: 68.164

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.