Literature DB >> 35556139

Towards Large-Scale Integrative Taxonomy (LIT): Resolving the Data Conundrum for Dark Taxa.

Emily Hartop1,2,3, Amrita Srivathsan3,4, Fredrik Ronquist5, Rudolf Meier3,4.   

Abstract

New, rapid, accurate, scalable, and cost-effective species discovery and delimitation methods are needed for tackling "dark taxa," here defined as groups for which $<$10$\%$ of all species are described and the estimated diversity exceeds 1,000 species. Species delimitation for these taxa should be based on multiple data sources ("integrative taxonomy") but collecting multiple types of data risks impeding a discovery process that is already too slow. We here develop large-scale integrative taxonomy (LIT), an explicit method where preliminary species hypotheses are generated based on inexpensive data that can be obtained quickly and cost-effectively. These hypotheses are then evaluated based on a more expensive type of "validation data" that is only obtained for specimens selected based on objective criteria applied to the preliminary species hypotheses. We here use this approach to sort 18,000 scuttle flies (Diptera: Phoridae) into 315 preliminary species hypotheses based on next-generation sequencing barcode (313 bp) clusters (using objective clustering [OC] with a 3$\%$ threshold). These clusters are then evaluated with morphology as the validation data. We develop quantitative indicators for predicting which barcode clusters are likely to be incongruent with morphospecies by randomly selecting 100 clusters for in-depth validation with morphology. A linear model demonstrates that the best predictors for incongruence between barcode clusters and morphology are maximum p-distance within the cluster and a newly proposed index that measures cluster stability across different clustering thresholds. A test of these indicators using the 215 remaining clusters reveals that these predictors correctly identify all clusters that are incongruent with morphology. In our study, all morphospecies are true or disjoint subsets of the initial barcode clusters so that all incongruence can be eliminated by varying clustering thresholds. This leads to a discussion of when a third data source is needed to resolve incongruent grouping statements. The morphological validation step in our study involved 1,039 specimens (5.8$\%$ of the total). The formal LIT protocol we propose would only have required the study of 915 (5.1$\%$: 2.5 specimens per species), as we show that clusters without signatures of incongruence can be validated by only studying two specimens representing the most divergent haplotypes. To test the generality of our results across different barcode clustering techniques, we establish that the levels of incongruence are similar across OC, Automatic Barcode Gap Discovery (ABGD), Poisson Tree Processes (PTP), and Refined Single Linkage (RESL) (used by Barcode of Life Data System to assign Barcode Index Numbers [BINs]). OC and ABGD achieved a maximum congruence score with the morphology of 89$\%$ while PTP was slightly less effective (84$\%$). RESL could only be tested for a subset of the specimens because the algorithm is not public. BINs based on 277 of the original 1,714 haplotypes were 86$\%$ congruent with morphology while the values were 89$\%$ for OC, 74$\%$ for PTP, and 72$\%$ for ABGD. [Biodiversity discovery; dark taxa; DNA barcodes; integrative taxonomy.].
© The Author(s) 2022. Published by Oxford University Press on behalf of the Society of Systematic Biologists.

Entities:  

Mesh:

Year:  2022        PMID: 35556139      PMCID: PMC9558837          DOI: 10.1093/sysbio/syac033

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   9.160


  53 in total

1.  The perils of DNA barcoding and the need for integrative taxonomy.

Authors:  Kipling W Will; Brent D Mishler; Quentin D Wheeler
Journal:  Syst Biol       Date:  2005-10       Impact factor: 15.683

2.  DNA barcodes distinguish species of tropical Lepidoptera.

Authors:  Mehrdad Hajibabaei; Daniel H Janzen; John M Burns; Winnie Hallwachs; Paul D N Hebert
Journal:  Proc Natl Acad Sci U S A       Date:  2006-01-17       Impact factor: 11.205

3.  Species concepts and species delimitation.

Authors:  Kevin De Queiroz
Journal:  Syst Biol       Date:  2007-12       Impact factor: 15.683

4.  Scaling laws predict global microbial diversity.

Authors:  Kenneth J Locey; Jay T Lennon
Journal:  Proc Natl Acad Sci U S A       Date:  2016-05-02       Impact factor: 11.205

5.  A MinION™-based pipeline for fast and cost-effective DNA barcoding.

Authors:  Amrita Srivathsan; Bilgenur Baloğlu; Wendy Wang; Wei X Tan; Denis Bertrand; Amanda H Q Ng; Esther J H Boey; Jayce J Y Koh; Niranjan Nagarajan; Rudolf Meier
Journal:  Mol Ecol Resour       Date:  2018-04-19       Impact factor: 7.090

6.  The Swedish Malaise Trap Project: A 15 Year Retrospective on a Countrywide Insect Inventory.

Authors:  Dave Karlsson; Emily Hartop; Mattias Forshage; Mathias Jaschhof; Fredrik Ronquist
Journal:  Biodivers Data J       Date:  2020-01-21

7.  Fast and sensitive mapping of nanopore sequencing reads with GraphMap.

Authors:  Ivan Sović; Mile Šikić; Andreas Wilm; Shannon Nicole Fenlon; Swaine Chen; Niranjan Nagarajan
Journal:  Nat Commun       Date:  2016-04-15       Impact factor: 14.919

8.  Flies from L.A., The Sequel: A further twelve new species of Megaselia (Diptera: Phoridae) from the BioSCAN Project in Los Angeles (California, USA).

Authors:  Emily A Hartop; Brian V Brown; R Henry L Disney
Journal:  Biodivers Data J       Date:  2016-04-14

9.  A Sequel to Sanger: amplicon sequencing that scales.

Authors:  Paul D N Hebert; Thomas W A Braukmann; Sean W J Prosser; Sujeevan Ratnasingham; Jeremy R deWaard; Natalia V Ivanova; Daniel H Janzen; Winnie Hallwachs; Suresh Naik; Jayme E Sones; Evgeny V Zakharov
Journal:  BMC Genomics       Date:  2018-03-27       Impact factor: 3.969

10.  Rapid, large-scale species discovery in hyperdiverse taxa using 1D MinION sequencing.

Authors:  Amrita Srivathsan; Emily Hartop; Jayanthi Puniamoorthy; Wan Ting Lee; Sujatha Narayanan Kutty; Olavi Kurina; Rudolf Meier
Journal:  BMC Biol       Date:  2019-11-29       Impact factor: 7.431

View more
  1 in total

1.  Community assembly and metaphylogeography of soil biodiversity: Insights from haplotype-level community DNA metabarcoding within an oceanic island.

Authors:  Carmelo Andújar; Paula Arribas; Heriberto López; Yurena Arjona; Antonio Pérez-Delgado; Pedro Oromí; Alfried P Vogler; Brent C Emerson
Journal:  Mol Ecol       Date:  2022-06-21       Impact factor: 6.622

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.