Literature DB >> 27528420

Strategies to improve usability and preserve accuracy in biological sequence databases.

Johan Bengtsson-Palme1,2, Fredrik Boulund3,4,5, Robert Edström4,6, Amir Feizi7, Anna Johnning3,4, Viktor A Jonsson4, Fredrik H Karlsson7, Chandan Pal8,3, Mariana Buongermino Pereira3,4, Anna Rehammar4, José Sanchez4,9, Kemal Sanli10, Kaisa Thorell5.   

Abstract

Biology is increasingly dependent on large-scale analysis, such as proteomics, creating a requirement for efficient bioinformatics. Bioinformatic predictions of biological functions rely upon correctly annotated database sequences, and the presence of inaccurately annotated or otherwise poorly described sequences introduces noise and bias to biological analyses. Accurate annotations are, for example, pivotal for correct identification of polypeptide fragments. However, standards for how sequence databases are organized and presented are currently insufficient. Here, we propose five strategies to address fundamental issues in the annotation of sequence databases: (i) to clearly separate experimentally verified and unverified sequence entries; (ii) to enable a system for tracing the origins of annotations; (iii) to separate entries with high-quality, informative annotation from less useful ones; (iv) to integrate automated quality-control software whenever such tools exist; and (v) to facilitate postsubmission editing of annotations and metadata associated with sequences. We believe that implementation of these strategies, for example as requirements for publication of database papers, would enable biology to better take advantage of large-scale data.
© 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Keywords:  Annotation; Bioinformatics; Databases; Functional prediction; Sequencing; Standards

Mesh:

Year:  2016        PMID: 27528420     DOI: 10.1002/pmic.201600034

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  10 in total

1.  On expert curation and scalability: UniProtKB/Swiss-Prot as a case study.

Authors:  Sylvain Poux; Cecilia N Arighi; Michele Magrane; Alex Bateman; Chih-Hsuan Wei; Zhiyong Lu; Emmanuel Boutet; Hema Bye-A-Jee; Maria Livia Famiglietti; Bernd Roechert; The UniProt Consortium
Journal:  Bioinformatics       Date:  2017-11-01       Impact factor: 6.937

2.  Metaxa2 Database Builder: enabling taxonomic identification from metagenomic or metabarcoding data using any genetic marker.

Authors:  Johan Bengtsson-Palme; Rodney T Richardson; Marco Meola; Christian Wurzbacher; Émilie D Tremblay; Kaisa Thorell; Kärt Kanger; K Martin Eriksson; Guillaume J Bilodeau; Reed M Johnson; Martin Hartmann; R Henrik Nilsson
Journal:  Bioinformatics       Date:  2018-12-01       Impact factor: 6.937

3.  The diversity of uncharacterized antibiotic resistance genes can be predicted from known gene variants-but not always.

Authors:  Johan Bengtsson-Palme
Journal:  Microbiome       Date:  2018-07-07       Impact factor: 14.650

Review 4.  Prospects and challenges of implementing DNA metabarcoding for high-throughput insect surveillance.

Authors:  Alexander M Piper; Jana Batovska; Noel O I Cogan; John Weiss; John Paul Cunningham; Brendan C Rodoni; Mark J Blacket
Journal:  Gigascience       Date:  2019-08-01       Impact factor: 6.524

Review 5.  Pulse Crop Genetics for a Sustainable Future: Where We Are Now and Where We Should Be Heading.

Authors:  Nurul Amylia Sahruzaini; Nur Ardiyana Rejab; Jennifer Ann Harikrishna; Nur Kusaira Khairul Ikram; Ismanizan Ismail; Hazel Marie Kugan; Acga Cheng
Journal:  Front Plant Sci       Date:  2020-04-30       Impact factor: 5.753

6.  Developing a non-destructive metabarcoding protocol for detection of pest insects in bulk trap catches.

Authors:  Jana Batovska; Alexander M Piper; Isabel Valenzuela; John Paul Cunningham; Mark J Blacket
Journal:  Sci Rep       Date:  2021-04-12       Impact factor: 4.996

Review 7.  Next-generation sequencing and PCR technologies in monitoring the hospital microbiome and its drug resistance.

Authors:  Carolina Cason; Maria D'Accolti; Irene Soffritti; Sante Mazzacane; Manola Comar; Elisabetta Caselli
Journal:  Front Microbiol       Date:  2022-07-28       Impact factor: 6.064

8.  Best practice data life cycle approaches for the life sciences.

Authors:  Philippa C Griffin; Jyoti Khadake; Kate S LeMay; Suzanna E Lewis; Sandra Orchard; Andrew Pask; Bernard Pope; Ute Roessner; Keith Russell; Torsten Seemann; Andrew Treloar; Sonika Tyagi; Jeffrey H Christiansen; Saravanan Dayalan; Simon Gladman; Sandra B Hangartner; Helen L Hayden; William W H Ho; Gabriel Keeble-Gagnère; Pasi K Korhonen; Peter Neish; Priscilla R Prestes; Mark F Richardson; Nathan S Watson-Haigh; Kelly L Wyres; Neil D Young; Maria Victoria Schneider
Journal:  F1000Res       Date:  2017-08-31

9.  Pre- and post-sequencing recommendations for functional annotation of human fecal metagenomes.

Authors:  Michelle L Treiber; Diana H Taft; Ian Korf; David A Mills; Danielle G Lemay
Journal:  BMC Bioinformatics       Date:  2020-02-24       Impact factor: 3.169

10.  On the complementarity of DNA barcoding and morphology to distinguish benign endemic insects from possible pests: the case of Dirioxa pornia and the tribe Acanthonevrini (Diptera: Tephritidae: Phytalmiinae) in Australia.

Authors:  Francesco Martoni; Isabel Valenzuela; Mark J Blacket
Journal:  Insect Sci       Date:  2020-05-08       Impact factor: 3.262

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.