Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Strategies to improve usability and preserve accuracy in biological sequence databases.

Literature DB >> 27528420

Strategies to improve usability and preserve accuracy in biological sequence databases.

Johan Bengtsson-Palme^1,2, Fredrik Boulund^3,4,5, Robert Edström^4,6, Amir Feizi⁷, Anna Johnning^3,4, Viktor A Jonsson⁴, Fredrik H Karlsson⁷, Chandan Pal^8,3, Mariana Buongermino Pereira^3,4, Anna Rehammar⁴, José Sanchez^4,9, Kemal Sanli¹⁰, Kaisa Thorell⁵.

Abstract

Biology is increasingly dependent on large-scale analysis, such as proteomics, creating a requirement for efficient bioinformatics. Bioinformatic predictions of biological functions rely upon correctly annotated database sequences, and the presence of inaccurately annotated or otherwise poorly described sequences introduces noise and bias to biological analyses. Accurate annotations are, for example, pivotal for correct identification of polypeptide fragments. However, standards for how sequence databases are organized and presented are currently insufficient. Here, we propose five strategies to address fundamental issues in the annotation of sequence databases: (i) to clearly separate experimentally verified and unverified sequence entries; (ii) to enable a system for tracing the origins of annotations; (iii) to separate entries with high-quality, informative annotation from less useful ones; (iv) to integrate automated quality-control software whenever such tools exist; and (v) to facilitate postsubmission editing of annotations and metadata associated with sequences. We believe that implementation of these strategies, for example as requirements for publication of database papers, would enable biology to better take advantage of large-scale data.

Keywords: Annotation; Bioinformatics; Databases; Functional prediction; Sequencing; Standards

Mesh：

Year: 2016 PMID： 27528420 DOI： 10.1002/pmic.201600034

Source DB: PubMed Journal: Proteomics ISSN： 1615-9853 Impact factor: 3.984

Keyword Cloud
Cited

10 in total

1. On expert curation and scalability: UniProtKB/Swiss-Prot as a case study.

Authors: Sylvain Poux; Cecilia N Arighi; Michele Magrane; Alex Bateman; Chih-Hsuan Wei; Zhiyong Lu; Emmanuel Boutet; Hema Bye-A-Jee; Maria Livia Famiglietti; Bernd Roechert; The UniProt Consortium
Journal: Bioinformatics Date: 2017-11-01 Impact factor: 6.937

2. Metaxa2 Database Builder: enabling taxonomic identification from metagenomic or metabarcoding data using any genetic marker.

Authors: Johan Bengtsson-Palme; Rodney T Richardson; Marco Meola; Christian Wurzbacher; Émilie D Tremblay; Kaisa Thorell; Kärt Kanger; K Martin Eriksson; Guillaume J Bilodeau; Reed M Johnson; Martin Hartmann; R Henrik Nilsson
Journal: Bioinformatics Date: 2018-12-01 Impact factor: 6.937

3. The diversity of uncharacterized antibiotic resistance genes can be predicted from known gene variants-but not always.

Authors: Johan Bengtsson-Palme
Journal: Microbiome Date: 2018-07-07 Impact factor: 14.650

Review 4. Prospects and challenges of implementing DNA metabarcoding for high-throughput insect surveillance.

Authors: Alexander M Piper; Jana Batovska; Noel O I Cogan; John Weiss; John Paul Cunningham; Brendan C Rodoni; Mark J Blacket
Journal: Gigascience Date: 2019-08-01 Impact factor: 6.524

Review 5. Pulse Crop Genetics for a Sustainable Future: Where We Are Now and Where We Should Be Heading.

Authors: Nurul Amylia Sahruzaini; Nur Ardiyana Rejab; Jennifer Ann Harikrishna; Nur Kusaira Khairul Ikram; Ismanizan Ismail; Hazel Marie Kugan; Acga Cheng
Journal: Front Plant Sci Date: 2020-04-30 Impact factor: 5.753

6. Developing a non-destructive metabarcoding protocol for detection of pest insects in bulk trap catches.

Authors: Jana Batovska; Alexander M Piper; Isabel Valenzuela; John Paul Cunningham; Mark J Blacket
Journal: Sci Rep Date: 2021-04-12 Impact factor: 4.996

Review 7. Next-generation sequencing and PCR technologies in monitoring the hospital microbiome and its drug resistance.

Authors: Carolina Cason; Maria D'Accolti; Irene Soffritti; Sante Mazzacane; Manola Comar; Elisabetta Caselli
Journal: Front Microbiol Date: 2022-07-28 Impact factor: 6.064

8. Best practice data life cycle approaches for the life sciences.

Authors: Philippa C Griffin; Jyoti Khadake; Kate S LeMay; Suzanna E Lewis; Sandra Orchard; Andrew Pask; Bernard Pope; Ute Roessner; Keith Russell; Torsten Seemann; Andrew Treloar; Sonika Tyagi; Jeffrey H Christiansen; Saravanan Dayalan; Simon Gladman; Sandra B Hangartner; Helen L Hayden; William W H Ho; Gabriel Keeble-Gagnère; Pasi K Korhonen; Peter Neish; Priscilla R Prestes; Mark F Richardson; Nathan S Watson-Haigh; Kelly L Wyres; Neil D Young; Maria Victoria Schneider
Journal: F1000Res Date: 2017-08-31

9. Pre- and post-sequencing recommendations for functional annotation of human fecal metagenomes.

Authors: Michelle L Treiber; Diana H Taft; Ian Korf; David A Mills; Danielle G Lemay
Journal: BMC Bioinformatics Date: 2020-02-24 Impact factor: 3.169

10. On the complementarity of DNA barcoding and morphology to distinguish benign endemic insects from possible pests: the case of Dirioxa pornia and the tribe Acanthonevrini (Diptera: Tephritidae: Phytalmiinae) in Australia.

Authors: Francesco Martoni; Isabel Valenzuela; Mark J Blacket
Journal: Insect Sci Date: 2020-05-08 Impact factor: 3.262

10 in total