Literature DB >> 33070442

TOA: A software package for automated functional annotation in non-model plant species.

Fernando Mora-Márquez1, Víctor Chano1, José Luis Vázquez-Poletti2, Unai López de Heredia1.   

Abstract

The increase of sequencing capacity provided by high-throughput platforms has made it possible to routinely obtain large sets of genomic and transcriptomic sequences from model and non-model organisms. Subsequent genomic analysis and gene discovery in next-generation sequencing experiments are, however, bottlenecked by functional annotation. One common way to perform functional annotation of sets of sequences obtained from next-generation sequencing experiments, is by searching for homologous sequences and accessing the related functional information deposited in genomic databases. Functional annotation is especially challenging for non-model organisms, like many plant species. In such cases, existing free and commercial general-purpose applications may not offer complete and accurate results. We present TOA (Taxonomy-oriented annotation), a Python-based user-friendly open source application designed to establish functional annotation pipelines geared towards non-model plant species that can run in Linux/Mac computers, HPCs and cloud servers. TOA performs homology searches against proteins stored in the PLAZA databases, NCBI RefSeq Plant, Nucleotide Database and Non-Redundant Protein Sequence Database, and outputs functional information from several ontology systems: Gene Ontology, InterPro, EC, KEGG, Mapman and MetaCyc. The software performance was validated by comparing the runtimes, total number of annotated sequences and accuracy of the functional information obtained for several plant benchmark data sets with TOA and other functional annotation solutions. TOA outperformed the other software in terms of number of annotated sequences and accuracy of the annotation and constitutes a good alternative to improve functional annotation in plants. TOA is especially recommended for gymnosperms or for low quality sequence data sets of non-model plants.
© 2020 John Wiley & Sons Ltd.

Keywords:  GO terms; bioinformatic analysis; functional annotation; genomic databases; non-model plant species; transcriptomics

Year:  2020        PMID: 33070442     DOI: 10.1111/1755-0998.13285

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  5 in total

Review 1.  A simple guide to de novo transcriptome assembly and annotation.

Authors:  Venket Raghavan; Louis Kraft; Fantin Mesny; Linda Rigerte
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

2.  ddRAD Sequencing-Based Identification of Genomic Boundaries and Permeability in Quercus ilex and Q. suber Hybrids.

Authors:  Unai López de Heredia; Fernando Mora-Márquez; Pablo G Goicoechea; Laura Guillardín-Calvo; Marco C Simeone; Álvaro Soto
Journal:  Front Plant Sci       Date:  2020-09-04       Impact factor: 5.753

3.  NGScloud2: optimized bioinformatic analysis using Amazon Web Services.

Authors:  Fernando Mora-Márquez; José Luis Vázquez-Poletti; Unai López de Heredia
Journal:  PeerJ       Date:  2021-04-16       Impact factor: 2.984

4.  Comparative Transcriptome Analyses of Different Rheum officinale Tissues Reveal Differentially Expressed Genes Associated with Anthraquinone, Catechin, and Gallic Acid Biosynthesis.

Authors:  Lipan Zhou; Jiangyan Sun; Tianyi Zhang; Yadi Tang; Jie Liu; Chenxi Gao; Yunyan Zhai; Yanbing Guo; Li Feng; Xinxin Zhang; Tao Zhou; Xumei Wang
Journal:  Genes (Basel)       Date:  2022-09-05       Impact factor: 4.141

5.  Transcriptomic Complexity in Strawberry Fruit Development and Maturation Revealed by Nanopore Sequencing.

Authors:  Qing Chen; Ximeng Lin; Wenlu Tang; Qian Deng; Yan Wang; Yuanxiu Lin; Wen He; Yunting Zhang; Mengyao Li; Ya Luo; Yong Zhang; Xiaorong Wang; Haoru Tang
Journal:  Front Plant Sci       Date:  2022-07-13       Impact factor: 6.627

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.