| Literature DB >> 34048573 |
Sofia N Barreira1, Anh-Dao Nguyen1, Mark T Fredriksen1, Tyra G Wolfsberg1, R Travis Moreland1, Andreas D Baxevanis1.
Abstract
To address the void in the availability of high-quality proteomic data traversing the animal tree, we have implemented a pipeline for generating de novo assemblies based on publicly available data from the NCBI Sequence Read Archive, yielding a comprehensive collection of proteomes from 100 species spanning 21 animal phyla. We have also created the Animal Proteome Database (AniProtDB), a resource providing open access to this collection of high-quality metazoan proteomes, along with information on predicted proteins and protein domains for each taxonomic classification and the ability to perform sequence similarity searches against all proteomes generated using this pipeline. This solution vastly increases the utility of these data by removing the barrier to access for research groups who do not have the expertise or resources to generate these data themselves and enables the use of data from nontraditional research organisms that have the potential to address key questions in biomedicine. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2021.Entities:
Keywords: comparative genomics; genome analysis; proteomes; sequence databases
Mesh:
Substances:
Year: 2021 PMID: 34048573 PMCID: PMC8476134 DOI: 10.1093/molbev/msab165
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 8.800