| Literature DB >> 31235882 |
Martin Steinegger1,2,3, Milot Mirdita4, Johannes Söding5.
Abstract
The open-source de novo protein-level assembler, Plass ( https://plass.mmseqs.com ), assembles six-frame-translated sequencing reads into protein sequences. It recovers 2-10 times more protein sequences from complex metagenomes and can assemble huge datasets. We assembled two redundancy-filtered reference protein catalogs, 2 billion sequences from 640 soil samples (soil reference protein catalog) and 292 million sequences from 775 marine eukaryotic metatranscriptomes (marine eukaryotic reference catalog), the largest free collections of protein sequences.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31235882 DOI: 10.1038/s41592-019-0437-4
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547