| Literature DB >> 35528780 |
Helena L Pound1, Eric R Gann1, Steven W Wilhelm1.
Abstract
Harmful algal blooms are increasing in duration and severity globally, resulting in increased research interest. The use of genetic sequencing technologies has provided a wealth of opportunity to advance knowledge, but also poses a risk to that knowledge if handled incorrectly. The vast numbers of sequence processing tools and protocols provide a method to test nearly every hypothesis, but each method has inherent strengths and weaknesses. Here, we tested six methods to classify and quantify metatranscriptomic activity from a harmful algal bloom dominated by Microcystis spp. Three online tools were evaluated (Kaiju, MG-RAST, and GhostKOALA) in addition to three local tools that included a command line BLASTx approach, recruitment of reads to individual Microcystis genomes, and recruitment to a combined Microcystis composite genome generated from sequenced isolates with complete, closed genomes. Based on the analysis of each tool presented in this study, two recommendations are made that are dependent on the hypothesis to be tested. For researchers only interested in the function and physiology of Microcystis spp., read recruitments to the composite genome, referred to as "Frankenstein's Microcystis", provided the highest total estimates of transcript expression. However, for researchers interested in the entire bloom microbiome, the online GhostKOALA annotation tool, followed by subsequent read recruitments, provided functional and taxonomic characterization, in addition to transcript expression estimates. This study highlights the critical need for careful evaluation of methods before data analysis.Entities:
Keywords: Kaiju; MG-RAST; Microcystis; RNA sequencing; bioinformatics; ghostKOALA; harmful algal bloom; metatranscriptome
Year: 2021 PMID: 35528780 PMCID: PMC9075346 DOI: 10.1002/lom3.10465
Source DB: PubMed Journal: Limnol Oceanogr Methods ISSN: 1541-5856 Impact factor: 3.162
Comparison of methods analyzed in this study. Y indicates a requirement or function provided, while N indicates a lack of requirement or output provided.
| BLASTx | Kaiju | MG‐RAST | Ghost KOALA | Individual genomes | Frankenstein genome | |
|---|---|---|---|---|---|---|
| Community taxonomy | Y | Y | Y | Y | N | N |
| Gene function | Y | N | Y | Y | Y | Y |
| User‐defined database | Y | N | N | N | Y | Y |
| Input | Contigs | Reads | Reads | Contigs | Reads | Reads |
| Additional read recruitment | Y | N | N | Y | N | N |
| Online platform | N | Y | Y | Y | N | N |
| Example references | Moniruzzaman et al. ( | Chen et al. ( | Zhang et al. ( | Xie et al. ( | Harke and Gobler ( | This study |
Fig. 1The sum of all reads in all samples that were recruited to or classified as Microcystis in each method.