| Literature DB >> 33109102 |
Bai-Ling Chen1, Wuttichai Mhuantong2, Shih-Hsin Ho3, Jo-Shu Chang4,5,6, Xin-Qing Zhao7, Feng-Wu Bai1.
Abstract
BACKGROUND: Scenedesmus obliquus belongs to green microalgae and is widely used in aquaculture as feed, which is also explored for lipid production and bioremediation. However, genomic studies of this microalga have been very limited. Cell self-flocculation of microalgal cells can be used as a simple and economic method for harvesting biomass, and it is of great importance to perform genome-scale studies for the self-flocculating S. obliquus strains to promote their biotechnological applications.Entities:
Keywords: Cell self-flocculation; Comparative genomics; Genome assembly and annotation; Green microalgae; Scenedesmus obliquus
Mesh:
Substances:
Year: 2020 PMID: 33109102 PMCID: PMC7590803 DOI: 10.1186/s12864-020-07142-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Morphological observation of self-flocculation S. obliquus AS-6-11 by SEM
Genomic information of the reported Scenedesmus strainsa
| Strains | Genome size (Mbp) | GC content (%) | Contig numbers | N50 value (bp) | Sequencing technology | Gene number | Reference/ BioProjects |
|---|---|---|---|---|---|---|---|
| 93.2 | 56.8 | 4727 | 37,561 | Illumina HiSeq | – | [ | |
| 38.2 | 61.4 | – | 42,815 | Illumina HiSeq 2000 | 8652 | [ | |
| 23.4 | 53.6 | 20,139 | 1571 | 454 | 20,139 | PRJNA498405 | |
| 65.4 | 63.2 | 13,425 | 8094 | Ion Proton | 13,514 | [ | |
| 108.7 | 56.8 | 9191 | – | Illumina Hiseq2000 | – | [ | |
| 208.0 | 56.7 | 2705 | 155,544 | PacBio | – | [ | |
| 172.3 | 52.0 | 2772 | 94,410 | PacBio | 31,964 | This study |
a- means information not available
Summary of the S. obliquus AS-6-11 genome annotation
| Protein database | Annotated protein numbers |
|---|---|
| NR | 19,847 |
| SWISS-PROT | 13,099 |
| Pfam | 13,612 |
| GO | 11,734 |
| KEGG | 3302 |
Fig. 2Gene functional annotation analysis of S. obliquus AS-6-11 genome. a The top 20 of GO categories enriched by gene functional annotation analysis of S. obliquus AS-6-11 genome; b The top 20 of KEGG pathways enriched by gene functional annotation analysis of S. obliquus AS-6-11 genome. BP: Biological process; CC: Cellular component; EP: Environmental information processing; GP: Genetic information processing; ME: Metabolism; MF: Molecular function; SP: Signaling and cellular processes
Analysis of gene numbers of the key metabolic pathways among the five microalgae
| KEGG pathways | |||||
|---|---|---|---|---|---|
| Fatty acid biosynthesis | 23 | 26 | 27 | 24 | 26 |
| Fatty acid elongation | 7 | 8 | 10 | 8 | 8 |
| Fatty acid degradation | 16 | 21 | 15 | 18 | 18 |
| Steroid biosynthesis | 9 | 12 | 14 | 9 | 10 |
| Steroid hormone biosynthesis | 5 | 4 | 4 | 4 | 3 |
| Glycerolipid metabolism | 28 | 28 | 30 | 28 | 21 |
| Glycerophospholipid metabolism | 35 | 37 | 35 | 32 | 30 |
| Ether lipid metabolism | 5 | 9 | 7 | 6 | 5 |
| Sphingolipid metabolism | 18 | 16 | 14 | 16 | 17 |
| Arachidonic acid metabolism | 14 | 13 | 13 | 10 | 7 |
| Alpha-linolenic acid metabolism | 10 | 13 | 14 | 11 | 9 |
| Biosynthesis of unsaturated fatty acids | 10 | 15 | 14 | 12 | 11 |
| Carotenoid biosynthesis | 12 | 11 | 14 | 12 | 10 |
Fig. 3The orthologous gene clusters in the five microalgae. a. The shared and different orthologous gene clusters; b. The size of each orthologous gene clusters
Fig. 4Venn diagram of shared/unique gene families among the five microalgae
Analysis of the top 10 GPI-anchored CWPs with signal peptidesa
| Protein name | GPI probability (%) | pI | Mw (KDa) | SMART analysis | Subcellular localization sites |
|---|---|---|---|---|---|
| Sco00011036 | 99.82 | 6.75 | 7.13 | TMR | vacu: 8, chlo: 2, plas: 2, extr: 1, golg: 1 |
| Sco00023226 | 99.73 | 5.38 | 30.84 | TMR | vacu: 8, plas: 4, extr: 2 |
| Sco00002357 | 99.65 | 6.36 | 9.68 | TMR | extr: 7, E.R.: 3.5, E.R._plas: 3, mito: 2, plas: 1.5 |
| Sco00003994 | 99.51 | 4.95 | 21.28 | – | extr: 12, mito: 1, E.R.: 1 |
| Sco00022819 | 99.47 | 8.59 | 29.73 | TMR | extr: 11, mito: 2, vacu: 1 |
| Sco00000470 | 99.41 | 9.58 | 14.83 | TMR | vacu: 7, plas: 3, extr: 2, E.R.: 1, golg: 1 |
| Sco00000669 | 99.33 | 8.48 | 6.10 | – | extr: 12, mito: 1, plas: 1 |
| Sco00004618 | 99.02 | 7.51 | 15.28 | TMR | extr: 9, vacu: 3, chlo: 2 |
| Sco00003952 | 98.73 | 7.51 | 78.84 | TMR | plas: 11, vacu: 2, E.R.: 1 |
| Sco00008125 | 98.71 | 5.22 | 8.87 | – | plas: 11, extr: 11, vacu: 2, nucl: 1, cyto: 1, E.R.: 1 |
achlo chloroplast, cyto cytoplasmic, E.R. endoplasmic reticulum, extr secreted, golg golgi apparatus, mito mitochondrial matrix, plas membrane protein, TMR Transmembrane region, vacu vacuolar. ‘-’ represented no information available
Fig. 5The phylogenetic tree (a) and conserved motifs (b) of the fasciclin domain proteins in S. obliquus AS-6-11
Analysis of predicted extracellular secreted fasciclin domain proteins in S. obliquus AS-6-11a
| Protein name | pI | Mw (KDa) | Signal peptide | The most similar homologous protein and the source organism | Identity to the most similar sequence |
|---|---|---|---|---|---|
| Sco00000123 | 7.5 | 32.5 | – | hypothetical protein MNEG_1104 [ | 63.00% |
| Sco00000322–1 | 9.2 | 43.4 | 1 | hypothetical protein A1O9_09854 [ | 40.00% |
| Sco00000322–2 | 8.8 | 35.1 | 1 | fasciclin domain-containing protein [ | 61.36% |
| Sco00000402–1 | 8.9 | 23.4 | 1 | hypothetical protein DI09_43p180 [ | 38.78% |
| Sco00001432 | 7.7 | 44.1 | 1 | Nex18 symbiotically induced [ | 52.08% |
| Sco00002253 | 7.1 | 80.0 | – | hypothetical protein MNEG_2497 [ | 46.00% |
| Sco00003534 | 4.2 | 29.9 | 1 | astaxanthin binding fasciclin family protein [ | 48.09% |
| Sco00003587 | 7.6 | 84.4 | 1 | – | |
| Sco00004297 | 6.3 | 16.4 | – | hypothetical protein Rsub_06992 [ | 61.54% |
| Sco00009020 | 8.9 | 18.3 | – | fasciclin domain-containing protein [ | 61.36% |
| 8.9 | 33.8 | 1 | fasciclin-like protein [ | 38.75% | |
| 7.7 | 26.7 | 1 | fasciclin [ | 46.94% | |
| Sco00022879 | 5.2 | 12.2 | – | beta-Ig-H3/fasciclin [ | 64.84% |
| Sco00000669 | 4.9 | 14.5 | – | fasciclin domain-containing protein [ | 53.85% |
aThe protein-coding genes that encode GPI-anchored CWPs were shown in bold font; ‘-’ represented no information available
Fig. 6Amino acid analysis of potential flocculation proteins in S. obliquus AS-6-11. a. Amino acid analysis of the four fasciclin domain proteins distributed in GPI-anchored CWPs; b. Amino acid differences of CAP domain between partial S. cerevisiae Pry1p and S. obliquus AS-6-11 D-mannose binding lectin. The same amino acids in Fig. 6 b were shown in red font and the amino acid using the yellow background represented the start/stop of the CAP domain. CAP domain: cysteine-rich secretory proteins domain; FAS1: Four repeated domains in the fasciclin I family of proteins; SP: Signal peptide; TE: Transmembrane region