| Literature DB >> 31118944 |
Tao Liu1,2, Xumin Wang2, Guoliang Wang3,4, Shangang Jia5, Guiming Liu6, Guangle Shan4, Shan Chi1,7, Jing Zhang8, Yahui Yu1, Ting Xue9, Jun Yu3,4.
Abstract
Saccharina, as one of the most important brown algae (Phaeophyceae) with multicellular thallus, has a very remarkable evolutionary history, and globally accounts for most of the economic marine aquaculture production worldwide. Here, we present the 580.5 million base pairs of genome sequence of Saccharina japonica, whose current assembly contains 35,725 protein-coding genes. In a comparative analysis with Ectocarpus siliculosus, the integrated virus sequence suggested the genome evolutionary footprints, which derived from their co-ancestry and experienced genomic arrangements. Furthermore, the gene expansion was found to be an important strategy for functional evolution, especially with regard to extracelluar components, stress-related genes, and vanadium-dependent haloperoxidases, and we proposed a hypothesis that gene duplication events were the main driving force for the evolution history from multicellular filamentous algae to thallus algae. The sequenced Saccharina genome paves the way for further molecular studies and is useful for genome-assisted breeding of S. japonica and other related algae species.Entities:
Keywords: Saccharina japonica; extracellular components; genome sequencing; halogen biosynthesis; phylogenetic analysis; virus genome
Year: 2019 PMID: 31118944 PMCID: PMC6507550 DOI: 10.3389/fgene.2019.00378
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Genome statistics of S. japonica and E. siliculosus.
| Genome size (Mb) | 580.5 | 195.8 |
| Number of scaffolds > 2 kb | 11,156 | 1561 |
| Supercontig (scaffold) N50 (bp) | 123,490 | 504,428 |
| Number of contigs | 418,683 | 14,403 |
| Contig N50 (bp) | 4733 | 32,862 |
| % of cDNAs matching the genome | 91.28% | 97.40% |
| G+C content | 48.72% | 53.60% |
| % of repeated sequences | 46.03% | 22.70% |
| Number of genes | 35,725 | 16,256 |
| Average gene length (bp) | 5591 | 6859 |
| Average coding sequence length (bp) | 1152 | 1563 |
| Number of introns | 131,519 | 113,619 |
| Average intron length (bp) | 1203 | 703.8 |
| Average number of introns per gene | 4.63 | 6.98 |
| Number of exons | 167,244 | 129,875 |
| Average exon length (bp) | 250.27 | 242.2 |
| Number of single exon genes | 7051 | 856 |
| Number of genes (Blast | 27,251(76.3%) | 10,278 (63.2%) |
| Number of genes with expressed sequence support | 30,595 (85.64%) | 9601 (59%) |
Orthologous gene number in six algal species.
| Gene number | 35,725 | 16,534 | 9053 | 10,402 | 11,776 | 11,501 |
| Gene number in orthogroups | 16,954 | 13,063 | 6460 | 8316 | 10,127 | 7832 |
| Number of unassigned genes | 18,771 | 3471 | 2593 | 2086 | 1649 | 3669 |
| Number of orthogroups containing species | 9658 | 9615 | 5229 | 6237 | 7598 | 4531 |
| Number of species–specific orthogroups | 66 | 17 | 4 | 20 | 16 | 50 |
| Gene number in species–specific orthogroups | 435 | 61 | 8 | 109 | 69 | 191 |
FIGURE 1The viral EsV-1 genome and its fragments integrated in the S. japonica and E. siliculosus genomes. The genes highlighted in red bar are found in both S. japonica and E. siliculosus genomes.
FIGURE 2Phylogenetic tree topologies of housekeeping genes using MrBayes software. (A) EF1α, (B) GAPDH, (C) Actin, (D) TUA, and (E) TUB. ChlP, Chlorophyta; Rho, Rhodophyta; Fun, Fungi; Pha, Phaeophyceae; Dia, Diatom; Oom, Oomycetes; Cya, Cyanobacteria; Bac, Bacteria; Arc, Archaea; Hap, Haptophyta; Cry, Cryptophyta; Gla, Glaucophyta.
Gene number of different metabolisms and gene families in sequenced Heterokontophyta algae.
| Cellulose synthase | 14 | 9 | 3 | – | – | - |
| Glycosyltransferase | 194 | 88 | 11 | 62 | 78 | 129 |
| Glycosyl hydrolase | 107 | 41 | 14 | 30 | 55 | 85 |
| Sulfotransferase | 8 | 13 | 5 | 24 | 34 | 38 |
| Mannitol synthesis | 6 | 7 | 2 | 1 | – | 2 |
| Alginate synthesis (except GTs) | 96 | 34 | 2 | 3 | 2 | 1 |
| Mannuronate C5-epimerase | 84 | 26 | – | – | – | – |
| Fucose synthesis (except GTs and STs) | 5 | 5 | 2 | 4 | 3 | 4 |
| Protein kinase | 369 | 258 | 55 | 84 | 132 | 258 |
Gene copies in the important gene pathways of S. japonica and E. siliculosus.
| Pathway | Gene | ||
|---|---|---|---|
| Halogen biosynthesis | 89 | 1 | |
| Mannitol biosynthesis | 2 | 3 | |
| 2 | 2 | ||
| Alginate and fucoidan biosynthesis | 3 | 4 | |
| 1 | 2 | ||
| Alginate biosynthesis | 2 | 3 | |
| 1 | 1 | ||
| Fucoidan biosynthesis | 2 | 2 | |
| 1 | 1 |
Gene number of Hsp gene families in sequenced Saccharina variety “Rongfu,” S. japonica and E. siliculosus.
| Total | ||||||
|---|---|---|---|---|---|---|
| 1 | 2 | 12 | 20 | 7 | 42 | |
| 1 | 2 | 8 | 17 | 7 | 35 | |
| 1 | 1 | 8 | 11 | 4 | 25 |
Gene number of the calcium-based signaling system in sequenced Saccharina variety “Rongfu,” S. japonica and E. siliculosus.
| Ca2+ Influx system | Ca2+ Efflux system | Calcium decoding system | Total | |||||
|---|---|---|---|---|---|---|---|---|
| 5 | 5 | 1 | 2 | 3 | 5 | 21 | 41 | |
| 3 | 4 | 1 | 2 | 3 | 5 | 18 | 36 | |
| 3 | 4 | 1 | 2 | 3 | 3 | 21 | 37 | |