Literature DB >> 27501718

A draft genome of the brown alga, Cladosiphon okamuranus, S-strain: a platform for future studies of 'mozuku' biology.

Koki Nishitsuji1, Asuka Arimoto2, Kenji Iwai3, Yusuke Sudo3, Kanako Hisata2, Manabu Fujie4, Nana Arakaki4, Tetsuo Kushiro5, Teruko Konishi6, Chuya Shinzato2, Noriyuki Satoh2, Eiichi Shoguchi1.   

Abstract

The brown alga, Cladosiphon okamuranus (Okinawa mozuku), is economically one of the most important edible seaweeds, and is cultivated for market primarily in Okinawa, Japan. C. okamuranus constitutes a significant source of fucoidan, which has various physiological and biological activities. To facilitate studies of seaweed biology, we decoded the draft genome of C. okamuranus S-strain. The genome size of C. okamuranus was estimated as ∼140 Mbp, smaller than genomes of two other brown algae, Ectocarpus siliculosus and Saccharina japonica Sequencing with ∼100× coverage yielded an assembly of 541 scaffolds with N50 = 416 kbp. Together with transcriptomic data, we estimated that the C. okamuranus genome contains 13,640 protein-coding genes, approximately 94% of which have been confirmed with corresponding mRNAs. Comparisons with the E. siliculosus genome identified a set of C. okamuranus genes that encode enzymes involved in biosynthetic pathways for sulfated fucans and alginate biosynthesis. In addition, we identified C. okamuranus genes for enzymes involved in phlorotannin biosynthesis. The present decoding of the Cladosiphon okamuranus genome provides a platform for future studies of mozuku biology.
© The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

Entities:  

Keywords:  Cladosiphon okamuranus; brown alga; genes for enzymes of polysaccharide synthesis; genome decoding

Mesh:

Substances:

Year:  2016        PMID: 27501718      PMCID: PMC5144679          DOI: 10.1093/dnares/dsw039

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


1. Introduction

Taxonomically, brown algae (Phaeophyceae) belong to the Stramenopiles and include multicellular species. They are photosynthetic organisms with chloroplasts surrounded by four membranes, suggesting that they originated from a symbiotic relationship between two eukaryotes. Most brown algae contain the pigment, fucoxanthin, which is responsible for the distinctive greenish-brown color that gives brown algae their common name. Phaeophytes include many types of seaweed in the Northern Hemisphere, and they are important members of marine ecosystems, both because they create habitats for other organisms and because they provide food. Cladosiphon okamuranus (Chordariales, Phaeophyceae), Okinawa mozuku in Japanese, is one of the important edible seaweeds. In Okinawa, C. okamuranus has been cultivated for more than 35 years by several fishermen’s associations, including those in Onna and Chinen Villages. This cultivation history has established several strains of mozuku that have similar morphology and texture. It was reported in the 36th annual report of the Japanese Cabinet Office that approximately 20 kilotons of mozuku (C. okamuranus and Nemacystus decipiens (‘Itomozuku’)) are produced annually, yielding approximately 4 billion Japanese yen in 2006. In addition, C. okamuranus and N. decipiens are sources of fucoidan, a sulfated polysaccharide found in the cell-wall matrix of brown algae that has anti-coagulant, anti-thrombin-like, and tumor-suppressant activities. Brown algae also produce alginates., Due to their biological significance, genomes of two species of brown algae have been decoded: Ectocarpus siliculosus (Order Ectocarpales) and Saccharina japonica (Order Laminariales). The genome size of former is approximately 214 Mbp with 16,256 predicted protein-coding genes, while that of the latter is 545 Mbp with 18,733 predicted protein-coding genes. Several genetic features of the two brown algae have been characterized to understand their biology., A close phylogenetic relationship between Ectocarpales and Chordariales has been reported., Given its importance for fisheries, food, and possible pharmaceuticals, we decoded the draft genome of Cladosiphon okamuranus S-strain, (Order Chordariales).

2. Materials and methods

Strain and DNA extraction

The S-strain of Cladosiphon okamuranus (‘Shikenjo-kabu’) has been maintained as a stock culture at the Okinawa Prefectural Fisheries Research and Extension Center, Okinawa, Japan. It is cultivated at 22.5° C with a 12-h light-dark cycle in sea water containing 0.5% KW21 (Daiichi Seimo Co., Ltd). The life cycle of C. okamuranus includes both haploid (n) and diploid (2n) generations (Fig. 1). 2n germlings mature into sporophytes that are harvested for market. For DNA extraction, 2n germlings of C. okamuranus were frozen in liquid nitrogen and crushed to powder with a frozen-cell-crusher, Cryo-Press (Microtec Co., Ltd). Genomic DNA was extracted from the powder using an extraction kit, DNA-Suisui-VS (Rizo Co., Ltd).
Figure 1

A diagram showing the life cycle of the brown alga, Cladosiphon okamuranus. The alga has n and 2n generations. Cladosiphon okamuranus is cultivated and sporophytes are harvested for market. Genomic DNA was extracted from 2n germlings, while RNA was extracted from 2n germlings and 2n sporophytes.

A diagram showing the life cycle of the brown alga, Cladosiphon okamuranus. The alga has n and 2n generations. Cladosiphon okamuranus is cultivated and sporophytes are harvested for market. Genomic DNA was extracted from 2n germlings, while RNA was extracted from 2n germlings and 2n sporophytes.

Genome sequencing and assembly

The Illumina platform (Miseq and Hiseq 2500) was used for sequencing. Libraries were prepared according to slight modifications of protocols provided by the manufacturer. Fragmented genomic DNA was further purified using Blue Pippin (Sage Science). A paired-end library consisting of clones ∼720 bp was prepared for the Miseq using a TruSeq DNA PCR-Free LT Sample Prep Kit (Illumina), and 3-kb and 8-kb mate-pair libraries were prepared for the Hiseq 2500 using a Nextera Mate Pair Sample Prep Kit (Illumina), respectively (Supplementary Table S1). Longer reads were obtained by using more reagent kits for the Hiseq. K-mer counting and estimation of genome size were performed using JELLYFISH 2.2.0 software., Adapter sequences were trimmed from all reads using Trimmomatic-0.30. Paired-end reads of high quality (quality value ≥ 20) were assembled de novo using Newbler 2.9 (GS Assembler) to create contigs. Then subsequent scaffolding of the Newbler output was performed using SSPACE 3.0, based on Illumina mate-pair information. Gaps inside scaffolds were closed using GapCloser 1.12. Diploid sequences of gap-closed scaffolds were merged with Haplomerger-2-20151124. CEGMA 2.5 software was used to evaluate genome assembly. The mitochondrial genome was generated with the IDBA_UD 1.1.1 assembler.

Transcriptome analyses

RNA was isolated from sporophytes (2–5 cm) and 2n germlings (Fig. 1). Total RNA was extracted following the instructions of the manufacturer using DNase and an RNeasy Plant mini kit (QIAGEN). Transcriptome libraries were prepared using a TruSeq Stranded mRNA Library Prep kit (Illumina). RNAs were sequenced as per the manufacturer’s instructions for the Illumina Hiseq 2500. Only sequences of high quality (quality value ≥ 20) were assembled, using Velvet 1.2.10 and Oases 0.2.08.

Gene model prediction

A set of gene model predictions (C. okamuranus Gene Model ver. 1) was generated using AUGUSTUS 3.2.1. AUGUSTUS was trained on the 9120 transcriptome contigs recommended by PASA 2.0.2 for this purpose. Gene models were produced by running AUGUSTUS on a repeat-masked genome, produced with RepeatMasker 4.0.6 (http://ftp.genome.washington.edu/RM/RepeatMasker.html) and refined with PASA.

Transposable elements and repetitive sequences

Repetitive sequences were detected as described previously. Tandem repeats were detected using Tandem Repeat Finder (version 4.04), and then classified using RepeatModeler-1.1.8 (http://www.repeatmasker.org/RepeatModeler.html). A de novo repeats library was generated with RepeatScout (version 1.0.5). Transposons and SINE in the scaffold were identified with CENSOR (version 4.2) using Blast searches against the Repbase TE library (version 16.05).

Gene annotation and identification

Three approaches, individually or in combination, were used to annotate protein-coding genes in the C. okamuranus genome. The primary approach for identification of putative C. okamuranus orthologous genes was reciprocal BLAST analysis. This was carried out using mutual best hits of genes of E. siliculosus against the C. okamuranus gene models (BLASTP) or the assembly (TBLASTN). A second approach used for genes encoding proteins with one or more specific protein domains, was to screen the models against the Pfam database (Pfam-A.hmm, release 24.0; http://pfam.sanger.ac.uk), which contains approximately 11,000 conserved domains using HMMER (hmmer3). In the case of complex multigene families, a third annotation method was employed; sets of related sequences were subjected to phylogenetic analyses in order to more precisely determine orthologous relationships between proteins of C. okamuranus and E. siliculosus. For this purpose, amino acid sequences were aligned using MAFFT with default options. Gaps and ambiguous areas were excluded using SeaView version 4.5.3 manually. Based on alignment datasets, phylogenetic trees were constructed by the maximum likelihood method. A maximum likelihood phylogenetic tree was constructed with MEGA 5.2 using the best model with 1,000 bootstrap duplications.

Genome browser

A genome browser has been established for the assembled genome sequences using the JavaScript-based Genome Browser (JBrowse) 1.11.6. The assembled sequence and gene models are accessible at http://marinegenomics.oist.jp/gallery/.

Gene expression analysis

RNA-seq reads were aligned to the genome using TopHat-2.0.9, while FPKM values were calculated from the aligned results using Cufflinks v2.0.0. Gene expression levels were visualized with R 3.2.2 (https://www.r-project.org/) using the pheatmap 1.0.8 package.,

3. Results and discussion

The Illumina Miseq (average library size, 720 bp) generated a total of 9.75 gigabase pairs (Gbp) of paired-end sequence data (read length, 309 ×2 bases) and the Hiseq2500 generated 5.79 Gbp of mate-pair sequence data (read length, 283 ×2 bases); 5.02 Gbp for the 3-kb library (physical coverage, 177×) and 0.78 Gbp for the 8-kb library (physical coverage, 73×), respectively. A total of 15.6 Gbp of sequence data was obtained (Supplementary Table S1). As mentioned below, the genome size of Cladosiphon okamuranus was estimated as approximately 140 Mb (Supplementary Fig. S1A). Total reads of 15.6 Gbp correspond to approximately 100x sequencing coverage of the genome. Comparison of draft genome assemblies of three species of brown algae, Cladosiphon okamuranus (Order Chordariales), Ectocarpus siliculosus (Order Ectocarpales), and Saccharina japonica (Order Laminariales) aThe present study. bCock et al. cYe et al. Illumina whole-genome, shotgun paired-end reads were assembled de novo with Newbler. The assembler generated 31,858 contigs with an N50 size of 21.7 kb (Table 1). The longest contig was 943.8 kb and approximately 40% of the sequences were covered by contigs >2 kb. Subsequent scaffolding of the 31,858 contigs of Newbler output was performed with SSPACE using Illumina mate-pair sequence information (Supplementary Table S1). Gaps inside scaffolds were closed with GapCloser. As a result, the final assembly contained 732 scaffolds with an N50 size of 536 kb. The total scaffold length reached 170 Mb.
Table 1.

Comparison of draft genome assemblies of three species of brown algae, Cladosiphon okamuranus (Order Chordariales), Ectocarpus siliculosus (Order Ectocarpales), and Saccharina japonica (Order Laminariales)

Cladosiphon okamuranusaEctocarpus siliculosusbSaccharina japonicac
Genome size (Mb)140214545
Total assembled length (Mb)129.9195.8537
Number of scaffolds5411,56113,327
N50 scaffold size (kb)416504252
Number of contigs31,85814,04329,670
N50 Contig size (bp)21,70532,86258,867
Number of genes13,64016,25618,733
GC Contents (%)545450
Repeated sequences (%)4.122.739
Average gene length (bp)7,6966,8599,587
Average coding sequence length (bp)2,0041,563
Average number of introns per gene8.266.98
Average intron length (bp)522703.81,057
Cegma Partial (%)91.991.579.0
Cegma Completeness (%)84.377.445.6

aThe present study.

bCock et al.

cYe et al.

Bacterial contaminant scaffolds were identified in the genome of an Ectocarpus siliculosus strain. To detect scaffolds of likely bacterial origin, 732 scaffolds were classified into one large group (129.9 Mb containing 541 scaffolds) and seven smaller groups (3.6–8.0 Mb) using MaxBin software, which bins assembled microbial scaffolds, based on tetranucleotide frequencies, scaffold coverage levels and marker sequences. Using transcriptome mapping of C. okamuranus (See later) and similarities to Ectocarpus genes, it was confirmed that the large group contained Cladosiphon okamuranus scaffolds. All scaffolds of smaller groups had no mapped transcriptomes that showed exon–intron structures. For 541 scaffolds of C. okamuranus, the number of contigs was 2,774 and the N50 of contigs was 88 kb. The longest scaffold was 2.8 Mbp, and the scaffold N50 was 416 kb (Table 1). Approximately 87% of scaffold sequences were covered with contigs of >20 kb. Chloroplast and mitochondrial genome sequences were included in scaffold IDs, Cok_S_s60 (599,436 bp) and Cok_S_s1074 (36,249 bp), respectively (data not shown). CEGMA analysis was carried out to evaluate the accuracy of the assembled genome (129.9 Mb). CEGMA reported 91.9% sequences (CEGMA partial) with an average degree of completeness for all sequences of 84.3%. On the other hand, CEGMA completeness values for genome sequences of E. siliculosus and S. japonica are 77.4% and 45.6% (Table 1), respectively, suggesting the assembled genome of C. okamuranus is the highest quality brown algal genome to date.

Genome size

The genome size of C. okamuranus was estimated by K-mer analysis (K-mer = 25). The peak appeared around 47 (Supplementary Fig. S1A). The estimated genome size was ∼140 Mb. The C. okamuranus genome is the smallest phaeophyte genome sequenced to date (Table 1). The genome size of E. siliculosus was estimated to be 214 Mbp while that of S. japonica was 545 Mb (Table 1).,

GC content

GC content of the C. okamuranus genome was estimated to be ∼54% (Supplementary Fig. S1B and Table 1). GC contents for E. siliculosus and S. japonica were 54% and 50%, respectively; thus, the three brown algae are similar in this regard (Table 1).

RNAseq, clustering, and mapping

Transcriptomic analysis is essential to determine which genes are expressed by a given organism. RNAs extracted from germlings and sporophytes were sequenced on an Illumina Hiseq 2500 (read length, 329 ×2 bases from both samples) (Supplementary Table S1). About 44.8 giga nucleotide reads were obtained. An assembly of transcriptome sequences yielded 59,590 contigs (92.8 mega nucleotides (nts)) with an N50 size of 2,231 nts. Of those, 58,095 (97.4%) had a blat alignment (with default settings) to the assembled genome sequences (129.9 Mb). 37,299 of the 59,590 contigs had predicted ORFs from start to stop codons of at least 450 nts. Of these putatively full-length RNA-seq contigs, 36,648 (98.6%) had a blat alignment to assembled scaffolds (129.9 Mb). These data were used to produce gene models for annotation.

Gene modeling

Assembled RNAseqs of C. okamuranus, algae EST datasets available on NCBI and putative protein coding loci found with PASA, were incorporated as AUGUSTUS ‘hints.’ The number of gene models was 13,640 (Table 1). This is somewhat smaller than the 16,256 predicted genes of E. siliculosus and the 18,733 of S. japonica, respectively, although this value is comparable. The 12,813 genes of C. okamuranus were supported by mRNA sequences. The average (arithmetic mean) length of C. okamuranus genes that included both exons and introns, was 7,696 bp, and that of exons translated to form proteins (coding sequence) was 2,004 bp. The average lengths of C. okamuranus 5′ and 3′UTRs were 911 bp and 459 bp, respectively. Cock et al. noted that E. siliculosus genes are intron-rich, with an average number of introns per gene of 6.98 and an average intron length of 704 bp (Table 1). This was the case in the C. okamuranus genome; the average number of introns per gene was 8.26 and the average intron length was 522 bp (Table 1).

Transposable elements and other genomic components

We examined the proportion of transposable elements and repetitive elements in the assembled mozuku genome. DNA transposons and retrotransposons apparently account for 0.539% and 2.491% of the C. okamuranus genome, respectively (Supplementary Table S2). DNA transposons included hAT (0.104% of the assembled sequences), EnSpm (0.090%), Helitron (0.084%), PIF-Harbinger (0.056%) and others, while retrotransposons included LTR retrotransposons such as Gypsy (1.096%), Copia (0.752%) and Bel_Pao (0.115%), and non-LTR retrotransposons such as CR1 (0.040%) and I (0.018%). The percentages of LINE (long interspersed nuclear elements) such as L1 (0.123%), Tx1, and Jockey, and SINE (short interspersed nuclear elements) were smaller in the C. okamuranus genome, compared to other brown algal genomes. Including unclassified repeats, 4.1% of the C. okamuranus genome was composed of repetitive sequences (Table 1). This ratio is lower than in the two other brown algae, 22.7% for E. siliculosus and 39% for S. japonica, respectively. It may be that the high frequency of repeated sequences in brown algal genomes is not universal among the Phaeophyceae.

Identification of Cladosiphon okamuranus genes

In order to examine the utility of the assembled genome as a platform for future studies in mozuku biology, we searched for genes in the assembled C. okamuranus genome. By comparing genes found in the E. siliculosus genome, we found the corresponding mozuku genes associated with transcription factors, signaling molecules, and enzymes involved in polysaccharide biosynthesis and phlorotannin biosynthesis.

3.7.1 Transcription factors

Transcription factors play pivotal roles in many biological processes. We examined transcription factors (TFs) by searching conserved protein domains using hmmer3 and the Pfam database (e-value cutoff < e − 5). The number of domains that have been identified in transcription factors was determined and compared with those in the E. siliculosus genome. These domains include HSF, Myb, bZIP, Zinc Finger, bHLH, CCAAT-binding, homeobox, AP2-EREBP, Nin-like, TAF, E2F-DP, CBF/NF-Y/archaeal and Sigma-70 r2/r3/r4 (Table 2).
Table 2.

Number of transcription factors (TFs) in the brown algae, Cladosiphon okamuranus and Ectocarpus siliculosus

Species
FamilyCladosiphon okamuranusEctocarpus siliculosus
Myb58(27.1%)a59(22.7%)
CBF/NF-Y/archaeal35(16.3%)42(16.2%)
Zn_finger, C2H2-type23(10.7%)30(11.5%)
Zn_finger, CCCH-type22(10.3%)24(9.2%)
bZIP17(7.9%)23(8.8%)
HSF13(6.1%)22(8.5%)
TAF10(4.7%)18(6.9%)
Sigma-70 r2/r3/r49(4.2%)10(3.8%)
Zn_finger, TAZ-type7(3.3%)7(2.7%)
Nin-like5(2.3%)1(0.4%)
bHLH3(1.4%)2(0.8%)
Homeobox3(1.4%)7(2.7%)
AP2-EREBP3(1.4%)4(1.5%)
E2F-DP3(1.4%)4(1.5%)
CXC/tesmin3(1.4%)4(1.5%)
Zn_finger, GATA-type0(0.0%)1(0.4%)
Fungal TRF0(0.0%)2(0.8%)
Total214260

aPercentage of the family within the brown algal genome.

Number of transcription factors (TFs) in the brown algae, Cladosiphon okamuranus and Ectocarpus siliculosus aPercentage of the family within the brown algal genome. This analysis showed that the C. okamuranus genome contains 214 transcription factor genes (Table 2), fewer than the 260 found in the E. siliculosus genome. However, the ratio of transcription factor genes to total genes was ∼1.6% in both genomes. The most abundant TF in both genomes was the Myb family, with 58 and 59 genes in the C. okamuranus and E. siliculosus genomes, respectively. The next most abundant TFs were CBF/NF-Y/archaeal (35), Zinc Finger C2H2-type (23) and Zinc Finger CCCH-type (22). The C. okamuranus genome likely contains three genes for bHLH domains, three for homeobox domains and 17 genes for bZIP domains. The expression level of bZIP genes seems to be similar in germlings and sporophytes, whereas some of CBF/NF-Y/archaeal genes may have different expression levels (Fig. 2A). Expression and function of phaeophyte TF genes should be expressly examined in future studies.
Figure 2

Heat maps compared with gene expression levels of germlings and sporophytes of Cladsiphon okamuranus based on FPKM values. Gene IDs for the genome browser are indicated by sXXX_XXXXX. (A) Expression levels of transcription factors shown in Table 2. (B) Expression levels of receptor kinase genes shown in Supplementary Fig. S2. (C) Expression levels of C. okamuranus genes potentially contributing to biosynthetic pathways of sulfated fucans, alginates, and phlorotannin shown in Figs. 3- -5 and Supplementary Table S3.

Heat maps compared with gene expression levels of germlings and sporophytes of Cladsiphon okamuranus based on FPKM values. Gene IDs for the genome browser are indicated by sXXX_XXXXX. (A) Expression levels of transcription factors shown in Table 2. (B) Expression levels of receptor kinase genes shown in Supplementary Fig. S2. (C) Expression levels of C. okamuranus genes potentially contributing to biosynthetic pathways of sulfated fucans, alginates, and phlorotannin shown in Figs. 3- -5 and Supplementary Table S3.

3.7.2 Signaling molecules

Cell–cell signaling molecules play pivotal roles in organismal development and cellular physiological activity. Cock et al. specifically analyzed membrane-spanning receptor kinase genes, since receptor kinases have been shown to function in developmental processes in both animals and green plants. They showed that Ectocarpus receptor kinases form a monophyletic clade, as in the case of animals and plants, suggesting that brown algae evolved independently of other taxa. The present domain analysis, using the Pfam database, showed that the C. okamuranus and E. siliculosus genomes contain 286 and 338 genes for protein kinase domains, which are both equivalent to about 2.1% of total genes. Cock et al. also identified five genes for stramenopile-lineage-specific receptor kinases in E. siliculosus. The C. okamuranus genome likely contains at least two genes for brown algal-specific receptor kinases (Cok_S_s144_12294 and Cok_S_s043_7434), which have orthologous relationships with E. siliculosus genes (Supplementary Fig. S2). Three other genes are additional candidates (Cok_S_s126_11689, Cok_S_s097_10483 and Cok_S_s044_7515). All five genes are found on different scaffolds. RNA-seq reads demonstrate that Cok_S_s144_12294 is expressed in both germlings and sporophytes (Fig. 2B).

3.7.3. Genes associated with biosynthesis of polysaccharides

Genes encoding enzymes for polysaccharide metabolism in brown algae have been predicted in the E. siliculosus genome. Using informatics methods, we investigated gene families for biosynthesis of sulphated fucans and alginates, compounds characteristic of brown algae. Genes encoding enzymes involved in biosynthetic pathways are apparently conserved between C. okamuranus and E. siliculosus, although downstream enzymes of the polysaccharide biosynthesis pathways are likely to be expanded in each lineage.

3.7.3.1 Fucoidans

Fucoidans are a family of sulfated homo- and heteropolysaccharides of brown algae that include L-fucose residues. The family comprises a broad spectrum of polysaccharides, from compounds with high uronic acid content and very little fucose and sulfate, to almost pure α-l-fucan, in which the dominant monosaccharide is fucose. GDP-mannose and l-fucose are original sources of GDP-fucose, which is then transformed from fucan to sulfated fucan (Fig. 3A). Seven enzymes are likely involved in this pathway (Fig. 3A).
Figure 3

Identification of all genes in the predicted biosynthetic pathway of sulfated fucans in the draft genome of Cladosiphon okamuranus. (A) A schematic representation of the biosynthetic pathway of sulphated fucans in brown algae, based on the description of Michel et al. Genes encoding each of the enzymes in the genomes of Cladosiphon okamuranus (C.ok) and Ectocarpus siliculosus (E.si), and the number of identified genes is shown in broken squares. (B) The location of candidate genes for l-fucosinase (FK) and GDP-fucose pyrophosphorylase (GFPP) in the C. okamuranus genome, suggesting the presence of an enzymatic gene cluster having shared synteny with E. siliculosus (http://bioinformatics.psb.ugent.be/orcae/overview/Ectsi). Two enzymatic genes that contain sulphotransferase and hydrolase domains are clustered with a gene (magenta) for a bifunctional enzyme with FK and GHMP-kinase domains. Genes coloured grey encode conserved proteins with RNA-binding domains.

Identification of all genes in the predicted biosynthetic pathway of sulfated fucans in the draft genome of Cladosiphon okamuranus. (A) A schematic representation of the biosynthetic pathway of sulphated fucans in brown algae, based on the description of Michel et al. Genes encoding each of the enzymes in the genomes of Cladosiphon okamuranus (C.ok) and Ectocarpus siliculosus (E.si), and the number of identified genes is shown in broken squares. (B) The location of candidate genes for l-fucosinase (FK) and GDP-fucose pyrophosphorylase (GFPP) in the C. okamuranus genome, suggesting the presence of an enzymatic gene cluster having shared synteny with E. siliculosus (http://bioinformatics.psb.ugent.be/orcae/overview/Ectsi). Two enzymatic genes that contain sulphotransferase and hydrolase domains are clustered with a gene (magenta) for a bifunctional enzyme with FK and GHMP-kinase domains. Genes coloured grey encode conserved proteins with RNA-binding domains. Examination of genes for enzymes in both the C. okamuranus and E. siliculosus genomes identified two genes encoding GDP-mannose 4,6-dehydratase and one for GDP-l-fucose synthase (Fig. 3A and Supplementary Table S3). The presence of a candidate gene (FK-GFPP) for a bifunctional enzyme possessing both l-fucokinase and GDP-fucose pyrophosphorylase activities was identified in the C. okamuranus genome (Fig. 3A). These enzyme genes were expressed at germling and sporophyte stages (Fig. 2C). Comparison of the C. okamuranus and E. siliculosus genomes revealed syntenic localization of enzyme genes, suggesting the presence of an enzymatic gene cluster in both species (Fig. 3B). Two other enzymatic genes that contain a sulfotransferase domain and a hydrolase domain are clustered with FK-GFPP. The 3′ region of FK-GFPP contains an ankyrin-repeat domain (Fig. 3B). In addition, the C. okamuranus and E. siliculosus genomes contain four genes for fucosyltransferase, and six genes for sulfotransferase, respectively (Fig. 3A). Identification of genes that encode enzymes involved in sulfated fucan biosynthesis suggests that C. okamuranus provides a good source of fucoidans, which should be further characterized in future studies.

3.7.3.2. Alginates

Alginates are cell-wall constituents of brown algae. They are chain-forming heteropolysaccharides consisting of blocks of mannuronic acid and glucuronic acid. Composition of the blocks varies by species and by the part of the thallus from which the extraction is made. Enzymes involved in the alginate biosynthetic pathway, including GDP-mannose 6-dehydrogenase (GMD) and mannuronate C5-epimerase (MC5E), have been identified in genomes of E. siliculosus and C. okamuranus (Fig. 4A and Supplementary Table S3). The number of enzymatic genes is comparable in both species. Some of MC5E-coding genes likely have different expression levels in germlings and sporophytes (Fig. 2C). These expression data will be useful for functional analyses of these enzymes.
Figure 4

Predicted genes related to the alginate biosynthetic pathway in brown algae and conservation of a key enzyme, GDP-mannose 6-dehydrogenase (GMD). (A) A schematic representation of the alginate biosynthetic pathway in brown algae. As in Fig. 3, identified gene numbers in broken squares are followed by species identifiers, C.ok and E.si. (B) A molecular phylogenetic analysis of GMD revealing the conservation of a key enzyme for the alginate biosynthesis pathway between Cladosiphon okamuranus and Ectocarpus siliculosus. The maximum-likelihood (ML) tree was constructed with a WAG + G model. Numbers at nodes indicate more than 70% bootstrap support. Brown stars indicate Cladosiphon GMD proteins that were presumably transferred horizontally from bacteria. UDP glucose 6-dehydrogenase (UGD) is potentially involved in fucoidan biosynthesis, rather than alginate biosynthesis.

Predicted genes related to the alginate biosynthetic pathway in brown algae and conservation of a key enzyme, GDP-mannose 6-dehydrogenase (GMD). (A) A schematic representation of the alginate biosynthetic pathway in brown algae. As in Fig. 3, identified gene numbers in broken squares are followed by species identifiers, C.ok and E.si. (B) A molecular phylogenetic analysis of GMD revealing the conservation of a key enzyme for the alginate biosynthesis pathway between Cladosiphon okamuranus and Ectocarpus siliculosus. The maximum-likelihood (ML) tree was constructed with a WAG + G model. Numbers at nodes indicate more than 70% bootstrap support. Brown stars indicate Cladosiphon GMD proteins that were presumably transferred horizontally from bacteria. UDP glucose 6-dehydrogenase (UGD) is potentially involved in fucoidan biosynthesis, rather than alginate biosynthesis. Previous studies showed that E. siliculosus and S. japonica possess three genes for GMDs of possible bacterial origin. Examination of the C. okamuranus genome and further molecular phylogenetic analyses indicated that the C. okamuranus genome also contains three GMD genes for this key enzyme (Fig. 4A and B and Supplementary Table S3). Twenty-four MC5E genes of C. okamuranus have homology to E. siliculosus genes (Supplementary Table S3). Our survey suggests that MC5E genes are independently expanded in each lineage.

3.7.3.3. Phlorotannin biosynthesis

Brown algal phenols attract considerable attention due to the wide variety of biological activities and potential health benefits of phlorotannins., Phlorotannins are known only from brown algae and are structural analogs of condensed tannins, such as anthocyanidins and other flavonoid derivatives, a diverse class of metabolites with a vast array of functions in terrestrial plants. They probably serve multiple ecological functions in brown algae, such as antifouling substances and chemical defenses against herbivory, in addition to providing UV protection for intertidal seaweeds. Their chemical structures are based on aryl-aryl and/or diaryl ether linkages of phloroglucinol (1,3,5-trihydroxybenzene) units and are rather complex. Polymerization leads to a wide range of molecular masses (10 and 100 kDa)., Polyketide synthase (PKS) is a key gene for phlorotannin biosynthesis. It has recently been reported that E. siliculosus PKS1 was classified as a Type-III PKS. We surveyed PKS genes in the C. okamuranus genome and found two candidate genes (Fig. 5 and Supplementary Table S3). Molecular phylogenetic analysis indicated that C. okamuranus Cok_S_597_t3 is the ortholog of E. siliculosus PKS1, a key gene in phlorotannin biosynthesis. E. siliculosus has three type III PKS genes. On the other hand, C. okamuranus has only two (PKS1 and PKS2) and has probably lost PKS3. PKS1 and PKS2 are expressed in germlings and sporophytes of C. okamuranus (Fig. 2C). Type III PKS genes of brown algae probably originated with bacteria.
Figure 5

A molecular phylogenetic tree of two proteins similar to type III PKS in the Cladosiphon okamuranus genome. The ML tree was constructed with a WAG + G + I model. Proteins are indicated by brown stars. Other protein sequences refer to Meslet-Cladiere et al..

A molecular phylogenetic tree of two proteins similar to type III PKS in the Cladosiphon okamuranus genome. The ML tree was constructed with a WAG + G + I model. Proteins are indicated by brown stars. Other protein sequences refer to Meslet-Cladiere et al..

4. Data availability

A genome browser has been established at: http://marinegenomics.oist.jp/gallery/. Gene annotations from domain searches and Blast2GO are available at: (http://marinegenomics.oist.jp/gallery/).

Accession numbers

All Illumina reads (Supplementary Table S1) are available from DRA accession no. DRA004654. Genome sequences of the S-strain have been deposited with accession nos. BDDF01000001-01004525 (contigs) and DF977685-DF978416 (scaffolds).
  41 in total

Review 1.  Repbase Update, a database of eukaryotic repetitive elements.

Authors:  J Jurka; V V Kapitonov; A Pavlicek; P Klonowski; O Kohany; J Walichiewicz
Journal:  Cytogenet Genome Res       Date:  2005       Impact factor: 1.636

2.  Principles of protein and lipid targeting in secondary symbiogenesis: euglenoid, dinoflagellate, and sporozoan plastid origins and the eukaryote family tree.

Authors:  T Cavalier-Smith
Journal:  J Eukaryot Microbiol       Date:  1999 Jul-Aug       Impact factor: 3.346

3.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

4.  Using native and syntenically mapped cDNA alignments to improve de novo gene finding.

Authors:  Mario Stanke; Mark Diekhans; Robert Baertsch; David Haussler
Journal:  Bioinformatics       Date:  2008-01-24       Impact factor: 6.937

5.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

6.  CENSOR--a program for identification and elimination of repetitive elements from DNA sequences.

Authors:  J Jurka; P Klonowski; V Dagman; P Pelton
Journal:  Comput Chem       Date:  1996-03

7.  Pathway of algnic acid synthesis in the marine brown alga, Fucus gardneri Silva.

Authors:  T Y Lin; W Z Hassid
Journal:  J Biol Chem       Date:  1966-11-25       Impact factor: 5.157

8.  The sequence and de novo assembly of the giant panda genome.

Authors:  Ruiqiang Li; Wei Fan; Geng Tian; Hongmei Zhu; Lin He; Jing Cai; Quanfei Huang; Qingle Cai; Bo Li; Yinqi Bai; Zhihe Zhang; Yaping Zhang; Wen Wang; Jun Li; Fuwen Wei; Heng Li; Min Jian; Jianwen Li; Zhaolei Zhang; Rasmus Nielsen; Dawei Li; Wanjun Gu; Zhentao Yang; Zhaoling Xuan; Oliver A Ryder; Frederick Chi-Ching Leung; Yan Zhou; Jianjun Cao; Xiao Sun; Yonggui Fu; Xiaodong Fang; Xiaosen Guo; Bo Wang; Rong Hou; Fujun Shen; Bo Mu; Peixiang Ni; Runmao Lin; Wubin Qian; Guodong Wang; Chang Yu; Wenhui Nie; Jinhuan Wang; Zhigang Wu; Huiqing Liang; Jiumeng Min; Qi Wu; Shifeng Cheng; Jue Ruan; Mingwei Wang; Zhongbin Shi; Ming Wen; Binghang Liu; Xiaoli Ren; Huisong Zheng; Dong Dong; Kathleen Cook; Gao Shan; Hao Zhang; Carolin Kosiol; Xueying Xie; Zuhong Lu; Hancheng Zheng; Yingrui Li; Cynthia C Steiner; Tommy Tsan-Yuk Lam; Siyuan Lin; Qinghui Zhang; Guoqing Li; Jing Tian; Timing Gong; Hongde Liu; Dejin Zhang; Lin Fang; Chen Ye; Juanbin Zhang; Wenbo Hu; Anlong Xu; Yuanyuan Ren; Guojie Zhang; Michael W Bruford; Qibin Li; Lijia Ma; Yiran Guo; Na An; Yujie Hu; Yang Zheng; Yongyong Shi; Zhiqiang Li; Qing Liu; Yanling Chen; Jing Zhao; Ning Qu; Shancen Zhao; Feng Tian; Xiaoling Wang; Haiyin Wang; Lizhi Xu; Xiao Liu; Tomas Vinar; Yajun Wang; Tak-Wah Lam; Siu-Ming Yiu; Shiping Liu; Hemin Zhang; Desheng Li; Yan Huang; Xia Wang; Guohua Yang; Zhi Jiang; Junyi Wang; Nan Qin; Li Li; Jingxiang Li; Lars Bolund; Karsten Kristiansen; Gane Ka-Shu Wong; Maynard Olson; Xiuqing Zhang; Songgang Li; Huanming Yang; Jian Wang; Jun Wang
Journal:  Nature       Date:  2009-12-13       Impact factor: 49.962

9.  Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels.

Authors:  Marcel H Schulz; Daniel R Zerbino; Martin Vingron; Ewan Birney
Journal:  Bioinformatics       Date:  2012-02-24       Impact factor: 6.937

10.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

View more
  22 in total

1.  Genome-Wide Identification and Analysis of the Aureochrome Gene Family in Saccharina japonica and a Comparative Analysis with Six Other Algae.

Authors:  Yukun Wu; Pengyan Zhang; Zhourui Liang; Yanmin Yuan; Maohong Duan; Yi Liu; Di Zhang; Fuli Liu
Journal:  Plants (Basel)       Date:  2022-08-11

2.  Diversity and evolution of cytochromes P450 in stramenopiles.

Authors:  Linhong Teng; Xiao Fan; David R Nelson; Wentao Han; Xiaowen Zhang; Dong Xu; Hugues Renault; Gabriel V Markov; Naihao Ye
Journal:  Planta       Date:  2018-10-19       Impact factor: 4.116

3.  Rethinking the evolution of eukaryotic metabolism: novel cellular partitioning of enzymes in stramenopiles links serine biosynthesis to glycolysis in mitochondria.

Authors:  Melania Abrahamian; Meenakshi Kagda; Audrey M V Ah-Fong; Howard S Judelson
Journal:  BMC Evol Biol       Date:  2017-12-04       Impact factor: 3.260

4.  Evolution and Expansion of the Prokaryote-Like Lipoxygenase Family in the Brown Alga Saccharina japonica.

Authors:  Linhong Teng; Wentao Han; Xiao Fan; Dong Xu; Xiaowen Zhang; Simon M Dittami; Naihao Ye
Journal:  Front Plant Sci       Date:  2017-11-28       Impact factor: 5.753

5.  Comparative genomics of four strains of the edible brown alga, Cladosiphon okamuranus.

Authors:  Koki Nishitsuji; Asuka Arimoto; Yoshitaka Yonashiro; Kanako Hisata; Manabu Fujie; Mayumi Kawamitsu; Eiichi Shoguchi; Noriyuki Satoh
Journal:  BMC Genomics       Date:  2020-06-26       Impact factor: 3.969

6.  Genome-wide analysis of the Saccharina japonica sulfotransferase genes and their transcriptional profiles during whole developmental periods and under abiotic stresses.

Authors:  Chang Lu; Zhanru Shao; Pengyan Zhang; Delin Duan
Journal:  BMC Plant Biol       Date:  2020-06-11       Impact factor: 4.215

7.  A siphonous macroalgal genome suggests convergent functions of homeobox genes in algae and land plants.

Authors:  Asuka Arimoto; Koki Nishitsuji; Yoshimi Higa; Nana Arakaki; Kanako Hisata; Chuya Shinzato; Noriyuki Satoh; Eiichi Shoguchi
Journal:  DNA Res       Date:  2019-04-01       Impact factor: 4.458

8.  Distribution of natural ingredients suggests a complex network of metabolic transport between source and sink tissues in the brown alga Fucus vesiculosus.

Authors:  Claudia Birkemeyer; Natalia Osmolovskaya; Ludmila Kuchaeva; Elena Tarakhovskaya
Journal:  Planta       Date:  2018-09-12       Impact factor: 4.116

9.  The highly efficient NDH-dependent photosystem I cyclic electron flow pathway in the marine angiosperm Zostera marina.

Authors:  Ying Tan; Quan Sheng Zhang; Wei Zhao; Zhe Liu; Ming Yu Ma; Ming Yu Zhong; Meng Xin Wang
Journal:  Photosynth Res       Date:  2020-03-09       Impact factor: 3.429

10.  Two divergent Symbiodinium genomes reveal conservation of a gene cluster for sunscreen biosynthesis and recently lost genes.

Authors:  Eiichi Shoguchi; Girish Beedessee; Ipputa Tada; Kanako Hisata; Takeshi Kawashima; Takeshi Takeuchi; Nana Arakaki; Manabu Fujie; Ryo Koyanagi; Michael C Roy; Masanobu Kawachi; Michio Hidaka; Noriyuki Satoh; Chuya Shinzato
Journal:  BMC Genomics       Date:  2018-06-14       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.