| Literature DB >> 25265387 |
Jianzi Huang1, Xiang Lu2, Wanke Zhang2, Rongfeng Huang3, Shouyi Chen2, Yizhi Zheng1.
Abstract
Avicennia marina is a widely distributed mangrove species that thrives in high-salinity habitats. It plays a significant role in supporting coastal ecosystem and holds unique potential for studying molecular mechanisms underlying ecological adaptation. Despite and sometimes because of its numerous merits, this species is facing increasing pressure of exploitation and deforestation. Both study on adaptation mechanisms and conservation efforts necessitate more genomic resources for A. marina. In this study, we used Illumina sequencing of an A. marina foliar cDNA library to generate a transcriptome dataset for gene and marker discovery. We obtained 40 million high-quality reads and assembled them into 91,125 unigenes with a mean length of 463 bp. These unigenes covered most of the publicly available A. marina Sanger ESTs and greatly extended the repertoire of transcripts for this species. A total of 54,497 and 32,637 unigenes were annotated based on homology to sequences in the NCBI non-redundant and the Swiss-prot protein databases, respectively. Both Gene Ontology (GO) analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis revealed some transcriptomic signatures of stress adaptation for this halophytic species. We also detected an extraordinary amount of transcripts derived from fungal endophytes and demonstrated the utility of transcriptome sequencing in surveying endophyte diversity without isolating them out of plant tissues. Additionally, we identified 3,423 candidate simple sequence repeats (SSRs) from 3,141 unigenes with a density of one SSR locus every 8.25 kb sequence. Our transcriptomic data will provide valuable resources for ecological, genetic and evolutionary studies in A. marina.Entities:
Mesh:
Year: 2014 PMID: 25265387 PMCID: PMC4181315 DOI: 10.1371/journal.pone.0108785
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistics for the unigenes of Avicennia marina.
| Category | Value |
| Total number of unigenes | 91,125 |
| Mean length of unigenes (bp) | 463 |
| N50 of unigenes (bp) | 528 |
| Total length of unigenes (bp) | 42,187,266 |
| Average number of reads per unigene | 154 |
| Average coverage (rpkm) | 22.5 |
| Number of unigenes with Nr annotations | 54,497 |
| Number of unigenes with Swiss-Prot annotations | 32,637 |
| Number of unigenes with GO annotations | 11,472 |
| Number of unigenes with COG annotations | 15,852 |
| Number of unigenes with KEGG annotations | 23,966 |
Figure 1Comparison of the length distribution of unigenes with or without hits.
Annotation Sources for Avicennia marina unigenes.
| Annotation sources | Number of matched unigenes |
| Plants | 49,344 |
| Fungi | 4,874 |
| -Ascomycetes | 4,525 |
|
| 2,232 |
|
| 544 |
|
| 341 |
|
| 277 |
|
| 180 |
| -Basidiomycetes | 276 |
| -other fungi | 73 |
| Algae | 262 |
| Other eukaryotes | 17 |
Figure 2Species distribution of Nr annotation results.
Comparison of the characteristics of plant-derived and fungus-derived unigenes.
| Item | Plant-derived unigenes | Fungus-derived unigenes |
| Mean length of unigenes (bp) | 572 | 405 |
| Average number of reads per unigene | 224 | 44 |
| Average coverage (rpkm) | 29.5 | 6.1 |
| Number of unigenes with GO annotations | 10,040 | 1,352 |
| Number of SSR-containing unigenes | 1,583 | 98 |
Figure 3Gene Ontology classifications of plant-derived and fungus-derived unigenes.
Figure 4Clusters of orthologous group classification.
The top 10 KEGG pathways with the highest numbers of Avicennia marina unigenes.
| Rank | Pathway | Number of unigenes | Pathway ID |
| 1 | Metabolic pathways | 5949 (24.82%) | ko01100 |
| 2 | Biosynthesis of secondary metabolites | 3103 (12.95%) | ko01110 |
| 3 | Microbial metabolism in diverse environments | 1492 (6.23%) | ko01120 |
| 4 | Plant-pathogen interaction | 1377 (5.75%) | ko04626 |
| 5 | Spliceosome | 977 (4.08%) | ko03040 |
| 6 | Carbon metabolism | 881 (3.68%) | ko01200 |
| 7 | Ribosome | 828 (3.45%) | ko03010 |
| 8 | Biosynthesis of amino acids | 796 (3.32%) | ko01230 |
| 9 | Starch and sucrose metabolism | 637 (2.66%) | ko00500 |
| 10 | Protein processing in endoplasmic reticulum | 614 (2.56%) | ko04141 |
*Percentages of unigenes in each pathway among 23,966 unigenes with KEGG annotations are shown in parentheses.
Summary of SSRs identified from the Avicennia marina transcriptome.
| Searching item | Number |
| Total number of unigenes examined | 88,271 |
| Total number of identified SSRs | 3,423 |
| Number of SSR containing unigenes | 3,141 |
| Number of unigenes containing more than one SSR | 255 |
| Number of SSRs present in compound formation | 198 |
| Di-nucleotide repeats | 2,214 |
| Tri-nucleotide repeats | 952 |
| Tetra-nucleotide repeats | 75 |
| Penta-nucleotide repeats | 63 |
| Hexa-nucleotide repeats | 119 |
Figure 5Frequency distribution of SSRs based on motif sequence types.