| Literature DB >> 24415958 |
Dongmei Wang1, Kang Ning1, Jing Li1, Jianqiang Hu1, Danxiang Han2, Hui Wang3, Xiaowei Zeng1, Xiaoyan Jing1, Qian Zhou1, Xiaoquan Su1, Xingzhi Chang1, Anhui Wang1, Wei Wang1, Jing Jia1, Li Wei1, Yi Xin1, Yinghe Qiao1, Ranran Huang1, Jie Chen1, Bo Han1, Kangsup Yoon4, Russell T Hill3, Yonathan Zohar5, Feng Chen3, Qiang Hu4, Jian Xu1.
Abstract
Oleaginous microEntities:
Mesh:
Substances:
Year: 2014 PMID: 24415958 PMCID: PMC3886936 DOI: 10.1371/journal.pgen.1004094
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Structural features of the six Nannochloropsis genomes.
(A) Whole-genome based phylogeny of Nannochloropsis. A maximum likelihood (consensus) tree was generated using the PhyML program (JTT model) with 1,000 replicates based on the 1,085 six-way single-copy orthologous gene sets identified from the six Nannochloropsis genomes (). Percentages of replicate trees in the bootstrap test are shown next to the branches. (B) Genome divergence in Nannochloropsis. For each pair of genomes that consists of IMET1 and another Nannochloropsis strain, the percentages of strain-specific proteins versus their discrepancies in the full-length 18S rDNA sequence were plotted. Ten S. cerevisiae strains, eight E. coli strains, nine Synechococcus strains and two each of Ostreococcus and Micromonas were also included. For prokaryotic organisms including E. coli and Synechococcus, the percentages of strain-specific proteins were plotted against the discrepancies in 16S rDNA sequences. (C) The number of genes from the Nannochloropsis genomes and the Nannochloropsis core, with successive inclusion of each additional strain. (D) Functional categorization of Nannochloropsis core proteins. GO Slim terms corresponding to each GO term are presented.
Genomic features of the Nannochloropsis genomes.
| Microalgae |
|
|
|
|
|
|
|
| ||||||
| Assembly size (Mb) | 30.1 | 35.5 | 30.1 | 34.5 | 26.9 | 29.0 |
| G+C content (%) | 53.7 | 53.8 | 54.6 | 54.2 | 54.2 | 54.2 |
| Predicted gene number | 9,915 | 7,268 | 8,060 | 7,254 | 6,562 | 8,738 |
| Genes having blast hits in NCBI NR database | 6,853 | 3,584 | 4,014 | 3,557 | 3,789 | 5,467 |
| Genes with functional annotation | 5,121 | 2,528 | 2,798 | 3,321 | 1,378 | 3,704 |
| Predicted tRNA number | 80 | 79 | 85 | 82 | 85 | 85 |
| Predicted LTR number | 26 | 24 | 6 | 25 | 4 | 33 |
| LTR length (Kbp) | 24.3 | 9.4 | 2.3 | 25.6 | 3.4 | 41.8 |
|
| ||||||
| Size (bases) | 38,057 | 38,057 | 38,791 | 38,444 | 41, 907 | 38,981 |
| G+C content (%) | 31.9 | 31.9 | 32.0 | 31.8 | 31.4 | 31.4 |
| Predicted gene number | 35 | 35 | 35 | 35 | 36 | 35 |
| Predicted tRNA number | 28 | 28 | 28 | 28 | 28 | 26 |
|
| ||||||
| Size (bases) | 117,548 | 117,634 | 117,672 | 117,463 | 114,883 | 114,937 |
| G+C content (%) | 33.6 | 33.6 | 33.4 | 33.4 | 33.1 | 33.0 |
| Predicted gene number | 126 | 126 | 126 | 126 | 123 | 122 |
| Predicted tRNA number | 34 | 34 | 34 | 34 | 33 | 28 |
, data cited from the genome paper of N. gaditana CCMP526 [5].
Figure 2Structure of paralogous groups in each Nannochloropsis strain.
Each circle represents a paralogous group. The area of the circle is proportional to the size of the paralogous group. The top 15 largest paralogous groups in each of the six Nannochloropsis genomes and the other three model microalgae (including T. pseudonana, C. merolae and C. reinhardtii) are shown. The largest paralogous groups in the six Nannochloropsis genomes and the other three model microalgae, including T. pseudonana, C. merolae and C. reinhardtii. The color of the circle represents the functions (as defined by the associated GO Slim terms in biological process) encoded by the paralogous group. The paralogous groups in each Nannochloropsis strain are relatively small in size.
Figure 3Functional conservation and variation of the Nannochloropsis genomes.
For each genome, the numbers of genes assigned to each GO term and its subcategory terms are shown. The color scheme, defined by the scale bar on the top, represents the degree of relative enrichment or depletion for each functional category as compared to C. reinhardtii. The p values of enrichment or depletion were calculated using a binomial test corrected by FDR for multiple comparisons. IMET1, N. oceanica IMET1; 531, N. oceanica CCMP531; 529, N. granulata CCMP529; 525, N. oculata CCMP525; 526, N. gaditana CCMP526; 537, N. salina CCMP537; cre, C. reinhardtii.
Figure 4Enrichment of lipid biosynthesis genes in each of the six Nannochloropsis strains.
(A) The gene dose expansion in N. oceanica IMET1 as compared to C. reinhardtii. In the schema, enzymes in each reaction node in the Nannochloropsis and C. reinhardtii lipid biosynthesis pathways are represented as yellow and blue circles, respectively. Length of radius stands for gene dose. Putative HGT genes in each node in IMET1 are shown in purple. Chlamydomonas genes were not investigated for HGT events here. (B) The expansion in gene dose was conserved among the Nannochloropsis genomes. Each colored cell in the heatmap represents the gene copy numbers in each of the Nannochloropsis strains and in C. reinhardtii. The scale of the color bar ranges from 1 (the lowest copy number among the genomes) to 13 (the highest copy number).
Figure 5Divergent phylogenetic origins of Nannochloropsis DGAT-2 genes.
(A) Gene doses of DGAT-1 and DGAT-2 in Nannochloropsis and several model organisms. The numbers of DGAT-1 and DGAT-2 genes in each genome are indicated by different colors. N. oceanica IMET1 is marked by red star. (B) Schema illustrating the divergent phylogenetic origins of Nannochloropsis DGAT-2 from the red/green algae-related endosymbionts and potential secondary host. The protein names listed in each individual schematic cell indicate the DGAT-2 genes that were likely to be encoded in the genome. Abbreviations: N, nucleus; C, chloroplast; M, mitochondria.