| Literature DB >> 25012212 |
Chunfang Gao, Yun Wang, Yue Shen, Dong Yan, Xi He, Junbiao Dai1, Qingyu Wu.
Abstract
BACKGROUND: MicroEntities:
Mesh:
Substances:
Year: 2014 PMID: 25012212 PMCID: PMC4111847 DOI: 10.1186/1471-2164-15-582
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
The contigs and scaffolds of sp. 0710 genome
| Contig | Scaffold | |||
|---|---|---|---|---|
| Size (bp) | Number | Size (bp) | Number | |
| N90 | 9361 | 654 | 87711 | 74 |
| N50 | 35091 | 195 | 285534 | 24 |
| Longest | 150212 | - | 1144347 | - |
| Total_Size | 21873284 | - | 22929133 | - |
| Number (>100 bp) | - | 1419 | - | 406 |
| Number (>2 kb) | - | 1017 | - | 187 |
Genomic features of sequenced chlorophyte green algae
| CPRO | CVAR | CSUB | CREI | VCAR | MPUS | OTAR | |
|---|---|---|---|---|---|---|---|
| Taxonomic class | T | T | T | C | C | M | M |
| Assembly length (Mb) | 22.9 | 46.2 | 48.8 | 121 | 138 | 21.9 | 12.6 |
| GC content (%) | 63 | 67 | 53 | 64 | 56 | 65 | 58 |
| Repeat sequences (%) | 6.1 | 8.9 | 7.2 | 16.7 | 23.8 | 8.8 | 5.1 |
| Number of gene | 7,039 | 9,791 | 9,851 | 15,143 | 14,520 | 10,575 | 8,166 |
| Average gene length (bp) | 2,863 | 2,928 | 3,503 | 4,312 | 5,269 | 1,557 | nd |
| Average number of exons per gene | 5.72 | 7.3 | 8.2 | 8.33 | 7.78 | 1.9 | 1.57 |
| Average exon length (bp) | 207 | 170 | 182 | 190 | 194 | 731 | 750 |
| Mean length of introns (bp) | 246 | 209 | 240 | 373 | 491 | 187 | 103 |
| Coding sequence ratio (%) | 3.2 | 4.7 | 5.0 | 8.0 | 9.5 | 2.1 | 1.6 |
CPRO: Chlorella protothecoides sp.0710; CVAR: Chlorella variabilis NC64A; CSUB: Coccomyxa subellipsoidea C-169; CREI: Chlamydomonas reinhardtii; VCAR: Volvox carteri; MPUS: Micromonas pusilla CCMP1545; OTAR: Ostreococcus tauri.
T: Trebouxiophyceae; C: Chlorophyceae; M: Mamiellophyceae.
Figure 1sp. 0710 is closely related to NC64A. A). Genes in Cp0710, C. variabilis NC64A, and C. subellipsoidea C-169 were annotated by GO. Each bar represents the number of genes, and the different species are color-coded. B) The proteins in Cp0710 were searched by BLASTP against the TrEMBL protein database. The number of best-hit proteins in each organism is presented. C) A maximum likelihood phylogenetic tree was constructed using 300 single-copy orthologous genes shared in the seven species. D) The gene families in the three sequenced Trebouxiophyceae strains were compared. The Venn diagram shows the shared and unique gene families among the species. The number of gene families is indicated in black and the total number of genes is in yellow. CPRO: C. protothecoides, CVAR: C. variabilis NC64A; CSUB: C. subellipsoidea C-169.
Figure 2Cp0710 has a compact genome. (A) Comparison of genes in three sequenced Trebouxiophyceae. Genes are classified as core orthologues, pairwise orthologues, or homologous and unique genes, and were marked with different colors. The height of the bar represents the number of genes in each category. (B) Synteny comparison between Cp0710 and C. variabilis NC64A. The conserved blocks of synteny are indicated by a different color. Scaffold ID of Cp0710 and C. variabilis NC64A are indicated in green and red, respectively. Only scaffolds with a length greater than 1 Mbp and 400 Kbp in Cp0710 and C. variabilis NC64A, respectively, and the lengths of syntenic regions greater than 50 Kbp are shown.
Figure 3Multiple metabolic pathways are differentially regulated under heterotrophic conditions. (A) The whole KEGG pathway map after integrating the RNA-seq data. The gray lines indicate pathways not present in Cp0710 and the blue lines represent pathways that showed no significant changes in transcription. Pathways enriched in genes that are upregulated or downregulated are labeled in red and green, respectively. Significance is defined as log2 (fold-change) >1 and P < 0.01. Transcripts for enzymes with multiple subunits or isoforms were summed. (B) Selected pathways that are uniformly changed in heterotrophic cells. Only carbohydrate metabolism, energy metabolism, and lipid metabolism are listed here. See Additional file 1 Figure S9 for a complete list of affected pathways.
Figure 4Glycolysis and the triacylglycerol biosynthetic pathway in Cp0710. (A) Change of enzyme expression in glycolysis under heterotrophic growth conditions. The name of each enzyme is in red and the corresponding protein ID in Cp0710 is shown in blue. Values in brackets indicate changes in RNA (left) and protein (right) levels, respectively. U: only identified in heterotrophic approach (upregulation). D: only identified in autotrophic approach (downregulation). N: not detected. (B) Change in enzyme expression in the triacylglycerol biosynthetic pathway under heterotrophic growth conditions. The color code is the same as in (A). Abbreviations of enzymes are as follows: HT, H+/hexose co-transporter; HK, hexokinase; GPI, glucose-6-phosphate isomerase; PFK, 6-phosphofructokinase; ALDO, fructose-bisphosphate aldolase; TPI, triose phosphate isomerase; GAPDH, glyceraldehyde 3-phosphate dehydrogenase; PGK, phosphoglycerate kinase; PGAM, phosphoglycerate mutase; ENO, enolase; PK, pyruvate kinase; Accase, acetyl-CoA carboxylase; MCAT, acyl-carrier protein (ACP) S-malonyltransferase; KAS, 3-oxoacyl-ACP synthase II; KAR, 3-oxoacyl-ACP reductase; HD, 3-hydroxyacyl-ACP dehydratase; ENR, enoyl-ACP reductase I; FATA, fatty acyl-ACP thioesterase A; ACS, long-chain acyl-CoA synthetase; G3PDH, glycerol-3-phosphate dehydrogenase; GPAT, glycerol-3-phosphate O-acyltransferase 3/4; LPAAT, lysophosphatidate acyltransferase; PAP, phosphatidate phosphatase; DGAT, diacylglycerol O-acyltransferase 1; MLDP, major lipid droplet protein.