| Literature DB >> 22675597 |
Supapon Cheevadhanarak, Kalyanee Paithoonrangsarid, Peerada Prommeenate, Warunee Kaewngam, Apiluck Musigkain, Somvong Tragoonrung, Satoshi Tabata, Takakazu Kaneko, Jeerayut Chaijaruwanich, Duangjai Sangsrakru, Sithichoke Tangphatsornruang, Juntima Chanprasert, Sissades Tongsima, Kanthida Kusonmano, Wattana Jeamton, Sudarat Dulsawat, Amornpan Klanchui, Tayvich Vorapreeda, Vasunun Chumchua, Chiraphan Khannapho, Chinae Thammarongtham, Vethachai Plengvidhya, Sanjukta Subudhi, Apiradee Hongsthong, Marasri Ruengjitchatchawalya, Asawin Meechai, Jittisak Senachak, Morakot Tanticharoen.
Abstract
Arthrospira platensis is a cyanobacterium that is extensively cultivated outdoors on a large commercial scale for consumption as a food for humans and animals. It can be grown in monoculture under highly alkaline conditions, making it attractive for industrial production. Here we describe the complete genome sequence of A. platensis C1 strain and its annotation. The A. platensis C1 genome contains 6,089,210 bp including 6,108 protein-coding genes and 45 RNA genes, and no plasmids. The genome information has been used for further comparative analysis, particularly of metabolic pathways, photosynthetic efficiency and barriers to gene transfer.Entities:
Year: 2012 PMID: 22675597 PMCID: PMC3368399 DOI: 10.4056/sigs.2525955
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1The phylogenetic tree of 51 cyanobacterial concatenated ribosomal proteins. The main topology is in agreement with earlier inferences of the phylogeny of this taxon with the 16s rRNA based on the GTR+G+I substitution model [7]. The tree is built using the Neighbor-Joining method and 1,000 re-samplings to calculate bootstrap values. A. platensis C1 was clustered together with other strains in the order Oscillatoriales and was clearly separated from related species in the order Nostocales. The conserved, concatenated ribosomal protein phylogenetic tree indicated the monophyly of this Arthrospira genus.
Classification and general features of A. platensis C1 according to the MIGS recommendations [8]
| | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Current Classification | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Gram stain | Negative | TAS [ | |
| Cell shape | Spiral | TAS [ | |
| Motility | None | ||
| Sporulation | None | ||
| Temperature range | 20 – 40 | TAS [ | |
| Optimum temperature | 30-35 | TAS [ | |
| MIGS-22 | Relationship to Oxygen | Aerobic | TAS [ |
| Carbon source | Phototroph, Mixotroph | TAS [ | |
| Energy source | Phototroph | TAS [ | |
| MIGS-6 | Habitat (EnvO) | Fresh water | TAS [ |
| MIGS-6.1 | Temperature | 20 - 40 | TAS [ |
| MIGS-6.2 | pH | 8.0 -10.0 | TAS [ |
| MIGS-6.3 | Salinity | 0.06 | TAS [ |
| MIGS-10 | Extrachromosomal elements | None | TAS [ |
| MIGS-11 | Estimated Size | 6.08 Mb | IDA |
| MIGS-14 | Known Pathogenicity | None | NAS |
| MIGS-15 | Biotic Relationship | Free living | NAS |
| MIGS-4 | Geographic Location | Not reported | NAS |
| MIGS-4.1 | Latitude | Not reported | NAS |
| MIGS-4.2 | Longitude | Not reported | NAS |
| MIGS-4.3 | Depth | Not reported | NAS |
| MIGS-4.4 | Altitude | Not reported | NAS |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [15]. If the evidence code is IDA, then the property was directly observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Draft |
| MIGS-28 | Libraries used | Genomic libraries: Sanger (one each of 2 and 5 kb library), standard 454 pyrosequence library and BAC library |
| MIGS-29 | Sequencing platforms | 454 and Sanger Technology |
| MIGS-31.2 | Sequence coverage | 28× |
| MIGS-30 | Assemblers | Newbler version 2.3, Phrap |
| MIGS-31.3 | Contigs | 63 |
| MIGS-32 | Gene calling method | Glimmer 3.0 |
| GOLD ID | Gi09635 | |
| NCBI project ID | 67617 | |
| Database: IMG-GEBA | 2507262036 | |
| Source material identifier | PCC9438 | |
| Project relevance | Biotechnological |
Genome statistics
| Attributed | Value | % of Total |
|---|---|---|
| Genome size (bp) | 6,089,210 | 100.00% |
| DNA coding region (bp) | 4,951,337 | 81.31% |
| G+C content (bp) | 2,651,568 | 44.68% |
| DNA scaffolds | 1 | 100.00% |
| Total genes | 6,153 | 100.00% |
| Protein coding genes | 6,108 | 100.00% |
| RNA genes | 45 | 0.73% |
| Genes with protein function prediction | 3,757 | 61.06% |
| Genes with enzymes | 952 | 15.47% |
| Genes with Transporter Classification | 345 | 5.61% |
| Genes assigns to KEGG pathways | 1,012 | 16.45% |
| Genes assigned to KEGG Orthology (KO) | 1,837 | 29.86% |
| Genes assigned to COGs | 3,459 | 56.22% |
| Genes assigned to Pfam | 3,529 | 57.35% |
| Genes assigned to TIGRfam | 1,180 | 19.18% |
| Genes assigned to InterPro | 4,244 | 68.97% |
| Genes assigned in paralog clusters | 1,048 | 17.03% |
| Genes assigned with signal peptides | 570 | 9.26% |
| Genes assigned with transmembrane proteins | 1,094 | 17.78% |
Number of genes associated with the general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 161 | 4.22 | Translation, ribosomal structure and biogenesis |
| A | 0 | - | RNA processing and modification |
| K | 164 | 4.29 | Transcription |
| L | 459 | 12.02 | Replication, recombination and repair |
| B | 2 | 0.05 | Chromatin structure and dynamics |
| D | 49 | 1.28 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | - | Nuclear structure |
| V | 98 | 2.57 | Defense mechanisms |
| T | 337 | 8.82 | Signal transduction mechanisms |
| M | 234 | 6.13 | Cell wall/membrane/envelope biogenesis |
| N | 54 | 1.41 | Cell motility |
| Z | 0 | - | Cytoskeleton |
| W | 0 | - | Extracellular structures |
| U | 71 | 1.86 | Intracellular trafficking, secretion, and vesicular transport |
| O | 180 | 4.71 | Posttranslational modification, protein turnover, chaperones |
| C | 181 | 4.74 | Energy production and conversion |
| G | 149 | 3.9 | Carbohydrate transport and metabolism |
| E | 201 | 5.26 | Amino acid transport and metabolism |
| F | 66 | 1.73 | Nucleotide transport and metabolism |
| H | 157 | 4.11 | Coenzyme transport and metabolism |
| I | 69 | 1.81 | Lipid transport and metabolism |
| P | 148 | 3.88 | Inorganic ion transport and metabolism |
| Q | 71 | 1.86 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 525 | 13.75 | General function prediction only |
| S | 443 | 11.6 | Function unknown |
| - | 2694 | 43.78 | Not in COG |
Figure 2Graphical circular map of the chromosome of Arthrospira platensis C1. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Genome statistics comparison among Arthrospira spp.
| Genome Name | | | | | |
|---|---|---|---|---|---|
| Genome size (bp) | 6,089,210 | 6,788,435 | 6,003,314 | 4,997,563 | 6,145,553 |
| Total genes | 6,153 | 6,676 | 5,730 | 5,401 | 5,718 |
| Protein coding genes | 6,108 | 6,630 | 5,690 | 5,370 | 5,675 |
| Protein with function prediction | 3,757 | 2,542 | 3,315 | 3,023 | 3,023 |
| RNA genes | 45 | 46 | 40 | 31 | 43 |
| Enzymes | 952 | 905 | 889 | 816 | 882 |
| % Enzymes | 15.47% | 13.56% | 15.51% | 15.11% | 15.42% |
| Transporter Classification | 345 | NA | NA | NA | NA |
| %Transporter Classification | 5.61% | NA | NA | NA | NA |
| KEGG pathways | 1,012 | 993 | 931 | 904 | 954 |
| % KEGG pathways | 16.45% | 14.87% | 16.25% | 16.74% | 16.68% |
| KEGG Orthology (KO) | 1,837 | 1,702 | 1,623 | 1,492 | 1,658 |
| % KEGG Orthology (KO) | 29.86% | 25.49% | 28.32% | 27.62% | 29.00% |
| COGs | 3,459 | 3,570 | 3,306 | 3,050 | 3,234 |
| % COGs | 56.22% | 53.48% | 57.70% | 56.47% | 56.56% |
| Pfam | 3,529 | 3,598 | 3,564 | 3,431 | 3,526 |
| % Pfam | 57.35% | 53.89% | 62.20% | 63.53% | 61.66% |
| TIGRfam | 1,180 | 1,213 | 1,160 | 1,185 | 1,203 |
| % TIGRfam | 19.18% | 18.17% | 20.24% | 21.94% | 21.04% |
| InterPro | 4,244 | 3,969 | 3,938 | 4,207 | 4,294 |
| % InterPro | 68.97% | 59.45% | 68.73% | 77.89% | 75.10% |
| signal peptides | 570 | 1,319 | 545 | 559 | 1,150 |
| % signal peptides | 9.26% | 19.76% | 9.51% | 10.35% | 20.11% |
| Transmembrane proteins | 1,094 | 1,123 | 1,053 | 1,094 | 1,057 |
| % Transmembrane proteins | 17.78% | 16.82% | 18.38% | 20.26% | 18.49% |
| COG clusters | 1,569 | 1,566 | 1,491 | 1,489 | 1,472 |
| KOG clusters | 729 | 723 | 722 | 737 | 717 |
| Pfam clusters | 1,740 | 1,732 | 1,702 | 1,729 | 1,730 |
| TIGRfam clusters | 932 | 936 | 902 | 924 | 931 |
NA: Information not available.