| Literature DB >> 27536339 |
William J Kelly1, Diana M Pacheco1, Dong Li1, Graeme T Attwood1, Eric Altermann1, Sinead C Leahy1.
Abstract
Methanobrevibacter millerae SM9 was isolated from the rumen of a sheep maintained on a fresh forage diet, and its genome has been sequenced to provide information on the phylogenetic diversity of rumen methanogens with a view to developing technologies for methane mitigation. It is the first rumen isolate from the Methanobrevibacter gottschalkii clade to have its genome sequence completed. The 2.54 Mb SM9 chromosome has an average G + C content of 31.8 %, encodes 2269 protein-coding genes, and harbors a single prophage. The overall gene content is comparable to that of Methanobrevibacter ruminantium M1 and the type strain of M. millerae (ZA-10(T)) suggesting that the basic metabolism of these two hydrogenotrophic rumen methanogen species is similar. However, M. millerae has a larger complement of genes involved in methanogenesis including genes for methyl coenzyme M reductase II (mrtAGDB) which are not found in M1. Unusual features of the M. millerae genomes include the presence of a tannase gene which shows high sequence similarity with the tannase from Lactobacillus plantarum, and large non-ribosomal peptide synthase genes. The M. millerae sequences indicate that methane mitigation strategies based on the M. ruminantium M1 genome sequence are also likely to be applicable to members of the M. gottschalkii clade.Entities:
Keywords: Methane; Methanobrevibacter millerae; Methanogen; Rumen
Year: 2016 PMID: 27536339 PMCID: PMC4987999 DOI: 10.1186/s40793-016-0171-9
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Morphology of M. millerae SM9. Micrograph of M. millerae SM9 cells captured at 100× magnification using UV illumination to show F420 fluorescence
Fig. 2Phylogenetic tree highlighting the position of M. millerae SM9 relative to the type strains of the other species within the genus Methanobrevibacter. The evolutionary history was inferred by using the Maximum Likelihood method based on the General Time Reversible model [38]. The tree with the highest log likelihood (−4507.7026) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites [5 categories (+G, parameter = 0.2484)]. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 15 nucleotide sequences. All positions with less than 95 % site coverage were eliminated. That is, fewer than 5 % alignment gaps, missing data, and ambiguous bases were allowed at any position. There were a total of 1206 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [39]. Species with strain genome sequencing projects registered in the Genomes Online Database (GOLD) [40] are labeled with an asterisk
Classification and general features of Methanbtevibacter millerae SM9 [41]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain: Archaea | TAS [ | |
| Phylum: | TAS [ | ||
| Class: | TAS [ | ||
| Order: | TAS [ | ||
| Family: | TAS [ | ||
| Genus: | TAS [ | ||
| Species: | TAS [ | ||
| strain: SM9 | |||
| Gram stain | Positive | TAS [ | |
| Cell shape | Coccobacilli | IDA | |
| Motility | Non-motile | NAS | |
| Sporulation | Not reported | IDA | |
| Temperature range | 36–42 °C | NAS | |
| Optimum temperature | 38 °C | NAS | |
| pH range; Optimum | 7.0–8.0; 6.8 | NAS | |
| Carbon source | CO2, Acetate | IDA | |
| MIGS-6 | Habitat | Sheep rumen | TAS [ |
| MIGS-6.3 | Salinity | Not reported | |
| MIGS-22 | Oxygen requirement | Anaerobic | IDA |
| MIGS-15 | Biotic relationship | Symbiont | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| MIGS-4 | Geographic location | Palmerston North, New Zealand | IDA |
| MIGS-5 | Sample collection | Not reported | |
| MIGS-4.1 | Latitude | −40.35 (40°21'00"S) | IDA |
| MIGS-4.2 | Longitude | +175.61 (175°36'36"E) | IDA |
| MIGS-4.4 | Altitude | 30 M | IDA |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [47]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality, closed genome |
| MIGS-28 | Libraries used | Paired-end and mate pair libraries |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium chemistry |
| MIGS-31.2 | Fold coverage | 213× |
| MIGS-30 | Assemblers | Newbler |
| MIGS-32 | Gene calling method | Glimmer and BLASTX |
| Locus Tag | sm9 | |
| Genbank ID | CP011266 | |
| Genbank Date of Release | 22nd December 2015 | |
| GOLD ID | Gp0007703 | |
| BIOPROJECT | PRJNA49589 | |
| MIGS 13 | Source Material Identifier |
|
| Project relevance | Ruminant methane emissions |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 2,543,538 | 100.00 |
| DNA coding (bp) | 2,225,085 | 87.48 |
| DNA G + C (bp) | 809,122 | 31.81 |
| DNA scaffolds | 1 | 100.00 |
| Total genes | 2,370 | 100.00 |
| Protein coding genes | 2,269 | 95.73 |
| RNA genes | 47 | 1.98 |
| Pseudo genes | 54 | 2.28 |
| Genes with function prediction | 1,568 | 66.16 |
| Genes assigned to COGs | 1,470 | 64.79 |
| Genes with Pfam domains | 1,951 | 85.99 |
| Genes with signal peptides | 135 | 5.95 |
| Genes with transmembrane helices | 544 | 23.98 |
| CRISPR repeats | 2 |
Number of genes associated with the general COG functional categories
| Code | Value | % of totala | Description |
|---|---|---|---|
| J | 145 | 6.39 | Translation |
| A | 0 | 0.00 | RNA processing and modification |
| K | 88 | 3.88 | Transcription |
| L | 129 | 5.69 | Replication, recombination and repair |
| B | 3 | 0.13 | Chromatin structure and dynamics |
| D | 6 | 0.26 | Cell cycle control, mitosis and meiosis |
| V | 37 | 1.63 | Defense mechanisms |
| T | 15 | 0.66 | Signal transduction mechanisms |
| M | 67 | 2.95 | Cell wall/membrane biogenesis |
| N | 4 | 0.18 | Cell motility |
| U | 9 | 0.40 | Intracellular trafficking and secretion |
| O | 45 | 1.98 | Posttranslational modification, protein turnover, chaperones |
| C | 162 | 7.14 | Energy production and conversion |
| G | 48 | 2.12 | Carbohydrate transport and metabolism |
| E | 114 | 5.02 | Amino acid transport and metabolism |
| F | 46 | 2.03 | Nucleotide transport and metabolism |
| H | 90 | 3.97 | Coenzyme transport and metabolism |
| I | 28 | 1.23 | Lipid transport and metabolism |
| P | 59 | 2.60 | Inorganic ion transport and metabolism |
| Q | 25 | 1.10 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 205 | 9.03 | General function prediction only |
| S | 145 | 6.39 | Function unknown |
| - | 800 | 35.21 | Not in COGs |
aThe total is based on the total number of protein coding genes in the genome
Fig. 3Genome atlas for M. millerae SM9. The circles from the outside represent: (1) forward and reverse coding domain sequences, the colour coding of the CDS represent different Clusters of Orthologous Groups categories; (2) rRNA and tRNA; (3) % GC plot; (4) GC skew [(GC)/(G + C)]
Fig. 4Synteny analysis. Alignment of the M. millerae SM9 genome against the draft genome of M. millerae ZA-10T (a) and the complete genome of M ruminantium M1 (b). Whenever the two sequences agree, a coloured line or dot is plotted. If the two sequences were perfectly identical, a single line would go from the bottom left to the top right. Units displayed in base-pairs