| Literature DB >> 26779305 |
Hannes Horn1, Alexander Keller2, Ulrich Hildebrandt3, Peter Kämpfer4, Markus Riederer3, Ute Hentschel1.
Abstract
The Gram-positive actinomycete Williamsia sp. ARP1 was originally isolated from the Arabidopsis thaliana phyllosphere. Here we describe the general physiological features of this microorganism together with the draft genome sequence and annotation. The 4,745,080 bp long genome contains 4434 protein-coding genes and 70 RNA genes. To our knowledge, this is only the second reported genome from the genus Williamsia and the first sequenced strain from the phyllosphere. The presented genomic information is interpreted in the context of an adaptation to the phyllosphere habitat.Entities:
Keywords: Adaption; Annotation; Arabidopsis thaliana; Assembly; Draft genome; Next generation sequencing; Phyllosphere; Whole genome sequencing; Williamsia sp. ARP1
Year: 2016 PMID: 26779305 PMCID: PMC4715301 DOI: 10.1186/s40793-015-0122-x
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 116S rRNA gene based maximum likelihood phylogenetic tree highlighting the position of Williamsia sp. ARP1 within the suborder Corynebacterineae. The tree is based on 16 s rRNA sequences comprising the genera Williamsia, Gordonia, Mycobacterium, Dietzia, Tsukamurella, Rhodococcus and Frankia as an outgroup. The Williamsia sp. ARP1 is highlighted in bold text to show its position. The maximum-likelihood phylogenetic tree was generated using PhyML with the GTR substitution model. Numbers at the nodes are percentages of 1000 bootstrap replicates. Genbank accession numbers are indicated in parentheses; type strains are tagged with a superscripted T. The scale bar represents 0.06 substitutions per nucleotide position
Classification and general features of Williamsia sp. ARP1 [34]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | IDA | ||
| (Type) strain: ARP1 | IDA | ||
| Gram stain | Positive | IDA | |
| Cell shape | Coccoid to rod-like | IDA | |
| Motility | Non-motile | IDA | |
| Sporulation | Non-sporulating | IDA | |
| Temperature range | 4–36 °C | IDA | |
| Optimum temperature | 25–30 °C | IDA | |
| pH range; Optimum | Not reported | NAS | |
| Carbon source | organic carbon | IDA | |
| MIGS-6 | Habitat | Phyllosphere | IDA |
| MIGS-6.3 | Salinity | 1.0–6.0 % | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Commensal | IDA |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| MIGS-4 | Geographic location | Würzburg, Germany | IDA |
| MIGS-5 | Sample collection | 2012 | IDA |
| MIGS-4.1 | Latitude | 49.766556 | IDA |
| MIGS-4.2 | Longitude | 9.931768 | IDA |
| MIGS-4.3 | Depth | Plant surface | IDA |
| MIGS-4.4 | Altitude | 198 m above sea level | IDA |
aEvidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [77]
Fig. 2General characteristics of Williamsia sp. ARP1. a The morphology of the colonies after three days of growth on LB-agar at 30 °C. b Image of Williamsia sp. ARP1 using scanning electron microscopy
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Draft genome |
| MIGS-28 | Libraries used | One Illumina paired-end library (400 bp insert size) |
| MIGS 29 | Sequencing platforms | Illumina MiSeq |
| MIGS 31.2 | Fold coverage | 65× |
| MIGS 30 | Assemblers | SPAdes 3.0, SSPACE 3.0 |
| MIGS 32 | Gene calling method | Prodigal 2.6.1 |
| Genbank ID | JXYP00000000 | |
| Locus Tag | TU34 | |
| GenBank Date of Release | July 1, 2015 | |
| GOLD ID | Gp0118481 | |
| BIOPROJECT | PRJNA272726 | |
| MIGS 13 | Source Material Identifier | DSM 46827 |
| Project relevance | Phyllosphere, Environmental |
Genome statistics
| Attribute | Value | % of total |
|---|---|---|
| Genome size (bp) | 4,745,080 | 100.00 |
| DNA coding (bp) | 4,347,123 | 91.61 |
| DNA G+C (bp) | 3,256,678 | 68.63 |
| DNA scaffolds | 50 | |
| Total genes | 4509 | 100.00 |
| Protein coding genes | 4438 | 98.42 |
| RNA genes | 71 | 1.57 |
| tRNA genes | 45 | 1.00 |
| rRNA genes | 5 | 0.01 |
| rRNA operons | 1a | |
| Pseudo genes | 0 | 0.00 |
| Genes in internal clusters | NA | |
| Genes with function prediction | 3505 | 77.73 |
| Genes assigned to COGs | 2207 | 48.95 |
| Genes with Pfam domains | 1330 | 29.50 |
| Genes with TIGRFAM domains | 793 | 17.59 |
| Genes with signal peptides | 334 | 7.41 |
| Genes with transmembrane helices | 1140 | 25.28 |
| CRISPR repeats | 2 | 0.04 |
aOnly one RNA operon appears to be complete
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 143 | 3.17 | Translation, ribosomal structure, and biogenesis |
| A | 1 | 0.02 | RNA processing and modification |
| K | 183 | 4.06 | Transcription |
| L | 85 | 1.89 | Replication, recombination, and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 0 | 0.00 | Cell cycle control, Cell division, chromosome partitioning |
| V | 31 | 0.69 | Defense mechanisms |
| T | 74 | 1.64 | Signal transduction mechanisms |
| M | 102 | 2.26 | Cell wall/membrane biogenesis |
| N | 11 | 0.24 | Cell motility |
| U | 18 | 0.40 | Intracellular trafficking and secretion |
| O | 79 | 1.75 | Posttranslational modification, protein turnover, chaperones |
| C | 184 | 4.08 | Energy production and conversion |
| G | 125 | 2.77 | Carbohydrate transport and metabolism |
| E | 226 | 5.01 | Amino acid transport and metabolism |
| F | 66 | 1.46 | Nucleotide transport and metabolism |
| H | 118 | 2.62 | Coenzyme transport and metabolism |
| I | 194 | 4.30 | Lipid transport and metabolism |
| P | 154 | 3.42 | Inorganic ion transport and metabolism |
| Q | 141 | 3.13 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 346 | 7.67 | General function prediction only |
| S | 184 | 4.08 | Function unknown |
| - | 2231 | 49.48 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Fig. 3Graphical circular map of the Williamsia sp. ARP1 genome. Starting from the outmost circle and moving inwards, each ring of the circle contains information of the genome: genes on the forward strand (colored according to their COG categories), CDS on the forward strand (blue arrows), CDS on the reverse strand (blue arrows), genes on the reverse strand (colored according to their COG categories), tRNA and rRNA genes on both strands (green and orange), GC content (black), GC skew (green and purple) and genome region by kbp
Fig. 4Metabolic subsystems of Williamsia sp. ARP1 annotated through the RAST webserver
Used actinomycete reference genomes in this study
| Species | Strain | Accession number | Genome Size [Mbp] | G+C content |
|---|---|---|---|---|
|
| D3 | NZ_AYTE000000000.1 | 5.62 | 64.60 |
|
| CP001802.1 | 5.21 | 67.00 | |
|
| VH2 | NC_016906.1 | 5.67 | 66.96 |
Fig. 5Comparison of COG classes between strain ARP1 and reference genomes. The color keys provide the relative percentage of each COG class per genome. The dendrogram is based on correlation analysis