| Literature DB >> 31921071 |
Jayanti Saha1, Barnan K Saha1, Monalisha Pal Sarkar2, Vivek Roy1, Parimal Mandal2, Ayon Pal1.
Abstract
Soil is a diversified and complex ecological niche, home to a myriad of microorganisms particularly bacteria. The physico-chemical complexities of soil results in a plethora of physiological variations to exist within the different types of soil dwelling bacteria, giving rise to a wide variation in genome structure and complexity. This serves as an attractive proposition to analyze and compare the genome of a large number soil bacteria to comprehend their genome complexity and evolution. In this study a combination of codon usage and molecular phylogenetics of the whole genome and key housekeeping genes like infB (translation initiation factor 2), trpB (tryptophan synthase, beta subunit), atpD (ATP synthase, beta subunit), and rpoB (RNA polymerase, beta subunit) of 92 soil bacterial species spread across the entire eubacterial domain and residing in different soil types was performed. The results indicated the direct relationship of genome size with codon bias and coding frequency in the studied bacteria. The codon usage profile demonstrated by the gene trpB was found to be relatively different from the rest of the housekeeping genes with a large number of bacteria having a greater percentage of genes with Nc values less than the Nc of trpB. The results from the overall codon usage bias profile also depicted that the codon usage bias in the key housekeeping genes of soil bacteria was majorly due to selectional pressure and not mutation. The analysis of hydrophobicity of the gene product encoded by the rpoB coding sequences demonstrated tight clustering across all the soil bacteria suggesting conservation of protein structure for maintenance of form and function. The phylogenetic affinities inferred using 16S rRNA gene and the housekeeping genes demonstrated conflicting signals with trpB gene being the noisiest one. The housekeeping gene atpD was found to depict the least amount of evolutionary change in the soil bacteria considered in this study except in two Clostridium species. The phylogenetic and codon usage analysis of the soil bacteria consistently demonstrated the relatedness of Azotobacter chroococcum with different species of the genus Pseudomonas.Entities:
Keywords: atpD gene; codon usage bias (CUB); housekeeping genes; infB gene; molecular phylogenetics; rpoB gene; soil bacteria; trpB gene
Year: 2019 PMID: 31921071 PMCID: PMC6928123 DOI: 10.3389/fmicb.2019.02896
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
List of genomic Nc (Ncavg) and genomic GC3 (GC3avg) along with standard deviation in the 92 species of soil bacteria analyzed in this study.
| 46.19 | 6.19 | 0.74 | 0.1 | |
| 41.72 | 7.09 | 0.8 | 0.1 | |
| 35.58 | 5.62 | 0.91 | 0.08 | |
| 32.94 | 3.46 | 0.91 | 0.05 | |
| 37.3 | 6.82 | 0.85 | 0.091 | |
| 32.9 | 4.39 | 0.94 | 0.054 | |
| 49.1 | 5.73 | 0.7 | 0.108 | |
| 47.23 | 5.83 | 0.75 | 0.098 | |
| 45.43 | 5.05 | 0.3 | 0.08 | |
| 45.34 | 6.13 | 0.77 | 0.092 | |
| 37.17 | 6.68 | 0.89 | 0.1 | |
| 49.53 | 5.1 | 0.65 | 0.098 | |
| 35.29 | 5.87 | 0.9 | 0.072 | |
| 44.88 | 5.8 | 0.75 | 0.072 | |
| 44.47 | 5.15 | 0.71 | 0.08 | |
| 34.37 | 7.17 | 0.89 | 0.102 | |
| 47.55 | 4.76 | 0.27 | 0.072 | |
| 52.53 | 4.92 | 0.49 | 0.115 | |
| 49.48 | 5.2 | 0.31 | 0.082 | |
| 46.97 | 4.92 | 0.29 | 0.077 | |
| 53.14 | 4.4 | 0.48 | 0.084 | |
| 46.17 | 4.9 | 0.26 | 0.074 | |
| 51.82 | 5.22 | 0.37 | 0.097 | |
| 52.36 | 4.88 | 0.43 | 0.1 | |
| 48.77 | 5.19 | 0.32 | 0.082 | |
| 53 | 5.25 | 0.4 | 0.101 | |
| 48.32 | 4.86 | 0.29 | 0.08 | |
| 48.95 | 5.13 | 0.32 | 0.08 | |
| 51.19 | 4.92 | 0.36 | 0.08 | |
| 50.99 | 4.92 | 0.33 | 0.08 | |
| 52.62 | 5.13 | 0.41 | 0.11 | |
| 50.57 | 5.14 | 0.34 | 0.08 | |
| 44.4 | 5.19 | 0.25 | 0.07 | |
| 51.81 | 5.07 | 0.4 | 0.09 | |
| 52.98 | 5.44 | 0.4 | 0.1 | |
| 52.52 | 5.07 | 0.4 | 0.1 | |
| 52.65 | 4.84 | 0.49 | 0.12 | |
| 52.62 | 4.97 | 0.41 | 0.11 | |
| 47.5 | 4.98 | 0.62 | 0.11 | |
| 46.09 | 3.99 | 0.44 | 0.09 | |
| 46.69 | 5.54 | 0.74 | 0.09 | |
| 46.81 | 5.74 | 0.71 | 0.13 | |
| 34.79 | 6.23 | 0.9 | 0.08 | |
| 33.65 | 5.27 | 0.92 | 0.07 | |
| 49.15 | 4.75 | 0.6 | 0.08 | |
| 34.18 | 6.51 | 0.9 | 0.11 | |
| 34.29 | 6.16 | 0.9 | 0.09 | |
| 39.63 | 3.87 | 0.15 | 0.05 | |
| 37.22 | 4.12 | 0.13 | 0.06 | |
| 35.93 | 3.34 | 0.1 | 0.05 | |
| 38.64 | 4.05 | 0.13 | 0.06 | |
| 35.46 | 3.89 | 0.12 | 0.05 | |
| 38.86 | 4.27 | 0.17 | 0.06 | |
| 37.26 | 4.01 | 0.12 | 0.06 | |
| 35.92 | 4.2 | 0.13 | 0.05 | |
| 36.22 | 3.75 | 0.13 | 0.05 | |
| 50.5 | 4.43 | 0.59 | 0.1 | |
| 51.08 | 4.72 | 0.59 | 0.11 | |
| 53.33 | 3.97 | 0.46 | 0.08 | |
| 51.36 | 4.06 | 0.47 | 0.1 | |
| 44.65 | 5.05 | 0.26 | 0.09 | |
| 48.92 | 5.39 | 0.49 | 0.15 | |
| 45.9 | 5.75 | 0.74 | 0.07 | |
| 30.94 | 3.68 | 0.93 | 0.05 | |
| 30.37 | 3.84 | 0.95 | 0.06 | |
| 32.76 | 3.68 | 0.9 | 0.05 | |
| 31.66 | 3.79 | 0.92 | 0.05 | |
| 29.99 | 3.91 | 0.95 | 0.05 | |
| 30.68 | 3.93 | 0.94 | 0.05 | |
| 53.58 | 4.32 | 0.47 | 0.11 | |
| 51.29 | 4.55 | 0.58 | 0.1 | |
| 42.46 | 6.34 | 0.81 | 0.09 | |
| 42.18 | 5.47 | 0.82 | 0.08 | |
| 36.42 | 4.14 | 0.88 | 0.05 | |
| 35.12 | 4.34 | 0.89 | 0.05 | |
| 40.16 | 6.64 | 0.81 | 0.1 | |
| 30.22 | 6.2 | 0.93 | 0.09 | |
| 40.55 | 6.41 | 0.8 | 0.1 | |
| 37.69 | 6.15 | 0.81 | 0.08 | |
| 35.46 | 4.96 | 0.87 | 0.07 | |
| 34.37 | 6.43 | 0.87 | 0.09 | |
| 44.3 | 6.45 | 0.76 | 0.08 | |
| 33.27 | 4.15 | 0.91 | 0.06 | |
| 32.89 | 4.08 | 0.92 | 0.06 | |
| 31.82 | 4.24 | 0.92 | 0.06 | |
| 32.47 | 4.94 | 0.92 | 0.07 | |
| 31.06 | 3.98 | 0.94 | 0.05 | |
| 33.76 | 3.92 | 0.9 | 0.05 | |
| 32.37 | 3.56 | 0.92 | 0.05 | |
| 36.2 | 5.29 | 0.89 | 0.07 | |
| 52.1 | 4.53 | 0.5 | 0.11 | |
| 51.65 | 6.35 | 0.42 | 0.1 |
Nc profile of the four housekeeping genes in 92 soil bacterial species.
| 37.50 | 8.69 | 34.19 | 12.00 | 37.28 | 8.91 | 38.91 | 7.28 | |
| 33.24 | 8.48 | 33.32 | 8.40 | 38.09 | 3.63 | 37.54 | 4.18 | |
| 29.18 | 6.40 | 27.56 | 8.02 | 35.89 | −0.31 | 29.49 | 6.09 | |
| 29.64 | 3.30 | 28.27 | 4.67 | 31.67 | 1.27 | 30.44 | 2.50 | |
| 32.41 | 4.89 | 32.68 | 4.62 | 34.50 | 2.80 | 32.43 | 4.87 | |
| 29.66 | 3.24 | 28.49 | 4.41 | 29.47 | 3.43 | 30.33 | 2.57 | |
| 49.59 | −0.49 | 40.55 | 8.55 | 49.24 | −0.14 | 56.32 | −7.22 | |
| 44.97 | 2.26 | 44.24 | 2.99 | 45.08 | 2.15 | 45.71 | 1.52 | |
| 37.48 | 7.95 | 35.81 | 9.62 | 36.15 | 9.28 | 38.95 | 6.48 | |
| 39.19 | 6.15 | 39.77 | 5.57 | 39.83 | 5.51 | 45.63 | −0.29 | |
| 28.95 | 8.22 | 27.24 | 9.93 | 36.35 | 0.82 | 29.38 | 7.79 | |
| 46.53 | 3.00 | 43.34 | 6.19 | 47.62 | 1.91 | 46.88 | 2.65 | |
| 30.07 | 5.22 | 29.60 | 5.69 | 29.22 | 6.07 | 30.09 | 5.20 | |
| 34.75 | 10.13 | 32.46 | 12.42 | 37.37 | 7.51 | 39.47 | 5.41 | |
| 38.28 | 6.19 | 35.55 | 8.92 | 40.07 | 4.40 | 44.19 | 0.28 | |
| 32.16 | 2.21 | 27.19 | 7.18 | 33.18 | 1.19 | 26.72 | 7.65 | |
| 40.58 | 6.97 | 0.00 | 47.55 | 43.52 | 4.03 | 49.97 | −2.42 | |
| 47.54 | 4.99 | 44.16 | 8.37 | 49.10 | 3.43 | 54.72 | −2.19 | |
| 46.67 | 2.81 | 39.11 | 10.37 | 42.59 | 6.89 | 46.55 | 2.93 | |
| 40.70 | 6.27 | 40.87 | 6.10 | 40.85 | 6.12 | 44.80 | 2.17 | |
| 51.29 | 1.85 | 50.22 | 2.92 | 48.68 | 4.46 | 53.87 | −0.73 | |
| 43.14 | 3.03 | 37.06 | 9.11 | 39.46 | 6.71 | 48.39 | −2.22 | |
| 46.28 | 5.54 | 42.63 | 9.19 | 48.19 | 3.63 | 48.87 | 2.95 | |
| 47.66 | 4.70 | 42.53 | 9.83 | 47.23 | 5.13 | 52.57 | −0.21 | |
| 39.18 | 9.59 | 36.91 | 11.86 | 40.47 | 8.30 | 48.72 | 0.05 | |
| 47.47 | 5.53 | 37.72 | 15.28 | 43.34 | 9.66 | 57.20 | −4.20 | |
| 42.97 | 5.35 | 40.65 | 7.67 | 40.26 | 8.06 | 49.34 | −1.02 | |
| 38.87 | 10.08 | 37.08 | 11.87 | 39.92 | 9.03 | 46.71 | 2.24 | |
| 48.89 | 2.30 | 44.07 | 7.12 | 48.58 | 2.61 | 50.84 | 0.35 | |
| 43.49 | 7.50 | 40.37 | 10.62 | 44.14 | 6.85 | 52.62 | −1.63 | |
| 48.77 | 3.85 | 43.63 | 8.99 | 48.95 | 3.67 | 51.61 | 1.01 | |
| 42.98 | 7.59 | 37.15 | 13.42 | 44.27 | 6.30 | 49.54 | 1.03 | |
| 39.35 | 5.05 | 35.43 | 8.97 | 39.16 | 5.24 | 44.55 | −0.15 | |
| 46.80 | 5.01 | 40.39 | 11.42 | 47.68 | 4.13 | 53.81 | −2.00 | |
| 46.13 | 6.85 | 38.89 | 14.09 | 45.46 | 7.52 | 56.43 | −3.45 | |
| 50.58 | 1.94 | 44.96 | 7.56 | 47.69 | 4.83 | 47.03 | 5.49 | |
| 0.00 | 52.65 | 43.92 | 8.73 | 50.79 | 1.86 | 55.11 | −2.46 | |
| 47.94 | 4.68 | 44.60 | 8.02 | 50.12 | 2.50 | 52.54 | 0.08 | |
| 38.05 | 9.45 | 33.31 | 14.19 | 41.15 | 6.35 | 0.00 | 47.50 | |
| 48.30 | −2.21 | 40.32 | 5.77 | 48.50 | −2.41 | 47.32 | −1.23 | |
| 39.78 | 6.91 | 37.56 | 9.13 | 42.08 | 4.61 | 44.61 | 2.08 | |
| 45.03 | 1.78 | 39.33 | 7.48 | 45.72 | 1.09 | 45.06 | 1.75 | |
| 29.79 | 5.00 | 28.80 | 5.99 | 28.86 | 5.93 | 29.06 | 5.73 | |
| 28.67 | 4.98 | 28.79 | 4.86 | 29.24 | 4.41 | 28.95 | 4.70 | |
| 48.43 | 0.72 | 47.96 | 1.19 | 47.60 | 1.55 | 46.72 | 2.43 | |
| 27.84 | 6.34 | 30.31 | 3.87 | 28.12 | 6.06 | 30.65 | 3.53 | |
| 28.85 | 5.44 | 31.12 | 3.17 | 30.90 | 3.39 | 32.31 | 1.98 | |
| 37.93 | 1.70 | 32.25 | 7.38 | 36.29 | 3.34 | 36.16 | 3.47 | |
| 37.19 | 0.03 | 40.54 | −3.32 | 33.63 | 3.59 | 0.00 | 37.22 | |
| 34.60 | 1.33 | 32.30 | 3.63 | 32.98 | 2.95 | 37.88 | −1.95 | |
| 37.91 | 0.73 | 32.81 | 5.83 | 32.89 | 5.75 | 0.00 | 38.64 | |
| 35.12 | 0.34 | 34.29 | 1.17 | 33.71 | 1.75 | 0.00 | 35.46 | |
| 36.54 | 2.32 | 35.42 | 3.44 | 34.25 | 4.61 | 34.67 | 4.19 | |
| 34.68 | 2.58 | 33.44 | 3.82 | 33.13 | 4.13 | 33.06 | 4.20 | |
| 34.28 | 1.64 | 32.60 | 3.32 | 33.72 | 2.20 | 0.00 | 35.92 | |
| 35.06 | 1.16 | 0.00 | 36.22 | 32.03 | 4.19 | 0.00 | 36.22 | |
| 49.66 | 0.84 | 46.19 | 4.31 | 53.08 | −2.58 | 52.28 | −1.78 | |
| 50.07 | 1.01 | 44.07 | 7.01 | 49.09 | 1.99 | 40.78 | 10.30 | |
| 47.97 | 5.36 | 55.38 | −2.05 | 48.30 | 5.03 | 51.08 | 2.25 | |
| 51.15 | 0.21 | 38.74 | 12.62 | 50.29 | 1.07 | 50.63 | 0.73 | |
| 38.61 | 6.04 | 36.86 | 7.79 | 39.47 | 5.18 | 45.79 | −1.14 | |
| 44.04 | 4.88 | 41.36 | 7.56 | 47.33 | 1.59 | 0.00 | 48.92 | |
| 32.11 | 13.79 | 32.41 | 13.49 | 41.29 | 4.61 | 43.07 | 2.83 | |
| 28.58 | 2.36 | 27.12 | 3.82 | 31.49 | −0.55 | 27.21 | 3.73 | |
| 28.45 | 1.92 | 27.53 | 2.84 | 30.84 | −0.47 | 28.37 | 2.00 | |
| 30.03 | 2.73 | 28.85 | 3.91 | 31.03 | 1.73 | 28.90 | 3.86 | |
| 28.28 | 3.38 | 27.16 | 4.50 | 30.71 | 0.95 | 27.30 | 4.36 | |
| 26.00 | 3.99 | 24.83 | 5.16 | 27.74 | 2.25 | 28.44 | 1.55 | |
| 28.94 | 1.74 | 27.46 | 3.22 | 31.37 | −0.69 | 26.91 | 3.77 | |
| 49.89 | 3.69 | 45.09 | 8.49 | 48.37 | 5.21 | 56.33 | −2.75 | |
| 51.59 | −0.30 | 50.25 | 1.04 | 51.56 | −0.27 | 54.07 | −2.78 | |
| 30.39 | 12.07 | 32.25 | 10.21 | 31.70 | 10.76 | 35.06 | 7.40 | |
| 33.13 | 9.05 | 32.84 | 9.34 | 31.66 | 10.52 | 35.01 | 7.17 | |
| 32.32 | 4.10 | 29.28 | 7.14 | 28.07 | 8.35 | 34.78 | 1.64 | |
| 0.00 | 35.12 | 28.38 | 6.74 | 28.08 | 7.04 | 31.71 | 3.41 | |
| 32.71 | 7.45 | 34.20 | 5.96 | 36.25 | 3.91 | 30.43 | 9.73 | |
| 28.05 | 2.17 | 27.58 | 2.64 | 29.73 | 0.49 | 25.44 | 4.78 | |
| 34.57 | 5.98 | 34.30 | 6.25 | 36.57 | 3.98 | 31.77 | 8.78 | |
| 30.26 | 7.44 | 30.27 | 7.42 | 31.31 | 6.38 | 27.35 | 10.34 | |
| 32.14 | 3.32 | 28.44 | 7.02 | 33.78 | 1.68 | 29.83 | 5.63 | |
| 31.75 | 2.62 | 32.11 | 2.26 | 32.16 | 2.21 | 25.44 | 8.93 | |
| 31.52 | 12.78 | 32.43 | 11.87 | 37.16 | 7.14 | 34.63 | 9.67 | |
| 28.77 | 4.50 | 27.90 | 5.37 | 30.74 | 2.53 | 27.33 | 5.94 | |
| 28.66 | 4.23 | 29.79 | 3.10 | 29.07 | 3.82 | 29.09 | 3.80 | |
| 28.64 | 3.18 | 27.09 | 4.73 | 29.33 | 2.49 | 27.47 | 4.35 | |
| 28.33 | 4.14 | 28.36 | 4.11 | 29.46 | 3.01 | 26.90 | 5.57 | |
| 27.22 | 3.84 | 26.76 | 4.30 | 29.34 | 1.72 | 26.76 | 4.30 | |
| 28.95 | 4.81 | 30.20 | 3.56 | 30.55 | 3.21 | 28.55 | 5.21 | |
| 28.18 | 4.19 | 29.01 | 3.36 | 31.01 | 1.36 | 28.32 | 4.05 | |
| 31.69 | 4.51 | 29.17 | 7.03 | 31.76 | 4.44 | 31.79 | 4.41 | |
| 44.23 | 7.87 | 40.85 | 11.25 | 44.28 | 7.82 | 51.70 | 0.40 | |
| 35.91 | 15.74 | 54.74 | −3.09 | 37.30 | 14.35 | 51.51 | 0.14 |
Nc.
GC3 profile of the four housekeeping genes in 92 soil bacterial species.
| 0.78 | −0.04 | 0.76 | −0.02 | 0.81 | −0.07 | 0.86 | −0.12 | |
| 0.88 | −0.08 | 0.86 | −0.06 | 0.81 | −0.01 | 0.87 | −0.07 | |
| 0.96 | −0.05 | 0.97 | −0.06 | 0.88 | 0.03 | 1.00 | −0.09 | |
| 0.93 | −0.02 | 0.92 | −0.01 | 0.83 | 0.08 | 0.92 | −0.01 | |
| 0.84 | 0.01 | 0.82 | 0.03 | 0.82 | 0.03 | 0.89 | −0.04 | |
| 0.88 | 0.06 | 0.87 | 0.07 | 0.92 | 0.02 | 0.95 | −0.01 | |
| 0.66 | 0.04 | 0.79 | −0.09 | 0.67 | 0.03 | 0.55 | 0.15 | |
| 0.73 | 0.02 | 0.78 | −0.03 | 0.74 | 0.01 | 0.79 | −0.04 | |
| 0.17 | 0.13 | 0.16 | 0.14 | 0.15 | 0.15 | 0.25 | 0.05 | |
| 0.83 | −0.06 | 0.75 | 0.02 | 0.83 | −0.06 | 0.79 | −0.02 | |
| 0.97 | −0.08 | 0.96 | −0.07 | 0.86 | 0.03 | 0.99 | −0.10 | |
| 0.66 | −0.01 | 0.67 | −0.02 | 0.61 | 0.04 | 0.71 | −0.06 | |
| 0.86 | 0.04 | 0.86 | 0.04 | 0.88 | 0.02 | 0.93 | −0.03 | |
| 0.73 | 0.02 | 0.67 | 0.08 | 0.70 | 0.05 | 0.71 | 0.04 | |
| 0.63 | 0.08 | 0.57 | 0.14 | 0.63 | 0.08 | 0.67 | 0.04 | |
| 0.86 | 0.03 | 0.87 | 0.02 | 0.82 | 0.07 | 0.96 | −0.07 | |
| 0.20 | 0.07 | 0.00 | 0.27 | 0.22 | 0.05 | 0.31 | −0.04 | |
| 0.31 | 0.18 | 0.32 | 0.17 | 0.41 | 0.08 | 0.44 | 0.05 | |
| 0.25 | 0.06 | 0.13 | 0.18 | 0.21 | 0.10 | 0.28 | 0.03 | |
| 0.16 | 0.13 | 0.14 | 0.15 | 0.20 | 0.09 | 0.22 | 0.07 | |
| 0.46 | 0.02 | 0.44 | 0.04 | 0.53 | −0.05 | 0.57 | −0.09 | |
| 0.17 | 0.09 | 0.13 | 0.13 | 0.15 | 0.11 | 0.29 | −0.03 | |
| 0.28 | 0.09 | 0.22 | 0.15 | 0.29 | 0.08 | 0.31 | 0.06 | |
| 0.35 | 0.08 | 0.25 | 0.18 | 0.31 | 0.12 | 0.42 | 0.01 | |
| 0.18 | 0.14 | 0.13 | 0.19 | 0.20 | 0.12 | 0.30 | 0.02 | |
| 0.27 | 0.13 | 0.16 | 0.24 | 0.26 | 0.14 | 0.41 | −0.01 | |
| 0.21 | 0.08 | 0.18 | 0.11 | 0.20 | 0.09 | 0.26 | 0.03 | |
| 0.18 | 0.14 | 0.13 | 0.19 | 0.18 | 0.14 | 0.28 | 0.04 | |
| 0.33 | 0.03 | 0.25 | 0.11 | 0.32 | 0.04 | 0.41 | −0.05 | |
| 0.23 | 0.10 | 0.16 | 0.17 | 0.21 | 0.12 | 0.42 | −0.09 | |
| 0.33 | 0.08 | 0.32 | 0.09 | 0.32 | 0.09 | 0.54 | −0.13 | |
| 0.21 | 0.13 | 0.14 | 0.20 | 0.23 | 0.11 | 0.37 | −0.03 | |
| 0.18 | 0.07 | 0.12 | 0.13 | 0.16 | 0.09 | 0.31 | −0.06 | |
| 0.24 | 0.16 | 0.19 | 0.21 | 0.33 | 0.07 | 0.40 | 0.00 | |
| 0.26 | 0.14 | 0.15 | 0.25 | 0.29 | 0.11 | 0.44 | −0.04 | |
| 0.33 | 0.07 | 0.29 | 0.11 | 0.29 | 0.11 | 0.35 | 0.05 | |
| 0.00 | 0.49 | 0.31 | 0.18 | 0.39 | 0.10 | 0.45 | 0.04 | |
| 0.32 | 0.09 | 0.32 | 0.09 | 0.31 | 0.10 | 0.49 | −0.08 | |
| 0.29 | 0.33 | 0.24 | 0.38 | 0.48 | 0.14 | 0.00 | 0.62 | |
| 0.54 | −0.10 | 0.42 | 0.02 | 0.47 | −0.03 | 0.48 | −0.04 | |
| 0.81 | −0.07 | 0.80 | −0.06 | 0.76 | −0.02 | 0.76 | −0.02 | |
| 0.58 | 0.13 | 0.45 | 0.26 | 0.62 | 0.09 | 0.76 | −0.05 | |
| 0.86 | 0.04 | 0.81 | 0.09 | 0.86 | 0.04 | 0.94 | −0.04 | |
| 0.87 | 0.05 | 0.85 | 0.07 | 0.86 | 0.06 | 0.96 | −0.04 | |
| 0.55 | 0.05 | 0.51 | 0.09 | 0.55 | 0.05 | 0.65 | −0.05 | |
| 0.88 | 0.02 | 0.80 | 0.10 | 0.88 | 0.02 | 0.94 | −0.04 | |
| 0.90 | 0.00 | 0.82 | 0.08 | 0.84 | 0.06 | 0.95 | −0.05 | |
| 0.10 | 0.05 | 0.05 | 0.10 | 0.08 | 0.07 | 0.19 | −0.04 | |
| 0.09 | 0.04 | 0.12 | 0.01 | 0.08 | 0.05 | 0.00 | 0.13 | |
| 0.08 | 0.02 | 0.05 | 0.05 | 0.02 | 0.08 | 0.05 | 0.05 | |
| 0.07 | 0.06 | 0.05 | 0.08 | 0.06 | 0.07 | 0.00 | 0.13 | |
| 0.09 | 0.03 | 0.07 | 0.05 | 0.07 | 0.05 | 0.00 | 0.12 | |
| 0.09 | 0.08 | 0.42 | −0.25 | 0.08 | 0.09 | 0.09 | 0.08 | |
| 0.07 | 0.05 | 0.43 | −0.31 | 0.04 | 0.08 | 0.11 | 0.01 | |
| 0.08 | 0.05 | 0.43 | −0.30 | 0.06 | 0.07 | 0.00 | 0.13 | |
| 0.10 | 0.03 | 0.00 | 0.13 | 0.05 | 0.08 | 0.00 | 0.13 | |
| 0.54 | 0.05 | 0.71 | −0.12 | 0.55 | 0.04 | 0.70 | −0.11 | |
| 0.55 | 0.04 | 0.43 | 0.16 | 0.60 | −0.01 | 0.74 | −0.15 | |
| 0.33 | 0.13 | 0.61 | −0.15 | 0.38 | 0.08 | 0.50 | −0.04 | |
| 0.43 | 0.04 | 0.23 | 0.24 | 0.42 | 0.05 | 0.59 | −0.12 | |
| 0.10 | 0.16 | 0.05 | 0.21 | 0.14 | 0.12 | 0.32 | −0.06 | |
| 0.33 | 0.16 | 0.29 | 0.20 | 0.47 | 0.02 | 0.00 | 0.49 | |
| 0.86 | −0.12 | 0.81 | −0.07 | 0.76 | −0.02 | 0.77 | −0.03 | |
| 0.91 | 0.02 | 0.95 | −0.02 | 0.83 | 0.10 | 0.98 | −0.05 | |
| 0.93 | 0.02 | 0.95 | 0.00 | 0.88 | 0.07 | 0.98 | −0.03 | |
| 0.89 | 0.01 | 0.88 | 0.02 | 0.80 | 0.10 | 0.94 | −0.04 | |
| 0.93 | −0.01 | 0.95 | −0.03 | 0.84 | 0.08 | 0.96 | −0.04 | |
| 0.96 | −0.01 | 0.96 | −0.01 | 0.92 | 0.03 | 0.98 | −0.03 | |
| 0.92 | 0.02 | 0.93 | 0.01 | 0.83 | 0.11 | 0.96 | −0.02 | |
| 0.35 | 0.12 | 0.28 | 0.19 | 0.30 | 0.17 | 0.38 | 0.09 | |
| 0.39 | 0.19 | 0.39 | 0.19 | 0.56 | 0.02 | 0.56 | 0.02 | |
| 0.94 | −0.13 | 0.89 | −0.08 | 0.91 | −0.10 | 0.91 | −0.10 | |
| 0.89 | −0.07 | 0.90 | −0.08 | 0.91 | −0.09 | 0.91 | −0.09 | |
| 0.85 | 0.03 | 0.85 | 0.03 | 0.91 | −0.03 | 0.90 | −0.02 | |
| 0.00 | 0.89 | 0.88 | 0.01 | 0.84 | 0.05 | 0.93 | −0.04 | |
| 0.61 | 0.20 | 0.61 | 0.20 | 0.60 | 0.21 | 0.86 | −0.05 | |
| 0.86 | 0.07 | 0.83 | 0.10 | 0.85 | 0.08 | 0.97 | −0.04 | |
| 0.63 | 0.17 | 0.60 | 0.20 | 0.63 | 0.17 | 0.90 | −0.10 | |
| 0.78 | 0.03 | 0.68 | 0.13 | 0.77 | 0.04 | 0.92 | −0.11 | |
| 0.81 | 0.06 | 0.86 | 0.01 | 0.79 | 0.08 | 0.89 | −0.02 | |
| 0.75 | 0.12 | 0.64 | 0.23 | 0.74 | 0.13 | 0.96 | −0.09 | |
| 0.82 | −0.06 | 0.72 | 0.04 | 0.71 | 0.05 | 0.82 | −0.06 | |
| 0.88 | 0.03 | 0.88 | 0.03 | 0.78 | 0.13 | 0.95 | −0.04 | |
| 0.90 | 0.02 | 0.86 | 0.06 | 0.84 | 0.08 | 0.96 | −0.04 | |
| 0.88 | 0.04 | 0.90 | 0.02 | 0.84 | 0.08 | 0.95 | −0.03 | |
| 0.89 | 0.03 | 0.86 | 0.06 | 0.84 | 0.08 | 0.97 | −0.05 | |
| 0.94 | 0.00 | 0.93 | 0.01 | 0.86 | 0.08 | 0.99 | −0.05 | |
| 0.90 | 0.00 | 0.90 | 0.00 | 0.80 | 0.10 | 0.98 | −0.08 | |
| 0.94 | −0.02 | 0.94 | −0.02 | 0.86 | 0.06 | 0.96 | −0.04 | |
| 0.93 | −0.04 | 0.92 | −0.03 | 0.91 | −0.02 | 0.93 | −0.04 | |
| 0.31 | 0.19 | 0.26 | 0.24 | 0.34 | 0.16 | 0.57 | −0.07 | |
| 0.22 | 0.20 | 0.51 | −0.09 | 0.19 | 0.23 | 0.44 | −0.02 |
GC3.
Figure 1A scattered plot depicting the hydrophobicity profile of the gene products encoded by the four housekeeping genes rpoB, atpD, infB, and trpB from the soil bacterial species considered in this study. The y-axis corresponds to the hydrophobicity value whereas the x-axis corresponds to the bacterial species sorted in alphabetical order as given in Table 1.
Figure 2A combined genomic Nc plot utilizing all the coding sequences of the whole genomes depicting the three typical mode of aggregation of coding sequences. Left centric aggregation represented by Clostridium butyricum JKY6D1 (in purple), mid centric aggregation shown in green by Nitrosomonas communis Nm2, and right centric aggregation depicted by Micrococcus luteus NCTC 2665, shown in red. The dashed blue line represents the null hypothesis curve which suggests that codon usage bias is solely due to mutation and not selection (Wright, 1990).
Figure 3A combined Nc plot of the four housekeeping genes rpoB, atpD, infB, and trpB from the 92 soil bacterial species included in this study, depicting selectional pressure as a major unifying force in shaping codon usage pattern. The dashed blue line represents the null hypothesis curve which suggests that codon usage bias is solely due to mutation and not selection (Wright, 1990).
Figure 4A phylogenetic tree showing the relationship between the soil bacterial species considered in this study based on 16S rRNA gene sequences along with Gram nature, taxonomic position and codon usage annotation data. The name of the species have been depicted in color corresponding to its Gram nature with magenta and blue representing Gram negative and positive nature, respectively. The outermost semicircle with magenta bars represents the genomic GC3 while the innermost semicircle with blue bars represents the genomic Nc. The middle strip with yellow to red color gradient depicts the genomic GC content with red representing maximum GC content. The evolutionary history was inferred by using the Maximum Likelihood method based on the Kimura 2-parameter model (Kimura, 1980). The bootstrap consensus tree inferred from 1,000 replicates is taken to represent the evolutionary history of the taxa analyzed (Felsenstein, 1985). The tree with the highest log likelihood (−6,331.0306) is shown. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites [five categories (+G, parameter = 0.5623)]. The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 40.6240% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. All positions containing gaps and missing data were eliminated. There were a total of 357 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 (Kumar et al., 2008). The visualization and annotation of the phylogenetic tree was done using iTOL ver. 4.4.2 (Letunic and Bork, 2007).
Figure 5Phylogenetic tree showing the relationship between the soil bacterial species considered in this study based on rpoB gene sequences along with Gram nature, taxonomic position and codon usage annotation data. The name of the species have been depicted in color corresponding to the Gram nature with magenta and blue representing Gram negative and positive, respectively. The outermost semicircle with green bars represents the GC3 content of rpoB sequences while the innermost semicircle with blue bars represents the Nc of the rpoB coding sequences. The middle strip with cyan to orange color gradient depicts the variation in hydrophobicity of the protein encoded by rpoB coding sequences. The evolutionary history was inferred by using the Maximum Likelihood method based on the General Time Reversible model (Nei and Kumar, 2000). The bootstrap consensus tree inferred from 1,000 replicates is taken to represent the evolutionary history of the taxa analyzed (Felsenstein, 1985). The tree with the highest log likelihood (−73,689.4674) is shown. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites [five categories (+G, parameter = 0.7988)]. The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 22.2913% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. All positions containing gaps and missing data were eliminated. There were a total of 1,726 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 (Kumar et al., 2008). The visualization and annotation of the phylogenetic tree was done using iTOL ver. 4.4.2 (Letunic and Bork, 2007).
Figure 6Phylogenetic tree showing the relationship between the soil bacterial species considered in this study based on atpD gene sequences along with Gram nature, taxonomic position and codon usage annotation data. The name of the species have been depicted in color corresponding to its Gram nature with magenta and blue representing Gram negative and positive, respectively. The outermost semicircle with green bars represents the GC3 content of atpD sequences while the innermost semicircle with blue bars represents the Nc of the atpD coding sequences. The middle strip with cyan to orange color gradient depicts the variation in hydrophobicity of the protein encoded by the atpD coding sequences. The evolutionary history was inferred by using the Maximum Likelihood method based on the General Time Reversible model (Nei and Kumar, 2000). The bootstrap consensus tree inferred from 1,000 replicates is taken to represent the evolutionary history of the taxa analyzed (Felsenstein, 1985). Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites [five categories (+G, parameter = 0.9877)]. The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 6.2680% sites). All positions containing gaps and missing data were eliminated. There were a total of 1,123 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 (Kumar et al., 2008). The visualization and annotation of the phylogenetic tree was done using iTOL ver. 4.4.2 (Letunic and Bork, 2007).
Figure 7Phylogenetic tree showing the relationship between the soil bacterial species considered in this study based on infB gene sequences along with Gram nature, taxonomic position and codon usage annotation data. The name of the species has been depicted in color corresponding to its Gram nature with magenta and blue representing Gram negative and positive, respectively. The outermost semicircle with green bars represents the GC3 content of infB sequences while the innermost semicircle with blue bars represents the Nc of the infB coding sequences. The middle strip with cyan to orange color gradient depicts the variation in hydrophobicity of the protein encoded by infB coding sequences. The evolutionary history was inferred by using the Maximum Likelihood method based on the General Time Reversible model (Nei and Kumar, 2000). The bootstrap consensus tree inferred from 1,000 replicates is taken to represent the evolutionary history of the taxa analyzed (Felsenstein, 1985). The tree with the highest log likelihood (−84,778.7972) is shown. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites [five categories (+G, parameter = 1.1105)]. The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 16.2594% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. All positions containing gaps and missing data were eliminated. There were a total of 1,617 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 (Kumar et al., 2008). The visualization and annotation of the phylogenetic tree was done using iTOL ver. 4.4.2 (Letunic and Bork, 2007).
Figure 8Phylogenetic tree showing the relationship between the soil bacterial species considered in this study based on trpB gene sequences along with Gram nature, taxonomic position and codon usage annotation data. The name of the species has been depicted in color corresponding to its Gram nature with magenta and blue representing Gram negative and positive, respectively. The outermost semicircle with green bars represents the GC3 content of trpB sequences while the innermost semicircle with blue bars represents the Nc of the trpB coding sequences. The middle strip with cyan to orange color gradient depicts the variation in hydrophobicity of the protein encoded by trpB coding sequences. The evolutionary history was inferred by using the Maximum Likelihood method based on the General Time Reversible model (Nei and Kumar, 2000). The bootstrap consensus tree inferred from 1,000 replicates is taken to represent the evolutionary history of the taxa analyzed (Felsenstein, 1985). The tree with the highest log likelihood (−57,296.2790) is shown. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites [five categories (+G, parameter = 1.2054)]. The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 14.7706% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. All positions containing gaps and missing data were eliminated. There were a total of 1,098 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 (Kumar et al., 2008). The visualization and annotation of the phylogenetic tree was done using iTOL ver. 4.4.2 (Letunic and Bork, 2007).