| Literature DB >> 26927064 |
Yan-Ping Ma1,2, Hao Ke3, Zhi-Ling Liang4, Zhen-Xing Liu5, Le Hao6, Jiang-Yao Ma7, Yu-Gu Li8.
Abstract
Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts.Entities:
Keywords: Streptococcus agalactiae; codon usage bias; evolution; multiple evolutionary selections
Mesh:
Substances:
Year: 2016 PMID: 26927064 PMCID: PMC4813141 DOI: 10.3390/ijms17030277
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
The synonymous codon usage pattern in all open reading frames (ORFs) of S. agalactiae.
| Amino Acid | Codon | RSCU a |
|---|---|---|
| Ala | GC |
|
| GCC | 0.44 | |
| GCG | 0.37 | |
| GC |
| |
| Arg | AG |
|
| AGG | 0.56 | |
| CGA | 0.72 | |
| CGC | 0.67 | |
| CGG | 0.18 | |
| CG |
| |
| Asn | AAC | 0.50 |
| AA |
| |
| Asp | GAC | 0.43 |
| GA |
| |
| Cys | UGC | 0.41 |
| UG |
| |
| Gln | CA |
|
| CAG | 0.44 | |
| Glu | GA |
|
| GAG | 0.52 | |
| Gly | GG |
|
| GGC | 0.46 | |
| GGG | 0.47 | |
| GG |
| |
| His | CAC | 0.53 |
| CA |
| |
| Ile | AUA | 0.55 |
| AUC | 0.61 | |
| AU |
| |
| Leu | CUA | 0.73 |
| CUC | 0.35 | |
| CUG | 0.25 | |
| CU |
| |
| UU |
| |
| UUG | 0.98 | |
| Lys | AA |
|
| AAG | 0.48 | |
| Phe | UUC | 0.45 |
| UU |
| |
| Pro | CC |
|
| CCC | 0.32 | |
| CCG | 0.32 | |
| CC |
| |
| Ser | AGC | 0.57 |
| AG |
| |
| UC |
| |
| UCC | 0.32 | |
| UCG | 0.32 | |
| UC |
| |
| Thr | AC |
|
| ACC | 0.48 | |
| ACG | 0.41 | |
| AC |
| |
| Tyr | UAC | 0.51 |
| UA |
| |
| Val | GU |
|
| GUC | 0.51 | |
| GUG | 0.44 | |
| GU |
|
a Average value of RSCU in five S. agalactiae genome; underline represented A/U rich-ending compositional constraints.
Figure 1The plot of the two most dominant axes (axis 1 and axis 2) calculated from PCA according to all open reading frames (ORFs) of S. agalactiae genes. Genes located on the leading strand were denoted as green plus signs, whereas orange plus signs indicated lagging strand genes. Axis 1 represented the first major trend and axis 2 represented the second major trend.
Nucleotide composition statistics for a gene population in five S. agalactiae genomes.
| Skew | Axis 1 | Axis 2 |
|---|---|---|
| GC skew | ||
| AT skew | ||
| GC12 skew | ||
| AT12 skew | ||
| GC3 skew | ||
| AT3 skew |
* Significant difference between them; ** highly significant difference between them.
Figure 2Graphs showing the relationship between effective number of codon (ENC) and the GC3% in functional genes of S. agalactiae genomes. The continuous curve indicated the expected codon usage if GC compositional constraints alone account for codon usage bias.
Figure 3PR2—plot for S. agalactiae genomes. Genes located on the leading strand were denoted as black triangles, whereas blue asterisks indicated lagging strand genes.
Figure 4The correlation between CAI values and ENC, GC%, GC12%, GC3% values. (A) the correlation between CAI values and ENC values, the black dot represented the ENC value against CAI value for each gene, the black line was generated by the correlation analysis; (B) the correlation between CAI values and GC% values, the black dot represented the GC% value against CAI value for each gene, the black line represented linear fitting line of all data; (C) the correlation between CAI values and GC12% values, the black dot represented the GC12% value against CAI value for each gene, the black line represented linear fitting line of all data; (D) the correlation between CAI values and GC3% values, the black dot represented the GC3% value against CAI value for each gene, the black line represented linear fitting line of all data.