| Literature DB >> 22651236 |
Lijing Xu1, Jimmy Kuo2, Jong-Kang Liu3, Tit-Yee Wong1.
Abstract
BACKGROUND: The efficiencies of the stop codons TAA, TAG, and TGA in protein synthesis termination are not the same. These variations could allow many genes to be regulated. There are many similar nucleotide trimers found on the second and third reading-frames of a gene. They are called premature stop codons (PSC). Like stop codons, the PSC in bacterial genomes are also highly bias in terms of their quantities and qualities on the genes. Phylogenetically related species often share a similar PSC profile. We want to know whether the selective forces that influence the stop codons and the PSC usage biases in a genome are related. We also wish to know how strong these trimers in a genome are related to the natural history of the bacterium. Knowing these relations may provide better knowledge in the phylogeny of bacteriaEntities:
Year: 2012 PMID: 22651236 PMCID: PMC3466146 DOI: 10.1186/2042-5783-2-6
Source DB: PubMed Journal: Microb Inform Exp ISSN: 2042-5783
Figure 1Species correlations based on reading-frame-specific translation stop signals. Hierarchical clustering of 61 bacteria (A) correlation based on the genomic translation stop signals ratios on the first reading frames (TSSR-1); (B) correlation based on the genomic translation stop signals ratios on the second reading frames (TSSR-2) and, (C) correlation based on the genomic translation stop signals ratios on third reading frames (TSSR-3). Correlation distance is between zero and one with zero being 100% similar, and one being no correlation.
Figure 216S rRNA alignment reference tree. A phylogenetic reference tree is constructed from the 16SrRNA sequence alignment with 19 type species (see Table 1). This standard tree was used to validate the accuracy of other trees using bacterial translation stop signals profiles.
List of reference type species
| S000004313 | |
| S000139289 | |
| S000414306 | |
| S000003950 | |
| S000414651 | |
| S000414655 | |
| S000500280 | |
| S000436058 | |
| S000414667 | |
| S000437122 | Rickettsia typhi (T); Wilmington; |
| S000006115 | |
| S000926440 | |
| S000926444 | |
| S000000258 | |
| S000013935 | |
| S000392501 | |
| S000392506 | |
| S000392498 | |
| S000413795 |
The 16SrRNA sequences of 18 type species from Ribosomal Database Project were used for the construction of reference phylogenetic tree in Figure 2.
Figure 3Species correlation based on genomic translation stop signals on all three reading-frames. Distance correlation of 61 bacteria based on their Genomic Translation Stop Signals Ratio. A species is represented by the average value of all its Genic-TSSR (Genomic-TSSR). The Genomic-TSSR values of 61 bacterial genomes were clustered by Hierarchical clustering by City-Block Distance, Complete-Linage. Parentheses show the genomic size and GC ratio of that species. Correlation distance is between zero and one with zero being 100% similar, and one being no correlation.
Figure 4Correspondence Analysis of individual genes from four different species. Five hundred randomly selected genes from each of the genomes of four different bacteria were selected for CA analysis. At 95% confidence, four clusters of genes could be recognized: Escherichia coli CFT073 (ECOL, solid line, Turquoise), Salmonella typhimurium LTS (SALM, dashed line, Green), Rickettsia typhi Wilmington (RICK, dotted-dashed line, Dark Blue) and Neisseria meningitidis Mc58 (NEIS, dotted line, Red). Also showed are the centroids of the genes of E. coli (E), S. typhimurium (S), R. typhi (R), and N. meningitidis (N).
Figure 5Kolmogorov-Smirnov test for discrete distributions of Genic-TSSRs on a chromosome. A two-sided Kolmogorov-Smirnov test (KS-test) comparing the average percentage of counts of each translation stop signals among 400 genes from Escherichia coli K12 based on assigning these genes to three categories. (A) genes are assigned based on the location on each replichore (Left (∆) vs. Right (▼)); (B) genes are assigned based on their orientation on the leading or lagging stands of the DNA (Forward (■) vs. Reverse (□)), or (C) genes are assigned based on their proximity to the replication origin or terminus (Ori (●) vs. Ter (○)). The insert shows the standard deviation of the average percentage of counts of all nine types of translation stop signals in each gene assignment.
Organisms used in this study and their corresponding genomic Translation stop signal ratios (Genomic-TSSR)
| α | 357244 | Orientia tsutsugamushi Boryong | 0.010 | 0.003 | 0.003 | 0.186 | 0.157 | 0.063 | 0.281 | 0.102 | 0.195 |
| | 293614 | Rickettsia akari str. Hartford | 0.009 | 0.004 | 0.005 | 0.200 | 0.164 | 0.061 | 0.277 | 0.088 | 0.192 |
| | 391896 | Rickettsia bellii OSU 85-389 | 0.009 | 0.003 | 0.003 | 0.198 | 0.163 | 0.053 | 0.285 | 0.086 | 0.200 |
| | 336407 | Rickettsia bellii RML369-C | 0.009 | 0.003 | 0.002 | 0.198 | 0.164 | 0.053 | 0.286 | 0.086 | 0.199 |
| | 293613 | Rickettsia canadensis str. McKi | 0.010 | 0.003 | 0.003 | 0.205 | 0.163 | 0.057 | 0.280 | 0.090 | 0.189 |
| | 272944 | Rickettsia conorii Malish 7 | 0.010 | 0.004 | 0.003 | 0.206 | 0.165 | 0.058 | 0.279 | 0.089 | 0.185 |
| | 315456 | Rickettsia felis URRWXCal2 | 0.008 | 0.004 | 0.003 | 0.204 | 0.167 | 0.054 | 0.283 | 0.088 | 0.189 |
| | 416276 | Rickettsia massiliae MTU5 | 0.008 | 0.003 | 0.002 | 0.206 | 0.166 | 0.056 | 0.281 | 0.091 | 0.186 |
| | 272947 | Rickettsia prowazekii str. Madrid E | 0.009 | 0.002 | 0.002 | 0.204 | 0.163 | 0.059 | 0.283 | 0.086 | 0.193 |
| | 452659 | Rickettsia rickettsii str. Iowa | 0.011 | 0.004 | 0.004 | 0.205 | 0.165 | 0.059 | 0.279 | 0.088 | 0.185 |
| | 392021 | Rickettsia rickettsii str. Shei | 0.010 | 0.004 | 0.004 | 0.204 | 0.167 | 0.058 | 0.279 | 0.089 | 0.186 |
| | 257363 | Rickettsia typhi str. Wilmington | 0.008 | 0.002 | 0.002 | 0.202 | 0.161 | 0.059 | 0.284 | 0.086 | 0.195 |
| | 80849 | Wolbachia pipientis Drosophila (wBm) | 0.016 | 0.009 | 0.008 | 0.198 | 0.153 | 0.098 | 0.209 | 0.082 | 0.228 |
| | 955 | Wolbachia pipientis Brugia (wMel) | 0.011 | 0.006 | 0.005 | 0.201 | 0.158 | 0.095 | 0.216 | 0.085 | 0.223 |
| β | 218491 | Neisseria flavescens SK114 | 0.027 | 0.004 | 0.010 | 0.078 | 0.074 | 0.329 | 0.176 | 0.025 | 0.278 |
| | 242231 | Neisseria gonorrhoeae FA1090 | 0.026 | 0.006 | 0.020 | 0.098 | 0.066 | 0.402 | 0.141 | 0.021 | 0.222 |
| | 374833 | Neisseria meningitidis 053442 | 0.024 | 0.006 | 0.018 | 0.095 | 0.069 | 0.380 | 0.152 | 0.024 | 0.232 |
| | 272831 | Neisseria meningitidis FAM18 | 0.023 | 0.005 | 0.019 | 0.096 | 0.072 | 0.381 | 0.152 | 0.025 | 0.227 |
| | 122586 | Neisseria meningitidis MC58 | 0.024 | 0.006 | 0.018 | 0.097 | 0.073 | 0.372 | 0.157 | 0.026 | 0.227 |
| | 122587 | Neisseria meningitidis Z2491 | 0.025 | 0.006 | 0.019 | 0.097 | 0.070 | 0.383 | 0.152 | 0.024 | 0.225 |
| γ | 316407 | Escherichia coli W3110 | 0.020 | 0.003 | 0.012 | 0.091 | 0.064 | 0.274 | 0.205 | 0.040 | 0.291 |
| | 362663 | Escherichia coli 536 | 0.018 | 0.002 | 0.009 | 0.092 | 0.066 | 0.272 | 0.207 | 0.040 | 0.296 |
| | 585055 | Escherichia coli 55989 | 0.017 | 0.002 | 0.009 | 0.089 | 0.060 | 0.269 | 0.209 | 0.039 | 0.305 |
| | 405955 | Escherichia coli APEC O1 | 0.017 | 0.002 | 0.008 | 0.092 | 0.064 | 0.270 | 0.209 | 0.039 | 0.301 |
| | 481805 | Escherichia coli ATCC 8739 | 0.018 | 0.002 | 0.008 | 0.091 | 0.065 | 0.275 | 0.206 | 0.040 | 0.295 |
| | 413997 | Escherichia coli B str. REL606 | 0.018 | 0.002 | 0.008 | 0.090 | 0.064 | 0.277 | 0.207 | 0.039 | 0.295 |
| | 469008 | Escherichia coli BL21(DE3) | 0.019 | 0.002 | 0.008 | 0.089 | 0.064 | 0.278 | 0.206 | 0.039 | 0.295 |
| | 574521 | Escherichia coli BW2952 | 0.018 | 0.002 | 0.009 | 0.091 | 0.064 | 0.272 | 0.207 | 0.039 | 0.299 |
| | 199310 | Escherichia coli CFT073 | 0.019 | 0.003 | 0.012 | 0.095 | 0.064 | 0.271 | 0.205 | 0.040 | 0.291 |
| | 331111 | Escherichia coli E24377A | 0.018 | 0.002 | 0.009 | 0.091 | 0.063 | 0.272 | 0.207 | 0.039 | 0.298 |
| | 585397 | Escherichia coli ED1a | 0.018 | 0.002 | 0.010 | 0.090 | 0.062 | 0.277 | 0.205 | 0.037 | 0.301 |
| | 585034 | Escherichia coli IAI1 | 0.018 | 0.002 | 0.008 | 0.090 | 0.064 | 0.273 | 0.208 | 0.040 | 0.296 |
| | 585057 | Escherichia coli IAI39 | 0.018 | 0.002 | 0.009 | 0.092 | 0.062 | 0.272 | 0.209 | 0.038 | 0.297 |
| | 574521 | Escherichia coli O127-H6 str- E2348_69 | 0.018 | 0.002 | 0.008 | 0.088 | 0.064 | 0.279 | 0.205 | 0.039 | 0.295 |
| | 155864 | Escherichia coli O157:H7 EDL933 | 0.017 | 0.002 | 0.010 | 0.093 | 0.062 | 0.267 | 0.209 | 0.040 | 0.300 |
| | 83334 | Escherichia coli O157:H7 str. Sakai | 0.017 | 0.002 | 0.010 | 0.093 | 0.062 | 0.268 | 0.208 | 0.039 | 0.300 |
| | 444450 | Escherichia coli O157-H7 str- Ec4115 | 0.018 | 0.003 | 0.010 | 0.092 | 0.062 | 0.268 | 0.209 | 0.039 | 0.301 |
| | 585035 | Escherichia coli S88 | 0.018 | 0.002 | 0.009 | 0.090 | 0.064 | 0.275 | 0.206 | 0.038 | 0.299 |
| | 409438 | Escherichia coli SE11 | 0.018 | 0.002 | 0.009 | 0.091 | 0.063 | 0.273 | 0.207 | 0.039 | 0.298 |
| | 439855 | Escherichia coli SMS-3-5 | 0.018 | 0.002 | 0.009 | 0.093 | 0.064 | 0.270 | 0.210 | 0.039 | 0.295 |
| | 168927 | Escherichia coli str. K-12 MG1655 | 0.018 | 0.002 | 0.008 | 0.090 | 0.064 | 0.277 | 0.207 | 0.040 | 0.293 |
| | 585056 | Escherichia coli UMN026 | 0.017 | 0.002 | 0.009 | 0.093 | 0.064 | 0.272 | 0.208 | 0.039 | 0.296 |
| | 364106 | Escherichia coli UTI89 | 0.018 | 0.002 | 0.009 | 0.093 | 0.065 | 0.272 | 0.206 | 0.040 | 0.295 |
| | 585054 | Escherichia fergusonii ATCC 35469 | 0.017 | 0.002 | 0.008 | 0.094 | 0.067 | 0.258 | 0.207 | 0.041 | 0.305 |
| | 209261 | Salmonella typhi Ty2 | 0.018 | 0.003 | 0.010 | 0.091 | 0.068 | 0.273 | 0.217 | 0.045 | 0.275 |
| | 321314 | Salmonella choleraesuis str. SC-B67 | 0.020 | 0.004 | 0.013 | 0.094 | 0.069 | 0.270 | 0.216 | 0.045 | 0.269 |
| | 295319 | Salmonella paratyphi A str. ATCC 9150 | 0.018 | 0.003 | 0.009 | 0.091 | 0.069 | 0.272 | 0.219 | 0.046 | 0.272 |
| | 220341 | Salmonella typhi str. CT18 | 0.018 | 0.003 | 0.010 | 0.091 | 0.068 | 0.272 | 0.217 | 0.045 | 0.276 |
| | 99287 | Salmonella typhimurium str. LT2 | 0.018 | 0.003 | 0.009 | 0.092 | 0.068 | 0.272 | 0.219 | 0.045 | 0.274 |
| | 300268 | Shigella boydii Sb227 | 0.021 | 0.004 | 0.014 | 0.088 | 0.060 | 0.280 | 0.201 | 0.036 | 0.297 |
| | 300267 | Shigella dysenteriae Sd197 | 0.024 | 0.004 | 0.013 | 0.084 | 0.062 | 0.279 | 0.203 | 0.038 | 0.294 |
| | 198215 | Shigella flexneri 2a str. 301 | 0.019 | 0.002 | 0.008 | 0.090 | 0.063 | 0.279 | 0.204 | 0.037 | 0.297 |
| | 300269 | Shigella sonnei Ss046 | 0.020 | 0.003 | 0.011 | 0.088 | 0.061 | 0.283 | 0.202 | 0.035 | 0.297 |
| | 393305 | Yersinia enterocolitica 8081 | 0.014 | 0.003 | 0.006 | 0.088 | 0.096 | 0.219 | 0.209 | 0.048 | 0.316 |
| | 349746 | Yersinia pestis Angola | 0.015 | 0.004 | 0.007 | 0.087 | 0.094 | 0.223 | 0.213 | 0.045 | 0.312 |
| | 360102 | Yersinia pestis Antiqua JGI | 0.014 | 0.003 | 0.007 | 0.088 | 0.094 | 0.223 | 0.213 | 0.046 | 0.312 |
| | 229193 | Yersinia pestis biovar Medieval 91001 | 0.013 | 0.004 | 0.007 | 0.090 | 0.093 | 0.225 | 0.211 | 0.046 | 0.311 |
| | 214092 | Yersinia pestis CO92 | 0.013 | 0.003 | 0.007 | 0.088 | 0.094 | 0.224 | 0.212 | 0.046 | 0.312 |
| | 187410 | Yersinia pestis KIM | 0.014 | 0.004 | 0.007 | 0.090 | 0.094 | 0.224 | 0.211 | 0.047 | 0.309 |
| | 349747 | Yersinia pseudotuberculosis IP 31758 | 0.013 | 0.004 | 0.007 | 0.088 | 0.094 | 0.221 | 0.214 | 0.047 | 0.313 |
| 273123 | Yersinia pseudotuberculosis IP 32953 | 0.013 | 0.003 | 0.007 | 0.088 | 0.094 | 0.221 | 0.214 | 0.047 | 0.314 |
Proteobacteria strains used in this study are grouped in the order of α-, β-, and γ- subphyla(S). Their Taxon identifications (ID) and their corresponding Genomic Translation stop signals Ratios (TSSR) are listed. Columns labeled 1–9 represent the rations of TAA: TAG: TGA of the 1st, 2nd, and 3 reading-frames, respectively, of the Genomic Translation Stop Signal Ratios (TSSR) of that species. A more detailed table is posted on our website (http://umdrive.memphis.edu/tywong/public/Table_1jb).