| Literature DB >> 26272053 |
Hao Luo1, Feng Gao2, Yan Lin1.
Abstract
Essential genes are thought to be critical for the survival of the organisms under certain circumstances, and the natural selection acting on essential genes is expected to be stricter than on nonessential ones. Up to now, essential genes have been identified in approximately thirty bacterial organisms by experimental methods. In this paper, we performed a comprehensive comparison between the essential and nonessential genes in the genomes of 23 bacterial species based on the Ka/Ks ratio, and found that essential genes are more evolutionarily conserved than nonessential genes in most of the bacteria examined. Furthermore, we also analyzed the conservation by functional clusters with the clusters of orthologous groups (COGs), and found that the essential genes in the functional categories of G (Carbohydrate transport and metabolism), H (Coenzyme transport and metabolism), I (Transcription), J (Translation, ribosomal structure and biogenesis), K (Lipid transport and metabolism) and L (Replication, recombination and repair) tend to be more evolutionarily conserved than the corresponding nonessential genes in bacteria. The results suggest that the essential genes in these subcategories are subject to stronger selective pressure than the nonessential genes, and therefore, provide more insights of the evolutionary conservation for the essential and nonessential genes in complex biological processes.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26272053 PMCID: PMC4536490 DOI: 10.1038/srep13210
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The workflow of the Ka/Ks estimation.
Flow chart schematically shows the procedure to estimate the Ka/Ks ratios of all the protein-coding genes.
The summary of the dataseta.
| Organism | RefSeq | Essential | Nonessential | PNE | RefSeq of the homologous strains | ||
|---|---|---|---|---|---|---|---|
| Genes | COGs | Genes | COGs | ||||
| NC_005966 | 499 | 474 | 2594 | 1699 | 0.00 | NC_009085 NC_011595 NC_017847 | |
| NC_000964 | 271 | 267 | 3904 | 2483 | 0.15 | NC_014479 NC_016047 NC_017195 | |
| NC_016776 | 547 | – | 3743 | – | 3.37 | NC_003228 NC_006347 NC_009614 | |
| NC_004663 | 325 | 256 | 4453 | 2578 | 0.00 | NC_009614 NC_010831 NC_015164 | |
| NC_006351 NC_006350 | 505 | 453 | 5222 | 3724 | 21.52 | NC_009078 NC_009076 NC_017831 NC_017831 NC_018527 NC_018529 | |
| NC_007651 NC_007650 | 406 | 372 | 5226 | 3706 | 0.00 | NC_021173 NC_017832 | |
| NC_002163 | 228 | 178 | 1395 | 1000 | 0.86 | NC_003912 NC_009707 NC_017280 | |
| NC_011916 | 480 | 436 | 3224 | 1988 | 0.00 | NC_002696 NC_010338 NC_014100 | |
| NC_000913 | 296 | 284 | 4077 | 3144 | 1.84 | NC_008253 NC_009801 NC_013008 | |
| NC_008601 | 392 | 340 | 1329 | 942 | 0.08 | NC_017449 NC_017450 NC_017909 | |
| NC_000907 | 642 | 549 | 512 | 422 | 0.20 | NC_007146 NC_017451 NC_022356 | |
| NC_000915 | 323 | 230 | 1135 | 759 | 0.00 | NC_017359 NC_019563 NC_022886 | |
| NC_000962 | 687 | 589 | 3070 | 1747 | 45.24 | NC_017528 NC_021192 NC_021193 | |
| NC_000908 | 381 | 287 | 94 | 64 | 39.36 | NC_018495 NC_018496 NC_018497 NC_018498 | |
| NC_010729 | 463 | 374 | 1627 | 840 | 1.23 | NC_002950 NC_015571 | |
| NC_002516 | 117 | 98 | 5454 | 4075 | 13.31 | NC_017548 NC_018080 NC_021577 | |
| NC_008463 | 335 | 271 | 960 | 627 | 7.08 | NC_002516 NC_017548 NC_018080 | |
| NC_004631 | 358 | 338 | 3906 | 2707 | 3.84 | NC_003197 NC_021151 NC_022544 | |
| NC_009511 | 535 | 452 | 4315 | 3184 | – | NC_020561 | |
| NC_002745 | 302 | 280 | 2281 | 1471 | 13.42 | NC_002951 NC_007795 NC_018608 | |
| NC_007795 | 351 | 308 | 2541 | 1346 | 7.20 | NC_002745 NC_018608 NC_020533 | |
| NC_009009 | 218 | 211 | 2052 | 1341 | 0.00 | NC_017595 NC_017618 NC_018526 | |
| NC_002506 NC_002505 | 779 | 433 | 2943 | 2173 | 47.54 | NC_012578 NC_012580 NC_012582 NC_012583 NC_017269 NC_017270 | |
aThe organism name, RefSeq, proportion of PNE genes and the RefSeq used to evaluate conservation are provided. The count of the essential and nonessential genes as well as their COGs are also present in this table.
bThe percentage is the proportion of PNE genes in the nonessential genes.
cThe genome of S. wittichii RW1 only have single completely homologous genomes in NCBI, so that the percentage of PNE was not available. And due to the absence of homologous strains in A. baylyi, the three genomes were selected from the Acinetobacter genus.
Figure 2The average of Ka, Ks and Ka/Ks value for essential and nonessential genes in each organism.
The histogram shows the averages for (A) Ka, (B) Ks and (C) Ka/Ks values between the essential and nonessential genes, respectively. The P-values calculated by Mann–Whitney U Test are also displayed at the top of figures.
Figure 3Box-plot diagrams for the differences between the essential and nonessential genes.
The pairs of box-plots, presented in red and blue respectively, compare the distributions for the average Ka/Ks ratios of the essential genes and nonessential genes in each organisms.
Figure 4The heat map analysis for the significant conserved genes based on the COG categories.
The hierarchal cluster diagram was constructed by Ward’s linkage clustering. The P-values of each COG category are calculated with Mann–Whitney U Test by organism, which reflect the significance of the difference for the Ka/Ks value between the essential and nonessential genes. The blue boxes represent that the COG subcategory in which the essential genes are evolutionarily conserved than the nonessential ones, while the red boxes represent the opposite case.