| Literature DB >> 20380691 |
Andreas Dötsch1, Frank Klawonn, Michael Jarek, Maren Scharfe, Helmut Blöcker, Susanne Häussler.
Abstract
BACKGROUND: The constant increase in development and spread of bacterial resistance to antibiotics poses a serious threat to human health. New sequencing technologies are now on the horizon that will yield massive increases in our capacity for DNA sequencing and will revolutionize the drug discovery process. Since essential genes are promising novel antibiotic targets, the prediction of gene essentiality based on genomic information has become a major focus.Entities:
Mesh:
Year: 2010 PMID: 20380691 PMCID: PMC2864246 DOI: 10.1186/1471-2164-11-234
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Overview of sequencing results.
| genome coverage | sequence variations | |||
|---|---|---|---|---|
| Strain | median read | missing genesb | nonsynonymous | synonymous |
| Psae1152 | 47.6 | 226 | 2.47 | 20.00 |
| Psae1747 | 41.6 | 170 | 1.45 | 10.55 |
| Psae2136 | 63.8 | 170 | 1.53 | 10.93 |
| Psae2162 | 43.8 | 230 | 2.39 | 19.67 |
| PADJ-1 | 36.7 | 61 | 3.26 | 11.40 |
| PADJ-2 | 38.2 | 122 | 3.36 | 11.33 |
| PADJ-3 | 30.8 | 93 | 3.13 | 11.07 |
| meand | 34.6 | 87 | 3.22 | 11.25 |
a Median value over all genes
b Normalized read depth ≤ 0.25
c Mean substitution rate per 1000 sites
d Data set including all 36 strains obtained by merging short reads of the three PADJ sets.
Genes missing in the sequenced clinical isolates for which no transposon mutants are availablea.
| Locus ID | Psae1152/2162 | Psae1747/2136 | PADJ-1 | PADJ-2 | PADJ-3 |
|---|---|---|---|---|---|
| normalized read depthb | |||||
| PA0637 | 0/0 (absent) | 1.20/0.99 (present) | 0.40 | 0.13 | 0.29 |
| PA2387 | 0.18/0.18 (absent) | 0.19/0.11 (absent) | 0.26 | 0.89 | 0.47 |
| PA2565 | 1.30/1.32 (present) | 0/0 (absent) | 0.79 | 0.89 | 0.47 |
a Within the scope of the two comprehensive libraries [13,14].
b Genes were assumed to be absent if the normalized read depth was below 0.25. In the pooled sequence data sets, the read depth gives only a rough estimate for the fraction of strains that harbor the gene.
Figure 1Codon adaptation index is high in . A) Histogram of codon adaptation index (CAI) and B) gene expression in P. aeruginosa. C) Gene expression data is only weakly correlated with the codon adaptation index in PAO1 (R = 0.135). Color indicates the density of data points at the coordinates indicated by the spots ranging from blue (single data point) to deep red (many). D) Core genes in PAO1 show generally high CAI values, while the accessory genome is significantly less adapted. Boxplots indicate 0.25 and 0.75 quantiles and the median (red line) with notches giving an estimate for the variance of the median. The green dashed line indicates the median dN for all genes.
Figure 2Protein evolution in essential and dispensable genes. Boxplots of nonsynonymous substitution rates dN of coding sequences averaged for all 36 strains. Data were grouped by gene essentiality as annotated in the Database of Essential Genes [13]. 87 genes showing a very low read depth (≤ 25% of the average read depth) were neglected in this analysis. Boxplots indicate 0.25 and 0.75 quantiles and the median (red line) with notches giving an estimate for the variance of the median. The green dashed line indicates the median dN for all genes. Values with dN above 20 × 10-3 are not shown.
Figure 3Protein evolution rates in genes of different functional categories. Boxplots of nonsynonymous substitution rates dN of coding sequences averaged for all 36 strains. Data were grouped by functional category according to PseudoCAP functional categories [19]. Numbers in brackets indicate the number of genes and the percentage of essential genes for each functional category. Groups were sorted by descending median dN. 87 genes showing a very low read depth (≤ 25% of the average read depth) were neglected in this analysis. Boxplots indicate 0.25 and 0.75 quantiles and the median (red line) with notches giving an estimate for the variance of the median. The green dashed line indicates the median dN for all genes. Values with dN above 20 × 10-3 are not shown.
Figure 4Gene expression correlates with substitution rate and gene essentiality. A) Nonsynonymous substitution rate dN (coding sequences averaged for all 36 strains) versus the expression level. Color indicates the density of data points ranging from blue (single data point) to deep red. B) Boxplots of gene expression levels grouped by gene essentiality as annotated in the Database of Essential Genes [29]. Boxplots indicate 0.25 and 0.75 quantiles and the median (red line) with notches giving an estimate for the variance of the median. The green dashed line indicates the median dN for all genes.
Figure 5Protein evolution in genes with different subcellular localization. Boxplots of nonsynonymous substitution rates dN of coding sequences averaged for all 36 strains. Data were grouped by subcellular localization as annotated in the Pseudomonas genome database [19]. 87 genes showing a very low read depth (≤ 25% of the average read depth) were neglected in this analysis. Boxplots indicate 0.25 and 0.75 quantiles and the median (red line) with notches giving an estimate for the variance of the median. The green dashed line indicates the median dN for all genes. Values with dN above 20 × 10-3 are not shown.
Genes completely conserved on the protein level that are specific to the genus Pseudomonas
| Locus tag | Gene name | Locus tag | Gene name | Locus tag | Gene name |
|---|---|---|---|---|---|
| PA0033b | PA2381b | PA4035a | |||
| PA0050a | PA2447a | PA4139b | |||
| PA0052b | PA2486b | PA4303a | |||
| PA0061a | PA2747b | PA4360b | |||
| PA0377b | PA2766b | PA4556a | |||
| PA0776b | PA2781b | PA4570a | |||
| PA1000b | PA2808b | PA4608a | |||
| PA1021b | PA2884b | PA4643a | |||
| PA1064a | PA2894b | PA4685a | |||
| PA1103a | PA2896a | PA4697a | |||
| PA1347b | PA2897a | PA4815a | |||
| PA1414b | PA2899b | PA4843a | |||
| PA1442a | PA3062a | PA5052a | |||
| PA1531a | PA3280a | PA5295a | |||
| PA1621a | PA3309b | PA5364a | |||
| PA1753a | PA3332b | PA5424b | |||
| PA1773a | PA3423a | ||||
| PA1921b | PA3486a | ||||
| PA1970a | PA3515b | ||||
| PA2332a | PA4032a |
a no orthologous genes were found outside the genus Pseudomonas.
b no orthologous genes were found outside the species P. aeruginosa