| Literature DB >> 15929794 |
Betsy M Martinez-Vaz1, Yang Xie, Wei Pan, Arkady B Khodursky.
Abstract
BACKGROUND: The distribution and location of insertion elements in a genome is an excellent tool to track the evolution of bacterial strains and a useful molecular marker to distinguish between closely related bacterial isolates. The information about the genomic locations of IS elements is available in public sequence databases. However, the locations of mobile elements may vary from strain to strain and within the population of an individual strain. Tools that allow de novo localization of IS elements and are independent of existing sequence information are essential to map insertion elements and advance our knowledge of the role that such elements play in gene regulation and genome plasticity in bacteria.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15929794 PMCID: PMC1174868 DOI: 10.1186/1471-2164-6-81
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Comparisons of different methods used to identify IS5 neighbouring genes from microarray experiments. Results obtained when different test statistics (rank product – RP, median rank – MR, Mean, SAM and t-statistics) and normalization methods (global and scale normalization) were applied to experimental data from the control, isolate 'A' and isolate 'B'. These results were also compared to existing genome sequence data. Top 50 to 100 genes were predicted as IS5 neighbours from microarray experiments. The neighbours from sequence data were defined as 5 genes upstream and 5 genes downstream of the IS5 elements and IS5 itself, so there were 121 neighbours based on sequence data. The numbers in this table represent number of genes that overlap between the microarray experiment and the sequence data. The higher the number is, the better consistency between the microarray experiment and the sequence data.
| Method | Top 50 | Top 100 | |||||
| control | A | B | control | A | B | ||
| Rank – based | RP | 13 | 46 | 31 | 15 | 69 | 43 |
| MR | 13 | 45 | 32 | 15 | 70 | 47 | |
| Global Normalization | Mean | 20 | 37 | 20 | 23 | 65 | 29 |
| SAM | 20 | 38 | 24 | 23 | 65 | 29 | |
| t | 20 | 38 | 24 | 23 | 65 | 31 | |
| Scale Normalization | Mean | 20 | 44 | 21 | 23 | 65 | 23 |
| SAM | 20 | 40 | 22 | 23 | 63 | 29 | |
| t | 20 | 16 | 24 | 23 | 30 | 31 | |
Figure 1False discovery rate for median rank statistic. False discovery rate, FDR, as a function of the number of total (claimed) positive genes, TP. A, FDR in the control experiment where fluorescent probe was confined to the IS5 sequence. B, FDR in the experiment where floarescent probe extended into the IS5 neighboring regions.
Figure 2The ranks of the true neighbours of . A, Experiment. B, Control. Rank of fluorescent intensities of target sequences as a function of their positions, relative to the position of an IS5 element. All known 11 locations of IS5 elements are shown as separate panels. The closed circles represent the IS5 elements and open circles represent the neighbours. The lower the rank, the higher probability that the gene is a neighbour of the IS5 element. Experiment and control are described in the legend to Figure 1 and in the text.
The 100 genes predicted as neighbours of IS5 elements in the genome of MG1655 isolate 'A'. False significant genes are the genes without any neighbours (5 up- or down-stream neighbours) on this gene list; possible significant genes are the genes with one neighbor on the list; significant genes are the genes with at least 2 neighbours on the gene list. The names in bold are the known IS5 elements.
| False significant genes | lpxC, yi22_1, b0878, ompF, b1297, flip, ogrK, fruA, b2442, b2639, cpxR, araH |
| Possible significant genes, strings of neighbours | 1. yi22_2, tra8_2 |
| 2. b1578, rspB | |
| 3. yi22_4, yi21_4 | |
| 4. yi21_5, yi22_5 | |
| Significant genes, strings of neighbours | 1. b0255, tra8_1, ykfC, |
| 2. b0546, b0547, b0548, b0550, b0551, | |
| 3. gltL, gltK, gltJ, ybeJ, | |
| 4. b1328, b1329, b1330, | |
| 5. b1361, b1362, trkG, b1368, | |
| 6. motB, motA, flhC, flhD, yecG, otsA, otsB | |
| 7. nac, cobT, cobs, cobU, | |
| 8. b2028, gnd, | |
| 9. yejM, yejO, b2191, | |
| 10. glcD, glcC, b2981, | |
| 11. yhcD, yhcE, | |
| 12. arsB, yhiS, | |
The 100 genes predicted as neighbours of IS5 elements in the genome of MG1655 isolate 'B'. Refer to the legend in Table 2 for details.
| False significant genes | araA, b0607, lipA, ybfA, tolR, b0817, ycfJ, b1388, relB, insA_5, flip, b1971, yeeD, b2059, ogrK, ompC, ptsH, b2505, mltB, rpoS, relA, fucK, exuR, yhbY, crp, rhaA, yjdG, tra8_3, yjhG, yjhL, pssR, smf_1, b1965 |
| Possible significant genes strings of neighbours | 1. insA_1, yaaC |
| 2. yi22_1, b0365 | |
| 3. b1141, b1146 | |
| 4. rspB, rspA | |
| 5. yi22_4, yi21_4 | |
| 6. b3254, yhdT | |
| Significant genes, strings of neighbours | 1. ykfC, |
| 2. b0546, b0547, b0550, b0551, | |
| 3. gltK, gltJ, ybeJ, | |
| 4. pspB, pspC, pspD | |
| 5. b1330, | |
| 6. b1368, | |
| 7. cobs, cobU, | |
| 8. gnd, | |
| 9. yejO, b2191, | |
| 10. glcC, b2981, | |
| 11. yhcD, yhcE, | |
| 12. yhiS, | |
Figure 3A, A diagram of the flhDC-yecG region in 'A' (motile), isolate 'B' (less motile) and in the IS5:kanr mutant. The location of IS5 relative to flhDC transcriptional start site is shown. The position of the primers used for the verification PCR and the length of the expected PCR products are indicated. The wide arrow at the bottom of the diagram indicates the direction of transcription. B, Confirmation of the presence of an IS5 element in the flhDC-yecG region. 1.0% agarose gel shows the products of the PCR reactions confirming the IS5 mapping predictions as well as the replacement and deletion of the IS5 element by the lambda red recombinase system. The PCR products shown were obtained from reactions done with flhDCF3 and yecGR primers. Lane1: isolate 'A' (motile); lane2: isolate 'B' (less motile); lane3: IS5: kanr; lane4: isolate 'AΔIS5'-colony #1; lane5: isolate 'AΔIS5' – colony #2, lane6: no template negative control; lane7: Hi-lo DNA marker ladder. The size of the PCR reaction products is shown in kb.
Figure 4Relative abundance of flagella transcripts in isolate 'A' versus 'B' (open bars), in 'A' versus 'AΔIS5' (solid bars), and in 'AΔIS5' versus isolate 'B' (gray bars). The log2 (Ratio) values of transcript abundances obtained following lowess smoothing and ANOVA normalization in direct comparisons on whole-genome DNA microarrays are plotted along the Y-axis. The names of the genes are shown under the horizontal axis. Error bars indicate the standard error for at least six experimental replicas.
Swarming rates for isolates 'A', 'B', 'A ΔflhDC:kanr' and 'A ΔIS5'. Swarm diameters were measured every hour for 8 hours for overnight colonies inoculated into soft tryptone agar and incubated at 30°C. Average swarm rates and standard errors are shown for at least 15 replicas.
| Strain | |
| 4.1 ± 0.1 | |
| 1.3 ± 0.2 | |
| 0.0 ± 0.0 | |
| 3.2 ± 0.1 |