| Literature DB >> 20122239 |
Abstract
BACKGROUND: Conserved gene clusters are groups of genes that are located close to one another in the genomes of several species. They tend to code for proteins that have a functional interaction. The identification of conserved gene clusters is an important step towards understanding genome evolution and predicting gene function.Entities:
Mesh:
Year: 2010 PMID: 20122239 PMCID: PMC3009537 DOI: 10.1186/1471-2105-11-S1-S63
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Effect of Jaccard score threshold. Plot of the number of identified operons versus Jaccard score threshold for BBH r-window gene clusters, where the maximum window length is 6.
Figure 2Effect of window length. Plot of the percentage of identified operons and percentage of non-operon clusters versus maximum window length for our BBH r-window gene clusters model.
Figure 3Effect of maximum gap length. Plot of the percentage of identified operons and percentage of non-operon clusters versus maximum distance between adjacent genes in a team for the gene teams model.
Figure 4Overlap in operons identified by the two cluster models. Venn diagram showing the overlap between the operons identified based our BBH r-window gene cluster model and the gene teams model for a single parameter value (r = 6, δ = 3) and over a range of parameter values (r ∈ [1, 30], δ ∈ [1, 32]).
Figure 5Comparison of precision versus recall curves. Precision versus recall curve for BBH r-window gene clusters (r = 6) and gene teams (δ = 3) for identification of E. coli K-12 operons.
Significant BBH r-window gene clusters and corresponding operons. Nine out of the top twelve based on log E value and corresponding operons. Numbers in brackets indicate number of genes in the cluster over number of genes in the operon.
| log E | BBH | Operon |
|---|---|---|
| -13 | atpC, atpD, atpG, atpA, atpH, atpF, atpE, atpB | atpIBEFHAGDC (8/8) |
| -12 | secE, nusG, rplK, rplA, rplJ, rplL, rpoB, rpoC | secE-nusG (2/2), rplKAJL-rpoBC (6/6) |
| -11 | hisG, hisD, hisB, hisH, hisA, hisF, hisI | hisLGDCBHAFI (7/8) |
| -10 | fliE, fliF, fliG, fliH, fliI, fliJ, fliK | fliFGHIJK (7/6) |
| -9 | menE, menC, menB, yfbB, menD, menF | menFD-yfbB-menBCE (6/6) |
| -9 | rbsD, rbsA, rbsC, rbsB, rbsK, rbsR | rbsDACBKR (6/6) |
| -8 | pnp, rpsO, truB, rbfA, infB, nusA, yhbC | metY-yhbC-nusA-infB-rbfA-truB-rpsO-pnp (7/7) |
| -8 | dppF, dppD, dppC, dppB, dppA, yhjX | dppABCDF (6/5) |
| -7 | oppA, oppB, oppC, oppD, oppF | oppABCDF (5/5) |