| Literature DB >> 12740027 |
John P Bannantine1, Qing Zhang, Ling-Ling Li, Vivek Kapur.
Abstract
BACKGROUND: Mycobacterium avium subspecies avium (M. avium) is frequently encountered in the environment, but also causes infections in animals and immunocompromised patients. In contrast, Mycobacterium avium subspecies paratuberculosis (M. paratuberculosis) is a slow-growing organism that is the causative agent of Johne's disease in cattle and chronic granulomatous infections in a variety of other ruminant hosts. Yet we show that despite their divergent phenotypes and the diseases they present, the genomes of M. avium and M. paratuberculosis share greater than 97% nucleotide identity over large (25 kb) genomic regions analyzed in this study.Entities:
Mesh:
Substances:
Year: 2003 PMID: 12740027 PMCID: PMC156888 DOI: 10.1186/1471-2180-3-10
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Figure 1Amplification strategy and organization of the M. paratuberculosis chromosomal origin of replication. The locations of primer pairs used for amplification and sequencing are marked with facing arrows above the kilobase (kb) scale. The GC skew is shown beneath the kb scale and has a window size of 500. OriC, right at the point of the GC inflection, designates the origin of replication. An open reading frame map of the ~11 kb fragment is represented by shaded boxes and the two divergent arrows immediately above identify the direction of transcription. The degree of substitution in comparison to the corresponding M. avium gene is indicated below the gene name. π (tau) is the overall substitution rate, ds is the synonymous substitution rate, and dn is the non-synonymous substitution rate. GidB, glucose inhibited division protein B. RnpA, RNAse protein component A. RpmH, ribosomal protein L34. DnaA, replication initiator. DnaN, DNA polymerase subunit III. GyrB, DNA gyrase subunit B.
Sequence analysis of predicted ORFs in the M. paratuberculosis oriC region.
| Protein | Length (amino acids) | Top BLASTP match | Expect Value |
| 1 | 311 | 5e-64 | |
| 2 | 195 | hypothetical protein Rv3920c ( | 1e-49 |
| 3 | 370 | hypothetical protein Rv3921c ( | 1e-142 |
| 4 | 115 | hypothetical protein Rv3922c ( | 2e-35 |
| 5 | 126 | 5e-21 | |
| 6 | 146 | 4e-17 | |
| 7 | 524 | 0.0 | |
| 8 | 409 | 1e-173 | |
| 9 | 385 | 1e-144 | |
| 10 | 280 | hypothetical protein Rv0004 ( | 1e-63 |
| 11 | 685 | 0.0 |
Figure 2Comparative gene order in the oriC region of mycobacteria and other Gram-positive bacteria. The relative gene order in this region of M. paratuberculosis conforms to the highly conserved order found in other gram-positive bacteria. Numbers indicate the length of the ORF or intergenic region. Arrows show the direction of transcription.
Comparison of amino acid identity in the oriC region with the corresponding M. paratuberculosis sequence.
| gidB | 97% (98%) | 66% (75%) | 73% (83%) | Not Found | 50% (64%) | 51% (64%) |
| dnaN | 100% (100%) | 85% (88%) | 86% (91%) | 80% (89%) | 51% (67%) | 48% (69%) |
| rpmH | 97% (100%) | 91 % (95%) | 91% (93%) | 91% (95%) | 81% (87%) | 89% (93%) |
| Unknown (AAF33691) | 100% (100%) | 73% (82%) | 85% (88%) | Not Found | 64% (71%) | Not Found |
| dnaA | 99% (99%) | 87% (89%) | 88% (90%) | 78% (84%) | 68% (78%) | 53% (67%) |
| recF | 97% (98%) | 78% (87%) | 76% (85%) | 73% (84%) | 55% (70%) | 53% (71%) |
| gyrB | 99% (100%) | 90% (95%) | 90% (94%) | 88% (92%) | 65% (79%) | 72% (82%) |
| rnpA | 94% (97%) | 62% (76%) | 60% (74%) | Not Found | 41% (57%) | 38% (56%) |
| unknown (AAF33696) | 100% (100%) | 78% (85%) | 79% (88%) | 70% (82%) | 39% (51%) | 33% (47%) |
| unknown (AAF33697) | 98% (98%) | Not Found | 64% (73%) | Not Found | Not Found | Not Found |
| unknown (AAF33698) | 99% (99%) | 75% (81%) | 82% (88%) | Not Found | 34% (50%) | 42% (62%) |
Figures are reported as percent identity with percent similarity indicated in parenthesis. Blastp was done at the NCBI site except for M. avium, which was done at the TIGR site using tblastn. Not found indicates that the gene sequence is not available in public databases.
Figure 3Non-coding sequences flanking M. paratuberculosis dnaA harbor 35 DnaA boxes. Nucleotide sequence of the rpmH-dnaA intergenic region (A) and dnaA-dnaN intergenic region (B) are shown. Sequences matching the DnaA box consensus(TTGTCCACA) with 1 – 3 mismatches are marked with an arrow. In (B), an A/T-rich region is underlined and the potential ATP-DnaA recognition site is boxed.
Figure 4DNA matrix analysis of a contiguous 2 million nucleotide section of the M. avium (y-axis) and M. paratuberculosis (x-axis) genomes. Four 500,000 nucleotide matrices are shown with the nucleotide segments indicated above each plot. A long unbroken diagonal line from the upper left corner to the lower right corner indicates that the sequences are collinear. The diagonal line (in blue) that runs from the lower left to the upper right at the 350,000 nucleotide region indicates that one sequence is the reverse complement of the other. The arrows (in red) show sequences present in M. avium but absent in M. paratuberculosis and the arrowhead (in green) shows a sequence represented only in M. paratuberculosis. The initial nucleotide in the dnaA coding sequence was defined as number one in both genomes for this analysis. The parameters for this DNA identity matrix include: a window size of 30, a minimum percent score of 80, and a hash value of 4.