| Literature DB >> 22646228 |
Vanya Paralanov1, Jin Lu, Lynn B Duffy, Donna M Crabb, Susmita Shrivastava, Barbara A Methé, Jason Inman, Shibu Yooseph, Li Xiao, Gail H Cassell, Ken B Waites, John I Glass.
Abstract
BACKGROUND: Ureaplasma urealyticum (UUR) and Ureaplasma parvum (UPA) are sexually transmitted bacteria among humans implicated in a variety of disease states including but not limited to: nongonococcal urethritis, infertility, adverse pregnancy outcomes, chorioamnionitis, and bronchopulmonary dysplasia in neonates. There are 10 distinct serotypes of UUR and 4 of UPA. Efforts to determine whether difference in pathogenic potential exists at the ureaplasma serovar level have been hampered by limitations of antibody-based typing methods, multiple cross-reactions and poor discriminating capacity in clinical samples containing two or more serovars.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22646228 PMCID: PMC3511179 DOI: 10.1186/1471-2180-12-88
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Overview of and genomes
| 27813 | NZ_ABES00000000 | 760 | 753,674 | 8 | 604 | 212 | 25% | 14.6X | |
| 27815 | NC_010503 | 760 | 751,679 | 1 | 609 | 219 | 25% | 10.2X | |
| 700970 | NC_002162 | Patient Isolate | 751,719 | 1 | 614 | 154 | 25% | - | |
| 27818 | NZ_AAZQ00000000 | 760 | 772,971 | 5 | 619 | 221 | 25% | 11.4X | |
| 33697 | NZ_ABER00000000 | 760 | 749,965 | 7 | 594 | 199 | 25% | 14.5X | |
| 27814 | NZ_ABFL00000000 | 880 | 861,061 | 1 | 664 | 248 | 26% | 10.7X | |
| 27816 | NZ_AAYO00000000 | 910 | 835,413 | 4 | 654 | 206 | 26% | 7.0X | |
| 27817 | NZ_AAZR00000000 | 1140 | 884,046 | 18 | 677 | 252 | 26% | 8.5X | |
| 27819 | NZ_AAYP00000000 | 880 | 875,530 | 4 | 660 | 246 | 26% | 8.3X | |
| 27618 | NZ_AAYN00000000 | 890 | 874,381 | 1 | 673 | 232 | 26% | 9.9X | |
| 33175 | NZ_AAYQ00000000 | 950 | 947,165 | 10 | 711 | 244 | 26% | 8.6X | |
| 33699 | NC_011374 | 890 | 874,478 | 1 | 657 | 232 | 26% | 12.1X | |
| 33695 | NZ_AAZS00000000 | 840 | 876,474 | 6 | 644 | 236 | 27% | 10.0X | |
| 33696 | NZ_AAZT00000000 | 870 | 873,466 | 2 | 650 | 234 | 25% | 9.0X | |
| 33698 | NZ_ABEV00000000 | 900 | 846,596 | 5 | 655 | 234 | 25% | 11.1X | |
| unknown serovar | AJFX00000000 | Patient Isolate | 804,560 | 16 | 646 | 190 | 26% | 39.0X | |
| unknown serovar | AJFY00000000 | Patient Isolate | 856,546 | 14 | 667 | 258 | 26% | 60.0X | |
| unknown serovar | AJFZ00000000 | Patient Isolate | 858,890 | 18 | 684 | 225 | 26% | 73.0X | |
| unknown serovar | AJGA00000000 | Patient Isolate | 844,630 | 16 | 662 | 214 | 26% | 52.0X |
Figure 1Role Category Breakdown of Genes.
Pan genomes of differentspecies sets
| Pan genome | 1020 | 971 | 938 | 688 |
| Core genome | 515 | 523 | 553 | 538 |
| Singletons | 262 | 246 | 216 | 77 |
| Clusters of Orthologous Genes(COGs) | 758 | 725 | 722 | 688 |
Pan genome represents the number of clusters of orthologous genes and singletons. Singletons are genes found only in one of the genomes. Clusters of Orthologous Genes (COGs) have genes orthologous among at least 2 genomes.
a) ATCC UUR2, UUR4, UUR5, UUR7-13, and the clinical isolates 2033, 2608, 4155, 4318.
b) ATCC UPA1, UPA3 (ATCC 27815), UPA3 (ATCC 700970), UPA6, UPA14.
Figure 2Percent GC Distribution Among Genes of The Ureaplasma Pan Genome (19 Strains). For genes that are present in two or more genomes the average %GC of the orthologs was used. Values were grouped in bins (example: bin 20 contains genes with %GC from 15 to 20%). %GC of singleton genes was also included in the histogram.
Serovar to serovar difference expressed in percent
| | 0.66 | 0.52 | 0.75 | 9.90 | 9.99 | 9.68 | 9.78 | 9.66 | 10.23 | 9.84 | 9.70 | 9.93 | 9.79 | |
| 0.70 | | 0.49 | 0.35 | 9.93 | 9.67 | 9.33 | 9.43 | 9.33 | 10.01 | 9.43 | 9.36 | 9.66 | 9.84 | |
| 0.62 | 0.52 | | 0.50 | 9.82 | 9.82 | 9.40 | 9.49 | 9.38 | 9.95 | 9.53 | 9.42 | 9.76 | 9.75 | |
| 0.83 | 0.33 | 0.45 | | 9.92 | 10.01 | 9.59 | 9.69 | 9.57 | 9.99 | 9.70 | 9.60 | 9.95 | 9.83 | |
| 9.82 | 9.87 | 9.58 | 9.81 | | 0.86 | 0.74 | 0.78 | 0.76 | 1.25 | 0.74 | 0.77 | 0.86 | 0.84 | |
| 9.90 | 9.60 | 9.57 | 9.83 | 0.94 | | 0.69 | 0.64 | 0.69 | 0.82 | 0.88 | 0.66 | 0.07 | 0.80 | |
| 9.72 | 9.31 | 9.25 | 9.52 | 0.72 | 0.60 | | 0.15 | 0.13 | 0.66 | 0.56 | 0.16 | 0.58 | 0.66 | |
| 9.72 | 9.32 | 9.25 | 9.52 | 0.82 | 0.60 | 0.16 | | 0.15 | 0.66 | 0.53 | 0.11 | 0.60 | 0.67 | |
| 9.76 | 9.35 | 9.27 | 9.54 | 0.71 | 0.59 | 0.08 | 0.10 | | 0.61 | 0.51 | 0.11 | 0.59 | 0.65 | |
| 10.90 | 9.83 | 9.60 | 9.71 | 1.21 | 0.72 | 0.63 | 0.62 | 0.60 | | 0.85 | 0.63 | 0.75 | 1.08 | |
| 9.79 | 9.35 | 9.29 | 9.56 | 0.70 | 0.81 | 0.51 | 0.48 | 0.51 | 0.87 | | 0.46 | 0.80 | 0.43 | |
| 9.73 | 9.33 | 9.25 | 9.52 | 0.80 | 0.61 | 0.16 | 0.11 | 0.16 | 0.67 | 0.51 | | 0.60 | 0.64 | |
| 9.85 | 9.58 | 9.52 | 9.79 | 0.93 | 0.06 | 0.67 | 0.64 | 0.69 | 0.85 | 0.87 | 0.65 | | 0.80 | |
| 9.70 | 9.74 | 9.47 | 9.66 | 0.97 | 0.86 | 0.79 | 0.76 | 0.75 | 1.27 | 0.56 | 0.74 | 0.86 |
The percent difference was obtained by whole genome comparison on the nucleotide level.
Number of Clusters of Orthologous Genes (COGs) per functional category present only in UUR or UPA serovars
| hypothetical protein | 83 |
| putative lipoprotein | 8 |
| multiple banded antigen | 7 |
| putative membrane protein | 4 |
| transposase | 4 |
| DNA primase | 3 |
| DNA topoisomerase IV, B subunit | 3 |
| site-specific recombinase | 3 |
| restriction-modification enzyme subunit | 2 |
| AAA domain/DeoR HTH domain protein | 1 |
| AAA family ATPase | 1 |
| ABC transported MDR-type, ATPase component | 1 |
| chromosome partition protein Smc | 1 |
| divergent AAA domain family | 1 |
| ferrichrome ABC transporter, ATP-binding | 1 |
| putative phage head-tail adaptor | 1 |
| relaxase | 1 |
| sigma-70, region 4 family | 1 |
| superfamily II DNA and RNA helicase | 1 |
| TolA homolog | 1 |
| TraG/TraD family | 1 |
| viral A-type inclusion protein, putative | 1 |
| | |
| hypothetical protein | 18 |
| type I restriction modification enzyme protein | 3 |
| integrase-recombinase protein | 2 |
| putative lipoprotein | 2 |
| divergent AAA domain family | 1 |
| nucleoside 2-deoxyribosyltransferase sup | 1 |
Figure 3MBA Based Phylogenetic Tree of 19 Ureaplasmas. The tree is based on the nucleotide sequence of the conserved domain of the mba (1–430 nt).
Figure 4Phylogenetic Tree of 19 Ureaplasma Strains Based on 82 Housekeeping Genes. ATCC type strains are labeled with tree letters (species) followed by a number (serovar). UUR = Ureaplasma urealyticum; UPA = Ureaplasma parvum; ntUPA3 = clinical isolate sequenced in 2000; 2033, 2608, 4155, and 4318 are clinical isolates of Ureaplasma urealyticum that cannot be serotyped. The tree is based on the concatenated alignment of 82 housekeeping genes 16 tRNA ligase genes, 12 DNA and RNA polymerase genes, 47 ribosomal protein genes, and the 7 urease subunit genes). The non- informative positions were removed from the alignments. The removal of the non- nformative positions increased the bootstrap values.
Figure 5Clusters of Orthologous Genes Potentially Involved in the MBA Phase Variable System of Ureaplasmas. This table contains the NCBI locus tags for genes potentially involved in the MBA phase variable system. To form the NCBI locus tag add the serovar id and underscore before the gene number: UPA1_G0402; UUR12_A0163. Genes with tandem repeats are highlighted in green. A red box is drawn around the 4MBA genes expressed in ATCC type strains.
Tandem Repeating Units (TRUs) identified in thelocus
| 1 | 12 | 60.8 | 6 | 6 | 6 | - | |
| 2 | 18 | 36.7–53.7 | 1 | 1 | 1 | - | |
| 3 | 18 | 40.6 | 3 | 3 | 3 | - | |
| 4 | 21 | 29.5–32.0 | 14 | 14 | 14 | - | |
| 5 | 24 | 20.2–33.5 | 2,5,8 | 5 | 5 (2608, 4318) | 2608, 4318, 4155 | |
| 6 | 24 | 34.6 | 10 | 10 | 10 | - | |
| 7 | 30 | 17.2–26.2 | 4,12,13 | 4 | 4 (2033) | 2033 | |
| 8 | 42 | 7.6–11.6 | 7,10,11 | 11 | 11 | - | |
| 9 | 45 | 2.0–10.0 | 2,5,8,9 | 9 | 9 | 4155 | |
| 10 | 213 | 3.0–4.0 | 4,10,12,13 | - | - | 2033 | |
| 11 | 213 | 2.8–3.9 | 2,5,8 | 2 | 2 | 4155 | |
| 12 | 213 | 1.9 | 2 | - | - | - | |
| 13 | 231 | 2.8–3.9 | 7 | 7 | 7 | - | |
| 14 | 252 | 1.9–5.9 | 8,9,11 | 8 | 8 | 4155 | |
| 15 | 252 | 2.1–4.1 | 4,10,12,13 | 12 | 12 | - | |
| 16 | 252 | 2.0–3.0 | 2,5 | - | - | - | |
| 17 | 276 | 2.0–3.8 | 2,8,9 | - | (4155) | 2608, 4318 | |
| 18 | 327 | 2.3–4.0 | 1 | - | 1 | - | |
| 19 | 330 | 4 | 10 | - | - | 2608 | |
| 20 | 333 | 3.0–4.0 | 4,12,13 | - | - | 2033, 4318 | |
| 21 | 336 | 2.9 | 6 | - | - | - | |
| 22 | 579 | 1.9 | 5 | - | - | - |
The name of each TRU consists of the mba gene name followed by the period size (bp) of the repeating unit. Different sequences of the same period size are marked by “.” and a version number (ex. mba18.1 and mba18.2). Observed minimum and maximum copy number of the TRU is shown in the third column. Column 6 shows the serovar in which the conserved domain was associated with each TRU. Note that the conserved region of the UPA1 mba was found linked to two different TRUs (highlighted).
Figure 6Multiple Banded Antigen Locus.
Figure 7Multiple Banded Antigen Locus.
Figure 8The MBA Locus in UUR4, UUR12, and UUR13. Genes in each genome are represented as directional blue or green boxes. Orthologous gene clusters (COGs) are represented by gray or pink bands spanning across the tree genomes. The COG with a pink band represents the first mba gene in the MBA locus. The locus includes the next 4 genes following the gene in the pink labeled COG (all tree genome have 5 mba genes each). The conserved domain of the mba is marked by a red box. Rearrangements of the genes are visible by following the twisting of the connecting bands.