| Literature DB >> 20520734 |
David M Nelson1, Isaac K O Cann, Eric Altermann, Roderick I Mackie.
Abstract
BACKGROUND: Lateral gene transfer (LGT) appears to promote genotypic and phenotypic variation in microbial communities in a range of environments, including the mammalian intestine. However, the extent and mechanisms of LGT in intestinal microbial communities of non-mammalian hosts remains poorly understood. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2010 PMID: 20520734 PMCID: PMC2875401 DOI: 10.1371/journal.pone.0010785
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Phylogenetic relationships of 16S rRNA gene sequences among the fosmids, clone library, and Clostridium clusters.
Clone library sequences start with “M.” The numbers in parentheses following some of the marine iguana sequences indicate the number of times that a particular sequence was obtained. Only representatives of the major Clostridium clusters, and limited representatives of the Bacteroidetes and Coriobacteriales, are shown. The tree was inferred using the neighbor joining approach. The numbers at the nodes represent bootstrap values. The bar represents 0.02 substitutions per nucleotide position. The outgroup is Aquifex pyrophilus.
|
|
|
|
|
|
|
|
| |||
|
|
|
|
| |||||||
| 7–14 | 1 | 18–920 | + | 53.0 |
| No |
| ZP_02038860 | 2.00E−136 | 92 |
| 7–14 | 2 | 1459–1929 | − | 35.6 | Teicoplanin resistance protein vanZ | Novel |
| YP_001124564 | 7.00E−06 | 31 |
| 7–14 | 3 | 2346–3584 | − | 50.8 |
|
|
| ZP_02205788 | 2.00E−167 | 69 |
| 7–14 | 4 | 4214–5032 | − | 43.8 | Hypothetical protein | Novel |
| ZP_02418355 | 1.00E−03 | 39 |
| 7–14 | 5 | 6252–6962 | − | 51.9 | Outer membrane lipoprotein-sorting protein | Novel |
| YP_001179297 | 8.00E−06 | 31 |
| 7–14 | 6 | 6959–7480 | − | 47.3 | RNA polymerase sigma-54 factor rpoN | Unresolved |
| YP_001512203 | 1.00E−28 | 40 |
| 7–14 | 7 | 7745–7820 | 63.2 | tRNA-Pro (TGG) | Na | |||||
| 7–14 | 8 | 8265–10187 | − | 57.9 | Large exoproteins involved in heme utilization or adhesion | Unresolved |
| YP_001545021 | 2.00E−58 | 35 |
| 7–14 | 9 | 10188–10862 | − | 55.4 | Hypothetical protein | Unresolved |
| ZP_02440985 | 3.00E−38 | 56 |
| 7–14 | 10 | 10855–11580 | − | 54.4 | Hypothetical protein |
|
| YP_517449 | 7.00E−67 | 56 |
| 7–14 | 11 | 11732–12193 | − | 52.8 | Iron-sulfur cluster regulator IscR | Unresolved (No) |
| ZP_02443971 | 2.00E−44 | 65 |
| 7–14 | 12 | 12428–13360 | + | 59.1 | Cysteine synthase | Unresolved |
| ZP_02090920 | 4.00E−123 | 83 |
| 7–14 | 13 | 13706–13840 | − | 46.7 | Sodium/glutamate symporter | Unresolved |
| ZP_02423775 | 5.00E−12 | 80 |
| 7–14 | 14 | 13795–14877 | − | 56.7 | Sodium/glutamate symporter | Unresolved |
| ZP_02423775 | 7.00E−134 | 73 |
| 7–14 | 15 | 14895–15938 | − | 61.3 | Immunogenic protein | Unresolved |
| ZP_02205701 | 6.00E−61 | 44 |
| 7–14 | 16 | 16214–17023 | − | 57.9 | 8-oxoguanine-DNA-glycosylase |
|
| ZP_01966859 | 9.00E−55 | 43 |
| 7–14 | 17 | 17025–18026 | − | 57.6 | L-asparaginase | Unresolved |
| ZP_02087774 | 7.00E−103 | 54 |
| 7–14 | 18 | 18036–19133 | − | 62.6 | Exonuclease SbcD | No |
| ZP_02034908 | 3.00E−98 | 55 |
| 7–14 | 19 | 19130–19684 | − | 56.8 | EBSC protein | Unresolved |
| ZP_02078782 | 2.00E−54 | 68 |
| 7–14 | 20 | 19651–20424 | − | 55.9 | Phosphoesterase family protein | No |
| ZP_02034909 | 4.00E−87 | 61 |
| 7–14 | 21 | 21133–21414 | − | 50.7 | Hypothetical protein | Unresolved |
| ZP_02423392 | 2.00E−08 | 35 |
| 7–14 | 22 | 22289–23032 | + | 54.6 |
| No |
| ZP_02038860 | 1.00E−110 | 87 |
| 7–14 | 23 | 23434–23508 | 58.7 | tRNA-Glu (CTC) | Na | |||||
| 7–14 | 24 | 23684–23759 | 52.6 | tRNA-Lys (CTT) | Na | |||||
| 7–14 | 25 | 23856–26709 | 52.5 | 23S rRNA gene | Na | |||||
| 7–14 | 26 | 27033–27109 | 63.6 | tRNA-Ile (GAT) | Na | |||||
| 7–14 | 27 | 27121–27196 | 53.9 | tRNA-Ala (TGC) | Na | |||||
| 7–14 | 28 | 27324–28847 | 53.5 | 16S rRNA gene | Na | |||||
| 7–14 | 29 | 29448–30365 | − | 57.4 | Germination and sporulation | Yes, from XIVa to IV (No) |
| ZP_02038021 | 1.00E−25 | 38 |
| 7–14 | 30 | 30362–31768 | − | 60.5 | Osmosensitive K+ channel histidine kinase kdpD | No |
| ZP_02038022 | 1.00E−120 | 52 |
| 7–14 | 31 | 31788–32465 | − | 55.8 | Two-component response regulator SA14-24 | No |
| ZP_02036840 | 7.00E−86 | 76 |
| 7–14 | 32 | 32486–32854 | − | 55.3 | Late competence protein comEA, DNA receptor | Unresolved | Cand. | YP_001718193 | 3.00E−15 | 58 |
| 7–14 | 33 | 33191–34558 | + | 57.1 | D-alanyl-D-alanine carboxypeptidase | No |
| ZP_02036838 | 3.00E−7 | 47 |
| 7–14 | 34 | 34607–35155 | − | 59.7 | Nitroreductase family protein | Unresolved (Yes, from XIVa to IV) |
| YP_001393744 | 9.00E−36 | 50 |
| 7–25 | 1 | 1–794 | + | 51.5 |
|
|
| ZP_02083674 | 6.00E−90 | 62 |
| 7–25 | 2 | 910–2469 | − | 57.8 | GMP synthase [glutamine-hydrolyzing] | No |
| ZP_02035344 | 0.00E+00 | 82 |
| 7–25 | 3 | 2447–2962 | − | 56.2 | Xanthine phosphoribosyltransferase | Unresolved |
| ZP_02207218 | 3.00E−43 | 54 |
| 7–25 | 4 | 3418–4239 | − | 60.5 | Nucleotide-binding protein | No |
| ZP_02421312 | 4.00E−91 | 64 |
| 7–25 | 5 | 4308–6572 | − | 61.3 | Chromosome partition protein smc | No |
| ZP_02034907 | 6.00E−103 | 35 |
| 7–25 | 6 | 7160–8682 | 53.6 | 16S rRNA gene | Na | |||||
| 7–25 | 7 | 8810–8885 | 55.3 | tRNA-Ala (TGC) | Na | |||||
| 7–25 | 8 | 8897–8973 | 64.9 | tRNA-Ile (GAT) | Na | |||||
| 7–25 | 9 | 9297–12150 | 52.5 | 23S rRNA gene | Na | |||||
| 7–25 | 10 | 12247–12322 | 53.9 | tRNA-Lys (CTT) | Na | |||||
| 7–25 | 11 | 12498–12572 | 58.7 | tRNA-Glu (CTC) | Na | |||||
| 7–25 | 12 | 12864–12939 | 57.9 | tRNA-Asn (GTT) | Na | |||||
| 7–25 | 13 | 12994–13070 | 61.0 | tRNA-Met (CAT) | Na | |||||
| 7–25 | 14 | 13108–13183 | 59.2 | tRNA-Trp (CCA) | Na | |||||
| 7–25 | 15 | 13246–13322 | 62.3 | tRNA-Asp (GTC) | Na | |||||
| 7–25 | 16 | 13328–13403 | 59.2 | tRNA-Thr (GGT) | Na | |||||
| 7–25 | 17 | 13613–14674 | + | 54.3 |
| No |
| ZP_02038860 | 4.00E−131 | 89 |
| 7–25 | 18 | 15027–15791 | + | 40.4 | Unknown | Unresolved (No) |
| NP_810500 | 1.00E−117 | 78 |
| 7–25 | 19 | 15913–17193 | + | 55.2 | Hypothetical protein | Unresolved |
| ZP_02421339 | 4.00E−86 | 44 |
| 7–25 | 20 | 17144–18088 | + | 56.9 | Hypothetical protein | Unresolved |
| ZP_02027484 | 5.00E−66 | 43 |
| 7–25 | 21 | 18370–19662 | + | 58.0 | Putative stomatin/prohibitin-family membrane protease subunit |
|
| NP_349972 | 2.00E−92 | 48 |
| 7–25 | 22 | 19560–20867 | + | 57.0 | Protein RtcB | No |
| ZP_02038919 | 2.00E−154 | 69 |
| 7–25 | 23 | 20864–21208 | + | 55.1 | Predicted nucleotidyltransferase | Unresolved |
| ZP_03048733 | 5.00E−16 | 59 |
| 7–25 | 24 | 22686–23951 | + | 53.7 | Hypothetical lipoprotein |
|
| ZP_02087827 | 0.00E+00 | 82 |
| 7–25 | 25 | 24010–25620 | + | 48.6 | ABC-type sugar transport system, ATP-binding protein |
|
| ZP_02087828 | 0.00E+00 | 89 |
| 7–25 | 26 | 25634–26713 | + | 54.8 | ABC transporter integral membrane protein |
|
| ZP_02087829 | 2.00E−164 | 88 |
| 7–25 | 27 | 26710–27834 | + | 54.3 | ABC transporter integral membrane protein |
|
| ZP_02087830 | 7.00E−167 | 84 |
| 7–25 | 28 | 27831–28319 | + | 54.6 | Hypothetical protein |
|
| ZP_02087831 | 6.00E−47 | 70 |
Na, not applicable.
CDS number from 5′ to 3′ on cloned insert.
The predicted function of each gene was determined as desribed in the text. Predicted mobile elements are in bold.
The occurrence and direction of LGT was determined using phylogenetic analysis, as described in the text. Genes that likely underwent LGT are in bold.
For CDSs in which the assessment of LGT differed between the neighbor joining and maximum likelihood based approaches the neighbor joining assessment is listed first and the maximum likelihood assessment is listed in parentheses.
As compared to the protein sequence database in GenBank. Top BLASTX hits that are members of Clostridium cluster IV are in bold.
Figure 2BLASTP result distribution across fosmids 7–14 and 7–25.
a) The X-axis indicates genera with at least 10 BLASTP hits throughout the ORFeome of the analyzed fosmids. Using a previously published approach [33] the organism distribution on a genus level was identified for each coding gene, e-values were grouped into ranges, and threshold levels were defined for minimum overall frequency. Genera are phylogenetically sorted. The Y-axis indicates respective e-value ranges. The frequency of hits for each genus in each e-value range (log scale) is shown by color coding and corresponding values are indicated in the figure. All BLASTP hits per genus per ORF were accepted. b) Same as a), except that custom databases of species from phylogenetically defined Clostridium clusters were used.