| Literature DB >> 17668047 |
Sandor E Karpathy1, Xiang Qin, Jason Gioia, Huaiyang Jiang, Yamei Liu, Joseph F Petrosino, Shailaja Yerrapragada, George E Fox, Susan Kinder Haake, George M Weinstock, Sarah K Highlander.
Abstract
Fusobacterium nucleatum is a prominent member of the oral microbiota and is a common cause of human infection. F. nucleatum includes five subspecies: polymorphum, nucleatum, vincentii, fusiforme, and animalis. F. nucleatum subsp. polymorphum ATCC 10953 has been well characterized phenotypically and, in contrast to previously sequenced strains, is amenable to gene transfer. We sequenced and annotated the 2,429,698 bp genome of F. nucleatum subsp. polymorphum ATCC 10953. Plasmid pFN3 from the strain was also sequenced and analyzed. When compared to the other two available fusobacterial genomes (F. nucleatum subsp. nucleatum, and F. nucleatum subsp. vincentii) 627 open reading frames unique to F. nucleatum subsp. polymorphum ATCC 10953 were identified. A large percentage of these mapped within one of 28 regions or islands containing five or more genes. Seventeen percent of the clustered proteins that demonstrated similarity were most similar to proteins from the clostridia, with others being most similar to proteins from other gram-positive organisms such as Bacillus and Streptococcus. A ten kilobase region homologous to the Salmonella typhimurium propanediol utilization locus was identified, as was a prophage and integrated conjugal plasmid. The genome contains five composite ribozyme/transposons, similar to the CdISt IStrons described in Clostridium difficile. IStrons are not present in the other fusobacterial genomes. These findings indicate that F. nucleatum subsp. polymorphum is proficient at horizontal gene transfer and that exchange with the Firmicutes, particularly the Clostridia, is common.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17668047 PMCID: PMC1924603 DOI: 10.1371/journal.pone.0000659
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Map of the FNP ATCC 10953 genome.
The inner circle (orange) shows the percent GC calculated using a sliding window of 5 kb. The triangles in the next circle show the location and directionality of tRNAs (red) and ncRNAs (blue). The next tract shows the coordinate scale; this is surrounded by the ORFs, on both strands. ORFs are colored by category, as follows: tan, cell processes; purple, cell structure; red; DNA replication and recombination; blue, general metabolism; green, regulation; yellow, transcription; orange, translation, cyan, transport; fuchsia, virulence; and black, unknown. The IStrons are indicated by the fuchsia arrowheads on the outside circle; the intact IStron is indicated with the star. Plasmid (green) and phage locations (cyan) also appear on the outside circle.
Figure 2Plasmid pFN3.
a) Map of plasmid pFN3. Replication and recombination ORFs are shown in blue and hypotheticals are colored green. b) Alignments of fusobacterial relaxase protein domains to mobilization class consensus motifs [66]. Consensus sequence abbreviations: uppercase letters, conserved; lowercase letters, present in 50% of sites; U or u, bulky hydrophobic residues (I, L, V, M, F, Y and W); -, no consensus at this site; Y, putative active site tyrosine residue. Asterisks (*) above residues indicate identity with consensus sequence. Alignments were performed using Clustal W [102] and then adjusted to best fit the consensus.
General genome statistics for FNP, FNN and FNV.
| Genome | Length (bp) | %GC content | % Coding | ORFs | Proteins | rRNA | tRNA | ncRNA |
| FNP | 2,429,698 | 26.84 | 94.28 | 2510 | 2391 | ND | 45 | 15 |
| FNN | 2,174,499 | 27.15 | 89.10 | 2129 | 2067 | 15 | 47 | ND |
| FNV | 2,118,259 | 27.56 | ND | 2277 | >2212 | 12 | 43 | ND |
F. nucleatum subsp. polymorphum ATCC 10953, this project, accession number CM000440.
F. nucleatum subsp. nucleatum ATCC 25586, accession number NC_003454.
F. nucleatum subsp. vincentii ATCC 49256, accession number NZ_AABF00000000.
ND, not determined.
Figure 3Intergenic repeats.
The repeats were aligned using ClustalW [102] and conserved bases were shaded using BOXSHADE (www.ch.embnet.org/software/BOX_form.html). Coordinates are shown in the left column.
Conserved gene clusters/operons in FNP.
| Group | Cluster | Conserved gene cluster | FNN |
|
|
|
| I | 1 | 16S, 23S, 5S rRNAs | ND | + | + | + |
| 1 | S10 operon ( | + | + | + | ||
| 2 | Str operon ( | + | + | + | + | |
| 4 | Spc operon ( | - | + | + | + | |
| 5 | L13 operon ( | + | + | + | + | |
| 6 | L11 operon ( | + | + | + | + | |
| 7 | Alpha operon ( | + | + | + | + | |
| 7 | L35 operon ( | + | + | + | + | |
| 8 | No L34 operon | + | - | - | - | |
| 10 | L21/L27 ( | + | + | + | + | |
| 11 | L10 operon ( | + | + | + | + | |
| II | 12 | ATPases ( | + | + | + | + |
| III | 13 | Beta operon ( | + | + | + | + |
| 14 | Initiation factor cluster ( | + | - | - | - | |
| 15 | RF-1 cluster ( | + | + | + | + | |
| 16 | Ribosome release factor cluster ( | + | + | + | + | |
| IV | 17 | Spermidine/putrescine ABC transport cluster ( | + | + | + | + |
| V | 18 | Chaperones cluster ( | + | + | + | + |
The rplX gene is duplicated in this operon in the FNN genome.
The L34 operon is present in the C. difficile, B. anthracis and E. coli genomes.
The ABC1 family protein gene is missing.
An L8A gene maps between nusA and infB.
ND, not determined.
Figure 4Whole genome display of FNP illustrating clustering of genes without hits in FNN or FNV.
Yellow boxes represent FNP genes with either FNN or FNV as top BLASTN hits (1835/2462 or 75%) and blue boxes represent genes whose top BLASTN hits are to genes from other organisms (627/2462 or 25%).
Figure 5Linear map of prophage located between nts 2,024,189 and 2,053,649 in FNP.
Replication and regulatory ORFs are colored blue, ORFs encoding structural proteins are red, ORFs encoding proteins with homologs in the nr database but of unknown function are colored green, and hypotheticals are shaded gray.
Potential virulence factor genes.
| Locus_Tag | Start | Stop | Gene | Definition |
|
| ||||
| FNP_0314 | 693051 | 692284 |
| VacJ family lipoprotein |
| FNP_0972 | 1330393 | 1329281 |
| porin FomA |
| FNP_1118 | 1459567 | 1458752 |
| undecaprenol kinase |
| FNP_1264 | 1619294 | 1618293 | probable microcin C7 self-immunity protein MccF | |
| FNP_1337 | 1692327 | 1693952 |
| fibronectin-binding protein A |
| FNP_1360 | 1721587 | 1723056 | MviN family protein | |
| FNP_1391 | 1750650 | 1752584 | possible autotransporter adhesin | |
| FNP_1446 | 1797366 | 1798136 | probable Bvg family transcriptional regulator | |
| FNP_1762 | 2106322 | 2107530 |
| acetyl-CoA acetyltransferase/immunosuppressive protein FipA |
| FNP_1880 | 2219936 | 2219178 | von Willebrand factor domain protein | |
| FNP_1881 | 2220088 | 2220807 |
| complement resistance protein TraT |
| FNP_1888 | 2226995 | 2228584 | von Willebrand factor domain protein | |
| FNP_1921 | 2263322 | 2265424 |
| ribonuclease R |
|
| ||||
| FNP_2146 | 62092 | 60956 | butyryl-CoA dehydrogenase | |
| FNP_0790 | 1158356 | 1159132 | 3-hydroxybutyryl-CoA dehydratase | |
| FNP_0791 | 1159148 | 1159987 |
| 3-hydroxybutyryl-CoA dehydrogenase |
| FNP_0969 | 1326816 | 1326151 |
| butyrate-acetoacetate CoA-transferase, beta subunit |
| FNP_0970 | 1327487 | 1326834 |
| butyrate–acetoacetate CoA-transferase, alpha subunit |
| FNP_0971 | 1329020 | 1327641 |
| MFS superfamily major facilitator short chain fatty acids symporter |
| FNP_1467 | 1817668 | 1818813 | butyryl-CoA dehydrogenase | |
| FNP_1762 | 2106322 | 2107530 |
| acetyl-CoA acetyltransferase |
|
| ||||
| FNP_2267 | 200871 | 200098 |
| heme ATP binding cassette transporter, ABC protein |
| FNP_2268 | 201893 | 200868 |
| heme ATP binding cassette transporter, membrane protein HmuU |
| FNP_2269 | 202768 | 201896 |
| heme ATP binding cassette transporter, binding protein HmuT |
| FNP_2270 | 205012 | 203039 | possible TonB-dependent iron (Fe) receptor | |
| FNP_2353 | 300274 | 299846 |
| ferric uptake regulator protein |
| FNP_0006 | 386027 | 386275 | possible alpha-hemolysin | |
| FNP_0155 | 520499 | 522103 |
| probable TPS family two-partner secretion family protein TpsB |
| FNP_0156 | 522114 | 530369 |
| probable TPS family two-partner secretion family exoprotein TpsA |
| FNP_0159 | 531328 | 532326 | possible hemolysin | |
| FNP_0338 | 716299 | 716574 | cobalamin/iron ATP binding cassette transporter, ABC protein | |
| FNP_0339 | 716593 | 717216 | cobalamin/iron ATP binding cassette transporter, ABC protein | |
| FNP_0340 | 717349 | 718404 | cobalamin/iron ATP binding cassette transporter, binding protein | |
| FNP_0341 | 718423 | 720105 | cobalamin/iron ATP binding cassette transporter, membrane protein | |
| FNP_0428 | 795363 | 796424 | iron ABC superfamily ATP binding cassette transporter, binding protein | |
| FNP_0429 | 796439 | 797554 | iron ABC superfamily ATP binding cassette transporter, ABC protein | |
| FNP_0430 | 797544 | 799190 | iron ABC superfamily ATP binding cassette transporter, membrane protein | |
| FNP_0531 | 896606 | 895221 | OfeT family oxidase-dependent iron transporter | |
| FNP_0999 | 1350397 | 1351044 | possible hemolysin III | |
| FNP_1246 | 1599267 | 1589866 |
| probable TPS family two-partner secretion exoprotein TpsA |
| FNP_1247 | 1601074 | 1599281 |
| pseudogene of TPS family two-partner secretion protein TpsB |
| FNP_1451 | 1801085 | 1801975 | iron ABC superfamily ATP binding cassette transporter, binding protein | |
| FNP_1452 | 1802030 | 1804210 | iron ABC superfamily ATP binding cassette transporter, binding protein | |
| FNP_1453 | 1805190 | 1805966 | iron ABC superfamily ATP binding cassette transporter, membrane protein | |
| FNP_1454 | 1805980 | 1805966 | iron ABC superfamily ATP binding cassette transporter, ABC protein | |
| FNP_1660 | 2022933 | 2021746 | probable Nramp family metal ion transporter | |
| FNP_1765 | 2114622 | 2112373 | possible TonB-dependent iron (Fe) receptor | |
|
| ||||
| FNP_0174 | 546224 | 547564 | MOP/MATE family multidrug-resistance efflux pump | |
| FNP_0388 | 760374 | 759283 |
| probable MFS superfamily major facilitator transporter, macrolide symporter |
| FNP_0507 | 872530 | 869465 | RND superfamily resistance-nodulation-cell division antiporter | |
| FNP_0508 | 873639 | 872533 | RND superfamily resistance-nodulation-cell division antiporter | |
| FNP_0622 | 986693 | 987907 | probable MFS superfamily major facilitator transporter, multidrug symporter | |
| FNP_0640 | 1009181 | 1010527 |
| MOP/MATE superfamily multidrug-resistance efflux pump NorM |
| FNP_0769 | 1137188 | 1136280 | DMT superfamily drug/metabolite transporter | |
| FNP_0890 | 1260254 | 1261621 |
| MOP/MATE family multidrug-resistance efflux pump NorM |
| FNP_1162 | 1513737 | 1512364 |
| MOP/MATE family multidrug-resistance efflux pump NorM |
| FNP_1207 | 1557921 | 1559300 | MOP/MATE family multidrug-resistance efflux pump | |
| FNP_1299 | 1653389 | 1652052 |
| MOP/MATE family multidrug-resistance efflux pump NorM |
| FNP_1503 | 1864175 | 1865314 | possible MFP membrane fusion protein family transporter | |
| FNP_1504 | 1865341 | 1866003 | antimicrobial peptide ATP binding cassette transporter, ABC protein | |
| FNP_1505 | 1866000 | 1867226 | antimicrobial peptide ATP binding cassette transporter, membrane protein | |
| FNP_1524 | 1881745 | 1882815 | possible DMT superfamily drug/metabolite transporter | |
| FNP_1596 | 1955589 | 1956911 | MOP/MATE family multidrug-resistance efflux pump | |
|
| ||||
| FNP_2175 | 93177 | 92392 | beta-lactamase | |
| FNP_0581 | 941979 | 943874 | beta-lactamase superfamily zinc-dependent hydrolase | |
| FNP_0627 | 993909 | 992767 | probable beta-lactamase superfamily zinc-dependent hydrolase | |
| FNP_0629 | 995488 | 994865 | beta-lactamase superfamily zinc-dependent hydrolase | |
|
| ||||
| FNP_2182 | 98598 | 97345 | probable lipoprotein | |
| FNP_2196 | 112459 | 110366 | outer membrane protein | |
| FNP_2270 | 205012 | 203039 | possible TonB-dependent outer membrane receptor | |
| FNP_2283 | 226990 | 219536 | AT family autotransporter | |
| FNP_2284 | 227596 | 227006 | outer membrane protein | |
| FNP_2361 | 315873 | 308506 | AT family autotransporter | |
| FNP_2362 | 316452 | 315883 | OmpA family outer membrane protein | |
| FNP_0032 | 413132 | 404055 | fusobacterial outer membrane protein | |
| FNP_0217 | 583789 | 591255 | fusobacterial outer membrane protein | |
| FNP_0314 | 693051 | 692284 |
| VacJ family lipoprotein |
| FNP_0378 | 751249 | 751797 | outer membrane protein OmpF | |
| FNP_0436 | 804352 | 805092 | outer membrane protein | |
| FNP_0509 | 874940 | 873669 | TolC family outer membrane protein | |
| FNP_0517 | 882518 | 883156 | probable outer membrane protein | |
| FNP_0668 | 1046781 | 1047941 | OmpA family outer membrane protein | |
| FNP_0820 | 1189564 | 1191015 | possible outer membrane protein P1 | |
| FNP_0972 | 1330393 | 1329281 |
| porin FomA |
| FNP_1046 | 1391420 | 1402057 | AT family autotransporter | |
| FNP_1248 | 1601352 | 1601074 | OmpW | |
| FNP_1784 | 2135004 | 2136413 | outer membrane protein TolC | |
| FNP_1877 | 2216275 | 2217012 | possible outer membrane protein | |
| FNP_1891 | 2231145 | 2236019 | probable outer membrane protein | |
| FNP_1996 | 2339205 | 2338783 | probable lipoprotein | |
|
| ||||
| FNP_2389 | 342089 | 340542 |
| general secretion pathway protein D |
| FNP_2396 | 346381 | 345881 | A24 family prepilin peptidase | |
| FNP_2397 | 346854 | 346266 |
| general secretion protein G |
| FNP_2398 | 348121 | 347081 |
| general secretion protein F |
| FNP_2399 | 349362 | 348118 |
| general secretion protein E |
| FNP_1034 | 1383491 | 1382541 |
| Tfp pilus assembly protein PilT |
| FNP_1868 | 2206819 | 2207523 |
| probable conjugal transfer protein TrbF/VirB8 |
| FNP_1869 | 2207535 | 2208371 |
| probable conjugal transfer protein TrbG/VirB9 |
| FNP_1870 | 2208381 | 2209583 |
| probable conjugal transfer protein TrbI/VirB10 |
| FNP_1871 | 2209586 | 2211616 |
| probable conjugal transfer protein TraG/VirD4 |
| FNP_1873 | 2212045 | 2213010 |
| probable conjugal transfer protein TrbB/VirB11 |
| FNP_1875 | 2213293 | 2216031 |
| probable conjugal transfer protein TrbE/VirB4 |
|
| ||||
| FNP_2152 | 69523 | 69237 | frameshift of AT family transporter | |
| FNP_2283 | 226990 | 219536 | AT family autotransporter | |
| FNP_2361 | 315873 | 308506 | AT family autotransporter | |
| FNP_0035 | 417960 | 415012 | possible autotransporter | |
| FNP_1046 | 1402057 | 1391420 | AT family autotransporter | |
| FNP_1391 | 1750650 | 1752584 | possible autotransporter adhesin | |
| FNP_1637 | 1986896 | 1998694 | AT family autotransporter | |
| FNP_2077 | 2423459 | 2420256 | AT family autotransporter | |
| FNP_0155 | 520499 | 522103 |
| probable TPS family two-partner secretion family protein TpsB |
| FNP_0156 | 522114 | 530369 |
| probable TPS family two-partner secretion family exoprotein TpsA |
| FNP_1246 | 1599267 | 1589866 |
| probable TPS family two-partner secretion exoprotein TpsA |
| FNP_1247 | 1601074 | 1599281 |
| pseudogene of TPS family two-partner secretion protein TpsB |
|
| ||||
| FNP_0461 | 825985 | 827004 | M50A family metalloprotease | |
| FNP_0897 | 1266394 | 1267038 | O-sialoglycoprotein endopeptidase | |
| FNP_1813 | 2167021 | 2168046 | O-sialoglycoprotein endopeptidase | |
|
| ||||
| FNP_2334 | 282271 | 280451 | O-antigen acetylase | |
| FNP_0533 | 898226 | 899365 | possible ADP-heptose:LPS heptosyltransferase | |
| FNP_0534 | 899377 | 900162 |
| lipopolysaccharide cholinephosphotransferase |
| FNP_0537 | 902005 | 903150 |
| glycosyltransferase |
| FNP_0538 | 903150 | 903902 | possible polysaccharide deacetylase | |
| FNP_0539 | 903907 | 904767 | possible glycosyltransferase | |
| FNP_0540 | 904782 | 905864 | possible glycosyltransferase | |
| FNP_0541 | 905879 | 906610 | probable glycosyltransferase | |
| FNP_0544 | 908004 | 909200 |
| possible O-antigen ligase |
| FNP_0830 | 1201261 | 1200185 |
| heptosyltransferase II (inner core) |
| FNP_1103 | 1443645 | 1442839 | possible glycosyltransferase | |
| FNP_1104 | 1444893 | 1443745 | UDP-N-acetylglucosamine 2-epimerase | |
| FNP_1105 | 1446154 | 1444898 |
| CMP-N-acetylneuraminate cytidylyltransferase |
| FNP_1106 | 1447200 | 1446157 |
| possible N-acetyl neuramic acid synthetase |
| FNP_1107 | 1447822 | 1447205 | N-acetylneuraminate synthase | |
| FNP_1108 | 1449226 | 1447934 | oligosaccharidyl-lipid/polysaccharide flippase | |
| FNP_1109 | 1450280 | 1449300 | possible lipooligosaccharide sialyltransferase | |
| FNP_1205 | 1556498 | 1557478 |
| possible ADP-heptose synthase |
| FNP_1807 | 2161658 | 2162707 |
| LPS heptosyltransferase II |
| FNP_1808 | 2162704 | 2163732 |
| LPS heptosyltransferase II |
| FNP_1809 | 2163725 | 2164327 |
| possible lipopolysaccharide core biosynthesis protein WaaY |
| FNP_1810 | 2164329 | 2165336 |
| lipopolysaccharide core biosynthesis glycosyl transferase WaaQ |
| FNP_1907 | 2251044 | 2251837 |
| UDP-3-O-acyl-N-acetylglucosamine deacetylase |
| FNP_1909 | 2252299 | 2253072 |
| acyl-[acyl-carrier-protein]–UDP-N-acetylglucosamine O-acyltransferase |
| FNP_1911 | 2253885 | 2254955 |
| lipid-A-disaccharide synthase |
Gene appears in more than one category within the table.