| Literature DB >> 28989277 |
Nicolas Carels1, Marcial Gumiel2, Fabio Faria da Mota3, Carlos José de Carvalho Moreira4, Patricia Azambuja2,5.
Abstract
The digestive tract of triatomines (DTT) is an ecological niche favored by microbiota whose enzymatic profile is adapted to the specific substrate availability in this medium. This report describes the molecular enzymatic properties that promote bacterial prominence in the DTT. The microbiota composition was assessed previously based on 16S ribosomal DNA, and whole sequenced genomes of bacteria from the same genera were used to calculate the GC level of rare and prominent bacterial species in the DTT. The enzymatic reactions encoded by coding sequences of both rare and common bacterial species were then compared and revealed key functions explaining why some genera outcompete others in the DTT. Representativeness of DTT microbiota was investigated by shotgun sequencing of DNA extracted from bacteria grown in liquid Luria-Bertani broth (LB) medium. Results showed that GC-rich bacteria outcompete GC-poor bacteria and are the dominant components of the DTT microbiota. In addition, oxidoreductases are the main enzymatic components of these bacteria. In particular, nitrate reductases (anaerobic respiration), oxygenases (catabolism of complex substrates), acetate-CoA ligase (tricarboxylic acid cycle and energy metabolism), and kinase (signaling pathway) were the major enzymatic determinants present together with a large group of minor enzymes including hydrogenases involved in energy and amino acid metabolism. In conclusion, despite their slower growth in liquid LB medium, bacteria from GC-rich genera outcompete the GC-poor bacteria because their specific enzymatic abilities impart a selective advantage in the DTT.Entities:
Keywords: EC number; GC content; ecological niche; gene number; genome size; midgut
Year: 2017 PMID: 28989277 PMCID: PMC5624349 DOI: 10.1177/1177932217733422
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Sequence materials for GC and enzymatic characterization.
| Genera | Freq. | Whole genome sequences | GC |
|---|---|---|---|
|
| 2 | NA | |
|
| 1 | NA | |
|
| 1 | NA | |
|
| Endo | GenBank | Poor |
|
| 1 | Bradyrhizobium_japonicum_USDA_6_uid158851/NC_017249.fna | Rich |
|
| 1 | NA | |
|
| 1 | Comamonas_testosteroni_CNB_2_uid62961/NC_013446.fna | Rich |
|
| 2015 | Corynebacterium_terpenotabidum_Y_11_uid210639/NC_021663.fna | Rich |
|
| 2 | NA | |
|
| 5008 | GenBank | Rich |
|
| 6 | Enterococcus_faecalis_D32_uid171261/NC_018221.fna | Poor |
|
| Low | Erwinia_amylovora_ATCC_49946_uid46943/NC_013971.fna | Rich |
|
| 2 | NA | |
|
| 11825 | Gordonia_polyisoprenivorans_VH2_uid86651/NC_016906.fna | Rich |
|
| 1 | Haemophilus_somnus_2336_uid57979/NC_010519.fna | Poor |
|
| 15 | NA | |
|
| 1 | NA | |
|
| 1 | Marinomonas_posidonica_IVIA_Po_181_uid67323/NC_015559.fna | Poor |
|
| 2 | NA | |
|
| 5737 | Mycobacterium_marinum_M_uid59423/NC_010612.fna | Rich |
|
| Low | Pectobacterium_carotovorum_PC1_uid59295/NC_012917.fna | Rich |
|
| 5 | Propionibacterium_propionicum_F0230a_uid170533/NC_018142.fna | Rich |
|
| 4 | Pseudomonas_aeruginosa_RP73_uid209328/NC_021577.fna | Rich |
|
| 1855 | Rhodococcus_opacus_B4_uid13791/NC_012522.fna | Rich |
|
| 4917 | Serratia_marcescens_FGI94_uid185180/NC_020064.fna | Rich |
|
| 2 | NA | |
|
| 1 | Sphingomonas_wittichii_RW1_uid58691/NC_009511.fna | Rich |
|
| 3 | Staphylococcus_saprophyticus_ATCC_15305_uid58411/NC_007350.fna | Poor |
|
| 1 | Stenotrophomonas_maltophilia_D457_uid162199/NC_017671.fna | Rich |
|
| 1 | Streptococcus_oralis_Uo5_uid65449/NC_015291.fna | Poor |
|
| 2 | NA | |
|
| 15 | NA | |
|
| Endo | Wolbachia_endosymbiont_of_Culex_quinquefasciatus_Pel_uid61645/NC_010981.fna | Poor |
|
| 1 | Xanthobacter_autotrophicus_Py2_uid58453/NC_009720.fna | Rich |
Abbreviation: NA, not available.
Bacterial species detected by Gumiel et al.[8]
Bacterial species detected by da Mota et al.[7]
Freq. is for the numbers in Table 2 of Gumiel et al[8] indicating the maximum absolute number of times a genus was detected over each of 4 samples. Low is for the low but uncharacterized level of detection of a genus by DGGE in da Mota et al.[7] Endo is for the high but uncharacterized level of detection of an endosymbiont by DGGE in da Mota et al.[7]
GC content of coding sequences associated with bacterial genera in digestive tract of triatomines.
| Phylum | Class | Order | Suborder | Family | Species | N | GC class | GC, % | σGC | GC1, % | σGC1 | GC2, % | σGC2 | GC3, % | σGC3 | Fr[ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Firmicutes | Bacilli | Bacillales | NA | Staphylococcaceae |
| 1844 | Poor | 33.8 | 4.6 | 46.8 | 5.4 | 31.7 | 4.6 | 22.9 | 3.8 | L[ |
| Proteobacteria | α-Proteobacteria | Rickettsiales | NA | Anaplasmataceae | 1037 | Poor | 34.4 | 4.5 | 44.3 | 4.8 | 32.9 | 4.3 | 26.1 | 4.3 | Endo[ | |
| Proteobacteria | γ-Proteobacteria | Pasteurellales | NA | Pasteurellaceae |
| 1601 | Poor | 37.5 | 5.1 | 49.5 | 5.1 | 34.9 | 4.5 | 28.3 | 5.7 | L |
| Firmicutes | Bacilli | Lactobacillales | NA | Enterococcaceae |
| 2249 | Poor | 37.8 | 4.7 | 49.4 | 5.0 | 34.3 | 4.7 | 29.8 | 4.5 | L |
| Firmicutes | Bacilli | Lactobacillales | NA | Streptococcaceae |
| 1451 | Poor | 41.3 | 5.9 | 52.2 | 5.8 | 34.0 | 4.6 | 37.7 | 7.2 | L |
| Proteobacteria | γ-Proteobacteria | Oceanospirillales | NA | Oceanospirillaceae |
| 2795 | Poor | 44.6 | 4.6 | 54.0 | 4.4 | 37.9 | 3.9 | 41.7 | 5.3 | L |
| Proteobacteria | γ-Proteobacteria | Enterobacteriales | NA | Enterobacteriaceae |
| 306 | Poor | 42.3 | 6.0 | 45.2 | 5.9 | 39.6 | 6.0 | 42.2 | 6.1 | Endo |
| Proteobacteria | γ-Proteobacteria | Enterobacteriales | NA | Enterobacteriaceae |
| 3322 | Rich | 52.2 | 6.4 | 59.0 | 6.0 | 40.9 | 4.7 | 56.7 | 8.6 | L |
| Proteobacteria | γ-Proteobacteria | Enterobacteriales | NA | Enterobacteriaceae |
| 2478 | Rich | 53.9 | 6.7 | 60.2 | 6.4 | 41.7 | 4.9 | 59.6 | 9.0 | L |
| Actinobacteria | Actinobacteria | Actinomycetales | Corynebacterineae | Dietziaceae | 157 | Rich | 68.0 | 4.7 | 69.1 | 5.3 | 66.3 | 4.3 | 68.6 | 4.4 | H[ | |
| Proteobacteria | γ-Proteobacteria | Enterobacteriales | NA | Enterobacteriaceae |
| 3620 | Rich | 59.4 | 7.5 | 63.2 | 6.6 | 42.7 | 5.1 | 72.5 | 10.8 | H |
| Proteobacteria | β-Proteobacteria | Burkholderiales | NA | Comamonadaceae |
| 3941 | Rich | 62.0 | 6.2 | 65.2 | 5.3 | 45.9 | 4.9 | 74.8 | 8.3 | L |
| Actinobacteria | Actinobacteria | Actinomycetales | Corynebacterineae | Mycobacteriaceae |
| 4464 | Rich | 65.6 | 5.4 | 68.0 | 4.9 | 50.6 | 5.5 | 78.0 | 5.8 | H |
| Proteobacteria | α-Proteobacteria | Rhizobiales | NA | Bradyrhizobiaceae |
| 7029 | Rich | 64.0 | 6.1 | 65.1 | 4.9 | 47.9 | 4.9 | 78.9 | 8.6 | L |
| Actinobacteria | Actinobacteria | Actinomycetales | Corynebacterineae | Gordoniaceae |
| 3870 | Rich | 67.1 | 5.4 | 69.0 | 5.0 | 51.1 | 5.2 | 81.3 | 6.0 | H |
| Actinobacteria | Actinobacteria | Actinomycetales | NA | Propionibacteriaceae |
| 2292 | Rich | 66.4 | 6.3 | 68.4 | 5.5 | 49.2 | 5.4 | 81.6 | 8.0 | L |
| Proteobacteria | γ-Proteobacteria | Xanthomonadales | NA | Xanthomonadaceae |
| 3275 | Rich | 67.2 | 5.7 | 69.8 | 5.6 | 48.8 | 5.4 | 83.0 | 6.2 | L |
| Actinobacteria | Actinobacteria | Actinomycetales | Corynebacterineae | Corynebacteriaceae |
| 1861 | Rich | 67.3 | 5.4 | 68.8 | 5.1 | 49.7 | 5.3 | 83.3 | 5.7 | H |
| Actinobacteria | Actinobacteria | Actinomycetales | Corynebacterineae | Nocardiaceae |
| 5908 | Rich | 67.9 | 5.7 | 69.7 | 4.9 | 50.6 | 5.3 | 83.5 | 6.7 | H |
| Proteobacteria | α-Proteobacteria | Rhizobiales | NA | Xanthobacteraceae |
| 3783 | Rich | 67.6 | 6.0 | 69.1 | 5.6 | 49.7 | 5.3 | 84.0 | 7.3 | L |
| Proteobacteria | γ-Proteobacteria | Pseudomonadales | NA | Pseudomonadaceae |
| 4627 | Rich | 66.6 | 6.4 | 68.7 | 5.8 | 46.8 | 5.8 | 84.3 | 7.5 | L |
| Proteobacteria | α-Proteobacteria | Sphingomonadales | NA | Sphingomonadaceae |
| 4069 | Rich | 68.7 | 5.9 | 69.3 | 5.6 | 50.0 | 5.2 | 86.7 | 6.9 | L |
Abbreviation: NA, not available.
Fr is for the frequency of bacterial species reported in Gumiel et al.[8]
L is for low frequency.
Endo is for endosymbiont.
H is for high frequency and is highlighted in gray background.
The shading regions in Table 2 is to improve the contrast between GC-rich (gray) and GC-poor (white) genomes.
Relationships between GC3, genome size, and gene number in the representative bacterial species with complete genome sequence of bacterial genera found in the intestinal tract of triatomines.
| Species | GC3 | Genome, bp | Gene, nb |
|---|---|---|---|
| 22.9 | 2 516 573 | 2445 | |
| 28.3 | 2 263 855 | 1980 | |
| 29.8 | 2 987 449 | 2876 | |
| 37.7 | 1 958 688 | 1905 | |
| 41.7 | 3 899 938 | 3491 | |
| 56.7 | 4 862 911 | 4246 | |
| 59.6 | 3 805 872 | 3437 | |
| 72.5 | 4 858 215 | 4361 | |
| 74.8 | 5 373 642 | 4802 | |
| 78.0 | 6 636 826 | 5423 | |
| 78.9 | 9 207 382 | 8826 | |
| 81.3 | 5 669 804 | 4945 | |
| 81.6 | 3 449 358 | 2938 | |
| 83.0 | 4 769 154 | 4101 | |
| 83.3 | 2 751 232 | 2369 | |
| 83.5 | 7 913 449 | 7246 | |
| 84.0 | 5 308 932 | 4746 | |
| 84.3 | 6 342 033 | 5762 | |
| 86.7 | 5 382 259 | 4850 | |
| Correlations | |||
| | 0.66 | — | — |
|
| — | 0.61 | — |
|
| — | — | 0.99 |
Correlation for GC3 vs genome size.
Correlation for GC3 vs gene number.
Correlation for genome size vs gene number.
The shading regions in Table 3 is to improve the contrast between GC-rich (gray) and GC-poor (white) genomes.
Relative frequency (%) of enzymatic functions in GC-poor and GC-rich bacterial species found in triatomine digestive tract according to the first EC number digit.
| ECNO. | Ss | Hs | Ef | So | Average | SD | Sm | Mm | Gp | Ct | Ro | Average | SD | AvGCr/AvGCp[ | Class of enzyme function |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GC3,% | 22.9 | 28.3 | 29.8 | 37.7 | 72.5 | 78.0 | 81.3 | 83.3 | 83.5 | ||||||
| 1. -.-.- | 17.9 | 14.4 | 12.7 | 12.8 | 14.4 | 2.4 | 21.2 | 31.6 | 30.5 | 20.3 | 34.2 | 27.5 | 6.4 | 1.9 | Oxidoreductases |
| 2. -.-.- | 31.9 | 33.8 | 31.6 | 33.7 | 32.8 | 1.2 | 29.5 | 28.8 | 27.6 | 32.4 | 25.3 | 28.7 | 2.6 | 0.9 | Transferases |
| 3. -.-.- | 29.3 | 27.2 | 35.5 | 33.9 | 31.5 | 3.9 | 28.3 | 21.2 | 19.6 | 25.3 | 19.8 | 22.8 | 3.8 | 0.7 | Hydrolases |
| 4. -.-.- | 8.2 | 10.0 | 6.9 | 7.2 | 8.1 | 1.4 | 10.1 | 8.5 | 9.6 | 9.3 | 8.9 | 9.3 | 0.6 | 1.1 | Lyases |
| 5. -.-.- | 5.3 | 7.5 | 6.7 | 5.1 | 6.1 | 1.1 | 6.2 | 3.3 | 4.5 | 4.3 | 4.3 | 4.5 | 1.0 | 0.7 | Isomerases |
| 6. -.-.- | 7.5 | 7.1 | 6.6 | 7.3 | 7.1 | 0.4 | 4.7 | 6.6 | 8.3 | 8.4 | 7.6 | 7.1 | 1.5 | 1.0 | Ligases |
Abbreviations: Ct, Corynebacterium terpenotabidum; EC no., Enzyme Commission number; Ef, Enterococcus faecalis; Gp, Gordonia polyisoprenivorans; Hs, Haemophilus somnus; Mm, Mycobacterium marinum; Ro, Rhodococcus opacus; Sm, Stenotrophomonas maltophilia; So, Streptococcus oralis; Ss, Staphylococcus saprophyticus.
Factor difference where AvGCr is for average of GC-rich and AvGCp is for average of GC-poor.
The shading regions in Table 4 is to improve the contrast between GC-rich (gray) and GC-poor (white) genomes.
Relative frequency (%) of enzymatic functions in GC-poor and GC-rich bacterial species found in triatomine digestive tract according to the 2 first EC number digits.
| ECNO. | Ss | Hs | Ef | So | Average | SD | Sm | Mm | Gp | Ct | Ro | Average | SD | AvGCr/AvGCp[ | Class of enzyme function |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GC3,% | 22.9 | 28.3 | 29.8 | 37.7 | 72.5 | 78.0 | 81.3 | 83.3 | 83.5 | ||||||
| 1.3.-.- | 1.3 | 1.4 | 1.2 | 1.3 | 1.3 | 0.1 | 1.6 | 5.1 | 4.6 | 2.8 | 5.2 | 3.8 | 1.6 | 2.9 | Acting on the CH-CH group of donors |
| 1.4.-.- | 0.9 | 0.2 | 0.3 | 0.3 | 0.4 | 0.3 | 0.9 | 0.9 | 0.9 | 1.3 | 1.6 | 1.1 | 0.3 | 2.7 | Acting on the CH-NH2 group of donors (amino acid oxidoreductase) |
| 1.10.-.- | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.4 | 0.1 | 0.2 | 0.3 | 0.1 | 0.2 | 0.1 | ∞ | Acting on diphenols and related substances as donors (diphenol oxidoreductases) |
| 1.13.-.- | 0.0 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 | 0.6 | 0.6 | 0.8 | 0.1 | 1.5 | 0.7 | 0.5 | 24.5 | Acting on single donors with incorporation of molecular oxygen (monooxygenases) |
| 1.14.-.- | 0.8 | 0.6 | 0.1 | 0.1 | 0.4 | 0.4 | 1.4 | 4.3 | 4.9 | 2.1 | 3.8 | 3.3 | 1.5 | 8.2 | Acting on paired donors, with incorporation or reduction of molecular oxygen (dioxygenases) |
| 1.18.-.- | 0.1 | 0.0 | 0.2 | 0.0 | 0.1 | 0.1 | 0.3 | 0.2 | 0.4 | 0.4 | 0.5 | 0.4 | 0.1 | 5.0 | Acting on iron-sulfur proteins as donors |
| 3.3.-.- | 0.1 | 0.0 | 0.1 | 0.0 | 0.0 | 0.1 | 0.1 | 0.4 | 0.2 | 0.3 | 0.4 | 0.3 | 0.1 | 6.1 | Acting on ether bonds |
| 3.7.-.- | 0.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1 | 0.2 | 0.0 | 0.2 | 0.1 | 0.1 | 4.4 | Acting on carbon-carbon bonds |
| 5.5.-.- | 0.0 | 0.1 | 0.0 | 0.1 | 0.1 | 0.1 | 0.2 | 0.1 | 0.2 | 0.1 | 0.1 | 0.1 | 0.0 | 2.2 | Intramolecular lyases |
| 6.2.-.- | 0.6 | 0.5 | 0.5 | 0.1 | 0.4 | 0.2 | 0.5 | 2.3 | 1.7 | 0.6 | 3.3 | 1.7 | 1.2 | 4.0 | Forming carbon-sulfur bonds |
Abbreviations: Ct, Corynebacterium terpenotabidum; EC no., Enzyme Commission number; Ef, Enterococcus faecalis; Gp, Gordonia polyisoprenivorans; Hs, Haemophilus somnus; Mm, Mycobacterium marinum; Ro, Rhodococcus opacus; Sm, Stenotrophomonas maltophilia; So, Streptococcus oralis; Ss, Staphylococcus saprophyticus.
Factor difference where AvGCr is for average of GC-rich and AvGCp is for average of GC-poor.
The shading regions in Table 5 is to improve the contrast between GC-rich (gray) and GC-poor (white) genomes and is expected to improve table readability.
Relative frequency (%) of enzymatic functions in GC-poor and GC-rich bacterial species found in the triatomine digestive tract according to the 3 first EC number digits.
| ECNO. | Ss | Hs | Ef | So | Average | SD | Sm | Mm | Gp | Ct | Ro | Average | SD | AvGCr/AvGCp[ | Class of enzyme function |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GC3,% | 22.9 | 28.3 | 29.8 | 37.7 | 72.5 | 78.0 | 81.3 | 83.3 | 83.5 | ||||||
| 1.1.2.- | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.2 | 0.1 | 0.1 | 0.0 | 0.1 | 0.1 | ∞ | With a cytochrome as acceptor |
| 1.1.3.- | 0.0 | 0.0 | 0.1 | 0.1 | 0.1 | 0.1 | 0.0 | 0.2 | 0.3 | 0.0 | 0.4 | 0.2 | 0.2 | 2.8 | With oxygen as acceptor |
| 1.1.99.- | 0.1 | 0.0 | 0.1 | 0.0 | 0.1 | 0.1 | 0.5 | 0.1 | 0.3 | 0.1 | 0.2 | 0.2 | 0.2 | 4.8 | With unknown physiological acceptors |
| 1.2.1.- | 1.5 | 0.9 | 0.9 | 0.3 | 0.9 | 0.5 | 1.5 | 2.1 | 2.6 | 1.8 | 3.5 | 2.3 | 0.8 | 2.6 | With NAD+ or NADP+ as acceptor |
| 1.3.99.- | 0.3 | 0.6 | 0.2 | 0.1 | 0.3 | 0.2 | 0.6 | 4.5 | 3.4 | 1.8 | 4.7 | 3.0 | 1.8 | 9.6 | With unknown physiological acceptors |
| 1.4.1.- | 0.6 | 0.1 | 0.3 | 0.1 | 0.3 | 0.2 | 0.5 | 0.3 | 0.5 | 0.7 | 0.9 | 0.6 | 0.2 | 2.0 | With NAD+ or NADP+ as acceptor |
| 1.4.3.- | 0.0 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 | 0.3 | 0.5 | 0.4 | 0.3 | 0.6 | 0.4 | 0.1 | 14.2 | With oxygen as acceptor |
| 1.4.4.- | 0.2 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1 | 0.2 | 0.0 | 0.3 | 0.1 | 0.1 | 0.1 | 2.8 | With a disulfide as acceptor |
| 1.5.3.- | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.2 | 0.1 | 0.5 | 0.1 | 0.2 | 0.2 | 0.2 | ∞ | With oxygen as acceptor |
| 1.6.1.- | 0.0 | 0.2 | 0.0 | 0.0 | 0.1 | 0.1 | 0.2 | 0.3 | 0.3 | 0.1 | 0.1 | 0.2 | 0.1 | 3.5 | With NAD+ or NADP+ as acceptor |
| 1.7.99.- | 0.0 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 | 0.7 | 0.1 | 0.0 | 0.0 | 0.3 | 0.2 | 0.3 | 6.7 | With unknown physiological acceptors |
| 1.9.3.- | 0.0 | 0.0 | 0.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.2 | 0.3 | 0.1 | 0.1 | 0.1 | 5.3 | With oxygen as acceptor |
| 1.13.11.- | 0.0 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 | 0.5 | 0.4 | 0.6 | 0.0 | 1.2 | 0.5 | 0.4 | 17.5 | With incorporation of 2 atoms of oxygen |
| 1.13.12.- | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.3 | 0.3 | 0.1 | 0.5 | 0.2 | 0.2 | ∞ | With incorporation of 1 atom of oxygen (internal monooxygenases or internal mixed function oxidases) |
| 1.14.11.- | 0.0 | 0.0 | 0.0 | 0.1 | 0.0 | 0.1 | 0.2 | 0.1 | 0.3 | 0.0 | 0.2 | 0.1 | 0.1 | 4.3 | With 2-oxoglutarate as 1 donor, and incorporation of 1 atom each of oxygen into both donors |
| 1.14.12.- | 0.2 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.2 | 0.0 | 0.2 | 0.4 | 0.4 | 0.2 | 0.2 | 4.5 | With NADH or NADPH as 1 donor, and incorporation of 2 atoms of oxygen into 1 donor |
| 1.14.13.- | 0.5 | 0.5 | 0.1 | 0.0 | 0.3 | 0.3 | 0.6 | 1.0 | 3.0 | 0.6 | 1.7 | 1.4 | 1.0 | 4.9 | With NAD or NADH as 1 donor, and incorporation of 1 atom of oxygen |
| 1.14.99.- | 0.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.0 | 0.1 | 0.3 | 0.3 | 0.4 | 0.2 | 0.2 | 9.2 | Miscellaneous |
| 1.17.7.- | 0.0 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1 | 0.1 | 0.0 | 0.0 | 0.1 | 0.0 | 2.1 | With an iron-sulfur protein as acceptor |
| 1.18.1.- | 0.1 | 0.0 | 0.2 | 0.0 | 0.1 | 0.1 | 0.2 | 0.2 | 0.4 | 0.4 | 0.6 | 0.4 | 0.2 | 5.1 | With NAD+ or NADP+ as acceptor |
| 2.7.11.- | 0.4 | 0.0 | 0.2 | 0.3 | 0.2 | 0.2 | 0.2 | 0.8 | 1.1 | 0.6 | 1.3 | 0.8 | 0.4 | 3.6 | Protein-serine/threonine kinases |
| 2.8.3.- | 0.1 | 0.2 | 0.0 | 0.0 | 0.1 | 0.1 | 0.2 | 0.7 | 0.5 | 0.6 | 1.4 | 0.7 | 0.4 | 7.7 | CoA-transferases |
| 2.10.1.- | 0.0 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1 | 0.1 | 0.0 | 0.0 | 0.1 | 0.0 | 2.1 | Molybdenumtransferases or tungstentransferases with sulfide groups as acceptors |
| 3.1.6.- | 0.0 | 0.2 | 0.0 | 0.1 | 0.1 | 0.1 | 0.2 | 0.3 | 0.1 | 0.1 | 0.4 | 0.2 | 0.1 | 2.4 | Sulfuric ester hydrolases |
| 3.3.2.- | 0.1 | 0.0 | 0.1 | 0.0 | 0.1 | 0.1 | 0.1 | 0.4 | 0.2 | 0.1 | 0.4 | 0.3 | 0.2 | 5.1 | Ether hydrolases |
| 4.3.3.- | 0.0 | 0.0 | 0.1 | 0.0 | 0.0 | 0.0 | 0.3 | 0.0 | 0.0 | 0.1 | 0.0 | 0.1 | 0.1 | 4.1 | Amine-lyases |
| 5.1.99.- | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.3 | 0.1 | 0.3 | 0.2 | 0.1 | ∞ | Acting on other compounds |
| 5.3.3.- | 0.1 | 0.0 | 0.1 | 0.1 | 0.1 | 0.1 | 0.3 | 0.4 | 0.3 | 0.1 | 0.2 | 0.3 | 0.1 | 3.2 | Transposing C=C bonds |
| 5.5.1.- | 0.0 | 0.1 | 0.0 | 0.1 | 0.1 | 0.1 | 0.2 | 0.1 | 0.2 | 0.1 | 0.1 | 0.2 | 0.0 | 2.4 | Miscellaneous |
| 6.2.1.- | 0.6 | 0.5 | 0.5 | 0.1 | 0.4 | 0.2 | 0.6 | 2.6 | 1.9 | 0.6 | 3.6 | 1.9 | 1.3 | 4.3 | Miscellaneous |
Abbreviations: Ct, Corynebacterium terpenotabidum; EC no., Enzyme Commission number; Ef, Enterococcus faecalis; Gp, Gordonia polyisoprenivorans; Hs, Haemophilus somnus; Mm, Mycobacterium marinum; Ro, Rhodococcus opacus; Sm, Stenotrophomonas maltophilia; So, Streptococcus oralis; Ss, Staphylococcus saprophyticus.
Factor difference where AvGCr is for average of GC-rich and AvGCp is for average of GC-poor.
The shading regions in Table 6 is to improve the contrast between GC-rich (gray) and GC-poor (white) genomes.
Relative frequency (%) of enzymatic functions in GC-poor and GC-rich bacterial species found in the triatomine digestive tract according to the 4 EC number digits.
| ECNO. | Ss | Hs | Ef | So | Average | SD | Sm | Mm | Gp | Ct | Ro | Average | SD | AvGCr/AvGCp[ | Class of enzyme function |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GC3,% | 22.9 | 28.3 | 29.8 | 37.7 | 72.5 | 78.0 | 81.3 | 83.3 | 83.5 | ||||||
| 1.1.99.1 | 1.0 | 0.0 | 0.0 | 0.0 | 0.3 | 0.5 | 2.0 | 1.0 | 1.0 | 1.0 | 2.0 | 1.4 | 0.5 | 5.6 | Choline dehydrogenase |
| 1.2.1.2 | 4.0 | 1.0 | 1.0 | 0.0 | 1.5 | 1.7 | 7.0 | 8.0 | 7.0 | 2.0 | 9.0 | 6.6 | 2.7 | 4.4 | Formate dehydrogenase |
| 1.2.1.3 | 3.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.4 | 3.0 | 4.0 | 5.0 | 2.0 | 14.0 | 5.6 | 4.8 | 5.6 | Aldehyde dehydrogenase (NAD+) |
| 1.2.1.7 | 1.0 | 1.0 | 0.0 | 0.0 | 0.5 | 0.6 | 1.0 | 3.0 | 2.0 | 2.0 | 1.0 | 1.8 | 0.8 | 3.6 | Benzaldehyde dehydrogenase (NADP+) |
| 1.2.1.8 | 1.0 | 0.0 | 0.0 | 0.0 | 0.3 | 0.5 | 5.0 | 0.0 | 1.0 | 1.0 | 2.0 | 1.8 | 1.9 | 7.2 | Betaine-aldehyde dehydrogenase |
| 1.2.1.10 | 0.0 | 0.0 | 3.0 | 0.0 | 0.8 | 1.5 | 1.0 | 1.0 | 1.0 | 0.0 | 5.0 | 1.6 | 1.9 | 2.1 | Acetaldehyde dehydrogenase (acetylating) |
| 1.2.1.16 | 1.0 | 1.0 | 0.0 | 0.0 | 0.5 | 0.6 | 1.0 | 1.0 | 2.0 | 1.0 | 2.0 | 1.4 | 0.5 | 2.8 | Glyceraldehyde-3-phosphate dehydrogenase (NADP+) (phosphorylating) |
| 1.2.1.27 | 0.0 | 1.0 | 0.0 | 0.0 | 0.3 | 0.5 | 1.0 | 1.0 | 3.0 | 0.0 | 5.0 | 2.0 | 2.0 | 8.0 | Methylmalonate-semialdehyde dehydrogenase (acylating) |
| 1.2.1.38 | 1.0 | 0.0 | 0.0 | 0.0 | 0.3 | 0.5 | 1.0 | 0.0 | 1.0 | 1.0 | 1.0 | 0.8 | 0.4 | 3.2 | |
| 1.2.1.70 | 1.0 | 1.0 | 0.0 | 0.0 | 0.5 | 0.6 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | 2.0 | Glutamyl-tRNA reductase |
| 1.3.99.1 | 2.0 | 3.0 | 1.0 | 0.0 | 1.5 | 1.3 | 8.0 | 6.0 | 9.0 | 2.0 | 7.0 | 6.4 | 2.7 | 4.3 | Succinate dehydrogenase |
| 1.4.1.1 | 3.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.4 | 3.0 | 4.0 | 6.0 | 4.0 | 8.0 | 5.0 | 2.0 | 5.0 | Alanine dehydrogenase |
| 1.4.1.2 | 2.0 | 0.0 | 0.0 | 0.0 | 0.5 | 1.0 | 2.0 | 1.0 | 0.0 | 0.0 | 6.0 | 1.8 | 2.5 | 3.6 | Glutamate dehydrogenase |
| 1.4.1.13 | 2.0 | 0.0 | 1.0 | 0.0 | 0.8 | 1.0 | 3.0 | 3.0 | 5.0 | 2.0 | 6.0 | 3.8 | 1.6 | 5.1 | Glutamate synthase (NADPH) |
| 1.4.3.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 1.0 | 2.0 | 3.0 | 1.6 | 0.9 | ∞ | |
| 1.4.4.2 | 2.0 | 0.0 | 0.0 | 0.0 | 0.5 | 1.0 | 2.0 | 3.0 | 0.0 | 2.0 | 3.0 | 2.0 | 1.2 | 4.0 | Glycine dehydrogenase (decarboxylating) |
| 1.5.3.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.0 | 1.0 | 5.0 | 1.0 | 5.0 | 3.2 | 2.0 | ∞ | Sarcosine oxidase |
| 1.6.1.2 | 0.0 | 2.0 | 0.0 | 0.0 | 0.5 | 1.0 | 2.0 | 2.0 | 3.0 | 1.0 | 3.0 | 2.2 | 0.8 | 4.4 | NAD(P)+ transhydrogenase (Re/Si-specific) |
| 1.7.99.4 | 0.0 | 1.0 | 0.0 | 0.0 | 0.3 | 0.5 | 12.0 | 1.0 | 0.0 | 0.0 | 6.0 | 3.8 | 5.2 | 15.2 | Nitrate reductase |
| 1.13.11.2 | 0.0 | 1.0 | 0.0 | 0.0 | 0.3 | 0.5 | 3.0 | 4.0 | 2.0 | 0.0 | 11.0 | 4.0 | 4.2 | 16.0 | Catechol 2,3-dioxygenase |
| 1.13.11.24 | 0.0 | 1.0 | 0.0 | 0.0 | 0.3 | 0.5 | 2.0 | 2.0 | 0.0 | 0.0 | 3.0 | 1.4 | 1.3 | 5.6 | Quercetin 2,3-dioxygenase |
| 1.14.12.1 | 2.0 | 0.0 | 0.0 | 0.0 | 0.5 | 1.0 | 3.0 | 0.0 | 1.0 | 1.0 | 3.0 | 1.6 | 1.3 | 3.2 | Anthranilate 1,2-dioxygenase (deaminating, decarboxylating) |
| 1.14.13.1 | 1.0 | 0.0 | 0.0 | 0.0 | 0.3 | 0.5 | 5.0 | 1.0 | 4.0 | 1.0 | 6.0 | 3.4 | 2.3 | 13.6 | Salicylate 1-monooxygenase |
| 1.14.13.8 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 4.0 | 1.0 | 3.0 | 2.0 | 1.4 | ∞ | Dimethyl aniline monooxygenase (N-oxide-forming) |
| 1.14.99.3 | 1.0 | 0.0 | 0.0 | 0.0 | 0.3 | 0.5 | 0.0 | 1.0 | 2.0 | 2.0 | 2.0 | 1.4 | 0.9 | 5.6 | Heme oxygenase (decyclizing) |
| 1.17.7.1 | 0.0 | 1.0 | 0.0 | 0.0 | 0.3 | 0.5 | 1.0 | 2.0 | 1.0 | 0.0 | 1.0 | 1.0 | 0.7 | 4.0 | 4-Hydroxy-3-methylbut-2-en-1-yldiphosphate synthase |
| 1.18.1.2 | 1.0 | 0.0 | 2.0 | 0.0 | 0.8 | 1.0 | 2.0 | 1.0 | 3.0 | 2.0 | 5.0 | 2.6 | 1.5 | 3.5 | Ferredoxin—NADP+ reductase |
| 1.18.1.3 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 1.0 | 1.0 | 6.0 | 2.0 | 2.2 | ∞ | Ferredoxin—NAD+ reductase |
| 2.7.11.1 | 1.0 | 0.0 | 1.0 | 2.0 | 1.0 | 0.8 | 2.0 | 12.0 | 12.0 | 4.0 | 29.0 | 11.8 | 10.6 | 11.8 | Nonspecific serine/threonine protein kinase |
| 2.8.3.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 9.0 | 4.0 | 1.0 | 17.0 | 6.4 | 6.8 | ∞ | Propionate CoA-transferase |
| 2.10.1.1 | 0.0 | 1.0 | 0.0 | 0.0 | 0.3 | 0.5 | 1.0 | 2.0 | 1.0 | 0.0 | 1.0 | 1.0 | 0.7 | 4.0 | Molybdopterin molybdotransferase |
| 3.1.6.1 | 0.0 | 1.0 | 0.0 | 1.0 | 0.5 | 0.6 | 2.0 | 5.0 | 0.0 | 1.0 | 7.0 | 3.0 | 2.9 | 6.0 | Arylsulfatase |
| 3.3.2.1 | 1.0 | 0.0 | 1.0 | 0.0 | 0.5 | 0.6 | 1.0 | 4.0 | 1.0 | 0.0 | 1.0 | 1.4 | 1.5 | 2.8 | Isochorismatase |
| 4.3.3.7 | 0.0 | 0.0 | 1.0 | 0.0 | 0.3 | 0.5 | 5.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.2 | 2.2 | 4.8 | 4-Hydroxy-tetrahydrodipicolinate synthase |
| 6.2.1.1 | 2.0 | 0.0 | 0.0 | 0.0 | 0.5 | 1.0 | 2.0 | 2.0 | 6.0 | 1.0 | 15.0 | 5.2 | 5.8 | 10.4 | Acetate-CoA ligase |
| 6.2.1.3 | 1.0 | 1.0 | 1.0 | 0.0 | 0.8 | 0.5 | 4.0 | 5.0 | 6.0 | 2.0 | 9.0 | 5.2 | 2.6 | 6.9 | Long-chain-fatty-acid-CoA ligase |
| 6.2.1.26 | 0.0 | 1.0 | 1.0 | 0.0 | 0.5 | 0.6 | 1.0 | 1.0 | 1.0 | 0.0 | 3.0 | 1.2 | 1.1 | 2.4 |
Abbreviations: Ct, Corynebacterium terpenotabidum; EC no., Enzyme Commission number; Ef, Enterococcus faecalis; Gp, Gordonia polyisoprenivorans; Hs, Haemophilus somnus; Mm, Mycobacterium marinum; Ro, Rhodococcus opacus; Sm, Stenotrophomonas maltophilia; So, Streptococcus oralis; Ss, Staphylococcus saprophyticus.
Factor difference where AvGCr is for average of GC-rich and AvGCp is for average of GC-poor.
The shading regions in Table 7 is to improve the contrast between GC-rich (gray) and GC-poor (white) genomes.
Enzyme reactions of Table 7 classified by decreasing AvGCr/AvGCp.
| AvGCr/AvGCp[ | S. no. | EC no. | Enzymatic function |
|---|---|---|---|
| 10 to 16 | 1 | 1.13.11.2 | Catechol 2,3-dioxygenase |
| 2 | 1.7.99.4 | Nitrate reductase | |
| 3 | 1.14.13.1 | Salicylate 1-monooxygenase | |
| 4 | 2.7.11.1 | Nonspecific serine/threonine protein kinase | |
| 5 | 6.2.1.1 | Acetate—CoA ligase | |
| 5 to <10 | 1 | 1.2.1.27 | Methylmalonate-semialdehyde dehydrogenase |
| 2 | 1.2.1.8 | Betaine-aldehyde dehydrogenase | |
| 3 | 6.2.1.3 | Long-chain-fatty-acid-CoA ligase | |
| 4 | 3.1.6.1 | Arylsulfatase | |
| 5 | 1.1.99.1 | Choline dehydrogenase | |
| 6 | 1.2.1.3 | Aldehyde dehydrogenases | |
| 7 | 1.13.11.24 | Quercetin 2,3-dioxygenase | |
| 8 | 1.14.99.3 | Heme oxygenase—biliverdin-producing | |
| 9 | 1.4.1.13 | Glutamate synthase—NADPH | |
| 10 | 1.4.1.1 | Alanine dehydrogenase | |
| 4 to <5 | 1 | 4.3.3.7 | 4-Hydroxy-tetrahydrodipicolinate synthase |
| 2 | 1.2.1.2 | Formate dehydrogenase | |
| 3 | 1.6.1.2 | NAD(P)+ transhydrogenase—Re/Si-specific | |
| 4 | 1.3.99.1 | Succinate dehydrogenase | |
| 5 | 1.4.4.2 | Glycine dehydrogenase—aminomethyl-transferring | |
| 6 | 1.17.7.1 | Cytidine diphosphate-4-dehydro-6-deoxyglucose reductase | |
| 2 to <4 | 1 | 2.10.1.1 | Molybdopterin molybdotransferase |
| 2 | 1.2.1.7 | Benzaldehyde dehydrogenase | |
| 3 | 1.4.1.2 | Glutamate dehydrogenase | |
| 4 | 1.18.1.2 | Ferredoxin—NADP+ reductase | |
| 5 | 1.2.1.38 | ||
| 6 | 3.3.2.1 | Isochorismatase | |
| 7 | 6.2.1.26 | ||
| 8 | 1.2.1.10 | Acetaldehyde dehydrogenase | |
| 9 | 1.2.1.70 | Glutamyl-tRNA reductase | |
| <2 | 1 | 1.4.3.1 | |
| 2 | 1.5.3.1 | Sarcosine oxidase | |
| 3 | 1.14.13.8 | Flavin-containing monooxygenase | |
| 4 | 1.18.1.3 | Ferredoxin—NAD+ reductase | |
| 5 | 2.8.3.1 | Propionate CoA-transferase |
Abbreviation: EC no., Enzyme Commission number.
Factor difference where AvGCr is for average of GC-rich and AvGCp is for average of GC-poor.
Figure 1.Relative frequency of shotgun sequences associated with bacteria found in the digestive tract of triatomines. (A) Size of reads obtained by 454 Titanium technology, (B) contig size after successive read assembling with Velvet and CAP3, (C) size of ORFs extracted from read contigs, and (D) size of coding ORFs after UFM filtering. cORFs indicate coding open reading frames; ORFs, open reading frames.