| Literature DB >> 15059260 |
Ravindra Pushker1, Alex Mira, Francisco Rodríguez-Valera.
Abstract
BACKGROUND: The wealth of genomic data in bacteria is helping microbiologists understand the factors involved in gene innovation. Among these, the expansion and reduction of gene families appears to have a fundamental role in this, but the factors influencing gene family size are unclear.Entities:
Mesh:
Year: 2004 PMID: 15059260 PMCID: PMC395786 DOI: 10.1186/gb-2004-5-4-r27
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Relationship between percentage of genes belonging to paralogous families plotted versus genome size in 127 eubacterial genomes. Inset shows the average gene family size versus genome size for the same genomes, except Shigella flexneri, Bordetella pertusis, B. parapertussis and B. bronchiseptica, which contain a high number of IS elements. Some genomes with atypical values are identified: Mpn, Mycoplasma pneumoniae; Mpt, Mycoplasma penetrans; Mga, Mycoplasma gallisepticum; Mlp, Mycobacterium leprae; Pir, Pirellula sp.
Figure 2Gene family sizes in genomes undergoing reductive evolution compared to a phylogenetically related larger sequenced genome. (a) Mycobacterium leprae (reductive) vs Mycobacterium tuberculosis H37Rv; (b) Shigella flexneri (reductive) vs Escherichia coli K12. Orthologous genes in the genome pairs (identified by amino-acid sequence similarity) are displayed in arbitrary order and plotted against the number of homologs in their own genome (that is, paralogs). Only protein-coding genes are included. IS elements from S. flexneri 2a are excluded.
Figure 3The number of members in E. coli K12 gene families plotted versus mean sequence identity of pairwise comparisons among the members of each family.
Figure 4Gene family sizes for homologous genes in groups of strains belonging to the same species, represented as in Figure 2. (a) Chlamydophila pneumoniae strains; (b) Streptococcus pyogenes strains; (c) Escherichia coli strains; (d) Staphylococcus aureus strains. Strain denomination and graph code displayed in the top right-hand corner. Only protein-coding genes are included. Zero on the y-axis indicates single-copy genes; 1 indicates a gene family formed of two members.
Figure 5Gene family sizes for homologous protein-coding genes in different species of the same genus. (a) Pseudomonas spp; (b) Bacillus spp. (c) Difference in the size of equivalent gene families between E. coli K12 and S. typhimurium LT2. Positive values indicate larger families in E. coli; negative values indicate larger families in S. typhymurium. The potG gene family is indicated.
Figure 6Proportions of assigned functions among genes belonging to families and singletons in B. subtilis and E. coli K12. Gene functions were assigned according to the Cluster of Orthologous Genes (COGs) classification [41]. Extended gene families are considered, in which a gene belongs to a single family only (see Materials and methods).
Species used in the current work and their accession numbers
| Species | Accession number | Genome size (bp) |
| NC_003062 | 2,841,581 | |
| NC_003304 | 2,841,490 | |
| NC_000918 | 1,551,335 | |
| NC_003997 | 5,227,293 | |
| NC_004722 | 5,411,809 | |
| NC_002570 | 4,202,353 | |
| NC_000964 | 4,214,814 | |
| NC_004663 | 6,260,361 | |
| NC_004307 | 2,256,646 | |
| NC_002927 | 5,339,179 | |
| NC_002928 | 4,773,551 | |
| NC_002929 | 4,086,189 | |
| NC_001318 | 910,724 | |
| NC_004463 | 9,105,828 | |
| NC_003317 | 2,117,144 | |
| NC_004310 | 2,107,792 | |
| NC_002528 | 640,681 | |
| NC_004545 | 615,980 | |
| NC_004061 | 641,454 | |
| NC_002163 | 1,641,481 | |
| NC_005061 | 705,557 | |
| NC_002696 | 4,016,947 | |
| NC_002620 | 1,072,950 | |
| NC_000117 | 1,042,519 | |
| NC_003361 | 1,173,390 | |
| NC_002179 | 1,229,858 | |
| NC_000922 | 1,230,230 | |
| NC_002491 | 1,226,565 | |
| NC_005043 | 1,225,935 | |
| NC_002932 | 2,154,946 | |
| NC_005085 | 4,751,080 | |
| NC_003030 | 3,940,880 | |
| NC_003366 | 3,031,430 | |
| NC_004557 | 2,799,251 | |
| NC_002935 | 2,488,635 | |
| NC_004369 | 3,147,090 | |
| NC_003450 | 3,309,401 | |
| NC_002971 | 1,995,275 | |
| NC_001263 | 2,648,638 | |
| NC_004668 | 3,218,031 | |
| NC_004431 | 5,231,428 | |
| NC_000913 | 4,639,221 | |
| NC_002695 | 5,498,450 | |
| NC_002655 | 5,528,445 | |
| NC_003454 | 2,174,500 | |
| NC_005125 | 4,659,019 | |
| NC_002940 | 1,698,955 | |
| NC_000907 | 1,830,138 | |
| NC_004917 | 1,799,146 | |
| NC_000915 | 1,667,867 | |
| NC_000921 | 1,643,831 | |
| NC_004567 | 3,308,274 | |
| NC_002662 | 2,365,589 | |
| NC_004342 | 4,332,241 | |
| NC_003212 | 3,011,208 | |
| NC_003210 | 2,944,528 | |
| NC_002678 | 7,036,074 | |
| NC_002945 | 4,345,492 | |
| NC_002677 | 3,268,203 | |
| NC_002755 | 4,403,836 | |
| NC_000962 | 4,411,529 | |
| NC_004829 | 996,422 | |
| NC_000908 | 580,074 | |
| NC_004432 | 1,358,633 | |
| NC_000912 | 816,394 | |
| NC_002771 | 963,879 | |
| NC_003112 | 2,272,351 | |
| NC_003116 | 2,184,406 | |
| NC_004757 | 2,812,094 | |
| NC_003272 | 6,413,771 | |
| NC_004193 | 3,630,528 | |
| NC_002663 | 2,257,487 | |
| NC_005126 | 5,688,987 | |
| NC_005027 | 7,145,576 | |
| NC_002950 | 2,343,476 | |
| NC_005071 | 2,410,873 | |
| NC_005042 | 1,751,080 | |
| NC_005072 | 1,657,990 | |
| NC_002516 | 6,264,403 | |
| NC_002947 | 6,181,863 | |
| NC_004578 | 6,397,126 | |
| NC_003295 | 3,716,413 | |
| NC_003103 | 1,268,755 | |
| NC_000963 | 1,111,523 | |
| NC_003198 | 4,809,037 | |
| NC_004631 | 4,791,961 | |
| NC_003197 | 4,857,432 | |
| NC_004347 | 4,969,803 | |
| NC_004741 | 4,599,354 | |
| NC_004337 | 4,607,203 | |
| NC_003047 | 3,654,135 | |
| NC_003923 | 2,820,462 | |
| NC_002758 | 2,878,040 | |
| NC_002745 | 2,814,816 | |
| NC_004461 | 2,499,279 | |
| NC_004116 | 2,160,267 | |
| NC_004368 | 211,485 | |
| NC_004350 | 203,0921 | |
| NC_003098 | 2,038,615 | |
| NC_003028 | 2,160,837 | |
| NC_002737 | 1,852,441 | |
| NC_004070 | 1,900,521 | |
| NC_003485 | 1,895,017 | |
| NC_004606 | 1,894,275 | |
| NC_003155 | 9,025,608 | |
| NC_003888 | 8,667,507 | |
| NC_005070 | 2,434,428 | |
| NC_000911 | 3,573,470 | |
| NC_003869 | 2,689,445 | |
| NC_004113 | 2,593,857 | |
| NC_000853 | 1,860,725 | |
| NC_000919 | 1,138,011 | |
| NC_004551 | 925,938 | |
| NC_004572 | 927,303 | |
| NC_002162 | 751,719 | |
| NC_002505 | 2,961,149 | |
| NC_004603 | 3,288,558 | |
| NC_004459 | 3,281,945 | |
| NC_005139 | 3,354,505 | |
| NC_004344 | 697,724 | |
| NC_005090 | 2,110,355 | |
| NC_003919 | 5,175,554 | |
| NC_003902 | 5,076,188 | |
| NC_002488 | 2,679,306 | |
| NC_004556 | 2,519,802 | |
| NC_003143 | 4,653,728 | |
| NC_004088 | 4,600,755 |