| Literature DB >> 24625962 |
Vincent P Richards1, Sara R Palmer, Paulina D Pavinski Bitar, Xiang Qin, George M Weinstock, Sarah K Highlander, Christopher D Town, Robert A Burne, Michael J Stanhope.
Abstract
The genus Streptococcus comprises important pathogens that have a severe impact on human health and are responsible for substantial economic losses to agriculture. Here, we utilize 46 Streptococcus genome sequences (44 species), including eight species sequenced here, to provide the first genomic level insight into the evolutionary history and genetic basis underlying the functional diversity of all major groups of this genus. Gene gain/loss analysis revealed a dynamic pattern of genome evolution characterized by an initial period of gene gain followed by a period of loss, as the major groups within the genus diversified. This was followed by a period of genome expansion associated with the origins of the present extant species. The pattern is concordant with an emerging view that genomes evolve through a dynamic process of expansion and streamlining. A large proportion of the pan-genome has experienced lateral gene transfer (LGT) with causative factors, such as relatedness and shared environment, operating over different evolutionary scales. Multiple gene ontology terms were significantly enriched for each group, and mapping terms onto the phylogeny showed that those corresponding to genes born on branches leading to the major groups represented approximately one-fifth of those enriched. Furthermore, despite the extensive LGT, several biochemical characteristics have been retained since group formation, suggesting genomic cohesiveness through time, and that these characteristics may be fundamental to each group. For example, proteolysis: mitis group; urea metabolism: salivarius group; carbohydrate metabolism: pyogenic group; and transcription regulation: bovis group.Entities:
Keywords: comparative genomics; enrichment; gene gain and loss; lateral gene transfer; phylogenetics
Mesh:
Year: 2014 PMID: 24625962 PMCID: PMC4007547 DOI: 10.1093/gbe/evu048
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Genome Sequence Details
| Species | Source | Tissue/Presentation | Accession No. |
|---|---|---|---|
| Human | Neonate | NC_007432 | |
| Bovine | Mastitis | AEXT01 | |
| Human | Rectal biopsy | ADME01 | |
| Human | Saliva | AEQR01 | |
| Human | Synovial fluid | AEEL01 | |
| Bovine | Mastitis | AIDX01 | |
| Human | Throat | AFUP01 | |
| Hamster | Caries lesion | AEUV02 | |
| Human | Periodontal abscess | AEVC01 | |
| Human | Oral cavity | AEKN01 | |
| Human | Toxic shock syndrome | NC_012891 | |
| Bovine | Mastitis | AEGO01 | |
| Horse | Strangles | NC_012471 | |
| Sheep | Mastitis | AWEX01 | |
| Human | Nephritis | NC_011134 | |
| Human | Gastrointestinal tract | AEVB01 | |
| Human | Blood | NC_013798 | |
| Human? | Blood/endocarditis valve? | NC_009785 | |
| Catfish | AEUX02 | ||
| Human | Feces | ABJK01 | |
| Human | Tooth surface and pharynx | AEVD01 | |
| Human | Blood | AMOO01 | |
| Human | Dental plaque | AFXO01 | |
| Macacae | Dental plaque | AEUW02 | |
| Cheese | NC_016749 | ||
| Human | Blood | NC_013853 | |
| Human | Oral cavity | NC_004350 | |
| Human | Oral cavity | NC_015291 | |
| Human | Oral cavity | NC_015678 | |
| Flounder | NC_015558 | ||
| Bovine | Mastitis | AEUT01 | |
| Human | Blood | NC_015600 | |
| Human | Tooth surface and pharynx | AEVF01 | |
| Human | Nasopharyngeal | NC_014498 | |
| Swine | Hemorrhagic lymph nodes | AEUU01 | |
| Human | Sputum; HIV | NC_015875 | |
| Human | Female genitourinary tract | AEUY02 | |
| Human | Throat | NC_006086 | |
| Rat | Oral cavity | AJTZ01 | |
| Human | Blood | NC_015760 | |
| Human | Dental plaque | NC_009009 | |
| Human | Toxic shock syndrome | NC_009442 | |
| Yogurt | NC_006449 | ||
| Bovine | Mastitis | NC_012004 | |
| Human | Urine | AEUZ02 | |
| Human | Oral cavity | AEVI01 | |
| Chicken | Crop | NC_014106 |
FPhylogenies derived from a core set of 136 genes. Left: Consensus of the phylogenetic signal from each gene (numbers on branches show the proportion of genes that support a particular grouping). Right: ML phylogeny derived from a concatenation of the genes (numbers on branches show bootstrap support). Each of the major eight groups is color shaded. The putative downei group is shown with a dashed line. Previous nomenclature for species within the bovis group is shown within parentheses.
Genome Characteristics for the Eight Streptococcus Species Sequenced as Part of This Study
| Species | Contigs | Base Pairs | CDS | rRNA | tRNA | %GC |
|---|---|---|---|---|---|---|
| 1 | 2,025,881 | 1,956 | 15 | 59 | 36.8 | |
| 1 | 1,916,985 | 1,871 | 15 | 65 | 41 | |
| 1 | 2,130,431 | 2,207 | 14 | 60 | 37.8 | |
| 2 | 2,417,851 | 2,282 | 15 | 61 | 42.2 | |
| 3 | 1,246,519 | 2,025 | 10 | 55 | 36.8 | |
| 5 | 1,816,306 | 2,027 | 15 | 57 | 37.1 | |
| 8 | 2,234,402 | 2,473 | 9 | 45 | 38.2 | |
| 133 | 2,140,742 | 2,127 | 4 | 43 | 41.4 |
FResult of gene gain/loss analysis. Boxes on nodes and tips of the phylogeny show genome size (genome size calculations exclude one MCL gene cluster that was excluded from the analysis—see Materials and Methods). Numbers on branches show the number of gains, losses, and the overall balance (gain–loss: balance), with red indicating an overall gain and blue an overall loss. Color shading for each major Streptococcus group follows figure 1.
Gene Gain/Loss Summary
| Group | Gains | Losses | Balance |
|---|---|---|---|
| Mitis | 1,059 | 1,208 | −149 |
| Sanguinis | 343 | 228 | 115 |
| Anginosus | 187 | 458 | −271 |
| Salivarius | 397 | 540 | −143 |
| Downei | 236 | 233 | 3 |
| Mutans | 268 | 364 | −96 |
| Pyogenic | 2,476 | 2,951 | −475 |
| Bovis | 657 | 851 | −194 |
| Mitis | 3,375 | 1,504 | 1,871 |
| Sanguinis | 1,532 | 1,007 | 525 |
| Anginosus | 2,202 | 774 | 1,428 |
| Salivarius | 1,604 | 454 | 1,150 |
| Downei | 1,384 | 268 | 1,116 |
| Mutans | 1,376 | 700 | 676 |
| Pyogenic | 7,935 | 3,866 | 4,069 |
| Bovis | 2,587 | 1,157 | 1,430 |
aGain/loss for each group including the branches leading to each group and excluding the terminal branches.
bGain/loss for terminal branches for each group.
FHierarchal clustering among genomes using presence/absence of MCL gene clusters. Approximately unbiased P values are shown on branches. Color shading for each major Streptococcus group follows figure 1.