| Literature DB >> 22264329 |
Emmanuel Buschiazzo1, Carol Ritland, Jörg Bohlmann, Kermit Ritland.
Abstract
BACKGROUND: Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set.Entities:
Mesh:
Year: 2012 PMID: 22264329 PMCID: PMC3328258 DOI: 10.1186/1471-2148-12-8
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Substitution rates in conifer protein-coding genes compared to angiosperm genes
| Pairwise comparison | Gene number | dS | d4 | dN | dN/dS | |||
|---|---|---|---|---|---|---|---|---|
| Gymnosperms: | 3,723 | 0.1908 | 0.1769 | 0.0492 | 0.68 | 0.64 | 0.18 | 0.3137 |
| Sitka spruce | ||||||||
| Loblolly pine | ||||||||
| Angiosperms: | 4,080 | 2.1846 | 2.0057 | 0.2019 | 9.93 | 9.12 | 0.92 | 0.0924 |
| 17.02 | 15.63 | 1.57 | ||||||
| Poplar | 2.84 | 2.61 | 0.26 | |||||
| Angiosperm:conifers | 11.4:1 | 11.4:1 | 4.1:1 | 14.6:1 | 14.4:1 | 5.2:1 | 1:3.4 | |
| 25.0:1 | 24.7:1 | 9.0:1 | ||||||
| Poplar:conifers | 4.2:1 | 4.1:1 | 1.5:1 | |||||
Mean genetic distances at synonymous (dS), 4-fold degenerate (d4) and non-synonymous (dN) sites are expressed as a number of substitutions per site. Absolute substitution rates are expressed in substitutions per synonymous (μS), four-fold degenerate (μ4D) and non-synonymous (μN ) site per year. Species-specific rates for angiosperms were estimated based on the 1:6 difference in evolutionary rate between poplar and Arabidopsis [57,58].
Conifer genes involved in defense, resistance and response against insects with dN/dS > 1
| Spruce clone ID | Pine UniGene ID | dN/dS | UniProt ID | Species | Putative function |
|---|---|---|---|---|---|
| WS02821_B21 | DT625383 | 7.3061 | A7P5L0 | Protein phosphatase/Serine/threonine phosphatases | |
| WS0297_D22 | CX645632 | 7.0185 | A7QNM9 | leucine-rich repeat family protein/binding protein | |
| WS02725_C02 | DR097823 | 6.5839 | A7P656 | Protein phosphatase 2C/hydrolase/metal-binding | |
| WS02757_H19 | DR165429 | 4.4902 | Q9SE11 | Chloroplast-localized small heat shock protein (HSP20) family | |
| WS02758_N18 | DR160912 | 4.4589 | Q0DTD2 | Heat shock protein DnaJ | |
| WS02741_E07 | DT634060 | 3.3785 | Q588B8 | Glycoside Hydrolase Family 17 | |
| WS02761 N01 | CO365391 | 3.0817 | A7PWA7 | Heat shock protein DnaJ | |
| WS02817_M06 | DR093347 | 2.9656 | A7NWZ2 | serine/threonine-specific protein kinase | |
| WS0272_J12 | DR015390 | 2.3311 | A7QFY4 | Heat shock protein DnaJ | |
| WS0454_E20 | DR049906 | 2.2728 | A0MMD5 | Xyloglucan endotransglycosylase (Glycoside hydrolase family) | |
| WS02774_M01 | DR060506 | 2.1005 | Q6VAA9 | UDP-glycosyltransferase | |
| WS02749_F04 | CO164226 | 1.7421 | Q9MA24 | Glycosyltransferase | |
| WS0288_C08 | DR022129 | 1.5822 | A7QTB5 | Glycoside hydrolase | |
| WS0292_O15 | DR681862 | 1.294 | A7P0R3 | heat shock protein (hsp70) | |
| WS02729_N15 | DR689530 | 1.1356 | Q8LHS7 | Histone deacetylase | |
| WS02716_E18 | AI784893 | 1.1314 | A5AWM3 | Pathogenesis-related transcriptional activator PTI6 | |
| WS0298_F15 | DT638459 | 1.0692 | A7QTU5 | Glycosyltransferase | |
| WS02725_E03 | U39301 | 1.0481 | A0ERF9 | Caffeic acid ortho-methyltransferase | |
Figure 1Distribution of evolutionary estimates for conifer and angiosperm protein-coding genes. A. Smoothed density plots of dS and dN estimates. B. Histogram plots of dN/dS estimates. Conifer genes with dN/dS > 2 (n = 38) are not shown. Breaks = 200.
Figure 2dN/dS estimates in conifer and angiosperm genes across . Mean dN/dS values for conifer (full circle) and angiosperm (open circle) protein-coding genes. Conifer genes were BLASTed against Arabidopsis gene transcripts, whose GO Slim annotations were used for homologous conifer genes. Brackets represent the standard error of the mean. A: Biological processes; B: Molecular functions; C: Cellular component.