| Literature DB >> 23734211 |
Lei Zhao1, Ning Zhang, Peng-Fei Ma, Qi Liu, De-Zhu Li, Zhen-Hua Guo.
Abstract
BEP clade of the grass family (Poaceae) is composed of three subfamilies, i.e. Bambusoideae, Ehrhartoideae, and Pooideae. Controversies on the phylogenetic relationships among three subfamilies still persist in spite of great efforts. However, previous evidence was mainly provided from plastid genes with only a few nuclear genes utilized. Given different evolutionary histories recorded by plastid and nuclear genes, it is indispensable to uncover their relationships based on nuclear genes. Here, eleven species with whole-sequenced genome and six species with transcriptomic data were included in this study. A total of 121 one-to-one orthologous groups (OGs) were identified and phylogenetic trees were reconstructed by different tree-building methods. Genes which might have undergone positive selection and played important roles in adaptive evolution were also investigated from 314 and 173 one-to-one OGs in two bamboo species and 14 grass species, respectively. Our results support the ((B, P) E) topology with high supporting values. Besides, our findings also indicate that 24 and nine orthologs with statistically significant evidence of positive selection are mainly involved in abiotic and biotic stress response, reproduction and development, plant metabolism and enzyme etc. from two bamboo species and 14 grass species, respectively. In summary, this study demonstrates the power of phylogenomic approach to shed lights on the evolutionary relationships within the BEP clade, and offers valuable insights into adaptive evolution of the grass family.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23734211 PMCID: PMC3667173 DOI: 10.1371/journal.pone.0064642
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistical summary of contigs and final transcripts assembled by Trinity and TGICL.
| Contigs | Final Transcripts | |
| Number | 111,937 | 39,075 |
| Min length | 100 | 100 |
| Max length | 10,203 | 13,370 |
| Mean | 564 | 747 |
| Median | 339 | 523 |
| N50 | 956 | 1,192 |
| N90 | 235 | 363 |
N50 and N90 are defined as the length of the smallest contig N in the sorted list of all contigs where the cumulative length from the largest contig to contig N is at least 50% and 90% of the total length, respectively.
Figure 1Phylogenetic relationships of the BEP Clade.
Phylogenomic trees were inferred by the concatenation analyses, PAUP, RAxML and MrBayes. Species trees were also estimated by the coalescent method, MP-EST. The bootstrap values above the horizontal are based on protein, while the values below are based on nucleotide data. “*” indicates support values of posterior probabilities (PP) = 1.0 and bootstrap (BP) = 100. “#” indicates all support values of PP = 1.0 and BP = 100. Support values are shown for nodes as maximum parsimony bootstrap/maximum likelihood bootstrap/Bayesian inference posterior probability/maximum pseudo-likelihood model bootstrap. Branch lengths were estimated through protein super-matrix using Bayesian analysis, and scale bar denotes substitutions per site.
Statistical confidence (P values) for alternative phylogenomic hypothesis of the BEP Clade from the concatenation analyses.
| P Values | |||||
| Data Type | Method | Hypothesis | AU | SH | WSH |
| amino acid | MP | ((BP)E) | 0.687 | 0.869 | 0.869 |
| ML | ((BP)E) | 0.101 | 0.762 | 0.760 | |
| BI | ((BP)E) | 0.474 | 0.722 | 0.716 | |
| NJ | ((BE)P) | 0.010 | 0.011 | 0.011 | |
| nucleotide | MP | ((BP)E) | 0.581 | 0.843 | 0.837 |
| ML | ((BP)E) | 0.604 | 0.507 | 0.615 | |
| BI | ((BP)E) | 0.517 | 0.547 | 0.640 | |
| NJ | ((BP)E) | 0.442 | 0.598 | 0.574 | |
Figure 2Distributions of Ka and Ks in 314 D. latiflorus – P. edulis OGs.
The threshold of Ka/Ks = 0.5 was used to detect candidate genes that may have been subjected to positive selection.
24 OGs with evidence for strong and weak positive selection in two bamboos.
| OGs ID |
|
| ω | Protein Function |
| OG8_14182 | Unigene22484 | ped_9008 | 2.5442 | expressed protein |
| OG8_14199 | Unigene818 | ped_1049 | 1.2837 | expressed protein |
| OG8_12558 | Unigene8135 | ped_6068 | 1.1362 | nodal modulator 1 precursor (PM5) |
| OG8_12263 | Unigene12950 | ped_5482 | 0.9245 | SNARE associated Golgi protein |
| OG8_12252 | Unigene6857 | ped_10366 | 0.8831 | transporter family protein |
| OG8_13258 | Unigene2231 | ped_10635 | 0.7864 | ENTH domain containing protein |
| OG8_13720 | Unigene7866 | ped_11081 | 0.7650 | TCP family transcription factor |
| OG8_13530 | Unigene16621 | ped_201 | 0.7525 | SNF7 domain containing protein |
| OG8_14766 | Unigene5371 | ped_3344 | 0.7216 | expressed protein |
| OG8_12173 | Unigene16907 | ped_5672 | 0.6769 | Aminotransferase domain containing protein |
| OG8_12874 | Unigene17978 | ped_10636 | 0.6263 | cold acclimation protein WCOR413 |
| OG8_15107 | Unigene14640 | ped_9017 | 0.6055 | protein kinase |
| OG8_14741 | Unigene14419 | ped_5994 | 0.6014 | expressed protein |
| OG8_13892 | Unigene32406 | ped_1256 | 0.5802 | expressed protein |
| OG8_14278 | Unigene5954 | ped_11333 | 0.5774 | expressed protein |
| OG8_15094 | Unigene11753 | ped_9836 | 0.5754 | 40S ribosomal protein S15a |
| OG8_13355 | Unigene16515 | ped_3263 | 0.5726 | glycosyltransferase family 43 protein |
| OG8_14698 | Unigene34057 | ped_6097 | 0.5388 | RNA recognition motif containing protein |
| OG8_13653 | Unigene14322 | ped_9421 | 0.5334 | expressed protein |
| OG8_13330 | Unigene15784 | ped_6343 | 0.5333 | RNA recognition motif containing protein |
| OG8_13031 | Unigene18530 | ped_5910 | 0.5327 | ras-related protein |
| OG8_15048 | Unigene14261 | ped_1500 | 0.5207 | eukaryotic translation initiation factor |
| OG8_12853 | Unigene3778 | ped_7614 | 0.5055 | zinc finger, C3HC4 type domain containing protein |
| OG8_13856 | Unigene16340 | ped_939 | 0.5054 | expressed protein |
Site Models used for detecting positively selected sites.
| OGs_ID | M1a vs M2a | M7 vs M8 | Protein Function | ||||
| LRT |
| Sites | LRT |
| Sites | ||
| OG8_13653 | 6.9040 | 0.0317 |
| 8.0162 | 0.0182 |
| homologous-pairing protein meu13 |
| OG8_12939 | 6.5458 | 0.0379 |
| 7.8069 | 0.0202 |
| expressed protein |
| OG8_13485 | 6.2762 | 0.0434 |
| 6.3493 | 0.0418 |
| expressed protein |
| OG8_14174 | 9.9498 | 0.0069 |
| glyoxalase family protein | |||
| OG8_14288 | 7.6284 | 0.0221 | 10E 12V 59A | gamma-glutamyl hydrolase precursor | |||
| OG8_14337 | 6.5172 | 0.0384 |
| OsClp8 - Putative Clp protease homologue | |||
| OG8_14931 | 7.1406 | 0.0281 | 86Q | DNA-directed RNA polymerase subunit | |||
| OG8_14221 | 7.3265 | 0.0256 | 60R 87E | transcriptional regulator | |||
| OG8_14202 | 6.7371 | 0.0344 | 5S 205T 206E | L1P family of ribosomal proteins domain protein | |||
P value of LRT between models M1a vs M2a or M7 vs M8.
When P value of LRT is lower 0.05, positively selected sites (PSS) estimated by BEB (BEB Value >85%), and sites in bold have BEB Value >95%.
Figure 3GO classification for 314 orthologs of D. latiflorus – P. edulis.
Figure 4GO classification for 173 orthologs of Poaceae.