| Literature DB >> 26151849 |
Fengqi Li1, Depan Cao2, Yang Liu3, Ting Yang4, Guirong Wang5.
Abstract
The identification of genes under positive selection is a central goal of evolutionary biology. Many legume species, including Phaseolus vulgaris (common bean) and Phaseolus lunatus (lima bean), have important ecological and economic value. In this study, we sequenced and assembled the transcriptome of one Phaseolus species, lima bean. A comparison with the genomes of six other legume species, including the common bean, Medicago, lotus, soybean, chickpea, and pigeonpea, revealed 15 and 4 orthologous groups with signatures of positive selection among the two Phaseolus species and among the seven legume species, respectively. Characterization of these positively selected genes using Non redundant (nr) annotation, gene ontology (GO) classification, GO term enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses revealed that these genes are mostly involved in thylakoids, photosynthesis and metabolism. This study identified genes that may be related to the divergence of the Phaseolus and legume species. These detected genes are particularly good candidates for subsequent functional studies.Entities:
Keywords: legume; lima bean; positive selection; transcriptome
Mesh:
Year: 2015 PMID: 26151849 PMCID: PMC4519893 DOI: 10.3390/ijms160715172
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Statistics of unigenes of lima bean transcripts obtained from assembly
| Assembly | Statistics |
|---|---|
| Number of unigenes | 96,248 |
| Large unigenes (≥1000 bp) | 29,556 |
| Max unigene length (bp) | 20,020 |
| Mean unigene length (bp) | 949 |
| N50 length (bp) | 1795 |
| Total bases (MB) | 87.12 |
Figure 1Distribution of ka and ks in 550 P. vulgaris–P. lunatus OGs. The threshold of ka/ks = 1 indicates positive selection. The analysis was performed using the Yang and Nielsen method [24].
Site Models for positively selected genes in seven legume species.
| OGs_ID | M1a | Sites a | M7 | Sites a | Protein Function | ||
|---|---|---|---|---|---|---|---|
| LRT | LRT | ||||||
| OG_9805 | 9.03 × 10−8 | 1.72 × 10−6 | 62N 89I 96L 128F 138Q | 6.00 × 10−12 | 1.49 × 10−10 | 62N 89I 96L 128F 138Q | Alanine-tRNA ligase |
| OG_9947 | 4.66 × 10−4 | 0.004 | 320V 483Q | 1.62 × 10−4 | 0.0013 | 320V 480S 483Q | Ceramide synthase lag1 |
| OG_10941 | 0.005 | 0.021 | 21G | 0.001 | 0.004 | 21G | Acyl-CoA |
| OG_10618 | 0.011 | 0.035 | 75W 106F | 0.003 | 0.0074 | 75W 106F | Pollen Ole e 1 allergen and extension family protein |
| OG_9713 | 0.006 | 0.021 | NS b | 2.28 × 10−4 | 0.0014 | 92I 177V | Mitochondrial inner membrane translocase, subunit tim44-related protein |
| OG_10037 | 0.003 | 0.019 | NS | 7.34 × 10−4 | 0.0028 | 17P 127M | Bromodomain transcription factor |
| OG_10186 | 0.022 | 0.058 | 379 A | 8.80 × 10−5 | 0.0011 | 310V 348K 379A 516S | 2,3-Bisphosphoglycerate-independent, phosphoglycerate mutase |
| OG_10467 | 0.024 | 0.058 | 32 G | 0.01 | 0.016 | 32G | Uncharacterized protein |
| OG_10095 | 0.212 | 0.287 | NS | 0.021 | 0.0298 | 100S | Hnh endonuclease |
| OG_10235 | 0.030 | 0.062 | NS | 6.88 × 10−4 | 0.0028 | 243D 291H | Golgin candidate 2 |
| OG_10350 | 0.054 | 0.085 | NS | 0.009 | 0.0155 | 180F | Pentatricopeptide repeat-containing protein |
| OG_10623 | 0.270 | 0.342 | NS | 0.006 | 0.0112 | 98P | Dessication-induced 1voc-like protein |
| OG_10631 | 1.000 | 1.000 | NS | 0.013 | 0.0191 | 323A | Endoribonuclease dicer-like 2 |
| OG_10744 | 1.000 | 1.000 | NS | 0.004 | 0.008 | 489L | Transmembrane amino acid transporter family protein |
| OG_9360 | 1.000 | 1.000 | NS | 0.005 | 0.0106 | 772N | Pseudouridine synthase family protein |
| OG_9471 | 0.033 | 0.062 | NS | 0.002 | 0.004 | 28R | DnaJ domain-containing protein |
| OG_9474 | 0.432 | 0.513 | NS | 0.026 | 0.0352 | 309I 327I | Cation calcium exchanger 5 |
| OG_9589 | 0.042 | 0.073 | NS | 8.14 × 10−4 | 0.0028 | 797D | Histone-lysine |
| OG_9889 | 0.100 | 0.147 | NS | 0.039 | 0.0492 | 53M | Uncharacterized protein |
a When p values of LRT and FDR were below 0.05, positively selected sites were estimated by BEB (BEB value > 95%).; b NS, not significant.
Figure 2GO classifications for 550 OGs of P. vulgaris–P. lunatus
The Over-Represented GO terms enriched in the test dataset compared with the reference dataset.
| Gene Class | GO ID | GO Term | FWER | Frequency a in Test Set | Frequency in Reference Set | |
|---|---|---|---|---|---|---|
| Positive OGs between | GO:0009579 | Thylakoid | 0.012326 | 0.001462 | 3/15 | 11/422 |
| Positive OGs between | GO:0015979 | Photosynthesis | 0.174733 | 0.018151 | 2/15 | 10/422 |
| Positive OGs between | GO:0006091 | Generation of precursor metabolites and energy | 0.272852 | 0.0280331 | 2/15 | 13/422 |
| Positive OGs among legumes | GO:0016746 | Transferring acyl groups | 0.0350915 | 0.00329338 | 2/4 | 8/400 |
a The number of times the term was found/the number of OGs that were in the set.
Figure 3GO classification for 413 OGs of legumes.