| Literature DB >> 24950957 |
Fengming Han, Yong Peng, Lijia Xu, Peigen Xiao1.
Abstract
BACKGROUND: Single copy genes are common across angiosperm genomes. With the sufficiently high quality sequenced genomes, the identification of large-scale single copy genes among multiple species is possible. Although some characteristics have been reported, our study provides novel insights into single copy genes.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24950957 PMCID: PMC4092219 DOI: 10.1186/1471-2164-15-504
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Identification of single copy genes and the predicted replicate blocks within each lineage
| Species | Total number of genes | Number of single copy genes | Singletons | Number of replicate blocks |
|---|---|---|---|---|
|
| 24823 | 4962 | 1886 | 65 |
|
| 32670 | 4863 | 1710 | 89 |
|
| 27382 | 4285 | 919 | 211 |
|
| 26552 | 3987 | 733 | 151 |
|
| 41019 | 3098 | 1366 | 745 |
|
| 26521 | 3613 | 393 | 179 |
|
| 27584 | 8292 | 3377 | 60 |
|
| 24533 | 4550 | 876 | 131 |
|
| 25379 | 4883 | 946 | 111 |
|
| 21458 | 4778 | 1309 | 149 |
|
| 36376 | 6897 | 3930 | 316 |
|
| 32831 | 6132 | 3547 | 122 |
|
| 45272 | 3687 | 2348 | 777 |
|
| 37505 | 3823 | 1456 | 801 |
|
| 30666 | 3788 | 672 | 476 |
|
| 26718 | 4081 | 724 | 266 |
|
| 55565 | 9995 | 6829 | 160 |
|
| 27197 | 3760 | 554 | 342 |
|
| 40668 | 4435 | 2502 | 544 |
|
| 27864 | 3922 | 689 | 122 |
|
| 31221 | 11095 | 7206 | 115 |
|
| 35471 | 7608 | 4130 | 154 |
|
| 34727 | 6274 | 2687 | 317 |
|
| 35119 | 5665 | 2388 | 238 |
|
| 33032 | 6761 | 3399 | 152 |
|
| 26351 | 3676 | 511 | 177 |
|
| 29408 | 7226 | 3434 | 139 |
|
| 26346 | 5684 | 2501 | 143 |
|
| 39656 | 6184 | 3307 | 383 |
The singleton is referred as a single copy gene which no orthologs are found in other lineages.
Figure 1GO term enrichments of united all single copy genes across 29 angiosperm genomes. The vertical axis shows the percentage of annotated sequences within each GO categories. Only the top six GO terms are showed. CC: Cellular Component; BP: Biological Process; MF: Molecular Function.
Figure 2Distribution of alternative splicing (AS) events between single copy genes and other family genes. The vertical axis represents the proportion of genes for which AS is found. The horizontal axis represents the different numbers of AS. The significance of AS differences between single copy genes and family genes is calculated by the use of the analysis of variance (ANOVA) with subsequent post-hoc tests (Tukey’s HSD).
Figure 3Maximum likelihood (ML) phylogenetic tree bases on the 12 shared single copy genes. Numbers above nodes are bootstrap proportions from 1000 pseudoreplicates. O. lucimarinus, M. pusilla, V. carteri and P. patens were used as outgroups. The letters embedded below nodes: A = Grass; B = Eudicots; C = Asterids; D = Rosids; E = Malvidae; F = Fabidae; G = Brassicaceae. The double slashes indicate outgroups.