| Literature DB >> 21470968 |
A N Massa1, H Wanjugi, K R Deal, K O'Brien, F M You, R Maiti, A P Chan, Y Q Gu, M C Luo, O D Anderson, P D Rabinowicz, J Dvorak, K M Devos.
Abstract
Nine different regions totaling 9.7 Mb of the 4.02 Gb Aegilops tauschii genome were sequenced using the Sanger sequencing technology and compared with orthologous Brachypodium distachyon, Oryza sativa (rice), and Sorghum bicolor (sorghum) genomic sequences. The ancestral gene content in these regions was inferred and used to estimate gene deletion and gene duplication rates along each branch of the phylogenetic tree relating the four species. The total gene number in the extant Ae. tauschii genome was estimated to be 36,371. The gene deletion and gene duplication rates and total gene numbers in the four genomes were used to estimate the total gene number in each node of the phylogenetic tree. The common ancestor of the Brachypodieae and Triticeae lineages was estimated to have had 28,558 genes, and the common ancestor of the Panicoideae, Ehrhartoideae, and Pooideae subfamilies was estimated to have had 27,152 or 28,350 genes, depending on the ancestral gene scenario. Relative to the Brachypodieae and Triticeae common ancestor, the gene number was reduced in B. distachyon by 3,026 genes and increased in Ae. tauschii by 7,813 genes. The sum of gene deletion and gene duplication rates, which reflects the rate of gene synteny loss, was correlated with the rate of structural chromosome rearrangements and was highest in the Ae. tauschii lineage and lowest in the rice lineage. The high rate of gene space evolution in the Ae. tauschii lineage accounts for the fact that, contrary to the expectations, the level of synteny between the phylogenetically more related Ae. tauschii and B. distachyon genomes is similar to the level of synteny between the Ae. tauschii genome and the genomes of the less related rice and sorghum. The ratio of gene duplication to gene deletion rates in these four grass species closely parallels both the total number of genes in a species and the overall genome size. Because the overall genome size is to a large extent a function of the repeated sequence content in a genome, we suggest that the amount and activity of repeated sequences are important factors determining the number of genes in a genome.Entities:
Mesh:
Year: 2011 PMID: 21470968 PMCID: PMC3163431 DOI: 10.1093/molbev/msr080
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FStructural organization of two genomic regions in four current-day grasses and their ancestor. (A) ctg4985. (B) ctg5622. Ancestral genes are represented by red circles, deleted genes by white diamonds, and inserted genes by blue diamonds. Orthologous genes are connected by full lines or, in cases of gene deletions or insertions, by dashed lines.
FPhylogenetic tree showing relationships among the analyzed species and dynamics of gene content changes. The number of gene deletions (numbers preceded by − sign) and insertions (numbers preceded by + sign) that occurred in each branch of the phylogenetic tree are given along the branches. The nodes of the phylogenetic tree that were analyzed are indicated with closed circles. The total numbers of annotated (sorghum, rice, and B. distachyon), inferred (Ae. tauschii), or computed genes (ancestor of Brachypodieae and Triticeae, ancestor of Ehrhartoideae and Pooideae, and ancestor or BEP and PACCAD clades) are given in red. The range for the computed genes is listed in parenthesis. The numbers of genes identified or inferred in the nine investigated regions are in blue. The length of branches based on divergence times are given in million years. (A) A tree based on an ancestral gene number of 71; (B) A tree based on an ancestral gene number of 68.
Number of Genes Deleted and Duplicated Relative to the Ancestral State of 71 or 68 Genes Over the Past 45–60 My.
| Lineage | Ancestral Gene Number = 71 | Ancestral Gene Number = 68 | ||
| No. of Inserted and Duplicated Genes | No. of Deleted Genes | No. of Inserted and Duplicated Genes | No. of Deleted Genes | |
| 4b | 13a | 7b | 13a | |
| 2b | 6a | 5b | 6a | |
| 9b | 8a | 9b | 5a | |
| 26a | 7a | 29a | 7a | |
NOTE.—Numbers in columns sharing the same letter are not significantly different at the 5% probability level (2 × 2 contingency table and Fisher's exact test).
Information About the Gene Space in Four Current-Day Species and Their Ancestors.
| Lineage | Genome Size (Mb) | Time | No. of Genes | Inserted and Duplicated Genes | Deleted Genes | |||||
| Ancestral gene number = 71 ( | ||||||||||
| 271 | 35.8 | 62 | 3 | 11 | 0.001385 | 0.004775 | 0.290 | 25,532 | 28,826 | |
| 4020 | 35.8 | 90 | 25 | 5 | 0.009090 | 0.002070 | 4.391 | 36,371 | 28,289 | |
| Internode 2 | ? | 11.5 | 70 | 1 | 2 | 0.001251 | 0.002485 | 0.504 | 28,558 | 28,965 |
| 380 | 47.3 | 67 | 2 | 6 | 0.000641 | 0.001867 | 0.343 | 28,236 | 29,922 | |
| Internode 1 | ? | 5.2 | 71 | 0 | 0 | 0.000000 | 0.000000 | — | 29,444 | 29,444 |
| 730 | 52.5 | 72 | 9 | 8 | 0.002543 | 0.002277 | 1.117 | 27,640 | 27,256 | |
| Ancestral gene number = 68 ( | ||||||||||
| 271 | 35.8 | 62 | 3 | 11 | 0.001385 | 0.004775 | 0.290 | 25,532 | 28,826 | |
| 4020 | 35.8 | 90 | 25 | 5 | 0.009090 | 0.002070 | 4.391 | 36,371 | 28,289 | |
| Internode 2 | ? | 11.5 | 70 | 1 | 2 | 0.001251 | 0.002485 | 0.504 | 28,558 | 28,965 |
| 380 | 47.3 | 67 | 2 | 6 | 0.000641 | 0.001867 | 0.343 | 28,236 | 29,922 | |
| Internode 1 | ? | 5.2 | 71 | 3 | 0 | 0.008302 | 0.000000 | — | 29,444 | 28,200 |
| 730 | 52.5 | 72 | 9 | 5 | 0.002543 | 0.001455 | 1.748 | 27,640 | 26,104 | |
NOTE.— t is the divergence time in million years, g( is the total number of genes in the investigated regions, k is the duplication rate, m is the deletion rate, G( is the total number of genes in the genome at time t, and G(0)total is the estimated total number of genes in the ancestral genome.
From Vogel et al. (2010).
In the ancestor to the Brachypodieae and Triticeae.
An estimate generated by averaging the G(0)total of B. distachyon and Ae. tauschii.
In the ancestor to the Ehrhartoideae and Pooideae.
An estimate generated by averaging the G(0)total of O. sativa and Internode 2.
In the BEP clade and PACCAD clade ancestor.
FRelationships between rates of gene space evolution and overall genome size. The left-hand y axis is the gene duplication rate (k) and gene deletion rate (m) in gene−1 My−1. The right-hand y axis is the k/m ratio.
Comparison of Rates of Subchromosomal and Total Structural Changes and Single Gene Deletion and Insertion Rates.
| Lineage | Rate of Subchromosomal Structural Changes (change My−1) | Rate of All Structural Changes (change My−1) | |
| 0.14 | 0.34 | 0.006161 | |
| 1.01 | 1.15 | 0.011160 | |
| Internode 2 | 0.09 | 0.13 | 0.003736 |
| 0.08 | 0.08 | 0.002507 | |
| 0.09 | 0.09 | 0.004821 | |
| 0.09 | 0.09 | 0.003998 | |
| 0.934 ( | 0.966 ( | ||
| 0.942 ( | 0.981 ( |
NOTE.—Rates of subchromosomal structural changes, principally inversions and translocations, are given in changes per million years (My) (Vogel et al. 2010). The total rate of single gene deletions and duplications (k + m) in the B. distachyon, rice, sorghum, Ae. tauschii lineages, and internode 2 of the phylogenetic tree is calculated from table 2.
For 71 gene scenario.
For 68 gene scenario.