| Literature DB >> 26353816 |
Natasha M Glover1,2,3, Josquin Daron1,2, Lise Pingault1,2, Klaas Vandepoele4, Etienne Paux1,2, Catherine Feuillet1,2,5, Frédéric Choulet6,7.
Abstract
BACKGROUND: Bread wheat is not only an important crop, but its large (17 Gb), highly repetitive, and hexaploid genome makes it a good model to study the organization and evolution of complex genomes. Recently, we produced a high quality reference sequence of wheat chromosome 3B (774 Mb), which provides an excellent opportunity to study the evolutionary dynamics of a large and polyploid genome, specifically the impact of single gene duplications.Entities:
Mesh:
Year: 2015 PMID: 26353816 PMCID: PMC4563886 DOI: 10.1186/s13059-015-0754-6
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Phylogeny of the model grass species used in this study. Dating information (in MYA) was taken from [12, 13, 19]
Fig. 2Methodology applied for classifying syntenic and non-syntenic genes
Filtration results for the four species compared in this study
| Species | Chr. | Number of genes (no ASVs or TEs) | Number of genes with at least 1 homolog | Number of genes after removing pseudogenes |
|---|---|---|---|---|
|
| 3B | 7,703 | 6,254 | 5,125 |
|
| 2 | 4,293 | 3,804 | 3,804 |
|
| 1 | 5,070 | 3,882 | 3,582 |
|
| 3 | 4,555 | 4,023 | 4,023 |
Sequential process of filtration, starting with the total numbers of genes in the syntenic regions of each chromosome after removing transposon related genes and alternative splice variants (ASVs). The numbers of genes with at least 1 homolog are those with a significant BLAST hit (>35 % amino acid identity and >35 % gene overlap) in at least one of the other species
Proportion of non-syntenic genes and their fixation rate for each species
| Species | No. syntenic genes (%) | No. non-syntenic genes (%) | Divergence time (MYA) | non-syntenic gene fixation rate (locus−1 MY−1) |
|---|---|---|---|---|
| Ta3B | 3,728 (72.7 %) | 1,397(27.3 %) | 39 | 7.0 × 10−3 |
| Bd2 | 3,509 (92.2 %) | 295 (7.8 %) | 39 | 2.0 × 10−3 |
| Os1 | 3,257 (90.9 %) | 325 (9.1 %) | 54 | 1.7 × 10−3 |
| Sb3 | 3,472 (86.3 %) | 551 (13.7 %) | 60 | 2.3 × 10−3 |
Percentages are out of the total number of genes. Divergence times were taken from [12]
Fig. 3Chromosomal distributions of non-syntenic genes. Proportion of non-syntenic genes per total number of genes for (a) wheat chromosome 3B, (b) rice chromosome 1, and (c) sorghum chromosome 3. For wheat, the proportion of non-syntenic genes per total gene count was plotted in a sliding window of 10 Mb with a step of 1 Mb; in rice, 1 Mb sliding window with a step of 0.1 Mb, and for sorghum a 5 Mb sliding window with a step of 0.5 Mb. Pseudogenes were removed
Fig. 4Comparison of the distributions of older versus more recent wheat non-syntenic genes along chromosome 3B. a “Recent” non-syntenic genes that moved after the divergence of the wheat progenitors (carried by 3B but without homolog on 3A, 3D, or 3H). b “Older” non-syntenic genes that have moved in the Triticeae lineage after divergence with Brachypodium and before the divergence of the wheat progenitors. These are 3B non-syntenic genes with a homolog on 3A, 3D, or 3H. The proportion of older and more recent non-syntenic genes out of the total number of genes was calculated within a 10 Mb sliding window with a step of 1 Mb. The centromeric/pericentromeric region is highlighted in the gray box
Structural and functional features of the 3B syntenic and non-syntenic genes
| Syntenic | Non-syntenic | Significance | |
|---|---|---|---|
| Mean genomic size | 3,299 bp | 3,008 bp |
|
| Median genomic size | 2,259 bp | 1,907 bp | |
| Mean CDS size | 1,252 bp | 1,150 bp |
|
| Median CDS size | 1,070 bp | 1,044 bp | |
| Mean number exons | 4.8 | 3.6 |
|
| Median number exons | 3 | 2 | |
| Expressed | 83 % | 74 % |
|
| Mean number conditions expressedc | 12.0 | 9.2 |
|
| Median number conditions expressedc | 15 | 10 | |
| Mean fpkmc | 260.8 | 142.0 |
|
| Median fpkmc | 130.8 | 46.3 | |
| Mean number alternative splice variants | 5.3 | 3.6 |
|
| Median number alternative splice variants | 2 | 1 |
Percentages are out of the total filtered gene count
a Mann–Whitney-Wilcoxon test
b Chi-squared test
c Of those genes expressed
Fig. 5Frequency distributions of Ka/Ks rates of syntenic (blue) and non-syntenic genes (red). Coding sequences of 3,877 3B genes were aligned with their closest homolog in Brachypodium
GO enrichment analysis of Biological Process terms for syntenic genes and non-syntenic genes
|
| ||||||
| GO ID | Term | Annotated | Significant | Expected |
| Adjusted |
| GO:0019219 | Regulation of nucleobase-containing compound metabolic process | 372 | 229 | 176.14 | 4.3e-09 | 8.2e-07 |
| GO:0019222 | Regulation of metabolic process | 437 | 263 | 206.92 | 6.2e-09 | 9.9e-07 |
| GO:0031323 | Regulation of cellular metabolic process | 382 | 232 | 180.88 | 1.9e-08 | 2.7e-06 |
| GO:0060255 | Regulation of macromolecule metabolic process | 404 | 243 | 191.3 | 2.9e-08 | 3.67e-06 |
| GO:0010468 | Regulation of gene expression | 378 | 228 | 178.98 | 6.0e-08 | 5.97e-06 |
| GO:0009889 | Regulation of biosynthetic process | 373 | 225 | 176.62 | 7.4e-08 | 5.97e-06 |
| GO:0010556 | Regulation of macromolecule biosynthetic process | 373 | 225 | 176.62 | 7.4e-08 | 5.97e-06 |
| GO:0031326 | Regulation of cellular biosynthetic process | 373 | 225 | 176.62 | 7.4e-08 | 5.97e-06 |
| GO:0006139 | Nucleobase-containing compound metabolic process | 695 | 391 | 329.09 | 9.7e-08 | 6.8e-06 |
| GO:0010467 | Gene expression | 679 | 378 | 321.51 | 8.5e-07 | 5.0e-05 |
| GO:0044248 | Cellular catabolic process | 163 | 100 | 77.18 | 0.00017 | 0.0090 |
| GO:1901576 | Organic substance catabolic process | 178 | 106 | 84.28 | 0.00054 | 2.1639e-02 |
| GO:0044249 | Cellular biosynthetic process | 742 | 393 | 351.34 | 0.00033 | 1.5428e-02 |
| GO:0016485 | Protein processing | 100 | 64 | 47.35 | 0.00050 | 2.0778e-02 |
| GO:0071704 | Organic substance metabolic process | 2000 | 998 | 947.01 | 0.00029 | 1.4790e-02 |
| GO:0009058 | Biosynthetic process | 750 | 396 | 355.13 | 0.00044 | 1.9747e-02 |
| GO:0015672 | Monovalent inorganic cation transport | 41 | 30 | 19.41 | 0.00067 | 2.3120e-02 |
| GO:0009057 | Macromolecule catabolic process | 146 | 87 | 69.13 | 0.00163 | 4.6894e-02 |
| GO:0006812 | Cation transport | 67 | 44 | 31.72 | 0.00175 | 4.9087e-02 |
| GO:0019941 | Modification-dependent protein catabolic process | 97 | 62 | 45.93 | 0.00064 | 2.3120e-02 |
| GO:0070647 | Protein modification by small protein conjugation or removal | 99 | 62 | 46.88 | 0.00139 | 4.1042e-02 |
|
| ||||||
| GO ID | Term | Annotated | Significant | Expected |
| Adjusted |
| GO:0012501 | Programmed cell death | 172 | 53 | 30.12 | 8.5e-06 | 0.00374 |
| GO:0043412 | Macromolecule modification | 663 | 150 | 116.1 | 0.00011 | 0.030855 |