| Literature DB >> 25033750 |
Jungeun Kim, Jeongyeo Lee, Jae-Pil Choi, Inkyu Park, Kyungbong Yang, Min Keun Kim, Young Han Lee, Ill-Sup Nou, Dae-Soo Kim, Sung Ran Min, Sang Un Park, HyeRan Kim1.
Abstract
BACKGROUND: The Brassicaceae family is an exemplary model for studying plant polyploidy. The Brassicaceae knowledge-base includes the well-annotated Arabidopsis thaliana reference sequence; well-established evidence for three rounds of whole genome duplication (WGD); and the conservation of genomic structure, with 24 conserved genomic blocks (GBs). The recently released Brassica rapa draft genome provides an ideal opportunity to update our knowledge of the conserved genomic structures in Brassica, and to study evolutionary innovations of the mesohexaploid plant, B. rapa.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25033750 PMCID: PMC4117954 DOI: 10.1186/1471-2164-15-606
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The distribution of the average K values of the collinear gene pairs between A. thaliana and B. rapa genome.
Classification of three chronological genomes by sequence divergence of the collinear gene pairs *
| Recent | Young | Old | |
|---|---|---|---|
| Avg. | 0.53 | 1.17 | 2.19 |
|
| |||
| No. of syntenic segments | 302 | 366 | 15 |
| No. of collinear protein pairs | 29,239 | 7,335 | 109 |
|
| |||
| Size of genome segment | 250,327,645 | 203,058,429 | 5,49,677 |
| (%) | (97.69) | (79.24) | (2.14) |
| No. of integrated GBs | 172 | 223 | 14 |
| Average collinear gene pairs | 1,217 | 318 | 5 |
| No. of distinct | 18,701 | 5,209 | 109 |
| in synteny | (53.48%) | (14.90%) | (0.31%) |
| Collinear-pairs/Mb | 107 | 37 | 10 |
*Detailed information about this table is represented in Additional file 2.
Reconstructed genomic blocks based on the synteny between A. thaliana and B. rapa
| AK | GB | Current GB* | Reconstructed GB | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Interval (Mb) | No of genes (%) | Intervals (Mb) | No of genes (%) | ||||||
| 1 | A | At1g01560-At1g19330 | (6.48) | 2,528 | (7.19) | At1g01010-At1g19840 | (6.87) | 2,690 | (7.65) |
| 1 | B | At1g19850-At1g36240 | (6.73) | 1,949 | (5.54) | At1g19850-At1g37130 | (7.27) | 1,986 | (5.65) |
| 1 | C | At1g43600-At1g56120 | (4.84) | 1,404 | (3.99) | At1g43020-At1g56190 | (4.88) | 1,418 | (4.03) |
| 2 | D | At1g63770-At1g56530 | (2.49) | 744 | (2.12) | At1g64670-At1g56210 | (3.00) | 913 | (2.60) |
| 2 | E | At1g65040-At1g80420 | (6.08) | 2,086 | (5.93) | At1g64960-At1g80950 | (6.29) | 2,163 | (6.15) |
| 3 | F | At3g01040-At3g25520 | (9.26) | 3,423 | (9.73) | At3g01015-At3g25520 | (9.27) | 3,427 | (9.74) |
| 3 | G | At2g05170-At2g07690 | (1.66) | 229 | (0.65) | At2g05170-At2g07690 | (1.66) | 229 | (0.65) |
| 3 | H | At2g15670-At2g20900 | (2.16) | 669 | (1.90) | At2g10940-At2g20900 | (4.68) | 940 | (2.67) |
| 4 | I | At2g20920-At2g28910 | (3.42) | 1,091 | (3.10) | At2g20920-At2g31035 | (4.21) | 1,385 | (3.94) |
| 4 | J | At2g31040-At2g47730 | (6.35) | 2,400 | (6.82) | At2g31040-At2g48150 | (6.48) | 2,465 | (7.01) |
| 5 | K | At2g01250-At2g03750 | (1.02) | 326 | (0.93) | At2g01060-At2g05160 | (1.79) | 508 | (1.44) |
| 5 | L | At3g25855-At3g29770 | (2.19) | 531 | (1.51) | At3g25540-At3g32960 | (4.23) | 684 | (1.94) |
| 5 | M | At3g43740-At3g49970 | (2.88) | 789 | (2.24) | At3g42180-At3g50940 | (4.61) | 1,001 | (2.85) |
| 5 | N | At3g50950-At3g62790 | (4.31) | 1,678 | (4.77) | At3g50950-At3g63530 | (4.52) | 1,771 | (5.03) |
| 6 | O | At4g00030-At4g04955 | (2.51) | 633 | (1.80) | At4g00026-At4g05450 | (2.75) | 708 | (2.01) |
| 6 | P | At4g12070-At4g08690 | (1.72) | 434 | (1.23) | At4g07390-At4g12620 | (3.27) | 595 | (1.69) |
| 6 | Q | At5g28885-At5g23010 | (3.20) | 741 | (2.11) | At5g30510-At5g23010 | (3.92) | 771 | (2.19) |
| 6 | R | At5g23000-At5g01010 | (7.70) | 2,825 | (8.03) | At5g23000-At5g01010 | (7.70) | 2,825 | (8.03) |
| 7 | S | At5g41900-At5g33210 | (4.31) | 853 | (2.42) | At5g42110- At5g32470 | (4.75) | 886 | (2.52) |
| 7 | T | At4g12750-At4g16143 | (1.64) | 526 | (1.50) | At4g12700-At4g16240 | (1.71) | 556 | (1.58) |
| 7 | U | At4g16250-At4g38770 | (8.90) | 3,252 | (9.24) | At4g16250-At4g40100 | (9.39) | 3,446 | (9.80) |
| 8 | V | At5g42130-At5g42810 | (0.33) | 97 | (0.28) | At5g42130-At5g47810 | (2.52) | 805 | (2.29) |
| 8 | W | At5g47820-At5g60800 | (5.10) | 1,805 | (5.13) | At5g47820-At5g60800 | (5.10) | 1,805 | (5.13) |
| 8 | X | At5g60805-At5g67385 | (2.42) | 957 | (2.72) | At5g60805-At5g67640 | (2.51) | 991 | (2.82) |
| Total | (97.69) | 31,971 | (90.89) | (113.34) | 34,968 | (99.41) | |||
*Genomic blocks updated by Cheng et al. (2013) [18]. AK: ancestral karyotype defined by Lysak et al. (2006) [15], GB: genomic block.
Figure 2Three reconstructed periodical B. rapa genomes based on refined 24 GBs. The left bar represents 0.1 Mb scale of the B. rapa genome and A-X rectangles in the upper part represent 24 genomic blocks (GB). Each chromosome represents three periodical genomic segments distinguished by average synonymous substitution rates of genomic segments (from left, recent, young, and old). The red triangles (►) and arrows (↕) in the left side of the GB map represent newly identified GBs in this study and an inverted order of GBs compared to Cheng et al. (2013) [18], respectively. The structural difference between Cheng et al. (2013) [18] and our map may be arisen by using different version of the B. rapa genome.
Figure 3Distribution of fast evolving genes in chromosomes. The graphs represent the number of fast-evolving genes in recent and young genomes distributed in the B. rapa chromosomes and the frequencies of the genes in the chromosome are represented in each bar.
Figure 4GO slim analysis of fast evolving genes. A. Frequency of GO-slim terms. The x-axis represents GO-slim terms classified into the biological process (BP), cellular component (CC) and molecular function (MF) while the y-axis represents % of fast evolving genes assigned to specific GO-slim categories. B. GO-slim enrichment analysed by fisher’s exact test and p-values estimating fisher’s exact test under the description. The x-axis represents fold ratio between the frequency of fast evolving gene in recent or young genome categorized in certain GO-slim terms and background frequency (total B. rapa gene are not detected in fast evolving genes) in that term. The y-axis represents GO-slim terms. Different levels of p-value (< 0.05, < 0.01, < 0.001) are represented by *, **, ***, respectively.
Classification of divergence time of B. rapa gene based on A. thaliana collinear pairs
| Index | Retention patterns a | Standard deviation of | No. of | ||||
|---|---|---|---|---|---|---|---|
| Recent | Young | Old | Recent | Young | Old | ||
| 1 | O | X | X | 0.31 | - | - | 22,371 (54.54) |
| 2 | O | O | X | 0.98 | 1.31 | - | 6,740 (16.43) |
| 3 | O | X | O | 0.29 | - | 0.86 | 84 (0.20) |
| 4 | O | O | O | 0.28 | 0.81 | 0.83 | 17 (0.04) |
| 5 | X | X | X | - | - | - | 11,235 (27.39) |
| 6 | X | O | X | - | 0.62 | - | 564 (1.37) |
| 7 | X | X | O | - | - | 0.85 | 4 (0.01) |
| 8 | X | O | O | - | 0.78 | 0.30 | 4 (0.01) |
aPresence (O) and/or absence (X) of collinear gene pairs in recent, young and old genomic segments. ‘-‘ denotes absence of collinear gene pairs.
Figure 5Functional bias according to gene birth and death patterns. A. Frequency of enriched GO terms re-categorized into the GO-slim terms. X-axis represents GO-slim description and y-axis represents percentage of enriched GO-slim terms divided by total number of GO-slim terms in specific GO-slim categories. B. Organ distributions of development processes enriched in index 1 and/or index 2. C. Stress or biological process related genes which are specifically enriched in Index 1 and/or Index 2.
Figure 6Enriched GO-slim terms analysed with single and multi-retention homoeologous genes in recent genome. Overrepresented GO-slim terms for single (A) and multi (B) retention genes. The x-axis represents X-fold GO-slim enrichment, calculated as the ratio percentages of the cluster frequency of the tested gene set and the cluster frequency of the genomic background. The y-axis represents GO-slim terms. The p-values for fisher’s exact test are indicated on the bar.
Functional differentiation of the B. rapa homoeologues
| Retention numbers | |||
|---|---|---|---|
| Two | Tree | Four | |
| Nonfunctionalization | 507 | 183 | 0 |
| Subfunctionalization | 2,026 | 913 | 3 |
| Neofunctionalization | 8 | 1 | 0 |
| Total | 2,541 | 1,097 | 3 |
Subfunctionalization of B. rapa genes involved in root and leaf developments
|
|
|
| Leaf a | Root a | Stem a | Pool a | Organ b | Absolute expression in | Description |
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
|
| AT1G31930 | 0.10 | 1 |
| 0.08 | 1 | root | seed, shoot | Extra-large G protein 3 |
|
| AT5G55480 | 0.21 | 0.9 |
| 8.1e-03 | 1 | root | seed, shoot | Glycerophosphoryl diester phosphodiesterase-like protein (GDPD) |
|
| AT5G57090 | 0.07 | 1 |
| 1 | 1 | stamen, root | seed, root | Auxin efflux carrier, root specific role |
|
| AT1G09560 | 0.18 | 1 |
| 1 |
| root | root | Germin-like protein 5 (GLP5) |
|
| AT1G70560 | 0.10 | 0.99 |
| 0.92 | 0.83 | flower, primary root, root, embryo, leaf, gynoecium, cotyledon, | carpel | Tryptophan aminotransferase of Arabidopsis 1 (TAA1) |
|
| AT2G39830 | 0.10 | 0.99 |
| 0.93 | 1 | root, phloem | shoot, petal | DA1-related protein |
|
| AT2G24260 | 0.27 | 1 |
| 1 | 0.96 | root hair cell | seed | Basic helix-loop-helix (bHLH) |
|
| AT2G35610 | 0.10 | 0.97 |
| 0.21 | 1 | root hair cell | seed, root, silique | Arabinosyltransferase |
|
| AT5G19320 | 0.14 | 1 |
|
| 1 | lateral root | root, seed, shoot, flower | RAN GTPase activating protein 2 (RANGAP2) |
|
| AT5G12330 | 0.13 | 1 |
| 0.41 | 1 | root | shoot, root | Lateral root primordium 1 (LRP1) |
|
| AT4G29040 | 0.02 | 1 |
| 0.01 | 1 | root, root cap, phloem, seedling | sepal, petal, senescent leaf | Regulatory particle AAA-ATPase 2A (RPT2A) |
|
| AT3G16857 | 0.09 | 1 |
| 0.43 | 0.91 | primary root, root | seed | Response regulator 1 (RR1) |
|
| AT1G07630 | 0.19 | 1 |
| 1 | 1 | lateral root, leaf | sepal, senescent leaf | Protein phosphatase 2C like gene |
|
| AT3G61440 | 0.16 | 0.98 |
|
| 1 | root hair cell | cotyledons | Cysteine synthase isomer (CysC1) |
|
| AT1G70940 | 0.20 | 1 |
| 0.16 | 1 | root | petal, stamen | PIN3, regulator of auxin efflux |
|
| AT4G34390 | 0.26 | 0.88 |
| 0.98 | 0.89 | lateral root | leaf, sepal | Extra-large GTP-binding protein 2 (XLG2) |
|
| AT1G13260 | 0.11 | 0.81 |
|
| 1 | flower, lateral root, leaf | root, leaf | EDF4 |
|
| AT3G19820 | 0.05 | 1 |
|
| 1 | anther, root | shoot, root flower, | Enhanced very-low-fluence responses 1 (EVE1) |
|
| AT1G31880 | 0.09 | 0.98 |
| 0.87 | 1 | lateral root, root | shoot, seed | BRX, cell proliferation and elongation in the root |
|
| AT1G35580 | 0.04 | 1 |
| 0.97 | 0.27 | root | root | Cytosolic invertase 1 |
|
| AT3G62680 | 0.35 | 1 |
| 1 | 1 | lateral root | root | Proline-rich protein 3 (PRR3) |
|
| AT1G74500 | 0.07 | 1 |
| 1 | 1 | root | root | bHLH, root initiation |
|
| AT1G04550 | 0.19 | 1 |
| 0.98 | 1 | root | shoot, carpel, hypocotyl, flower | BDL, auxin-mediated processes |
|
| AT4G38630 | 0.05 | 0.91 |
|
| 1 | leaf, post-embryonic root, pollen | seed | Multiubiquitin chain binding protein 1 (MCB1) |
|
| AT3G15540 | 0.16 | 1 |
| 0.61 | 0.96 | lateral root, stamen filament | seed, stamen | IAA19, primary auxin-response genes |
|
| AT5G62340 | 0.49 |
|
| 1 | 1 | lateral root | root | Plant invertase/pectin methylesterase inhibitor superfamily |
|
| |||||||||
|
| AT4G18390 | 0.09 |
|
| 0.95 | 1 | leaf | vegetative rosette | Cycloidea and PCF transcription factor 2 (TCP2) |
|
| AT3G15030 | 0.20 |
| 1 | 0.01 | 1 | embryo, leaf | petal | TCP4 |
aDigital expression patterns by Audic’s test, bold characters represent over-expressed tissues with p-value < 0.001. bspecific organs, sub-functionalized B. rapa genes involved in the developmental process, cexpression patterns of A. thaliana in eFP web site (http://bbc.botany.utoronto.ca/efp/cgi-bin/efpWeb.cgi) in developmental map.