| Literature DB >> 29728068 |
Tatsuhiro Nomaguchi1, Yoshiaki Maeda2, Tomoko Yoshino2, Toru Asahi1, Leila Tirichine3, Chris Bowler3, Tsuyoshi Tanaka4.
Abstract
BACKGROUND: Allopolyploidy is a genomic structure wherein two or more sets of chromosomes derived from divergent parental species coexist within an organism. It is a prevalent genomic configuration in plants, as an important source of genetic variation, and also frequently confers environmental adaptability and increased crop productivity. We previously reported the oleaginous marine diatom Fistulifera solaris JPCC DA0580 to be a promising host for biofuel production and that its genome is allopolyploid, which had never previously been reported in eukaryotic microalgae. However, the study of allopolyploidy in F. solaris was hindered by the difficulty in classifying the homoeologous genes based on their progenitor origins, owing to the shortage of diatom genomic references.Entities:
Keywords: Allopolyploidy; Diatom; Fistulifera solaris JPCC DA0580; Homoeolog expression bias; Pseudo-parental subgenome
Mesh:
Substances:
Year: 2018 PMID: 29728068 PMCID: PMC5935921 DOI: 10.1186/s12864-018-4691-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Comparison of GC content between pairs of homoeologous genes in each homoeologous chromosome. Vertical axis represents GC content ratio (dGC) value of each homoeologous gene, and horizontal axis indicates the relative number of homoeologous genes on each chromosome. The dGC value indicates the ratio of GC content in one homoeologous gene with respect to its corresponding pair. As such, if the dGC value is over 1 (above the horizontal axis), its GC content is higher than the other pair. The dGC value of homoeologous genes along the homoeologous chromosome with higher global GC content (Chromosome X) and lower global GC content (Chromosome X’) were indicated as red and green line, respectively
Fig. 2PCA based on codon usage frequency of homoeologous genes in each chromosome. The figure shows the two dimensional principal component analysis (PCA) results based on first principal component (PC1) and second principal component (PC2). Each plotted number represents a chromosome number and the colour corresponds to the subgenome to which they belong
Chromosome characteristics of allopolyploid diatom F. solaris
| Chromosome | Global GC content | Number of predicted genes | Number of homoeologous genes | Number of non-homoeologous genes | ||||
|---|---|---|---|---|---|---|---|---|
| Ha | La | H | L | H | L | H | L | |
| Chr 1 | 45.27 | 45.31 | 692 | 649 | 617 (89.16)b | 617 (95.07) | 75 (10.84) | 32 (4.93) |
| Chr 2 | 45.34 | 45.58 | 585 | 519 | 463 (79.15) | 463 (89.21) | 122 (20.85) | 56 (10.79) |
| Chr 3 | 45.93 | 45.77 | 595 | 562 | 531 (89.24) | 531 (94.48) | 64 (10.76) | 31 (5.52) |
| Chr 4 | 45.24 | 45.30 | 561 | 547 | 499 (88.95) | 499 (91.22) | 62 (11.05) | 48 (8.78) |
| Chr 5 | 44.85 | 45.04 | 497 | 454 | 421 (84.71) | 421 (92.73) | 76 (15.29) | 33 (7.27) |
| Chr 6 | 45.82 | 45.81 | 452 | 422 | 408 (90.27) | 408 (96.68) | 44 (9.73) | 14 (3.32) |
| Chr 7 | 45.82 | 45.68 | 384 | 345 | 322 (83.85) | 322 (93.33) | 62 (16.15) | 23 (6.67) |
| Chr 8 | 45.59 | 45.65 | 387 | 382 | 354 (91.47) | 354 (92.67) | 33 (8.53) | 28 (7.33) |
| Chr 9 | 45.75 | 45.36 | 364 | 340 | 299 (82.14) | 299 (87.94) | 65 (17.86) | 41 (12.06) |
| Chr 10 | 45.21 | 44.83 | 358 | 364 | 291 (81.28) | 291 (79.95) | 67 (18.72) | 73 (20.05) |
| Chr10_Ac | / | / | 308 | 285 | 256 (83.11) | 256 (89.82) | 52 (16.88) | 29 (10.18) |
| Chr10_Bc | / | / | 79 | 50 | 35 (44.30) | 35 (70.00) | 44 (55.70) | 15 (30.00) |
| Chr 11 | 45.89 | 45.83 | 343 | 314 | 297 (86.59) | 297 (94.59) | 46 (13.41) | 17 (5.41) |
| Chr 12 | 45.75 | 46.12 | 381 | 343 | 320 (83.99) | 320 (93.29) | 61 (16.01) | 23 (6.71) |
| Chr 13 | 45.17 | 45.39 | 356 | 332 | 301 (84.55) | 301 (90.66) | 55 (15.45) | 31 (9.34) |
| Chr 14 | 45.91 | 45.64 | 360 | 332 | 293 (81.39) | 293 (88.25) | 67 (18.61) | 39 (11.75) |
| Chr 15 | 45.50 | 45.51 | 338 | 326 | 303 (89.64) | 303 (92.94) | 35 (10.36) | 23 (7.06) |
| Chr 16 | 45.81 | 46.14 | 307 | 271 | 255 (83.06) | 255 (94.10) | 52 (16.94) | 16 (5.90) |
| Chr 17 | 46.41 | 45.81 | 294 | 288 | 260 (88.44) | 260 (90.28) | 34 (11.56) | 28 (9.72) |
| Chr 18 | 45.73 | 45.48 | 291 | 263 | 242 (83.16) | 242 (92.02) | 49 (16.84) | 21 (7.98) |
| Chr 19 | 46.22 | 45.78 | 310 | 297 | 282 (90.97) | 282 (94.95) | 28 (9.03) | 15 (5.05) |
| Chr 20 | 45.95 | 45.73 | 263 | 234 | 214 (81.37) | 214 (91.45) | 49 (18.63) | 20 (8.55) |
| Chr 21 | 45.17 | 45.40 | 273 | 265 | 252 (92.31) | 252 (95.09) | 21 (7.69) | 13 (4.91) |
| Chr 22 | 46.02 | 45.94 | 247 | 238 | 218 (88.26) | 218 (91.60) | 29 (11.74) | 20 (8.40) |
| Chr 23 | 45.07 | 45.42 | 250 | 231 | 215 (86.00) | 215 (93.07) | 35 (14.00) | 16 (6.93) |
| Chr 24 | 45.81 | 45.82 | 250 | 212 | 199 (79.60) | 199 (93.87) | 51 (20.40) | 13 (6.13) |
| Chr 25 | 45.38 | 45.21 | 216 | 203 | 191 (88.43) | 191 (94.09) | 25 (11.57) | 12 (5.91) |
| Chr 26 | 45.65 | 44.56 | 165 | 130 | 118 (71.52) | 118 (90.77) | 47 (28.48) | 12 (9.23) |
| Chr 27 | 46.11 | 45.32 | 107 | 118 | 96 (89.72) | 96 (81.36) | 11 (10.28) | 22 (18.64) |
| Chr 28 | 44.58 | 45.34 | 119 | 101 | 92 (77.31) | 92 (91.09) | 27 (22.69) | 9 (8.91) |
| Chr 29 | 46.38 | 45.38 | 129 | 114 | 103 (79.84) | 103 (90.35) | 26 (20.16) | 11 (9.65) |
| Chr 30 | 45.60 | 45.45 | 121 | 117 | 102 (84.30) | 102 (87.18) | 19 (15.70) | 15 (12.82) |
| Chr 31 | 45.36 | 45.63 | 89 | 78 | 74 (83.15) | 74 (94.87) | 15 (16.85) | 4 (5.13) |
| Chr 32 | 45.88 | 45.71 | 79 | 73 | 67 (84.81) | 67 (91.78) | 12 (15.19) | 6 (8.22) |
| Chr 33 | 44.92 | 46.05 | 63 | 57 | 54 (85.71) | 54 (94.74) | 9 (14.29) | 3 (5.26) |
| Chr 34 | 46.24 | 46.36 | 54 | 50 | 47 (87.04) | 47 (94.00) | 7 (12.96) | 3 (6.00) |
| Chr 35 | 44.42 | 44.99 | 43 | 40 | 35 (81.40) | 35 (87.50) | 8 (18.60) | 5 (12.50) |
| Chr 36 | 46.20 | 46.16 | 50 | 47 | 42 (84.00) | 42 (89.36) | 8 (16.00) | 5 (10.64) |
| Chr 37 | 46.48 | 46.42 | 45 | 34 | 34 (75.56) | 34 (100.00) | 11 (24.44) | 0 (0.00) |
| Chr 38 | 46.93 | 45.68 | 34 | 37 | 33 (97.06) | 33 (89.19) | 1 (2.94) | 4 (10.81) |
| Chr 39 | 45.55 | 45.80 | 33 | 27 | 27 (81.82) | 27 (100.00) | 6 (18.18) | 0 (0.00) |
| Chr 40 | 45.58 | 45.42 | 25 | 21 | 19 (76.00) | 19 (90.48) | 6 (24.00) | 2 (9.52) |
| Chr 41 | 44.86 | 44.34 | 10 | 16 | 8 (80.00) | 8 (50.00) | 2 (20.00) | 8 (50.00) |
| Chr 42 | 46.05 | 43.60 | 9 | 11 | 9 (100.00) | 9 (81.82) | 0 (0.00) | 2 (18.18) |
aEach element is separated into two columns, indicated as H and L, representing the data from chromosomes containing homoeologous genes with relatively high and low dGC, respectively
bThe values in parentheses indicates percentages of homoeologous/non-homoeologous genes within each homoeologous chromosome
cFor chromosome 10, corresponding values for upstream region (Chr10_A) and downstream region (Chr10_B) of the dGC flip over are written separately
Fig. 3Numbers of homoeologous genes expressing with a bias towards one of the specific subgenomes. a Histograms of genome-wide homoeologous expression bias of homoeologous gene pairs at three different stages during oil accumulation. N values represent the number of genes expressing towards the subgenome A or B. S values indicate the number of genes counted as biased expression due to silencing of the other homoeologous pair (not included in histograms). b Comparison of number of synchronized and unsynchronized homoeologous gene pairs throughout 3 time points
Fig. 4Range of homoeolog expression bias in different metabolic pathways. Box plot of log2 showing the expression ratio of homoeologous gene pairs in each metabolic pathway. Red dotted line indicates the equal ratio log2. Black dotted line indicates the threshold of expression ratio where the homoeologous gene pair is defined as biased (log2FC (fold change) > 1). Orange and blue colours of the boxes indicate the direction of homoeolog expression bias, towards Fso_h and Fso_l, respectively
Fig. 5Homoeolog expression bias in fatty acid and lipid metabolism pathway. The 48 h (a), 96 h (b), and 144 h (c) culture. Homoeologous genes encoding enzymes that have been predicted in F. solaris are written inside the ellipses. The ellipses in orange and blue represent the genes with homoeolog expression bias, towards Fso_h and Fso_l, respectively