| Literature DB >> 34078291 |
J S Eriksson1,2, C D Bacon3,4, D J Bennett3,4, B E Pfeil3,4, B Oxelman3,4, A Antonelli3,4,5,6.
Abstract
BACKGROUND: The great diversity in plant genome size and chromosome number is partly due to polyploidization (i.e. genome doubling events). The differences in genome size and chromosome number among diploid plant species can be a window into the intriguing phenomenon of past genome doubling that may be obscured through time by the process of diploidization. The genus Hibiscus L. (Malvaceae) has a wide diversity of chromosome numbers and a complex genomic history. Hibiscus is ideal for exploring past genomic events because although two ancient genome duplication events have been identified, more are likely to be found due to its diversity of chromosome numbers. To reappraise the history of whole-genome duplication events in Hibiscus, we tested three alternative scenarios describing different polyploidization events.Entities:
Keywords: Ancient genome duplication; Gene copy; Haplotype; Hibiscus; Malvaceae; Paralogy; Polyploidy
Year: 2021 PMID: 34078291 PMCID: PMC8170824 DOI: 10.1186/s12862-021-01751-7
Source DB: PubMed Journal: BMC Ecol Evol ISSN: 2730-7182
Fig. 1Three hypothetical genome evolution scenarios in Hibiscus with Gossypium raimondii (G) as outgroup. S is short for for H. syriacus; T for H. trionum; P for Pavonia triloba; C for H. cannabinus; and M for H. mechowii. The different colors represent three different genome duplication scenarios, a where red = S1 (two independent WGD in S), b blue = S2 (one WGD shared by all species of Hibiscus and one independent in S) and c green = S3 (two WGD events shared by all species of Hibiscus). Numbers above branches leading to each species are known chromosome counts
Mean read depth across all base pairs per species per gene
| Mean read depth (coverage across all assemblies) per locus | ||||||
|---|---|---|---|---|---|---|
| Gene | ||||||
| ABC-C2 | 206 | 120 | 180 | 324 | 66 | 53 |
| ACCS | 231 | 179 | 172 | 284 | 42 | 38 |
| Acylamino | 480 | 300 | 330 | 823 | 122 | 76 |
| AglucanP | 411 | 638 | 597 | 399 | 171 | 112 |
| Ankyrin | 239 | 151 | 145 | 315 | 91 | 77 |
| Bgalactosidase8 | 256 | 193 | 178 | 322 | 57 | 70 |
| CAD | 406 | 295 | 270 | 381 | 60 | 57 |
| Calcium11 | 494 | 293 | 315 | 600 | 75 | 158 |
| Calcium-ATPase1 | 215 | 195 | 196 | 287 | 73 | 72 |
| Calcium-ATPase3* | 338 | 217 | 243 | 460 | 87 | 76 |
| Callose* | 93 | 84 | 222 | 91 | 42 | 45 |
| CesA1 | 200 | 209 | 147 | 255 | 79 | 116 |
| DEAD-ATP* | 324 | 308 | 284 | 150 | 87 | 90 |
| EIF-2B | 133 | 96 | 98 | 160 | 23 | 35 |
| F5H | 232 | 200 | 172 | 268 | 66 | 52 |
| Formin2 | 141 | 117 | 109 | 151 | 31 | 27 |
| Glutamine | 425 | 302 | 349 | 744 | 158 | 85 |
| GPDH | 297 | 264 | 258 | 459 | 136 | 100 |
| Importin4* | 413 | 271 | 328 | 538 | 55 | 109 |
| Kinesin-KCA2* | 268 | 233 | 276 | 291 | 76 | 87 |
| Kinesin-Kp1 | 333 | 410 | 398 | 473 | 106 | 44 |
| LOC105792102 | 187 | 153 | 137 | 221 | 58 | 58 |
| MAP3K | 385 | 280 | 274 | 441 | 102 | 78 |
| Mechanosensitive* | 481 | 467 | 481 | 622 | 195 | 143 |
| MNS4* | 299 | 276 | 220 | 353 | 91 | 83 |
| NF-X1-zinc* | 274 | 282 | 271 | 429 | 100 | 85 |
| Oxysterol-1D* | 1103 | 1224 | 1095 | 2002 | 323 | 316 |
| Phospholipase | 212 | 178 | 178 | 299 | 62 | 65 |
| Plasma-ATPase | 216 | 189 | 174 | 238 | 73 | 56 |
| Polysub2* | 428 | 562 | 220 | 798 | 170 | 161 |
| RRP5* | 404 | 307 | 311 | 410 | 103 | 114 |
| SBT3-5* | 591 | 467 | 451 | − | 136 | 102 |
| SD1-1* | 628 | 495 | 524 | 685 | 127 | 116 |
| TGH* | 743 | 560 | 570 | 405 | 172 | 162 |
| Mean coverage across all loci | 312 | 261 | 259 | 371 | 87 | 80 |
The average read coverage is calculated across all contigs and per exons for each gene. Asterisks (*) refer to genes that have only one sequence copy per species (e.g. single copy genes)
Gene count data used for likelihood scenario testing
| Number of copies | ||||||
|---|---|---|---|---|---|---|
| Gene name | G | S | C | M | T | P |
| ABC-c2 | 1 | 2 | 1 | 1 | 0 | 0 |
| ABC-c2 | 1 | 1 | 1 | 0 | 1 | 3 |
| ACCS | 1 | 0 | 2 | 1 | 0 | 2 |
| ACCS | 1 | 3 | 2 | 2 | 2 | 1 |
| Acylamino | 1 | 3 | 3 | 2 | 2 | 5 |
| AglucanP | 1 | 9 | 2 | 2 | 2 | 12 |
| Ankyrin | 3 | 6 | 2 | 2 | 2 | 4 |
| Ankyrin | 1 | 1 | 2 | 0 | 0 | 0 |
| Bgalactosidase8 | 1 | 3 | 2 | 2 | 6 | 4 |
| CAD | 4 | 2 | 3 | 2 | 2 | 3 |
| CAD | 1 | 1 | 1 | 1 | 1 | 1 |
| Calcium11 | 1 | 4 | 2 | 2 | 0 | 3 |
| Calcium11 | 2 | 5 | 2 | 3 | 6 | 2 |
| Calcium-atpase1 | 1 | 3 | 1 | 1 | 1 | 2 |
| Calcium-atpase1 | 1 | 0 | 1 | 1 | 1 | 2 |
| CesA1 | 3 | 4 | 2 | 2 | 2 | 4 |
| EIF-2B | 2 | 2 | 1 | 1 | 1 | 2 |
| EIF-2B | 1 | 2 | 1 | 1 | 1 | 2 |
| F5H | 1 | 1 | 1 | 1 | 1 | 2 |
| F5H | 1 | 4 | 2 | 2 | 2 | 4 |
| F5H | 1 | 0 | 2 | 2 | 1 | 2 |
| Formin2 | 1 | 5 | 1 | 1 | 0 | 0 |
| Formin2 | 1 | 3 | 2 | 2 | 2 | 2 |
| Formin2 | 4 | 7 | 2 | 3 | 2 | 2 |
| Glutamine | 1 | 5 | 3 | 3 | 2 | 1 |
| GPDH | 1 | 2 | 1 | 0 | 0 | 2 |
| GPDH | 1 | 1 | 2 | 1 | 1 | 1 |
| GPDH | 1 | 2 | 2 | 2 | 2 | 3 |
| GPDH | 1 | 6 | 2 | 2 | 2 | 2 |
| GPDH | 2 | 10 | 1 | 2 | 2 | 4 |
| Kinesin-KP1 | 1 | 2 | 1 | 1 | 1 | 2 |
| Kinesin-KP1 | 3 | 2 | 1 | 1 | 1 | 2 |
| LOC105792102 | 2 | 2 | 1 | 1 | 1 | 2 |
| LOC105792102 | 1 | 3 | 2 | 2 | 2 | 2 |
| LOC105792102 | 1 | 2 | 2 | 2 | 2 | 4 |
| LOC105792102 | 1 | 7 | 2 | 2 | 1 | 0 |
| LOC105792102 | 1 | 1 | 1 | 1 | 1 | 3 |
| MAP3K | 1 | 4 | 1 | 1 | 1 | 2 |
| MAP3K | 1 | 3 | 2 | 2 | 2 | 4 |
| Phospolipase | 1 | 1 | 1 | 1 | 0 | 0 |
| Phospolipase | 1 | 4 | 2 | 2 | 0 | 0 |
| Phospolipase | 1 | 2 | 2 | 0 | 0 | 0 |
| Plasma-ATPase | 1 | 2 | 1 | 1 | 0 | 0 |
| Plasma-ATPase | 1 | 7 | 2 | 3 | 2 | 0 |
Duplicated gene name represents separate paralogous clades. The number of copies were counted for each gene as the number of sequences from one individual in a clade that had Gossypium (G) as an outgroup. The abbreviations are short for Hibiscus syriacus (S), H. cannabinus (C), H. mechowii (W), H. trionum (T) and Pavonia triloba (P)
Fig. 2Species tree inferred by StarBEAST using 10 single-copy genes. The blue color indicates the most probable placement of the whole-genome duplication events
Log-likelihood scores, AIC and weighted AIC from gene count data for each scenario
| loglikelihood | AIC | deltaAIC | rel.LL | AIC weights | |
|---|---|---|---|---|---|
| S1 | − 294.4646 | 596.9292 | 8.2118 | 0.01647 | 0.0162 |
| S2 | − 290.3586 | 588.7173 | 0 | 1 | 0.9833 |
| S3 | − 297.9675 | 603.9350 | 15.2176 | 0.0005 | 0.0005 |
| Null | − 303.7357 | 611.4714 | 22.7540 | 1.1455e−05 | 1.1264e−05 |
| S1 | − 276.8404 | 563.6808 | 13.5126 | 0.0012 | 0.0012 |
| S2 | − 270.0841 | 550.1682 | 0 | 1 | 0.9988 |
| S3 | − 285.7683 | 581.5366 | 31.3684 | 1.5432e−07 | 1.5414e−07 |
| Null | − 303.7357 | 611.4714 | 61.3032 | 4.8771e−14 | 4.8714e−14 |