| Literature DB >> 30740572 |
Ruisen Lu1, Wuqin Xu1, Qixiang Lu1, Pan Li1, Jocelyn Losh1, Faiza Hina1, Enxiang Li2, Yingxiong Qiu1.
Abstract
The genus Croomia (Stemonaceae) is an excellent model for studying the evolution of the Eastern Asia (EA)-Eastern North America (ENA) floristic disjunction and the genetic mechanisms of floral zygomorphy formation. In addition to the presence of both actinomorphic and zygomorphic flowers within the genus, species are disjunctively distributed between EA and ENA. However, due to the limited availability of genomic resources, few studies of Croomia have examined these questions. In this study, we sequenced the floral and leaf transcriptomes of the zygomorphic flowered C roomia heterosepala and the actinomorphic flowered Croomia japonica, and used comparative genomic approaches to investigate the transcriptome evolution of the two closely related species. The sequencing and de novo assembly of transcriptomes from flowers of C. heterosepala (ChFlower), flowers of C. japonica (CjFlower), and leaves of C. japonica (CjLeaf) yielded 57,193, 62,131 and 64,448 unigenes, respectively. In addition, estimation of Ka/Ks ratios for 11,566 potential orthologous groups between ChFlower and CjFlower revealed that only six pairs had Ka/Ks ratios significantly greater than 1 and are likely under positive selection. A total of 429 single copy nuclear genes (SCNGs) and 21,460 expression sequence tags-simple sequence repeats (EST-SSRs) were identified in this study. Specifically, we identified seven CYC/TB1-like genes from Stemonaceae. Phylogenetic and molecular evolution analyses indicated that these CYC/TB1-like genes formed a monophyletic clade (SteTBL1) and were subject to strong purifying selection. The shifts of floral symmetry in Stemonaceae do not appear to be correlated with TBL copy number.Entities:
Keywords: CYC/TB1; Croomia; EST-SSRs; Flower symmetry; SCNGs; Transcriptome
Year: 2018 PMID: 30740572 PMCID: PMC6317509 DOI: 10.1016/j.pld.2018.11.006
Source DB: PubMed Journal: Plant Divers ISSN: 2468-2659
Fig. 3Bayesian inference (BI) phylogeny of monocot CYC/TB1 genes rooted with basal eudicot CYC/TB1 genes. Posterior probabilities (pp, first) and bootstrap values (BS, second) > 50% are indicated (-- < 0.5). Where a particular relationship was not shown in ML analysis, only the Posterior probability is presented.
Summary statistics of the assembly results for C. heterosepala flowers (ChFlower), C. japonica flowers, (CjFlower), C. japonica leaves (CjLeaf) and C. japonica flowers + leaves (CjTranscriptome).
| Characteristics | ||||
|---|---|---|---|---|
| ChFlower | CjFlower | CjLeaf | CjTranscriptome | |
| Raw reads | 49 881 224 | 50 481 150 | 60 582 220 | 111 063 370 |
| Clean reads | 43 683 202 | 44 355 666 | 54 416 848 | – |
| GC percentage (%) | 50.46 | 49.68 | 50.24 | – |
| Q20 percentage | 97.60 | 97.60 | 98.55 | – |
| Total number of contigs | 76 976 | 77 344 | 79 538 | – |
| N50 length of contigs (bp) | 1452 | 1461 | 1641 | – |
| Mean length of contigs (bp) | 916 | 951 | 1027 | – |
| Total number of unigenes (bp) | 57 193 | 62 131 | 64 448 | 86 457 |
| 200–500 bp (% of total unigenes) | 21 357 (37.34%) | 22 099 (35.57%) | 23 801 (36.93%) | 30 335 (35.09%) |
| 500–1000 bp (% of total unigenes) | 13 622 (23.82%) | 14 847 (23.90%) | 14 830 (23.01%) | 19 700 (22.79%) |
| 1000–1500 bp (% of total unigenes) | 8655 (15.13%) | 10 000 (16.10%) | 9412 (14.60%) | 13 244 (15.32%) |
| 1500–2000 bp (% of total unigenes) | 6076 (10.62%) | 7053 (11.35%) | 6469 (10.04%) | 9521 (11.01%) |
| 2000–2500 bp (% of total unigenes) | 3301 (5.77%) | 3936 (6.34%) | 3907 (6.06%) | 5786 (6.69%) |
| 2500–3000 bp (% of total unigenes) | 1871 (3.27%) | 2076 (3.34%) | 2303 (3.57%) | 3313 (3.83%) |
| ≥3000 bp (% of total unigenes) | 2311 (4.04%) | 2120 (3.41%) | 3726 (5.78%) | 4558 (5.27%) |
| N50 length of unigenes (bp) | 1603 | 1575 | 1737 | 1732 |
| Mean length of unigens (bp) | 1042 | 1044 | 1106 | 1121 |
Annotation of unigenes from the transcriptomes of C. heterosepala flowers (ChFlower), C. japonica flowers (CjFlower), C. japonica leaves (CjLeaf), and C. japonica flowers + leaves (CjTranscriptome).
| Characteristics | |||||
|---|---|---|---|---|---|
| ChFlower | CjFlower | CjLeaf | CjTranscriptome | ||
| Functional annotations | Nr | 37 970 (66.39%) | 42 032 (67.65%) | 40 760 (63.24%) | 55 405 (64.08%) |
| Swiss-Prot | 26 629 (46.56%) | 29 573 (47.60%) | 29 008 (45.01%) | 42 708 (49.40%) | |
| KEGG | 25 382 (44.38%) | 27 938 (44.97%) | 27 652 (42.91%) | 37 106 (42.92%) | |
| COG | 17 633 (30.83%) | 19 404 (31.23%) | 19 412 (30.12%) | 26 073 (30.16%) | |
| GO | 18 436 (32.23%) | 20 376 (32.80%) | 26 640 (41.34%) | 34 595 (40.01%) | |
| Nt | 32 432 (56.71%) | 35 556 (57.22%) | 35 794 (55.54%) | 48 039 (55.56%) | |
| ALL | 39 919 (69.80%) | 44 209 (71.15%) | 43 375 (67.30%) | 58 895 (68.12%) | |
| CDS annotations | Homolog | 37 111 (64.89%) | 41 059 (66.08%) | 40 029 (62.11%) | 54 155 (62.64%) |
| ESTscan | 1248 (2.18%) | 1390 (2.24%) | 1187 (1.84%) | 1607 (1.86%) | |
| ALL | 38 359 (67.07%) | 42 449 (68.32%) | 41 216 (63.95%) | 55 762 (64.50%) | |
Fig. 1Gene ontology categories of unigenes in flowers of Croomia heterosepala (ChFlower) and C. japonica (CjFlower).
Candidate orthologs likely under positive selection in the transcriptomes C. heterosepala flowers (ChFlower) and C. japonica flowers (CjFlower), respectively.
| Gene ID of orthologous genes | Ka/Ks value | Description | ||
|---|---|---|---|---|
| ChFlower | CjFlower | |||
| CL3372.Contig2 | CL7786.Contig2 | 1.904 | 0.0027 | uncharacterized protein |
| CL7139.Contig1 | CL4557.Contig4 | 667.825 | 0.0142 | RING-H2 finger protein ATL58-like |
| Unigene30940 | Unigene7233 | 5.860 | 0.0142 | hypothetical protein Csa_7G372310 |
| Unigene132 | CL5006.Contig1 | 2.262 | 0.0263 | extensin-2-like, partial |
| Unigene9837 | Unigene3498 | 4.382 | 0.0230 | protein RALF-like 19 |
| CL3076.Contig2 | CL1809.Contig7 | 1.647 | 0.0301 | probable disease resistance protein At1g12280 |
| CL428.Contig4 | CL1228.Contig3 | 0.503 | 0.0082 | uncharacterized protein LOC105055788 |
| CL1969.Contig4 | CL159.Contig3 | 0.556 | 0.0130 | uncharacterized ATP-dependent helicase C17A2.12 isoform X1 |
| CL3529.Contig5 | CL1483.Contig4 | 0.527 | 0.0171 | hypothetical protein VITISV_015618 |
| Unigene31095 | CL3236.Contig1 | 0.511 | 0.0204 | uncharacterized protein LOC105040955 |
| Unigene766 | CL1582.Contig5 | 0.522 | 0.0275 | polypyrimidine tract-binding protein homolog 2 |
| CL6003.Contig1 | CL4260.Contig1 | 0.635 | 0.0328 | pentatricopeptide repeat-containing protein At5g62370 |
| CL3248.Contig4 | CL7151.Contig1 | 0.571 | 0.0329 | protein STICHEL-like 2 |
| Unigene22720 | CL1751.Contig4 | 0.501 | 0.0350 | la-related protein 6 A |
| CL2431.Contig1 | CL6743.Contig2 | 0.682 | 0.0362 | protein LONGIFOLIA 1 isoform X1 |
| CL6630.Contig10 | CL6598.Contig2 | 0.529 | 0.0366 | uncharacterized protein LOC103701275 |
| CL4997.Contig1 | Unigene3839 | 0.657 | 0.0397 | phytochrome A isoform X1 |
| Unigene24430 | Unigene8493 | 0.507 | 0.0407 | ethylene-responsive transcription factor RAP2-13-like |
| Unigene11165 | Unigene22312 | 0.513 | 0.0495 | RNA-binding protein 47-like |
Fig. 2Total numbers of different EST-SSR motifs in the transcriptomes of C. japonica flowers + leaves (CjTranscriptome).
Characteristics of 10 polymorphic EST-SSRs.
| Locus | Primer pair | Repeat motif | Size range (bp) | PIC | HWE | |||
|---|---|---|---|---|---|---|---|---|
| Cro_10 | F: CATCAATATCCGAAACTCTCCAG | AG (2*12) | 118–146 | 16 | 0.733 | 0.879 | 0.866 | 0.0001 |
| R: CGCCATAAATACTCTCACGAGTC | ||||||||
| Cro_21 | F: GGGACATTATTGGATCACAACTT | GAC (3*5) | 125–143 | 5 | 0.454 | 0.695 | 0.637 | 0.0232 |
| R: GTGTATGGGTGGATAGTTTGTGG | ||||||||
| Cro_22 | F: GGGTGGAAGGAGATAGATGAGAT | GTG (3*6) | 104–116 | 8 | 0.271 | 0.303 | 0.289 | 0.1184 |
| R: CCACCTCCACTCTTAACACACTC | ||||||||
| Cro_30 | F: CTCTTCCCCATCTTGTTCACTC | GCT (3*6) | 89–110 | 8 | 0.433 | 0.81 | 0.784 | 0.0001 |
| R: GGATAATAATAACGCGAGAAGCC | ||||||||
| Cro_38 | F: GTCACAGAGCACCCATCTCC | GGC (3*8) | 104–146 | 10 | 0.421 | 0.669 | 0.638 | 0.0000 |
| R: GACAGAGACTCCGATCATCTCAT | ||||||||
| Cro_50 | F: AACGAAAAACAGAAAGCCAAAAG | GCT (3*5) | 95–155 | 12 | 0.758 | 0.725 | 0.694 | 0.0000 |
| R: GATCCCCAATTCTCGATCTATTC | ||||||||
| Cro_56 | F: CGACCCTCCTCTTCTCCTATTTA | CCG (3*5) | 86–116 | 8 | 0.275 | 0.632 | 0.581 | 0.0003 |
| R: GTGAGTAAAAGAACGACGCCAT | ||||||||
| Cro_61 | F: CAGCAGCAGCAGCAGCAGCAG | CAG (3*7) | 89–122 | 12 | 0.646 | 0.834 | 0.813 | 0.0000 |
| R: GCTGAGGTTGAGAGAGTTGGATA | ||||||||
| Cro_77 | F: ATCGCTCCACCAACAACAG | CAG (3*8) | 98–119 | 7 | 0.254 | 0.749 | 0.71 | 0.2675 |
| R: CTGCTAGGGTTTGTCACCATC | ||||||||
| Cro_78 | F: ATCCAAAACCGACTACCAAGATT | ACC (3*5) | 95–119 | 9 | 0.392 | 0.802 | 0.772 | 0.0068 |
| R: GGACTTCCGAAATGAAAAACTCT | ||||||||
| average | 9.5 | 0.464 | 0.710 | 0.678 | 0.0416 |
Note: N number of alleles per locus, H observed heterozygosity, H expected heterozygosity, PIC polymorphism information content, HWE Hardy–Weinberg equilibrium.
Tests of hypotheses of ⍵ variation among TBL gene clades.
| ⍵Z2 | ⍵C1a | ⍵p2 | ⍵Z1a | ⍵Z1b | ⍵A1 | ⍵SteTBL | ⍵P1 | ⍵AcoTBL | LnL | df | P | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| M0 | 0.1463 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | −2207.81 | ||
| M1 | 0.0013 | 0.0001 | 0.0011 | 999 | 0.0001 | 0.1473 | 999 | 0.9159 | 0.0748 | −2137.08 | 131 | 0.2511 |
| M2 H1: Z2 | 2.0201 | 0.1463 | =⍵C1a | =⍵C1a | =⍵C1a | =⍵C1a | =⍵C1a | =⍵C1a | =⍵C1a | −2207.81 | 1 | 0.9936 |
| M2 H2: C1a | 0.1460 | 0.0657 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | −2207.81 | 1 | 0.9335 |
| M2 H3: P2 | 0.1486 | =⍵Z2 | 0.0690 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | −2207.66 | 1 | 0.5907 |
| = | = | = | = | = | = | = | ||||||
| M2 H5: Z1b | 0.14630 | =⍵Z2 | =⍵Z2 | =⍵Z2 | 2.3813 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | −2207.81 | 1 | 0.9935 |
| M2 H6: A1 | 0.1474 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | 0.0514 | =⍵Z2 | =⍵Z2 | =⍵Z2 | −2207.51 | 1 | 0.4394 |
| M2 H7: SteTBL | 0.1460 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | 0.1613 | =⍵Z2 | =⍵Z2 | −2207.81 | 1 | 0.9111 |
| = | = | = | = | = | = | = | ||||||
| M2 H9: AcoTBL | 0.1447 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | 0.2714 | −2207.68 | 1 | 0.6093 |
| M2 H10: Z1a&Z1b | 0.1463 | =⍵Z2 | =⍵Z2 | 1.8274 | =⍵Z1a | =⍵Z2 | =⍵Z2 | =⍵Z2 | =⍵Z2 | −2207.81 | 1 | 0.9906 |
| M2 H11: Z2&C1a&P2 | 999 | =⍵Z2 | =⍵Z2 | 0.1444 | 0.1444 | 0.1444 | 0.1444 | 0.1444 | 0.1444 | −2207.63 | 1 | 0.4455 |
Note: Z2 = ZinTBL2, C1a = ComTB1a, P2 = PoaTBL2, Z1a = ZinTBL1a, Z1b = ZinTBL1b, A1 = AlsTCP1 and P1=PoaTBL1.