| Literature DB >> 33552749 |
Tatiana Arias1,2,3, Diego Mauricio Riaño-Pachón4, Verónica S Di Stilio2.
Abstract
PREMISE: Multiple transitions from insect to wind pollination are associated with polyploidy and unisexual flowers in Thalictrum (Ranunculaceae), yet the underlying genetics remains unknown. We generated a draft genome of Thalictrum thalictroides, a representative of a clade with ancestral floral traits (diploid, hermaphrodite, and insect pollinated) and a model for functional studies. Floral transcriptomes of T. thalictroides and of wind-pollinated, andromonoecious T. hernandezii are presented as a resource to facilitate candidate gene discovery in flowers with different sexual and pollination systems.Entities:
Keywords: Ranunculaceae; Thalictrum hernandezii; Thalictrum thalictroides; draft genome; floral transcriptome; pollination syndrome; sexual system
Year: 2021 PMID: 33552749 PMCID: PMC7845765 DOI: 10.1002/aps3.11407
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Figure 1Floral phenotypes of Thalictrum study species. Inflorescence of hermaphroditic T. thalictroides with hermaphrodite open flowers (A) and andromonoecious T. hernandezii with staminate buds () and hermaphrodite open flowers () occurring together in an inflorescence (B); inset, detail of young staminate flower. Scale bar = 1 cm.
Summary statistics for de novo genome assemblies of Thalictrum thalictroides.
| Metric | Consensus | WT964 | WT478 |
|---|---|---|---|
| No. of contigs | 44,860 | 41,892 | 41,384 |
| Total length (bp) | 243,091,176 | 205,112,749 | 167,308,766 |
| Largest contig (bp) | 119,890 | 80,531 | 57,549 |
| GC (%) | 36.23 | 36.29 | 36.61 |
| N50 (bp) | 12,761 | 10,189 | 8017 |
| L50 | 4281 | 4739 | 4875 |
Assembly statistics were computed using Quast version 5.0.2 (Gurevich et al., 2013).
Sequenced accessions from two different individuals.
Figure 2Proportion of Benchmarking Universal Single‐Copy Orthologs (BUSCOs) for the Thalictrum thalictroides genome (total number of BUSCOs = 1440 genes in Embryophyta from OrthoDB version 9.0). Data are shown for the two sequenced accessions (WT964 and WT478) and the consensus.
Simple sequence repeat (SSR) motif distribution in the Thalictrum thalictroides genome and the T. thalictroides and T. hernandezii (hermaphrodite and staminate combined) floral transcriptomes.
| SSR properties |
|
|
|
|---|---|---|---|
| Total no. of identified SSRs | 65,651 | 11,826 | 18,631 |
| No. of SSRs present in tandem | 3582 | 381 | 538 |
| No. of contigs with SSR | 22,867 | 9844 | 16,512 |
| No. of contigs with >1 SSR | 12,596 | 1634 | 1898 |
SSR sequences are available at https://doi.org/10.6084/m9.figshare.11984370.v8.
With abundance ≥5% of total SSRs in the genome.
Figure 3The relative abundance of different families of transcription‐associated proteins (TAPs) in the floral transcriptomes of Thalictrum thalictroides (Tt) and T. hernandezii (Th_H, hermaphrodite and Th_S, staminate). Transcripts >0.1 TPM (transcripts per million) are included. Statistically significant differences between samples (chi‐square test) are marked with black (P < 0.01) or red (P < 0.05) asterisks. TFF: transcription factor family, OTR: other transcriptional regulator family.
Properties of de novo floral transcriptome assemblies of Thalictrum thalictroides and T. hernandezii.
| Metrics | Parameter |
|
|
|---|---|---|---|
| Contigs | No. of sequences (transcripts) | 54,104 | 124,707 |
| Largest sequence (bp) | 14,923 | 11,578 | |
| No. of bases | 72,336,777 | 150,432,265 | |
| Mean sequence length | 1337.0 | 1206.3 | |
| No. of sequences >1 Kbp | 29,920 | 60,919 | |
| N50 (bp) | 1817 | 1703 | |
| % GC | 40 | 40 | |
| BUSCO | Complete | 80.1% | 82.9% |
| Single copy | 42.3% | 16.5% | |
| Duplicated | 37.8% | 66.4% | |
| Fragmented | 7.8% | 7.6% | |
| Missing | 12.1% | 9.5% | |
| Read mapping | No. of paired‐ends (fragments) | 40,504,757 | 65,712,208 |
| Proportion of paired‐end fragments mapping back to the assembly |
87.92% (Salmon) 88.85% overall mapping 84.25% concordantly mapping (Bowtie2) |
89.64% (Salmon) 90.72% overall mapping 87.03% concordantly mapping (Bowtie2) |
Comparative metrics of de novo floral transcriptome assemblies of Thalictrum thalictroides and T. hernandezii.
|
|
| |||
|---|---|---|---|---|
| Parameter | Reference | Reference | Reference | Reference |
| No. of CRBB | 40,771 | 46,744 | 97,550 | 90,591 |
| Proportion of CRBB contigs | 0.75357 | 0.86397 | 0.78223 | 0.72643 |
| No. of reciprocal best hits per reference | 0.99289 | 1.31451 | 2.74325 | 2.20615 |
| No. of CRBB reference transcripts | 18,877 | 19,387 | 18,323 | 21,224 |
| Proportion of CRBB reference transcripts | 0.45971 | 0.54519 | 0.51527 | 0.51686 |
Conditional reciprocal best BLAST algorithm, implemented in TransRate (Smith‐Unna et al., 2016).
Figure 4The number of shared and unique orthogroups or transcripts among the Thalictrum data sets in this study, with examples of candidate genes (in red based on Arabidopsis, see text for details). The total number of orthogroups per data set is indicated in parentheses. Representative flower pictures are shown for T. thalictroides (TTHA), insect‐pollinated with hermaphrodite flowers, and T. hernandezii (THER), wind‐pollinated with hermaphrodite (THER_HERM) and staminate (THER_MALE) flowers. (A) Interspecies comparison of floral transcriptomes (TTHA_RNA and THER_RNA) and validation against the T. thalictroides draft genome (TTHA). Area “a” represents T. thalictroides‐exclusive orthogroups found in the draft genome; “b” represents T. hernandezii‐exclusive orthogroups (both floral types combined) found in the draft genome. (B) Intraspecies comparison of transcripts of two floral types in T. hernandezii (male and hermaphrodite). Area “c” represents male flower–specific transcripts; “d” represents hermaphrodite flower–specific transcripts.
| Species | DNA code | Usage | Voucher accession no. (Herbarium) | Locality | SRA accession no. |
|---|---|---|---|---|---|
|
| ThWT441 | RNA‐seq analysis | V. Di Stilio 119 (WTU) | Cultivated from wild seed, Huitzila, Mexico Liston1215 | SRR6869420, SRR6869419 |
|
| TtWT478 |
Whole genome assembly RNA‐seq analysis | V. Di Stilio 124 (WTU) | Cultivated from nursery material (MI, USA) |
SRR6869426, SRR6869425, SRR6869418 Transcriptome: SRR6869422 GBVZ |
|
| TtWT964 | Whole genome assembly | Unvouchered | Cultivated from nursery material (NC, USA) | SRR6869424, SRR6869423, SRR6869421 |
| Sample | Library insert size | No. of sequenced fragments | Clean reads (Gbp) | SRA accession no. | ||||
|---|---|---|---|---|---|---|---|---|
| Raw reads | Trimmed reads | Percent (from raw) after Trimming | Clean reads | Percent (from raw) after cleaning | ||||
| WT964 | 170 | 11,429,928 | 10,146,420 | 88.8 | 9,758,045 | 85.4 | 1.684 | SRX3823744 |
| WT964 | 500 | 11,887,778 | 9,155,669 | 77.0 | 8,910,174 | 75.0 | 1.497 | SRX3823741 |
| WT964 | 800 | 11,826,149 | 9,253,079 | 78.2 | 9,130,960 | 77.2 | 1.524 | SRX3823742 |
| WT478 | 170 | 11,484,802 | 10,191,821 | 88.7 | 6,601,641 | 57.5 | 1.290 | SRX3823739 |
| WT478 | 500 | 11,477,882 | 10,414,934 | 90.7 | 7,727,941 | 67.3 | 1.502 | SRX3823740 |
| WT478 | 800 | 11,721,131 | 9,012,622 | 76.9 | 6,977,136 | 59.5 | 1.350 | SRX3823747 |
Clean reads are the number of reads remaining after quality‐trimming and contaminant removal (see Appendices S1 and S2 for a list of contaminants).