| Literature DB >> 32912303 |
Stefanie Hartmann1, Michaela Preick2, Silke Abelt2, André Scheffel3, Michael Hofreiter2.
Abstract
OBJECTIVE: Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira's lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales.Entities:
Keywords: Carnivorous plant; Clethra arborea; Genome assembly; Orthologous Matrix (OMA) Project; Phylogenomics; Roridula gorgonias; Transcriptome assembly
Mesh:
Year: 2020 PMID: 32912303 PMCID: PMC7488092 DOI: 10.1186/s13104-020-05254-4
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Summary statistics for gene total numbers and lengths of the full data sets used for the inference of gene families
| Min | Median | 3rd Qu | Max | Total | Type | Reference | |
|---|---|---|---|---|---|---|---|
| 4 | 353 | 535 | 5453 | 34,015 | g | [ | |
| 2 | 268 | 438 | 5498 | 42,988 | g | [ | |
| 14 | 324 | 520 | 4973 | 31,129 | g | This study | |
| 29 | 325 | 515 | 5786 | 76,698 | g | [ | |
| 29 | 399 | 601 | 5453 | 44,655 | g | [ | |
| 23 | 366 | 544 | 4732 | 18,301 | g | [ | |
| 49 | 375 | 605 | 5347 | 28,441 | g | [ | |
| 21 | 325 | 509 | 5314 | 22,655 | g | This study | |
| 39 | 108 | 155 | 447 | 22,690 | t | [ | |
| 41 | 111 | 158 | 831 | 18,748 | t | [ | |
| 40 | 118 | 212 | 2061 | 34,789 | t | [ | |
| 40 | 74 | 107 | 4354 | 219,698 | t | [ |
Summary statistics for scaffolds and predicted genes. Metrics are listed for scaffolds of at least 1 kbp as determined using the quast software
| metric | ||
|---|---|---|
| Total length (>= 10 kbp) | 235,721,577 | 437,604,713 |
| Total length (>= 25 kbp) | 200,375,750 | 384,820,916 |
| Total length (>= 50 kbp) | 125,205,191 | 312,317,250 |
| # contigs | 20,623 | 29,265 |
| Largest contig | 191,047 | 616,539 |
| Total length | 284,273,507 | 511,026,369 |
| GC (%) | 36.60 | 38.50 |
| N50 | 46,982 | 67,174 |
| # N’s per 100 kbp | 734.67 | 2,082.52 |
| Total BUSCO groups searched | 2121 | 2121 |
| Complete BUSCOs | 1787 (84.2%) | 1899 (89.5%) |
| Complete & single-copy BUSCOs | 1712 (80.7%) | 1744 (82.2%) |
| Complete & duplicated BUSCOs | 75 (3.5%) | 155 (7.3%) |
| Fragmented BUSCOs | 203 (9.6%) | 135 (6.4%) |
| Missing BUSCOs | 131 (6.2%) | 87 (4.1%) |
BUSCO statistics are based on 2,121 single-copy orthologs of eudicots for the predicted protein sequences of R. gorgonias and C. arborea
Fig. 1The most frequently observed ML topologies. Of 2,434 selected OMA families, 814, 218, and 176 trees resulted in the topologies shown in A, B, and C, respectively