| Literature DB >> 36040059 |
Daniel J Leite1,2, Laura Piovani2, Maximilian J Telford2.
Abstract
Polyclad flatworms are widely thought to be one of the least derived of the flatworm classes and, as such, are well placed to investigate evolutionary and developmental features such as spiral cleavage and larval diversification lost in other platyhelminths. Prostheceraeus crozieri, (formerly Maritigrella crozieri), is an emerging model polyclad flatworm that already has some useful transcriptome data but, to date, no sequenced genome. We have used high molecular weight DNA extraction and long-read PacBio sequencing to assemble the highly repetitive (67.9%) P. crozieri genome (2.07 Gb). We have annotated 43,325 genes, with 89.7% BUSCO completeness. Perhaps reflecting its large genome, introns were considerably larger than other free-living flatworms, but evidence of abundant transposable elements suggests genome expansion has been principally via transposable elements activity. This genome resource will be of great use for future developmental and phylogenomic research.Entities:
Keywords: zzm321990 Prostheceraeus crozierizzm321990 ; homeobox; polyclad; tiger flatworm
Mesh:
Substances:
Year: 2022 PMID: 36040059 PMCID: PMC9469890 DOI: 10.1093/gbe/evac133
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 4.065
Fig. 1.Genome stats, gene annotation characteristics, gene ortholog, and Pfam comparisons to other free-living flatworms. (A) Scaffold size frequency of initial (red) and final assembly (blue) and the scaffold sizes removed (green) during duplicate scaffold removal. (B) Kmer frequency coverage reveals two peaks, suggesting diploidy. (C) Repeat sizes in the soft-masked genome show many short and long repeats (>10 kb = red dash line). (D) Exon and (E) intron sizes and GC% distribution reveal large intron sizes but comparable GC% to other free-living flatworms. Exons/introns were sorted by GC %, split into bins of 1,000 genes, and the average length of each bin was measured. (F) Orthofinder detected 23,378 orthogroups of which 4,590 (19.6%) were shared between all four flatworm species. (G) Of the total 5,428 Pfams, 3,233 (59.6%) were shared between all four species. (H) The most abundant Pfam domains ordered by the total of all four species. Mlig in blue shows different distribution relating to possible high gene duplication. (I) The top 20 families in (B) reveal that Prostheceraeus crozieri has a high occurrence of retroviral/transposable element functioning Pfams. Pcro, P. crozieri (blue); Smed, Schmidtea mediterranea (purple); Djap, Dugesia japonica (blue); and Mlig, Macrostomum lignano (green).
Genome Assembly, Repeat Content, Annotation and BUSCO Metrics
| Assembly size (bp) | 2,065,465,794 |
| Scaffolds | 17,074 |
| N50 (bp) | 292,050 |
| Largest scaffold (bp) | 2,612,272 |
|
| 12,175 |
| GC (%) | 37.64 |
| Protein-coding genes | 43,325 |
| BUSCO (%) | C:89.7 (S:87.1, D:2.6), F:5.2, M:5.1 |
| Total repeats (%) | 67.9 |