| Literature DB >> 36016991 |
Rebecca M Varney1, Meghan K Yap-Chiongco2, Nina T Mikkelsen3, Kevin M Kocot2,4.
Abstract
Mollusca is the second most species-rich phylum and includes animals as disparate as octopuses, clams, and chitons. Dozens of molluscan genomes are available, but only one representative of the subphylum Aculifera, the sister taxon to all other molluscs, has been sequenced to date, hindering comparative and evolutionary studies. To facilitate evolutionary studies across Mollusca, we sequenced the genome of a second aculiferan mollusc, the lepidopleurid chiton Hanleya hanleyi (Bean 1844), using a hybrid approach combining Oxford Nanopore and Illumina reads. After purging redundant haplotigs and removing contamination from this 1.3% heterozygous genome, we produced a 2.5 Gbp haploid assembly (>4X the size of the other chiton genome sequenced to date) with an N50 of 65.0 Kbp. Despite a fragmented assembly, the genome is rather complete (92.0% of BUSCOs detected; 79.4% complete plus 12.6% fragmented). Remarkably, the genome has the highest repeat content of any molluscan genome reported to date (>66%). Our gene annotation pipeline predicted 69,284 gene models (92.9% of BUSCOs detected; 81.8% complete plus 11.1% fragmented) of which 35,362 were supported by transcriptome and/or protein evidence. Phylogenomic analysis recovered Polyplacophora sister to all other sampled molluscs with maximal support. The Hanleya genome will be a valuable resource for studies of molluscan biology with diverse potential applications ranging from evolutionary and comparative genomics to molecular ecology. Copyright:Entities:
Keywords: Aculifera; Lepidopleurida; genome; repetitive DNA
Mesh:
Year: 2022 PMID: 36016991 PMCID: PMC9366239 DOI: 10.12688/f1000research.121706.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. A. Specimen of Hanleya hanleyi used for genome sequencing (ZMBN 146951). Scale bar = 8 mm. B. GenomeScope analysis of the paired-end Illumina data. The presence of two peaks indicates that Hanleya has a diploid genome, as expected. Heterozygosity is measured via k-mer distribution (presented at top of graph as “het”). C. Phylogenetic analysis of 2,331 nuclear protein-coding genes. Bootstrap support values below 100 are displayed at each node. Scale bar = 0.2 substitutions per site.
The number of repetitive elements of various types across several molluscan genomes as indicated by RepeatModeler.
| Repetitive element |
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| buffer | 0 | 4 | 1 | 3 | 0 | 0 | 1 | 2 |
| DNA | 65 | 76 | 154 | 132 | 322 | 224 | 186 | 108 |
| LINE | 151 | 44 | 119 | 161 | 78 | 156 | 81 | 49 |
| SINE | 7 | 23 | 13 | 16 | 13 | 7 | 32 | 26 |
| LTR | 100 | 22 | 31 | 21 | 47 | 38 | 19 | 21 |
| RC | 4 | 10 | 13 | 0 | 85 | 52 | 60 | 3 |
| Satellite | 0 | 4 | 10 | 0 | 2 | 7 | 7 | 1 |