| Literature DB >> 35501689 |
Ana Corrochano-Fraile1, Andrew Davie1, Stefano Carboni2,3, Michaël Bekaert1.
Abstract
BACKGROUND: Molluscs remain one significantly under-represented taxa amongst available genomic resources, despite being the second-largest animal phylum and the recent advances in genomes sequencing technologies and genome assembly techniques. With the present work, we want to contribute to the growing efforts by filling this gap, presenting a new high-quality reference genome for Mytilus edulis and investigating the evolutionary history within the Mytilidae family, in relation to other species in the class Bivalvia.Entities:
Keywords: Evolution; Mytilus edulis; Paleogenomics; Positive selection; Whole-genome duplication
Mesh:
Year: 2022 PMID: 35501689 PMCID: PMC9063065 DOI: 10.1186/s12864-022-08575-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Statistics of the genome assembly of M. edulis
| Category | Number/length |
|---|---|
| K-mer = 17 | |
| Estimated genome size | 1,010,184,781 nt |
| Estimated repeats | 688,190,885 nt |
| Estimated hetorizygosity | 3.69% |
| K-mer = 23 | |
| Estimated genome size | 1,096,306,163 nt |
| Estimated repeats | 437,569,400 nt |
| Estimated hetorizygosity | 4.84% |
| Number of contigs | 3339 |
| Total length | 1,827,085,763 nt |
| Total repeats | 1,029,206,554 nt |
| Observed hetorizygosity | 0.48% |
| Largest contig | 10,529,124 nt |
| N50 | 1,097,279 nt |
| GC | 32.17% |
| Read Mapped | 91.35% |
| Avg. coverage depth | 152x |
| Coverage over 10x | 99.99% |
| N’s per 100 kbp | 13.73 |
| BUSCO recovered | 98.9% |
| Predicted rRNA genes | 132 |
| Predicted protein coding genes | 69,246 |
Fig. 1Gene composition and annotation estimations. A BUSCO evaluation (Metazoa database; number of framework genes 954), 98.9% of the gene were recovered; B A five-way Venn diagram. The figure shows the unique and common genes displaying predicted protein sequence similarity with one or more databases (details in Supplementary Table S3); C Level 2 GO annotations using the gene ontology of assembled transcripts
Fig. 2Genome assembly. A M. edulis annotated mitochondrial genome; B Phylogenetic tree inferred from the mitochondrial gene
Fig. 3Ka and Ks analysis. A Distribution of the Ks values of the duplicate pairs in M. coruscus, M. edulis and M. galloprovincialis; B Bivalvia (class) ortholog Ks distribution and multiple WGDs. Combined Ks plot of the gene age distributions of seven species (see Tables S5 and S6). The median peaks for these plots are highlighted. Analyses of ortholog divergence indicated that these taxa diverged after their most recent WGDs
Fig. 4Mean Ka/Ks ratios for each orthologs cluster in the A M. edulis and M. galloprovincialis; B M. edulis and M. coruscus; C M. coruscus and M. galloprovincialis