| Literature DB >> 34956332 |
Theodoros Danis1,2, Vasileios Papadogiannis2, Alexandros Tsakogiannis2, Jon B Kristoffersen2, Daniel Golani3, Dimitris Tsaparis2, Aspasia Sterioti2, Panagiotis Kasapidis2, Georgios Kotoulas2, Antonios Magoulas2, Costas S Tsigenopoulos2, Tereza Manousaki2.
Abstract
The Tetraodontidae family encompasses several species which attract scientific interest in terms of their ecology and evolution. The silver-cheeked toadfish (Lagocephalus sceleratus) is a well-known "invasive sprinter" that has invaded and spread, in less than a decade, throughout the Eastern and part of the Western Mediterranean Sea from the Red Sea through the Suez Canal. In this study, we built and analysed the first near-chromosome level genome assembly of L. sceleratus and explored its evolutionary landscape. Through a phylogenomic analysis, we positioned L. sceleratus closer to T. nigroviridis, compared to other members of the family, while gene family evolution analysis revealed that genes associated with the immune response have experienced rapid expansion, providing a genetic basis for studying how L. sceleratus is able to achieve highly successful colonisation. Moreover, we found that voltage-gated sodium channel (NaV 1.4) mutations previously connected to tetrodotoxin resistance in other pufferfishes are not found in L. sceleratus, highlighting the complex evolution of this trait. The high-quality genome assembly built here is expected to set the ground for future studies on the species biology.Entities:
Keywords: gene families; genome evolution; phylogenomic analyses; silver-cheeked toadfish; synteny; tetrodotoxin
Year: 2021 PMID: 34956332 PMCID: PMC8692874 DOI: 10.3389/fgene.2021.790850
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Lagocephalus sceleratus genome assembly pipeline.
Summary of sequencing results.
| Sequencing technology | Raw reads | Quality-controlled reads | Coverage |
| Illumina | 57,303,140 | 44,475,382 | 38 x |
| MinION | 552,476 | 484,152 | 20 x |
Polished genome assembly statistics and completeness.
| Total contigs | 235 |
| Total contig sequence | 373,851,781 bp |
| GC (%) | 46.7 |
| Contig N50 | 11,297,640 bp |
| Contig N75 | 6,386,954 bp |
| Longest contig | 17,085,954 bp |
| Contig L50 | 14 |
| Contig L75 | 25 |
| BUSCO completeness score | |
| Complete | 96.60% |
| Single | 95.70% |
| Duplicated | 0.90% |
| Fragmented | 1.00% |
| Missing | 2.40% |
| Total number of Actinopterygii orthologs | 3,640 (97.60%) |
Repeat elements annotation statistics.
| Repetitive elements | Number of elements | Length occupied (bp) | Percentage of sequence (%) |
| Retroelements | 204,205 | 29,198,482 | 7.81 |
| SINEs | 5,447 | 743,745 | 0.20 |
| Penelope | 3,627 | 2,572,446 | 0.69 |
| LINEs | 171,108 | 20,706,770 | 5.54 |
| CRE/SLACS | 0 | 0 | 0.00 |
| L2/CR1/Rex | 45,521 | 7,208,493 | 1.93 |
| R1/LOA/Jockey | 573 | 179,409 | 0.05 |
| R2/R4/NeSL | 48,675 | 3,708,881 | 0.99 |
| RTE/Bov-B | 47,510 | 3,856,263 | 1.03 |
| L1/CIN4 | 16,080 | 2,144,919 | 0.57 |
| LTR elements | 27,650 | 7,747,967 | 2.07 |
| BEL/Pao | 410 | 259,661 | 0.07 |
| Ty1/Copia | 235 | 96,868 | 0.03 |
| Gypsy/DIRS1 | 13,595 | 3,917,10 | 1.05 |
| Retroviral | 4,582 | 1,298,868 | 0.35 |
| DNA transposons | 59,679 | 8,587,997 | 2.30 |
| hobo-Activator | 25,182 | 2,911,788 | 0.78 |
| Tc1-IS630-Pogo | 14,227 | 2,885,165 | 0.77 |
| En-Spm | 0 | 0 | 0.00 |
| MuDR-IS905 | 0 | 0 | 0.00 |
| PiggyBac | 644 | 131,052 | 0.04 |
| Tourist/Harbinger | 2,399 | 392,537 | 0.10 |
| Other (Mirage, P-element, Transib) | 99 | 5,804 | 0.00 |
| Unclassified | 113,777 | 1,126,996 | 5.97 |
| Small RNA | 0 | 0 | 0.00 |
| Satellites | 20 | 85,955 | 0.02 |
| Simple repeats | 9,305 | 1,126,996 | 0.30 |
Summary statistics of functional annotated protein-coding genes.
| Type | Number | Percent (%) |
| Blast | 18,805 | 88 |
| InterProScan | 20,347 | 96 |
| EggNog-Mapper | 17,849 | 84 |
| Predicted genes | 20,578 | 97 |
| Total Genes | 21,251 | |
FIGURE 2Maximum likelihood tree using JTT + I + G4 + F model and 100 bootstrap replicates. The spotted gar L. oculatus was used as outgroup.
FIGURE 3Gene family gain-and-loss analysis, including the number of gained gene families (green) and lost gene families (red). The stacked barplot on the right indicates the gains and losses per species.
FIGURE 4Synteny plot of pairwise whole genome alignment of L. sceleratus against T. nigroviridis. The plot illustrates the contigs of the L. sceleratus aligned to T. nigroviridis chromosomes (y-axis), grouped by a specific color which are represented on the legend right to the plot.
FIGURE 5Circos plot illustrating syntenic relationships between L. sceleratus contigs (right) against T. nigroviridis chromosomes (left), based on one-to-one orthologous genes. Ribbons link orthologous genes between the two species and colors represent the different contigs of L. sceleratus.
FIGURE 6Multiple alignment plot of the four Ion Transport domain sequences previously studied by Venkatesh et al. (2005), from Latimeria chalumnae (NaV1.4), Lepisosteus oculatus (NaV1.4), Callorhinchus milii (NaV1.4), T. pardalis (NaV1.4a), T. rubripes (NaV1.4a, NaV1.4b), T. nigroviridis (NaV1.4a, NaV1.4b), L. sceleratus (NaV1.4a, NaV1.4b), M. mola (NaV1.4a, NaV1.4b), S. aurata (NaV1.4a, NaV1.4b), O. latipes (NaV1.4a, NaV1.4b), M. morhua (NaV1.4a, NaV1.4b), and D. rerio (NaV1.4a, NaV1.4b). Residues outlined in red, have been previously associated with TTX resistance in T. rubripes and T. nigroviridis by Venkatesh et al. (2005).