| Literature DB >> 35866575 |
Melissa K Drown1, Amanda N DeLiberto1, Nicole Flack2, Meghan Doyle1, Alexander G Westover1, John C Proefrock1, Sandra Heilshorn1, Evan D'Alessandro1, Douglas L Crawford1, Christopher Faulk3, Marjorie F Oleksiak1.
Abstract
Genetic data from nonmodel species can inform ecology and physiology, giving insight into a species' distribution and abundance as well as their responses to changing environments, all of which are important for species conservation and management. Moreover, reduced sequencing costs and improved long-read sequencing technology allows researchers to readily generate genomic resources for nonmodel species. Here, we apply Oxford Nanopore long-read sequencing and low-coverage (∼1x) whole genome short-read sequencing technology (Illumina) to assemble a genome and examine population genetics of an abundant tropical and subtropical fish, the hardhead silverside (Atherinomorus stipes). These fish are found in shallow coastal waters and are frequently included in ecological models because they serve as abundant prey for commercially and ecologically important species. Despite their importance in sub-tropical and tropical ecosystems, little is known about their population connectivity and genetic diversity. Our A. stipes genome assembly is about 1.2 Gb with comparable repetitive element content (∼47%), number of protein duplication events, and DNA methylation patterns to other teleost fish species. Among five sampled populations spanning 43 km of South Florida and the Florida Keys, we find little population structure suggesting high population connectivity.Entities:
Keywords: DNA methylation; genome assembly; mitogenome; population connectivity; population genetics; repetitive elements
Mesh:
Year: 2022 PMID: 35866575 PMCID: PMC9348626 DOI: 10.1093/gbe/evac111
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 4.065
Fig. 1.Atherinomorus stipes genome assembly. Snail plot summary of genome assembly statistics. From inside to outside: the central purple spiral shows log scaled scaffold count with white scale lines marking changes in order of magnitude; dark gray segments represent scaffold length distribution with plot radius scaled to the longest scaffold (red line); the orange segment represents N50 scaffold length; the light orange segment represents N90 scaffold length; outer blue and light blue rings show GC and AT percentages along the genome. Benchmarking Universal Single Copy Ortholog (BUSCO) score for the Actinopterygii database is in the upper right corner. To summarize and visualize genome assembly statistics, we used the software ‘assembly-stats’ (https://github.com/rjchallis/assembly-stats).
Fig. 2.Genome size and completeness among fishes. Genome size and completeness among fishes. The Atherinomorus stipes genome (orange, labeled Hardhead silverside) is comparable in size and completeness to other available fish genomes (black). All additional genomes were retrieved from the National Center for Biotechnology Information (NCBI), and BUSCO scores were calculated in command line with the Actinopterygii marker database. All species from supplementary table S1, Supplementary Material online are labeled.
Repetitive Content
| Element Classification | Number of Elements[ | Length Occupied | Percentage of Sequence |
|---|---|---|---|
| Retroelements | 295,824 | 95108093 | 7.86 |
| SINEs | 30,094 | 4597836 | 0.38 |
| Penelope | 5,670 | 1152119 | 0.1 |
| LINEs | 208,643 | 68631278 | 5.67 |
| CRE/SLACS | 0 | 0 | 0 |
| L2/CR1/Rex | 154,641 | 44523177 | 3.68 |
| R1/LOA/Jockey | 5,604 | 1176974 | 0.1 |
| R2/R4/NeSL | 3,571 | 1236956 | 0.1 |
| RTE/Bov-B | 16,904 | 6676757 | 0.55 |
| L1/CIN4 | 17,007 | 11548094 | 0.95 |
| LTR elements | 57,087 | 21878979 | 1.81 |
| BEL/Pao | 4,367 | 3723483 | 0.31 |
| Ty1/Copia | 193 | 172536 | 0.01 |
| Gypsy/DIRS1 | 21,734 | 10451783 | 0.86 |
| Retroviral | 6,851 | 3002280 | 0.25 |
| DNA transposons | 917,813 | 235992925 | 19.5 |
| hobo-Activator | 380,130 | 81335664 | 6.72 |
| Tc1-IS630-Pogo | 235,263 | 73509315 | 6.07 |
| En-Spm | 0 | 0 | 0 |
| MuDR-IS905 | 0 | 0 | 0 |
| PiggyBac | 11,725 | 2044062 | 0.17 |
| Tourist/Harbinger | 85,041 | 22637991 | 1.87 |
| Other (Mirage, P-element, Transib) | 26,034 | 8063098 | 0.67 |
| Rolling-circles | 13,505 | 4833905 | 0.4 |
| Unclassified | 903,970 | 216288773 | 17.87 |
|
| 547389791 | 45.22 | |
| Small RNA | 2,919 | 533198 | 0.04 |
| Satellites | 3,101 | 1397956 | 0.12 |
| Simple repeats | 385,531 | 20518992 | 1.7 |
| Low complexity | 62,906 | 4427812 | 0.37 |
Note.— Repetitive content in the Atherinomorus stipes genome identified using RepeatModler2 and classified using RepeatClassifier. In total, 3,164 families were identified covering 47.78% of the genome.
aMost repeats fragmented by insertions or deletions have been counted as one element.
Fig. 3.Atherinomorus stipes mitogenome. Total mitogenome size is 16,553 bp and contains 22 tRNAs, seven NADH subunits, four COX subunits, two ATP synthase subunits, and one D-loop control region. Mitogenome was annotated through the MITOchondrial genome annotation server (MITOS2) website with the start site manually set to COX1 and visualized with open vector editor.
Fig. 4.Atherinomorus stipes population map and principal component analysis. (A) Five sites were sampled in South Florida and the Florida Keys. Sites include (North to South) Crandon Park (green), Cutler Bay (teal), Homestead Park (blue), Turkey Point (purple), and Broad Key (red). Map generated in R. (B) Based on 266,731 nuclear SNPs among 75 individuals from five coastal Atherinomorus stipes sites, the sampled sites show no population structure. PCA was generated using PCAngsd and plotted in R.