| Literature DB >> 31112613 |
Bruno Louro1, Gianluca De Moro1, Carlos Garcia1, Cymon J Cox1, Ana Veríssimo2, Stephen J Sabatino2, António M Santos2, Adelino V M Canário1.
Abstract
BACKGROUND: The European sardine (Sardina pilchardus Walbaum, 1792) is culturally and economically important throughout its distribution. Monitoring studies of sardine populations report an alarming decrease in stocks due to overfishing and environmental change, which has resulted in historically low captures along the Iberian Atlantic coast. Important biological and ecological features such as population diversity, structure, and migratory patterns can be addressed with the development and use of genomics resources.Entities:
Keywords: European sardine; Sardina; genome; haplotype; single-nucleotide polymorphism; transcriptome
Mesh:
Substances:
Year: 2019 PMID: 31112613 PMCID: PMC6528745 DOI: 10.1093/gigascience/giz059
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:The European sardine, Sardina pilchardus (photo credit ©Eduardo Soares, IPMA)
Figure 2:The histogram of the 23-mer depth distribution was plotted in GenomeScope [17] to estimate genome size (907 Mb), repeat content (40.7%), and heterozygosity level (1.43%). Two k-mer coverage peaks are observed at 28× and 50×.
Descriptive metrics, estimated by Supernova, of the input sequence data for the de novo genome assembly
| Metric | Value |
| Number of paired reads used | 607.36 million |
| Mean read length after trimming | 138.50 bp |
| Median insert size | 345 bp |
| Weighted mean DNA molecule size | 46.41 kb |
| N50 reads per barcode | 612 |
| Raw coverage | 78.35× |
| Effective read coverage | 52.91× |
| Mean distance between heterozygous SNPs | 197 bp |
Descriptive metrics of sardine genome assemblies
| Scaffolds | Spil_haploid1 | Spil_haploid2 | SP_G | UP_Spi |
|---|---|---|---|---|
| Largest | 6.835 Mb | 6.850 Mb | 6.843 Mb | 0.285 Mb |
| Number | ||||
| ≥100 kb | 874 | 872 | 890 | 309 |
| ≥10 kb | 8,301 | 8,298 | 8,760 | 18,863 |
| ≥1 kb (total) | 117,698 | 117,698 | 117,259 | 44,627 |
| L50/N50 | ||||
| ≥100 kb | 135/906.0 kb | 134/925.2 kb | 137/899.1 kb | 130/122.5 kb |
| ≥10 kb | 242/572.7 kb | 242/568.2 kb | 254/552.2 kb | 4,594/32.9 kb |
| ≥1 kb ( | 859/102.9 kb | 860/102.7 kb | 903/96.6 kb | 6,797/25.6 kb |
| LG50/NG50 | 935/87.7 kb | 939/87.1 kb | 905/96.6 kb | 15,422/12.6 kb |
| Assembly size | ||||
| ≥100 kb | 469.371 Mb | 468.838 Mb | 473.550 Mb | 39.274 Mb |
| ≥10 kb | 622.165 Mb | 621.688 Mb | 636.491 Mb | 513.719 Mb |
| ≥1 kb ( | 935.548 Mb | 935.082 Mb | 949.618 Mb | 641.169 Mb |
| Guanine-cytosine content | 43.9% | 43.9% | 43.9% | 44.5% |
| Nucleotides per 100 kb | 12,955 | 12,961 | 12,834 | 169 |
SP_haploid1/SP_haploid2: haploid genomes (UOTT01000000 and UOTU01000000). SP_G: consensus genome (NCBI representative genome assembly, GCA_900 499 035.1). UP_Spi: Illumina paired-end assembled genome from [20] (GCA_0 036 04335.1). Values for scaffolds ≥1, 10, and 100 kb are presented in separate rows.
Figure 3:Venn diagram representing paralogous and orthologous groups between sardine (S. pilchardus), blind cave fish (A. mexicanus), zebrafish (D. rerio), and herring (C. harengus) obtained with OrthoFinder and plotted with Jvenn [40]. Orthogroups of singleton genes are showed in parentheses.
Summary statistics of transcriptome data for the 11 tissues
| Tissue | Paired raw reads | Contigs | CDS deduced (%) | SwissProt annotated (%) | Accession No. |
|---|---|---|---|---|---|
| Gill/branchial arch | 29,783,994 | 62,526 | 29.3 | 38.6 | ERS2629269 |
| Liver | 33,479,471 | 53,104 | 29.7 | 40.1 | ERS2629273 |
| Spleen | 25,634,530 | 66,419 | 31.6 | 40.4 | ERS2629276 |
| Ovary | 22 241,327 | 42,521 | 38.1 | 42.5 | ERS2629270 |
| Midgut | 28,016,117 | 75,782 | 31.0 | 39.5 | ERS2629274 |
| White muscle | 24,409,160 | 49,266 | 35.4 | 44.8 | ERS2629277 |
| Red muscle | 30,653,774 | 55,873 | 30.3 | 42.1 | ERS2629275 |
| Kidney | 27,861,879 | 59,495 | 30.8 | 37.3 | ERS2629272 |
| Head kidney | 25,280,960 | 65,888 | 32.2 | 38.4 | ERS2629271 |
| Brain/pituitary | 24,467,352 | 75,620 | 24.5 | 37.1 | ERS2629267 |
| Caudal fin (skin/cartilage/bone) | 26,342,097 | 64,832 | 23.9 | 38.0 | ERS2629268 |
| All tissues | 298,170,661 | 170,478 | 15.9 | 25.5 | ERS2629362 |
Figure 4:Optimal maximum likelihood tree (−ln likelihood: 146,565.6438) under a best-fitting JTT+Г4+F substitution model of 97 concatenated proteins. Maximum likelihood bootstrap support values are given below or to the right of nodes. Scale bar represents mean numbers of substitutions per site. The Actinopterygii ingroup was rooted to 2 outgroup taxa, namely, Petromyzon marinus (lamprey) and Latimeria chalumnae (coelacanth) (not shown).