| Literature DB >> 27624472 |
Nathaniel K Jue1,2, Paola G Batta-Lona3,4, Sarah Trusiak1, Craig Obergfell1, Ann Bucklin3, Michael J O'Neill1, Rachel J O'Neill5.
Abstract
A preliminary genome sequence has been assembled for the Southern Ocean salp, Salpa thompsoni (Urochordata, Thaliacea). Despite the ecological importance of this species in Antarctic pelagic food webs and its potential role as an indicator of changing Southern Ocean ecosystems in response to climate change, no genomic resources are available for S. thompsoni or any closely related urochordate species. Using a multiple-platform, multiple-individual approach, we have produced a 318,767,936-bp genome sequence, covering >50% of the estimated 602 Mb (±173 Mb) genome size for S. thompsoni Using a nonredundant set of predicted proteins, >50% (16,823) of sequences showed significant homology to known proteins and ∼38% (12,151) of the total protein predictions were associated with Gene Ontology functional information. We have generated 109,958 SNP variant and 9,782 indel predictions for this species, serving as a resource for future phylogenomic and population genetic studies. Comparing the salp genome to available assemblies for four other urochordates, Botryllus schlosseri, Ciona intestinalis, Ciona savignyi and Oikopleura dioica, we found that S. thompsoni shares the previously estimated rapid rates of evolution for these species. High mutation rates are thus independent of genome size, suggesting that rates of evolution >1.5 times that observed for vertebrates are a broad taxonomic characteristic of urochordates. Tests for positive selection implemented in PAML revealed a small number of genes with sites undergoing rapid evolution, including genes involved in ribosome biogenesis and metabolic and immune process that may be reflective of both adaptation to polar, planktonic environments as well as the complex life history of the salps. Finally, we performed an initial survey of small RNAs, revealing the presence of known, conserved miRNAs, as well as novel miRNA genes; unique piRNAs; and mature miRNA signatures for varying developmental stages. Collectively, these resources provide a genomic foundation supporting S. thompsoni as a model species for further examination of the exceptional rates and patterns of genomic evolution shown by urochordates. Additionally, genomic data will allow for the development of molecular indicators of key life history events and processes and afford new understandings and predictions of impacts of climate change on this key species of Antarctic pelagic ecosystems.Entities:
Keywords: Antarctic ecosystem; Salpa thompsoni; Thaliacean genome; miRNA; urochordate
Mesh:
Substances:
Year: 2016 PMID: 27624472 PMCID: PMC5174732 DOI: 10.1093/gbe/evw215
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
F(A) Evolutionary position of Salpa thompsoni with respect to other groups within the Subphylum Tunicata, including Ciona spp. and Botryllus schlosseri within Class Ascidiacea, Oikopleura dioica within Class Appendicularia, as well as Subphyla Cephalochrodata and Vertebrata. Overall relationships among Eumetazoa are indicated to left. (B) Map of Antarctica indicating the collection area for S. thompsoni samples (boxed inset, red diamond) in the Southern Indian Ocean. (C) Reproductive life cycle of S. thompsoni.
General Assembly Information for Genomic and Transcriptomic Assemblies for Salpa thompsoni
| Genome | |
|---|---|
| Number of individuals | 4 |
| Total amount of 454 sequence | 1,052,340,564 |
| Total amount of ion proton sequence | 10,774,831,593 |
| Number of contigs (or predicted transcripts) | 478,293 |
| N50 contig size (bp) | 934 |
| Total length of contigs (bp) | 318,767,936 |
| Average depth of coverage per base | 28.6 |
| Total number of bases sequenced (Mb) | 11,825 |
| Number of predicted genes (strictly ab initio predicted genes) | 5,467 (26,415) |
| Number of BLAST hits of predicted genes/transcipts | 16,823 |
| Trinity de novo predicted transcripts | 217,849 |
Repetitive Element Counts and Base-Pair Coverage for the Salp Genome
| Type of element | Count | Total base pairs | Percent of genome |
|---|---|---|---|
| SINEs | 45,324 | 2,935,478 | 0.92 |
| ALUs | 39 | 3,429 | 0 |
| LINEs | 136,014 | 14,682,786 | 4.61 |
| LINE1 | 230 | 32,570 | 0.01 |
| LINE2 | 6,222 | 818,015 | 0.26 |
| L3/CR1 | 13,793 | 1,676,529 | 0.53 |
| LTR elements | 113,301 | 8,988,469 | 2.82 |
| ERV_class I | 53 | 5,784 | 0 |
| DNA elements | 649,419 | 56,060,847 | 17.59 |
| hAT-Charlie | 8,128 | 889,239 | 0.28 |
| TcMar-Tigger | 215 | 30,504 | 0.01 |
| Unclassified | 280,392 | 20,865,094 | 6.55 |
| Small RNAs | 511 | 49,308 | 0.02 |
| Satellites | 404 | 72,054 | 0.02 |
| Simple repeats | 64,031 | 2,454,620 | 0.77 |
| Low complexity | 10,916 | 502,724 | 0.16 |
Estimated Percentages of CEGMA Core Eukaryotic Genes Sequenced Either Completely or Partial in Each Urochorate Species Genome (Gray Rows) As Well As Other Select Species
| Species | Complete (%) | Partial (%) | Total (%) | Estimated genome size (Mb) | Number of predicted genes per Mb |
|---|---|---|---|---|---|
| 87.90 | 6.45 | 94.35 | 160 | 108 | |
| 83.06 | 12.50 | 95.56 | 190 | 106 | |
| 85.48 | 6.05 | 91.53 | 70 | 257 | |
| 78.63 | 16.53 | 95.16 | 600 | 52 | |
| 35.48 | 40.73 | 76.21 | 602 | 53 | |
| 76.61 | 21.78 | 98.39 | 450 | 61 | |
| 97.58 | 0.40 | 97.98 | 98 | 117 | |
| 37.50 | 35.48 | 72.98 | 2,100 | 10 | |
| 49.60 | 39.11 | 88.71 | 800 | 29 | |
| 16.53 | 4.44 | 20.97 | 530 | 21 |
FSalpa thompsoni gene annotations. (A) Annotation status of S. thompsoni predicted proteins delineated by validation method. (B) The number of ambiguous sequences, codon shifts (left) and total orthologues (right) with S. thompsoni are shown for each species (bottom). Phylogenetic relationships for each species are indicated. (C and D) Pie chart summary of level 2 gene ontology terms associated with biological process (C) and molecular function (D) categories for all annotated Annotation status of S. thompsoni predicted proteins.
F(A) Kernel density plots of ortholog sequence divergence from Cephalochordate Branchiostoma floridae for each Urchordate species (see legend) reveal similar evolutionary rates among Urochordate species, with the exception of Oikopleura dioica which appears to evolve more rapidly than the others. Orthology groups (n = 967) investigated for positive selection were used in this analysis. (B) Schematic of cyclophilin A protein sequence with percent sequence conservation at each site shown for other Urochordate and Cephalochorate species analyzed. Yellow box indicates site under positive selection; purple boxes indicate metal ion-binding site; blue arrow indicates propyl isomerase functional domain.
F(A) Size distribution of small RNAs post-trimming across embryo and adult solitary salp and adult female samples. (B) miRNA precursor fold predictions for each novel miRNA. Nucleotide is color coded by probability, the mature miRNA is indicated by a black line for each hairpin, and sample distribution for each is indicated. (C) Classifiable piRNA content for aggregate female stage, solitary adult and embryo samples, excluding tRNA, rRNA and unmasked (nonrepeat) sequences within the piRNA size range.
Mapping Statistics for Predicted Small RNAs to the Genome (Top) and Transcriptome (Bottom) with Different Mismatched Nucleotide Frequencies Allowed (0–2) (miRNAs)
| miRNA sample | Mismatches | ||
|---|---|---|---|
| 0 | 1 | 2 | |
| Genome | |||
| Aggregate embryo | 81.18% | 89.33% | 98.30% |
| Solitary | 70.77% | 80.02% | 93.57% |
| Aggregate female | 71.79% | 82.57% | 93.86% |
| Transcriptome | |||
| Aggregate embryo | 74.77% | 81.86% | 90.51% |
| Solitary | 71.68% | 79.96% | 89.13% |
| Aggregate female | 69.60% | 79.12% | 87.31% |
Mapping Statistics for Predicted Small RNAs to the Genome (Top) and Transcriptome (Bottom) with Different Mismatched Nucleotide Frequencies Allowed (0–3) (piRNAs)
| piRNA sample | Mismatches | |||
|---|---|---|---|---|
| 0 | 1 | 2 | 3 | |
| Genome | ||||
| Aggregate embryo | 73.89% | 95.06% | 96.85% | 97.28% |
| Solitary | 53.36% | 78.49% | 82.29% | 83.69% |
| Aggregate female | 59.20% | 84.75% | 88.95% | 90.65% |
| Transcriptome | ||||
| Aggregate Embryo | 73.71% | 94.70% | 96.24% | 96.59% |
| Solitary | 61.37% | 86.58% | 89.13% | 89.98% |
| Aggregate Female | 63.83% | 89.10% | 92.11% | 92.99% |