| Literature DB >> 28132643 |
Nicholas J Marra1,2, Vincent P Richards3, Angela Early4, Steve M Bogdanowicz4, Paulina D Pavinski Bitar1, Michael J Stanhope5, Mahmood S Shivji6.
Abstract
BACKGROUND: Comparative genomic and/or transcriptomic analyses involving elasmobranchs remain limited, with genome level comparisons of the elasmobranch immune system to that of higher vertebrates, non-existent. This paper reports a comparative RNA-seq analysis of heart tissue from seven species, including four elasmobranchs and three teleosts, focusing on immunity, but concomitantly seeking to identify genetic similarities shared by the two lamnid sharks and the single billfish in our study, which could be linked to convergent evolution of regional endothermy.Entities:
Keywords: Adaptive immunity; Elasmobranchs; Gene ontology; Positive selection; RNA-seq; Regional endothermy
Mesh:
Year: 2017 PMID: 28132643 PMCID: PMC5278576 DOI: 10.1186/s12864-016-3411-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Descriptive statistics of quality filtered reads and the subsequent assembly/annotation of 7 heart transcriptomes
| Species | Filtered sequence reads | Trinity transcripts | n50 | ORFs | Annotated ORFs | Swiss-Prot Proteins | MCL Clusters | Complete coverage of CEGs | Total Coverage of CEGs |
|---|---|---|---|---|---|---|---|---|---|
| White shark | 23,288,212 | 174,288 | 2,294 | 46,825 | 34,986 | 11,705 | 7,021 | 95.2% | 98.0% |
| Great hammerhead | 22,977,042 | 179,367 | 2,649 | 49,992 | 37,700 | 10,991 | 6,535 | 96.4% | 98.4% |
| Mako | 10,802,234 | 101,534 | 779 | 30,354 | 22,491 | 11,135 | 6,980 | 80.2% | 90.7% |
| Ray | 12,921,908 | 104,957 | 2,004 | 31,672 | 23,265 | 10,651 | 6,437 | 98.0% | 99.2% |
| Swordfish | 16,524,630 | 144,990 | 2,778 | 63,771 | 50,494 | 11,717 | 6,788 | 98.0% | 100% |
| Surgeonfish | 8,983,274 | 81,222 | 2,864 | 42,302 | 34,302 | 9,530 | 5,999 | 91.1% | 94.4% |
| Hogfish | 7,665,032 | 64,264 | 2,168 | 28,403 | 24,077 | 10,412 | 6,184 | 87.9% | 94.0% |
Trinity transcripts refer to the initial number of transcripts in the Trinity assembly, which were then filtered for those containing the longest open reading frames. The translation of these transcripts was then annotated with BLASTP against the Swiss-Prot database and the number of hits to unique Swiss-Prot entries was recorded; if more than one transcript matched the same Swiss-Prot entry then the longest and most significant match was retained. A CEGMA analysis was conducted to evaluate the coverage of Core Eukaryotic Genes with complete coverage representing the proportion of CEGs with “complete” matches and the total coverage representing the percentage of CEGs that had complete or partial matches in the transcriptome
Fig. 1a Venn diagram of the MCLBlastLine sequence clusters present in elasmobranchs and how they are distributed among the four elasmobranch species. b Venn diagram of the MCLBlastLINE sequence clusters present in teleosts and how they are distributed among the three teleost species
Fig. 2Venn diagram of the MCLBlastLINE sequence clusters shared between teleosts and elasmobranchs (intersection of the diagram) as well as those unique to each of the groups
Fig. 3a Histogram of the most specific Biological Process GO terms that were found to be significantly enriched in elasmobranchs. Enrichment was judged significant by a Fisher’s exact test and a FDR < .05 after filtering for the most specific terms. b Cellular Component GO terms enriched in elasmobranchs after filtering for the most specific terms
Fig. 4a Histogram of the most specific Biological Process GO terms that were found to be significantly enriched in teleosts by a Fisher’s exact test at an FDR < .05 and after filtering for the most specific terms. b Cellular Component GO terms enriched in teleosts after filtering for the most specific terms. c Molecular Function GO terms enriched in teleosts after filtering for the most specific terms
Fig. 5Results of Fisher’s exact test showing the enrichment of GO terms in endotherms. This includes all terms (BP, CC, and MF categories) and at an FDR < .05
Fig. 6Histogram of the number of candidate immune genes present in each of the species transcriptomes. Candidates were identified as genes having immune functions in model species and listed on InnateDB or the Immunome knowledge base