| Literature DB >> 30351380 |
María Torres-Sánchez1, Christopher J Creevey2, Etienne Kornobis3, David J Gower4, Mark Wilkinson4, Diego San Mauro1.
Abstract
RNA sequencing (RNA-seq) has become one of the most powerful tools to unravel the genomic basis of biological adaptation and diversity. Although challenging, RNA-seq is particularly promising for research on non-model, secretive species that cannot be observed in nature easily and therefore remain comparatively understudied. Among such animals, the caecilians (order Gymnophiona) likely constitute the least known group of vertebrates, despite being an old and remarkably distinct lineage of amphibians. Here, we characterize multi-tissue transcriptomes for five species of caecilians that represent a broad level of diversity across the order. We identified vertebrate homologous elements of caecilian functional genes of varying tissue specificity that reveal a great number of unclassified gene families, especially for the skin. We annotated several protein domains for those unknown candidate gene families to investigate their function. We also conducted supertree analyses of a phylogenomic dataset of 1,955 candidate orthologous genes among five caecilian species and other major lineages of vertebrates, with the inferred tree being in agreement with current views of vertebrate evolution and systematics. Our study provides insights into the evolution of vertebrate protein-coding genes, and a basis for future research on the molecular elements underlying the particular biology and adaptations of caecilian amphibians.Entities:
Keywords: Gymnophiona; RNA-seq; gene families; phylogenomics; skin-specific genes
Mesh:
Substances:
Year: 2019 PMID: 30351380 PMCID: PMC6379020 DOI: 10.1093/dnares/dsy034
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Information on the species-specific caecilian transcriptome assemblies and their annotation
| Species | N | T | Contigs | % CEGs | Protein-coding genes | veNOG annotation | KVGF annotation |
|---|---|---|---|---|---|---|---|
| 1 | 10 | 142,502 | 97.18 | 27,384 | 18,368 | 12,937 | |
| 1 | 4 | 106,298 | 97.18 | 22,058 | 17,099 | 11,670 | |
| 2 | 9 | 146,348 | 97.58 | 26,302 | 18,487 | 12,719 | |
| 2 | 10 | 201,584 | 97.58 | 34,654 | 19,863 | 13,429 | |
| 1 | 7 | 134,394 | 97.58 | 27,603 | 18,302 | 12,293 |
N: number of specimens; T: number of tissues; % CEGs: percentage completeness core eukaryotic genes; veNOG annotation: number of genes with similarity match in veNOG database; KVGF annotation: number of known vertebrate gene families with caecilian genes.
Figure 1Numbers and tissue presence of the annotated genes found in caecilian transcriptomes. Genes were pooled for the five sampled species-specific transcriptomes and annotated in the 51 vertebrate species available on the EggNOG database, and mapped onto a vertebrate phylogeny inferred from the NCBI’s taxids (using phyloT and ITOL). For each vertebrate taxon, the number of caecilian annotated genes is subdivided to show the number of caecilian tissue types in which those genes are expressed.
Novel tissue-specific genes in caecilians
| Foregut | Heart | Kidney | Liver | Lung | Muscle | Skin | Spleen | Testis | Total | |
|---|---|---|---|---|---|---|---|---|---|---|
| Number of transcriptomes analysed | 4 | 2 | 5 | 7 | 4 | 3 | 11 | 2 | 2 | 40 |
| Known vertebrate gene families | 19 | 4 | 21 | 18 | 3 | 6 | 15 | 11 | 80 | 177 |
| Gene families shared with the other sampled non-caecilian amphibians | 6 | 2 | 6 | 15 | 2 | 3 | 22 | — | 25 | 81 |
| Candidate novel caecilian gene families | 32 | 12 | 40 | 44 | 9 | 27 | 108 | 8 | 142 | 422 |
| 0. 2671 | 0.7887 | 0.4639 | 1 | 1 | 0.2355 | 0.07605 | — |
The number of transcriptomes determined for each tissue, and the tissue-specific gene families (caecilian gene families that are already known vertebrate gene families, caecilian gene families shared with the other four sampled non-caecilian amphibians, and candidate caecilian-specific gene families) are shown. The last row shows the P-value (significant values in bold font) for Fisher’s exact test of the difference between the abundance of known vertebrate gene families and those of uncharacterized candidate novel caecilian gene families. Skin tissue includes skin samples from different parts of the body: skin and posterior skin samples, see Supplementary Table S1.