| Literature DB >> 30498627 |
Roberto Carlos Frias-Soler1,2, Lilian Villarín Pildaín1, Agnes Hotz-Wagenblatt3, Jonas Kolibius1, Franz Bairlein2, Michael Wink1.
Abstract
We have sequenced a partial transcriptome of the Northern Wheatear (Oenanthe oenanthe), a species with one of the longest migrations on Earth. The transcriptome was constructed de novo using RNA-Seq sequence data from the pooled mRNA of six different tissues: brain, muscle, intestine, liver, adipose tissue and skin. The samples came from nine captive-bred wheatears collected at three different stages of the endogenous autumn migratory period: (1) lean birds prior the onset of migration, (2) during the fattening stage and (3) individuals at their migratory body mass plateau, when they have almost doubled their lean body mass. The sample structure used to build up the transcriptome of the Northern Wheatears concerning tissue composition and time guarantees the future survey of the regulatory genes involved in the development of the migratory phenotype. Through the pre-migratory period, birds accomplish outstanding physical and behavioural changes that involve all organ systems. Nevertheless, the molecular mechanisms through which birds synchronize and control hyperphagia, fattening, restlessness increase, immunity boosting and tuning the muscles for such endurance flight are still largely unknown. The use of RNA-Seq has emerged as a powerful tool to analyse complex traits on a broad scale, and we believe it can help to characterize the migratory phenotype of wheatears at an unprecedented level. The primary challenge to conduct quantitative transcriptomic studies in non-model species is the availability of a reference transcriptome, which we have constructed and described in this paper. The cDNA was sequenced by pyrosequencing using the Genome Sequencer Roche GS FLX System; with single paired-end reads of about 400 bp. We estimate the total number of genes at 15,640, of which 67% could be annotated using Turkey and Zebra Finch genomes, or protein sequence information from SwissProt and NCBI databases. With our study, we have made a first step towards understanding the migratory phenotype regarding gene expression of a species that has become a model to study birds long-distance migrations.Entities:
Keywords: Migratory birds; Migratory phenotype; Migratory syndrome; Northern wheatears; Oenanthe oenanthe; RNASeq; Transcriptome
Year: 2018 PMID: 30498627 PMCID: PMC6251345 DOI: 10.7717/peerj.5860
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
O. oenanthe transcriptome assembly report from Newbler 2.8.
| amount | % | |
|---|---|---|
| Total of reads | 1,358,490 | |
| Aligned reads | 1,102,879 | 81.18% |
| Assembled reads | 1,028,291 | 75.69% |
| Partial reads | 74,000 | 5.45% |
| Singleton reads | 139,611 | 10.28% |
| Repeat reads | 68,301 | 5.03% |
| Outlier reads | 18,759 | 1.38% |
| Too short reads | 29,262 | 2.15% |
Figure 1Transcript length distribution of O. oenanthe, M. gallopavo and T. gutatta, references used for the wheatear’s transcriptome annotation.
Summary of the O. oenanthe transcriptome annotation.
Around 6.0% of the genes are “uncharacterized”, ∼49% have biological functions associated (GO) and ∼37% could not be annotated. The 396 ambiguous isogroups are not considered here.
| Isogroups | 1,442 | 8,355 (49%) | 1,055 | 6,291 (37%) | 17,143 |
| Isotigs | 1,757 | 10,449 | 1,198 | 7,016 | 20,420 |
Figure 2Length distribution of different subcategories of transcripts.
Total, all transcripts; Ambg.novel, ambiguous and novel isogroups; Ambg.non-novel, ambiguous but annotated isogroups; non.Ambg.novel, novel unambiguous isogroups; non.Ambg.non-novel, annotated and unambiguous isogroups.
Orthologous completeness of the wheatear transcriptome.
| Aves | 4,915 | 2,364 (48.1%) | 2,000 (40.7%) | 364 (7.4%) | 616 (12.5%) | 1,935 (39.4%) |
| Metazoans | 978 | 668 (68.3%) | 606 (62.0%) | 62 (6.3%) | 107 (10.9%) | 203 (20.8%) |
Figure 3Gene ontology terms shared by at least 100 genes, identified by their Uniprot IDs in the wheatear transcriptome.
Figure 4Number of protein-coding genes estimated for different vertebrates.
The numbers below refer to the number of species used to construct the figure. Data source: www.ncbi.nlm.nih.gov/genome (O’Leary Nuala et al., 2016); www.ensembl.org/info/about/species.html ( Zerbino et al., 2018).