| Literature DB >> 30561435 |
Claire Morandin1, Unni Pulliainen1,2, Nick Bos1, Eva Schultner3.
Abstract
Communication and nutrition are major drivers of fitness in ants. While communication is paramount to colony cohesion, nutrition is decisive in regulating reproductive division of labor among colony members. However, neither of these has been studied from a molecular perspective in developing individuals. Here, we report the availability of the first transcriptome resources for larvae of the ant Formica fusca, a species with excellent discrimination abilities and thus the potential to become a model system for studying molecular mechanisms of communication. We generated a comprehensive, high-coverage RNA-seq data set using Illumina RNA-seq technology by sequencing 24 individual 1st - 2nd instar larvae collected from four experimental groups (6 samples per treatment, 49 million mean reads per sample, coverage between 194-253×). A total of 24,765 unigenes were generated using a combination of genome-guided and de novo transcriptome assembly. A comprehensive assembly pipeline and annotation lists are provided. This dataset adds valuable transcriptomic resources for further study of developmental gene expression, transcriptional regulation and functional gene activity in ant larvae.Entities:
Mesh:
Year: 2018 PMID: 30561435 PMCID: PMC6298252 DOI: 10.1038/sdata.2018.282
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Flowchart of the experimental setup used to collect F. fusca larvae for sequencing showing treatment names and number of colonies.
Figure 2Flowchart of the RNA-sequencing setup and de novo transcriptome data analysis steps.
Properties and statistics of the final assembly of the Formica fusca larval transcriptome.
| Percent GC | 40.33 |
| N50 (bp) | 2868 |
| Median contig length | 1093 |
| Average contig length | 1723.45 |
| Total assembled bases | 42681228 |
| No unigenes annotated in nr | 19688 |
| No unigenes annotated in Swissprot | 13129 |
| No unigene annotated in GO | 11898 |
Figure 3Length distribution of the final transcripts.
The x-axis represents the length, the y-axis represents the number of transcripts.
Summary of the sequenced samples, number of reads and length, and sequencing depth.
| Sample | Treatment | Combined number of reads (paired-end) | Combined length (bp) | Sequencing depth |
|---|---|---|---|---|
| UP1 | Social isolation | 88472546 | 8847254600 | 204 |
| UP2 | Nutritional stimulation – fed | 96215472 | 9621547200 | 222 |
| UP3 | Social isolation | 101706160 | 10170616000 | 235 |
| UP4 | Social isolation | 101230178 | 10123017800 | 234 |
| UP5 | Nutritional stimulation – fed | 102274664 | 10227466400 | 236 |
| UP6 | Nutritional stimulation – starved | 98643690 | 9864369000 | 228 |
| UP7 | Social stimulation | 101484884 | 10148488400 | 234 |
| UP8 | Nutritional stimulation – starved | 105960956 | 10596095600 | 245 |
| UP9 | Nutritional stimulation – starved | 99181338 | 9918133800 | 229 |
| UP10 | Nutritional stimulation – fed | 100721650 | 10072165000 | 233 |
| UP11 | Social isolation | 96545086 | 9654508600 | 223 |
| UP12 | Social stimulation | 96197568 | 9619756800 | 222 |
| UP13 | Nutritional stimulation – starved | 96320760 | 9632076000 | 222 |
| UP14 | Nutritional stimulation – fed | 84069620 | 8406962000 | 194 |
| UP15 | Social isolation | 99083672 | 9908367200 | 229 |
| UP16 | Social stimulation | 93716720 | 9371672000 | 216 |
| UP17 | Social stimulation | 102649030 | 10264903000 | 237 |
| UP18 | Nutritional stimulation – starved | 96378826 | 9637882600 | 223 |
| UP19 | Nutritional stimulation – fed | 100431728 | 10043172800 | 232 |
| UP20 | Social isolation | 103424390 | 10342439000 | 239 |
| UP21 | Social stimulation | 107013904 | 10701390400 | 247 |
| UP22 | Social stimulation | 109412840 | 10941284000 | 253 |
| UP23 | Nutritional stimulation – fed | 102714942 | 10271494200 | 237 |
| UP24 | Nutritional stimulation – starved | 100043534 | 10004353400 | 231 |
Figure 4The cleaned reads from all 24 samples were assessed with FastQC and MultiQC.
(a) Read count distribution for mean sequence quality. (b) Mean quality scores distribution. (c) Read length distribution. (d) Mean quality scores distribution.