| Literature DB >> 26075903 |
A Faddeeva1, R A Studer2, K Kraaijeveld1, D Sie3, B Ylstra3, J Mariën1, H J M op den Camp4, E Datema5, J T den Dunnen6, N M van Straalen1, D Roelofs1.
Abstract
BACKGROUND: Collembola (springtails) represent a soil-living lineage of hexapods in between insects and crustaceans. Consequently, their genomes may hold key information on the early processes leading to evolution of Hexapoda from a crustacean ancestor.Entities:
Mesh:
Year: 2015 PMID: 26075903 PMCID: PMC4468109 DOI: 10.1371/journal.pone.0130600
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Phylogenetic relationship among F. candida, O. cincta, L. vannamei, P. monodon, D. pulex, T. castaneum, P. humanus and A. pisum.
The Collembola lineage is represented by F. candida and O. cincta. The Crustacea lineage is represented by two decapods (L. vannamei and P. monodon) and a cladoceran (D. pulex). The Insecta clade is represented by T. castaneum, P. humanusa and A. pisum. The red branches indicate terrestrialization.
Descriptive metrics of the F. candida and O. cincta de novo assembled transcriptomes.
|
|
| |
|---|---|---|
| Number of contigs | 38,015 | 32,432 |
| N50 (bp) | 1,161 | 1,105 |
| Maximum contig size (bp) | 6,069 | 8,299 |
| Total (bp) | 29,540,912 | 22,484,011 |
| % ESTs with BlastN hit in dataset | 99.4 | 96.6 |
| Accuracy (%) | 99.9 | 99.4 |
| % EST covered more than 70% by best-hit transcript | 99.2 | 95.5 |
| % Core eukaryotic proteins with tBlastN hit in dataset | 99.2 | 97.6 |
| % Transcripts with predicted ORF | 99.3 | 99.1 |
Fig 2Gene Ontology (GO) distribution in F. candida and O. cincta transcriptomes.
The bar chart describes the number of genes assigned to the GO biological processes and molecular functions on level 2 with sequence cut-off of 2 sequences.
Fig 3Term cloud of over-represented Gene Ontology terms among positively selected genes (PSGs).
(A) GO biological processes over-represented among PSGs in the Hexapoda lineage and (B) in the Collembola lineage. The size of the GO terms is proportional to the p-value obtained in the enrichment test [75], enriched terms were summarized and redundancy was removed with the REVIGO tool [76] using a semantic similarity threshold of 0.7 and D. melanogaster GO Database as reference. The complete dataset of GO enriched terms is presented in the S4 Table.