| Literature DB >> 28529652 |
Mark F Richardson1,2, William B Sherwin3,4, Lee A Rollins3,2.
Abstract
The European starling, Sturnus vulgaris, is a prolific and worldwide invasive species that also has served as an important model for avian ecological and invasion research. Although the genome sequence recently has become available, no transcriptome data have been published for this species. Here, we have sequenced and assembled the S. vulgaris liver transcriptome, which will provide a foundational resource for further annotation and validation of the draft genome. Moreover, it will be important for ecological and evolutionary studies investigating the genetic factors underlying rapid evolution and invasion success in this global invader.Entities:
Keywords: European starling; RNA-seq.; Sturnus vulgaris; de novo transcriptome assembly; invasive species
Year: 2017 PMID: 28529652 PMCID: PMC5436464 DOI: 10.7150/jgen.19504
Source DB: PubMed Journal: J Genomics
Transcriptome assembly and annotation statistics compared to other passerine transcriptomes.
| Raw sequencing reads | 230403632 | nr | - |
| Reads used in assembly | 45309889 | ~500000000 | - |
| Number of unigenes | 48279 | nr | - |
| Number of transcripts | 59557 | 66072 | 313060 |
| n50 transcript length (bp) | 1765 | 803 | 3979 |
| sum transcript length (Mb) | 64993660 | 39395826 | 334636954 |
| median transcript length (bp) | 626 | 367 | 345 |
| mean transcript length (bp) | 1091 | 596 | 1069 |
| GC % | 48.28 | 46.34 | 45.43 |
| Number of unigenes | 18678 | - | - |
| n50 longest unigene/transcript | 2232 | - | - |
| Sum longest unigene/transcript | 26825376 | - | - |
| Median longest unigene/transcript length (bp) | 979 | - | - |
| Mean longest unigene/transcript length (bp) | 1436 | - | - |
| Number of transcripts | 23945 | - | - |
| n50 transcript length (bp) | 2328 | - | - |
| Sum transcript length (Mb) | 37637538 | - | - |
| Median transcript length (bp) | 1178 | - | - |
| Mean transcript length (bp) | 1572 | - | - |
| Transcripts with Blastx match | 33041 (55%) | nr | - |
| Transcripts with Blastp match | 24715 (41%) | 23,151 (35%) | - |
| Transcripts with GO terms | 27576 (46%) | nr | - |
| Transcripts with Blastx match | 19701 (82%) | - | - |
| Transcripts with Blastp match | 17898 (75%) | - | - |
| Transcripts with GO terms | 17462 (73%) | - | - |
aThis study; bdata from Meitern et al. (2014); cdata calculated from NCBI GBBC00000000.1; nr, not reported; percentage in parentheses
BUSCO evaluations of completeness against the vertebrate gene set compared to other passerine transcriptomes.
| Complete | 1523 (50%) | 904 (30%) | 1860 (62%) |
| Single | 1409 (47%) | 900 (30%) | 1563 (52%) |
| Multi | 114 (4%) | 4 (~0%) | 297 (10%) |
| Fragment | 346 (11%) | 361 (12%) | 249 (8%) |
| Missing | 1154 (38%) | 1758 (58%) | 914 (30%) |