| Literature DB >> 24615268 |
Yu-Juan Zhang1, Youjin Hao1, Fengling Si1, Shuang Ren1, Ganyu Hu1, Li Shen1, Bin Chen2.
Abstract
The onion maggot Delia antiqua is a major insect pest of cultivated vegetables, especially the onion, and a good model to investigate the molecular mechanisms of diapause. To better understand the biology and diapause mechanism of the insect pest species, D. antiqua, the transcriptome was sequenced using Illumina paired-end sequencing technology. Approximately 54 million reads were obtained, trimmed, and assembled into 29,659 unigenes, with an average length of 607 bp and an N50 of 818 bp. Among these unigenes, 21,605 (72.8%) were annotated in the public databases. All unigenes were then compared against Drosophila melanogaster and Anopheles gambiae. Codon usage bias was analyzed and 332 simple sequence repeats (SSRs) were detected in this organism. These data represent the most comprehensive transcriptomic resource currently available for D. antiqua and will facilitate the study of genetics, genomics, diapause, and further pest control of D. antiqua.Entities:
Keywords: codon usage bias; de novo assembly; high-throughput RNA sequencing; onion maggot; simple sequence repeat
Mesh:
Substances:
Year: 2014 PMID: 24615268 PMCID: PMC4025484 DOI: 10.1534/g3.113.009779
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Statistics from RNA-seq–based sequencing, assembling, and functional annotation for D. antiqua
| Sequencing results | N of total raw reads | 53,814,782 |
| N of total clean reads | 51,497,228 | |
| N of total clean nucleotides (nt) | 4,634,750,520 | |
| Q20 percentage of total clean reads | 98.24% | |
| GC percentage of total clean nucleotides | 39.03% | |
| N percentage of total clean nucleotides | 0.00% | |
| N of unigenes | 29,659 (4507 into distinct clusters; 25,152 singletons) | |
| Total length (nt) of total unigenes | 18,008,540 | |
| Mean length (nt) of total unigenes | 607 | |
| N50 (nt) of total unigenes | 818 | |
| Unigenes with Nr database | 21,050 (71.0%) | |
| (E-value≤1e−5) | Unigenes with Nt database | 10,058 (33.9%) |
| Unigenes with Swiss-Prot database | 15,886 (53.6%) | |
| Unigenes with KEGG database | 14,147 (47.7%), 257 pathways | |
| Unigenes with COG database | 6600 (22.3%), 25 functional categories | |
| Unigenes with GO database | 13,238 (44.6%), 60 subcategories belonging to 3 main categories | |
| Biological process | 25 subcategories | |
| Cellular component | 18 subcategories | |
| Molecular function | 17 subcategories | |
| Total unigenes annotated | 21,605 (72.8% of 29,659 unigenes) |
Figure 1NR classification of D. antiqua unigenes. (A) E-value distribution. (B) Similarity distribution. (C) Species distribution.
Figure 2GO function classification of D. antiqua unigenes.
Figure 3COG classification of D. antiqua unigenes.
Comparative analysis between D. antiqua and other Dipteran genomes using BLASTX with a cut-off E-value of 1E−5
| N of a.a. sequences | 27,538 | 14,324 |
| N of sequences with a.a. > 50 | 27,410 | 14,296 |
| Genome sequence version | r5.47 | AgamP3.6 |
| Source of genome sequence | Flybase | Vectorbase |
| With hits to | 17,922 (60.43%) | 14,969 (50.47%) |
| Only with hits to | 3135 (10.57%) | 182 (0.61%) |
| With hits to both | 14,787 (49.86%) | |
| With no hits to both | 11,555 (38.96%) | |
Figure 4D. antiqua unigenes similarity comparison with Dr. melanogaster and An. gambiae and functional classification by GO analysis. (A) Similarity search of D. antiqua sequences against Dr. melanogaster and An. Gambiae. (B) Functional classification of D. antiqua unigenes with and without homologs with Dr. melanogaster and An. gambiae.
Features of SSRs identified in the D. antiqua transcriptome
| Total n of examined unigenes | 29,659 |
| N of unigenes longer than 1 kb | 4637 |
| Total nucleotides screened (knt) | 7423 |
| N of unigenes containing SSRs | 315 |
| N of identified SSRs | 352 |
| Kinds of identified SSRs | 94 |
| N of unigenes containing more than 1 SSRs | 37 |
| Frequency of SSR in transcriptome | 1/14.7 kb |
Frequency of SSRs in D. antiqua transcriptome
| N of Nucleotides | N of Motif Repeats | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 4 | 5 | 6 | 7 | 8 | 9 | 10 | >10 | Total | % | |
| Di | — | — | 40 | 10 | 1 | 1 | — | — | 52 | 14.7 |
| Tri | — | 245 | 34 | 9 | 3 | — | — | — | 291 | 82.7 |
| Tetra | 6 | 1 | 2 | — | — | — | — | — | 9 | 2.5 |
| Penta | — | — | — | — | — | — | — | — | 0 | 0 |
| Hexa | — | — | — | — | — | — | — | — | 0 | 0 |
| Total | 6 | 246 | 76 | 19 | 4 | 1 | 0 | 0 | 352 | |
| % | 1.7 | 69.9 | 21.6 | 5.4 | 1.1 | 0.3 | 0 | 0 | ||