| Literature DB >> 27768153 |
Fan Xiaojun1, Yang Chun1, Liu Jianhong1, Zhang Chang1, Li Yao1.
Abstract
The Asian gypsy moth (Lymantria dispar) is a serious pest of forest and shade trees in many Asian and some European countries. However, there have been few studies of L. dispar genetic information and comprehensive genetic analyses of this species are needed in order to understand its genetic and metabolic sensitivities, such as the molting mechanism during larval development. In this study, high-throughput sequencing technology was used to sequence the transcriptome of the Asian subspecies of the gyspy moth, after which a comprehensive analysis of chitin metabolism was undertaken. We generated 37,750,380 high-quality reads and assembled them into contigs. A total of 37,098 unigenes were identified, of which 15,901 were annotated in the NCBI non-redundant protein database and 9,613 were annotated in the Swiss-Prot database. We mapped 4,329 unigenes onto 317 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database. Chitin metabolism unigenes were found in the transcriptome and the data indicated that a variety of enzymes was involved in chitin catabolic and biosynthetic pathways.Entities:
Year: 2016 PMID: 27768153 PMCID: PMC5409764 DOI: 10.1590/1678-4685-GMB-2015-0257
Source DB: PubMed Journal: Genet Mol Biol ISSN: 1415-4757 Impact factor: 1.771
Output statistics for the L. dispar transcriptome sequencing process.
| Total number of raw reads | 75,500,760 |
| Average raw read length (bp) | 100 |
| Total number of clean reads | 71,801,515 |
| Average clean read length (bp) | 95.67 |
| Q20 bases ratio (%) | 89.37 |
Statistics of assembled sequences.
| All numbers | All length (bp) | Mean length (bp) | N50 (bp) | |
|---|---|---|---|---|
| Transcript | 52538 | 52,894,995 | 1006.79 | 1891 |
| Unigene | 37098 | 29,231,915 | 787.96 | 1391 |
Figure 1Length distribution of Lymantria dispar transcripts.
Figure 2Length distribution of Lymantria dispar unigenes.
Functional annotation of the L. dispar transcriptome.
| Unigene no. (%) | NR | SWISS-PROT | TREMBL | PFAM | KOG | KEGG |
|---|---|---|---|---|---|---|
| 37098 | 15901 | 9613 | 15826 | 5750 | 7303 | 4329 |
| (100) | (42.86) | (25.91) | (42.66) | (15.50) | (19.69) | (11.67) |
Functional annotation of assembled sequences based on gene ontology (GO) categorization.
| Gene Ontology | Categories | % unigenes |
|---|---|---|
| Biological process | Metabolic processes | 18.75 |
| Cellular processes | 21.42 | |
| Single-organism process | 13.97 | |
| Biological regulation | 9.90 | |
| Regulation of biological process | 9.17 | |
| Response to stimulus | 7.61 | |
| Multicellular organismal process | 6.48 | |
| Cellular component | Cell parts | 20.67 |
| Cell | 20.67 | |
| Organelle | 15.26 | |
| Membrane | 9.63 | |
| Organelle part | 8.34 | |
| Molecular function | Binding | 20.24 |
| Catalytic activities | 15.78 |
Figure 3Clusters of orthologous group (KOG) classification.
Figure 4KEGG biochemical maps for Lymantria dispar.
Figure 5E-value distribution of the top Blastx hits against the non-redundant (Nr) protein database for each unigene.
Similarity distribution of L. dispar unigenes with sequences in the Nr database.
| Unigenes | ||
|---|---|---|
| Similarity range (%) | Number (Nr) | Percent (%) |
| 20–40 | 731 | 4.62 |
| 40–60 | 3809 | 24.08 |
| 60–80 | 6058 | 38.29 |
| 80–100 | 4985 | 31.51 |
| 100 | 238 | 1.50 |
Figure 6Similarity analysis based on the BLAST best hit.