| Literature DB >> 24058501 |
Li-Tao Li1, Yan-Bin Zhu, Ji-Fang Ma, Zhi-Yong Li, Zhi-Ping Dong.
Abstract
Athetis lepigone Möschler (Lepidoptera: Noctuidae) has recently become an important insect pest of maize (Zea mays) crops in China. In order to understand the characteristics of the different developmental stages of this pest, we used Illumina short-read sequences to perform de novo transcriptome assembly and gene expression analysis for egg, larva, pupa and adult developmental stages. We obtained 10.08 Gb of raw data from Illumina sequencing and recovered 81,356 unigenes longer than 100 bp through a de novo assembly. The total sequence length reached 49.75 Mb with 858 bp of N50 and an average unigene length of 612 bp. Annotation analysis of predicted proteins indicate that 33,736 unigenes (41.47% of total unigenes) are matches to genes in the Genbank Nr database. The unigene sequences were subjected to GO, COG and KEGG functional classification. A large number of differentially expressed genes were recovered by pairwise comparison of the four developmental stages. The most dramatic differences in gene expression were found in the transitions from one stage to another stage. Some of these differentially expressed genes are related to cuticle and wing formation as well as the growth and development. We identified more than 2,500 microsatellite markers that may be used for population studies of A. lepigone. This study lays the foundation for further research on population genetics and gene function analysis in A. lepigone.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24058501 PMCID: PMC3772797 DOI: 10.1371/journal.pone.0073911
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary for raw reads of four samples.
| Sample | Data (Mb) | GC% | CycleQ 20% |
| Egg | 2608.97 | 48.12 | 99.5 |
| Larva | 2265.42 | 48.09 | 99.5 |
| Pupa | 2857.8 | 45.99 | 100 |
| Adult | 2597.59 | 46.99 | 99.5 |
Note:
GC base content.
The proportion of cycle, of which the average quality value is ≥20.
Summary for A. lepigone transcriptome.
| Gene length | Egg | Larva | Pupa | Adult |
| ||||
| Transcriptnumber | Unigenenumber | Transcriptnumber | Unigenenumber | Transcriptnumber | Unigenenumber | Transcriptnumber | Unigenenumber | Unigenenumber | |
| 200–300 | 23619 | 12537 | 16052 | 10967 | 20584 | 16022 | 18808 | 11558 | 28165 |
| 300–500 | 25624 | 12825 | 17798 | 11841 | 19209 | 13520 | 21826 | 12820 | 25257 |
| 500–1000 | 22622 | 9624 | 13885 | 7738 | 16843 | 10179 | 19196 | 9105 | 15864 |
| 1000–2000 | 13260 | 5562 | 6859 | 3486 | 9311 | 4994 | 10581 | 4877 | 8556 |
| 2000+ | 5515 | 2166 | 1673 | 765 | 3642 | 1655 | 3407 | 1409 | 3514 |
| Total Length | 66340657 | 28834461 | 34397106 | 19348688 | 47956499 | 27815088 | 50511500 | 24683128 | 49749348 |
| Count | 90640 | 42714 | 56267 | 34797 | 69589 | 46370 | 73818 | 39769 | 81356 |
| N50_Length | 1064 | 971 | 806 | 689 | 986 | 805 | 943 | 838 | 858 |
| Mean Length | 731.91 | 675.06 | 611.32 | 556.04 | 689.14 | 599.85 | 684.27 | 620.66 | 611.50 |
Functional annotation of A. lepigone unigenes.
| Annotated database | Annotated Number | 300≥length<1000 | length≥1000 |
| Nr Annotation | 33736 | 17425 | 9829 |
| Nt Annotation | 17762 | 7894 | 7133 |
| Swissprot Annotation | 27909 | 13661 | 8847 |
| TrEMBL Annotation | 37754 | 19769 | 10121 |
| COG Annotation | 11518 | 5210 | 3792 |
| GO Annotation | 15585 | 7446 | 5412 |
| KEGG Annotation | 11824 | 5557 | 3573 |
| All Annotated | 43189 | 22409 | 10522 |
Figure 1Species distribution of the BLASTX results.
Note: This figure shows the species distribution of unigene BLASTX results against the NCBI-Nr protein database with a cutoff E value <10−5. Different colors represent different species. Species with proportions of more than 1% are shown.
Summary of simple sequence repeat (SSR) types in the A. lepigone transcriptome.
| SSR type | Repeat motif | Numbers | All/frequency |
| Mono-nucleotide | (A) n/(T) n/(C) n/(G) n | 1383/779/43/32 | 2123/75.31% |
| Di-nucleotide | GC/TG/AT/TA/CA/CG/AC/GT/AG/TC/CT/GA | 33/23/23/22/17/16/14/13/9/9/8/8 | 184/6.53% |
| Tri-nucleotide | CGC/GCC/GAA/GGC/AAT/AAG/CAA/TGA/ATA/ATT/AGA/TAT/GCG/TTC/CCG/CAC/TCA/GAT/ACA/CGG/CCA/ACC/TTA/TTG/TCT/GAG/CTT/ATG/CAG/TAA/GAC/CAT/CGA/AAC/ATC/AGC/GGT/GCA/GCT/CTG/AGT/ACG/TGT/TGC/TCG/GTT/GTG/CTC/AGG/ACT/TAC/TGG/TCC/GTC/GGA/CTA/CGT/CCT | 22/22/20/19/19/18/17/15/14/14/11/11/11/9/9/8/8/8/7/7/7/6/6/6/6/6/6/5/5/4/4/4/4/3/3/3/3/3/3/3/2/2/2/2/2/2/2/2/1/1/1/1/1/1/1/1/1/1 | 385/13.66% |
| Tetra-nucleotide | ATTT/AATT/GTAC/TGCG/ACTC/AAGA/AAAT/ATTC/TGAT/TGTA/TGTT/TTAT/TTAG | 3/1/1/1/1/1/1/1/1/1/1/1/1 | 15/0.53% |
| Penta-nucleotide | TGGTA/TATTC/GAGAA/TCAAA | 1/1/1/1 | 4/0.14% |
| Compound SSR | 108/3.83% | ||
| Total | 2819 |
Figure 2COG categories of the unigenes.
Note: Class definition, Number of this class, Percent of this class (%). A: RNA processing and modification, 60, 0.40. B: Chromatin structure and dynamics, 100, 0.66. C: Energy production and conversion, 717, 4.74. D: Cell cycle control, cell division, chromosome partitioning, 244, 1.61. E: Amino acid transport and metabolism, 1234, 8.16. F: Nucleotide transport and metabolism, 273, 1.81. G: Carbohydrate transport and metabolism, 1032, 6.83. H: Coenzyme transport and metabolism, 312, 2.06. I: Lipid transport and metabolism, 560, 3.70. J: Translation, ribosomal structure and biogenesis, 994, 6.58. K: Transcription, 938, 6.20. L: Replication, recombination and repair, 1632, 10.80. M: Cell wall/memberance/envelope biogenesis, 378, 2.50. N: Cell motility, 78, 0.52. O: Posttranslational modification, protein turnover, chaperones, 778, 5.15. P: Inorganic ion transport and metabolism, 746, 4.93. Q: Secondary metabolites biosynthesis, transport and catabolism, 432, 2.86. R: General function prediction only, 2811, 18.59. S: Function unknown, 384, 2.54. T: Signal transduction mechanisms, 736, 4.87. U: Intracellular trafficking, secretion, and vesicular transport, 273, 1.81. V: Defense mechanisms, 186, 1.23. W: Extracellular structures, 0, 0. Y: Nuclear structure, 6, 0.04. Z: Cytoskeleton, 212, 1.40.
Figure 3GO categories of the unigenes.
Note: The unigenes are annotated in three main categories: biological process, cellular component and molecular function. The right y-axis indicates the number of genes in a category. The left y-axis indicates the percentage of a specific category of genes in that main category.
GO categories contain few genes.
| Function class | Number of genes | Percent of genes(%) | |
| Cellular component | nucleoid | 1 | 0.006 |
| virion | 2 | 0.013 | |
| virion part | 2 | 0.013 | |
| Molecular function | channel regulator activity | 6 | 0.038 |
| metallochaperone activity | 6 | 0.038 | |
| morphogen activity | 6 | 0.038 | |
| nutrient reservoir activity | 2 | 0.013 | |
| protein tag | 2 | 0.013 | |
| receptor regulator activity | 3 | 0.019 | |
| Biological process | carbon utilization | 2 | 0.013 |
| cell killing | 4 | 0.026 | |
| viral reproduction | 1 | 0.006 | |
Figure 4Numbers of DGE unigenes in each comparison.