| Literature DB >> 27142308 |
Yongling Jin1, Bin Cong2, Liyan Wang3, Yugang Gao3, Haiyan Zhang3, Hui Dong2, Zhiwei Lin3.
Abstract
Epacromius coerulipes (Ivanov) is one of the most widely distributed locusts. To date, the main methods to kill locusts still rely on chemical controls, which can result in the selection of locusts with resistance to chemical pesticides. Butene-fipronil is a new pesticide that was discovered by the structural modification of fipronil. This pesticide has been used to control various agricultural pests and has become an important pesticide product to control pests that exhibit resistance to other pesticides, including locusts. To extend its useful half-life, studies of the initiation and progression of resistance to this pesticide are needed. Herein, two E. coerulipes strains, a pesticide-sensitive (PS) and a pesticide-resistant (PR) strain, were chosen to undergo de novo assembly by paired-end transcriptome Illumina sequencing. Overall, 63,033 unigenes were detected; the average gene length was 772 bp and the N50 was 1,589 bp. Among these unigenes, ∼ 25,132 (39.87% of the total) could be identified as known proteins in bioinformatic databases from national centers. A comparison of the PR and PS strains revealed that 2,568 genes were differentially expressed, including 1,646 and 922 genes that were up- and down-regulated, respectively. According to the Gene Ontology (GO) database, among biological processes the metabolic process group was the largest group (6,900 genes, 22.47%) and contained a high frequency of differentially expressed genes (544 genes, 27.54%). According to the Clusters of Orthologous Groups (COG) categories, 28 genes, representing 2.98% of all genes, belonged to the group of genes involved in the biosynthesis, transportation, and catabolism of secondary metabolites. The differentially expressed genes that we identified are involved in 50 metabolic pathways. Among these pathways, the metabolism pathway was the most represented. After enrichment analysis of differential gene expression pathways, six pathways--ribosome; starch, and sucrose metabolism; ascorbate and aldarate metabolism; drug metabolism-cytochrome P450; metabolism of xenobiotics by cytochrome P450; and glutathione metabolism--showed a high degree of enrichment. Among these pathways, drug metabolism-cytochrome P450, metabolism of xenobiotics by cytochrome P450, and glutathione metabolism have been associated with pesticide metabolism. Furthermore, 316 unigenes in the E. coerulipes transcriptome encode detoxifying enzymes and 76 unigenes encode target proteins of pesticides. Among these genes, 23 genes that encode detoxifying enzymes in the resistance group were found to be up-regulated. The transcriptome sequencing results of E. coerulipes established a genomics database of E. coerulipes for the first time. This study also establishes a molecular basis for gene function analysis of E. coerulipes Moreover, it provides a theoretical resource for mechanistic studies on pesticide resistance through the screening and investigation of resistance genes.Entities:
Keywords: Epacromius coerulipes (Ivanov); differential gene expression analysis; pesticide resistance; transcriptome
Mesh:
Substances:
Year: 2016 PMID: 27142308 PMCID: PMC4864582 DOI: 10.1093/jisesa/iew014
Source DB: PubMed Journal: J Insect Sci ISSN: 1536-2442 Impact factor: 1.857
Summary of the raw reads for the two samples
| Samples | Read number | Clean Data | GC content | Cycle Q20% | %≥Q30 |
|---|---|---|---|---|---|
| PR (T01) | 26,869,615 | 6,766,002,460 | 50.71% | 100.00 | 90.63% |
| PS (T02) | 26,428,337 | 6,653,605,161 | 50.81% | 100.00 | 90.09% |
Note: GC content: Clean Data G and C percentage of the total bases; Cycle Q20: Average quality score is greater than or equal to 20% of the cycle, Q30: Quality Score of base is greater than or equal to 30% of the total bases.
Summary of Illumina transcriptome assembly for E. coerulipes
| Length range | Contig | Transcript | Unigene |
|---|---|---|---|
| 200–300 | 6,232,095 (99.35%) | 30,249 (31.46%) | 26,218 (41.59%) |
| 300–500 | 15,971 (0.25%) | 18,617 (19.36%) | 14,150 (22.45%) |
| 500–1,000 | 10,968 (0.17%) | 16,364 (17.02%) | 9,772 (15.50%) |
| 1,000–2,000 | 7,402 (0.12%) | 14,974 (15.57%) | 6,855 (10.88%) |
| 2,000+ | 6,195 (0.10%) | 15,947 (16.59%) | 6,038 (9.58%) |
| Total number | 6,272,631 | 96,151 | 63,033 |
| Total length | 306,171,970 | 105,999,072 | 48,663,063 |
| N50 length | 47 | 2,269 | 1,589 |
| Mean length | 48.81 | 1,102.42 | 772.03 |
Note: The sequencing assembly obtained 6,272,631 Contigs and the N50 length was 47.96, 151 transcripts were obtained. The total length was 105.999 Mb, the average length was 1,102 bp, and the N50 length was 2,269. Among them, the number of transcripts with a length over 1 kb was 30,921, which was 32.16% of the total number. Additionally, 15,947 of transcripts with a length over 2 kb were identified, which was 16.59% of all transcripts. After clustering and assembly analysis of the transcripts, 63,033 unigenes were obtained, with a total length of 48.663 Mb and an average length of 772 bp, and a N50 length of 1,589 bp. Among them, 12,893 unigenes with a length over 1 kb were identified, which represents 20.45% of the total, and 6,083 unigenes with a length over 2 kb were identified, which represents 9.58% of the total.
Functional annotation of the E. coerulipes transcriptome
| Annotated database | Annotated unigenes number | 300 ≤ length < 1,000 | Length ≥ 1,000 |
|---|---|---|---|
| COG_Annotation | 8,013 (12.71) | 2,603 | 3,955 |
| GO_Annotation | 11,558 (18.34%) | 3,863 | 5,064 |
| KEGG_Annotation | 7,218 (11.46%) | 2,260 | 3,694 |
| Swissprot_Annotation | 16,490 (26.16%) | 5,676 | 7,889 |
| nr_Annotation | 24,841 (39.41%) | 9,079 | 9,881 |
| All_Annotated | 25,132 (39.87%) | 9,185 | 9,895 |
Note: Nr database annotated 24,841 unigenes, which represents 39.41% of the total. The Swissprot database annotated 16,490 unigenes, which represents 26.16% of the total. The COG database annotated 8,031 unigenes, which represents 12.71% of the total. The GO database annotated 11,558 unigenes, which represents 18.34% of the total. The KEGG database annotated 7,218 unigenes, which represents 11.46% of the total.
Number of differentially expressed genes and gene annotation
| DEGs number | Annotated | COG | GO | KEGG | Swiss-Prot | nr |
|---|---|---|---|---|---|---|
| All DEG 2,568 | 1,798 | 760 | 815 | 584 | 1,424 | 1,784 |
| Up-regulated 1,646 | 1,177 | 537 | 579 | 453 | 956 | 1,168 |
| Down-regulated 922 | 621 | 223 | 236 | 131 | 468 | 616 |
Note: The 2,568 differentially expressed genes were obtained and 1,798 of them were annotated to each database. Up-regulated genes included 1,646 and 1,177 genes from those that were annotated; down-regulated genes included 922 and 621 genes from those that were annotated.
Fig. 2.GO classification. Annotation statistics of differentially expressed genes in the secondary node of GO. Note: The horizontal axis shows secondary nodes of three categories in GO. The vertical axis displays the percentage of annotated genes versus the total gene number. The red columns display annotation information of the total genes and the blue columns represent annotation information of the differentially expressed genes.
Fig. 3.COG function classification of consensus sequences. Note: The categories of the COG are shown on the horizontal axis and gene numbers are plotted on the vertical axis. The number of differentially expressed genes involved in general function prediction is 133; in carbohydrate transport and metabolism is 121; in translation, ribosomal structure, and biogenesis is 114; in amino acid transport and metabolism is 84; in posttranslational modification, protein turnover, and chaperones is 79; in replication, recombination, and repair is 61; in inorganic ion transport and metabolism is 56; in energy production and conversion is 5; in lipid transport and metabolism is 47; and in cell wall/membrane/envelope biogenesis is 30; the number of differentially expressed genes in other subgroups is <30.
Fig. 4.KEGG categories of differentially expressed genes. Note: The vertical axis lists the names of the metabolic pathways in the KEGG, and the horizontal axis shows the proportion of annotated genes in each pathway versus the total number of annotated genes. The differentially expressed genes are distributed in 50 metabolic pathways, the pathway of metabolism holds the highest all number of genes.
Fig. 5.Scatterplot of differentially expressed genes from the KEGG pathway enrichment analysis. Note: Every graph in the figure represents one KEGG pathway and the name of the pathway is shown on the right side of the legend. The horizontal axis shows the enrichment factor, which represents the proportion of the annotated differentially expressed genes in each pathway compared to the total annotated gene number in that pathway. The smaller enrichment factor indicates a higher enrichment score of differentially expressed genes in the pathway. The vertical axis displays an absolute Q value, which is the p-value tested by multiple hypothesis and calibrations. Thus, a larger value on the vertical axis indicates a more reliable enrichment score of differentially expressed genes in a pathway.
Unique transcripts associated with insecticide resistance in E. coerulipes
| Enzymes or target | Annotated unigenes number | Up-regulated number (PS-vs.-PR) |
|---|---|---|
| GSTs | 43 | 5 |
| CarEs | 90 | 8 |
| CYP450 | 183 | 10 |
| GABA | 10 | – |
| AchR | 15 | – |
| RyR | 3 | – |
| AchE | 38 | – |
| CGSC | 10 | – |
Note: GSTs, glutathione S-transferases; CarEs, carboxylesterase; CYP450, cytochrome P450; GABA, γ-aminobutyric acid receptor; nAchR, nicotinic acetylcholine receptor; RyR, ryanodine receptor; AchE, acetylcholinesterase; VGSC, voltage-gated sodium channel; Up-regulated number (PS-vs.-PR), GSTs: c8747.graph_c0, c34411.graph_c0, c33067.graph_c0, c40846. graph_c0, c31380.graph_c0; CarEs : c33871.graph_c0, c30386.graph_c0, c33452.graph_c0, c22727.graph_c0, c27991.graph_c0, c34612.graph_c0, c35381.graph_c0, c30559.graph_c0; CYP450: c23816.graph_c0, c37409. graph_c0, c29735.graph_c0, c34866.graph_c0, c34963.graph_c0, c34617. graph_c0, c39389.graph_c0, c37409.graph_c0, c22350.graph_c0, c39389. graph_c0.