| Literature DB >> 26385554 |
Bao-Jian Ding1, Christer Löfstedt2.
Abstract
BACKGROUND: Moths rely heavily on pheromone communication for mate finding. The pheromone components of most moths are modified from the products of normal fatty acid metabolism by a set of tissue-specific enzymes. The turnip moth, Agrotis segetum uses a series of homologous fatty-alcohol acetate esters ((Z)-5-decenyl, (Z)-7-dodecenyl, and (Z)-9 tetradecenyl acetate) as its sex pheromone components. The ratio of the components differs between populations, making this species an interesting subject for studies of the enzymes involved in the biosynthetic pathway and their influence on sex pheromone variation.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26385554 PMCID: PMC4575462 DOI: 10.1186/s12864-015-1909-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Output statistics of sequencing
| Sample | Total raw reads | Total clean reads | Total clean nucleotides | Q20 % |
| GC % |
|---|---|---|---|---|---|---|
| As_PG | 63,086,832 | 53,235,252 | 4,791,172,680 | 96.61 % | 0.01 % | 47.54 % |
| As_AB | 62,193,108 | 54,156,050 | 4,874,044,500 | 96.62 % | 0.01 % | 47.44 % |
Q20 percentage is proportion of nucleotides with quality value larger than 20 in reads (sequencing error rate = 1 %); N percentage is proportion of unknown nucleotides in clean reads. GC percentage is proportion of guanidine and cytosine nucleotides among total nucleotides
Statistics of assembly quality
| Sample | Total length (nt) | Mean length (nt) | N50 | Total consensus sequences | Distinct contigs | Distinct singletons |
|---|---|---|---|---|---|---|
| As_PG | 37,824,354 | 574 | 907 | 65,840 | 17,534 | 48,306 |
| As_AB | 36,392,640 | 618 | 1007 | 58,847 | 13,055 | 45,792 |
| All | 45,545,001 | 733 | 1150 | 62,165 | 22,633 | 39,532 |
N50 value defined as the length for which the collection of all sequences of that length or longer contains at least half of the sum of the lengths of all sequences
Fig. 1Length distribution of unigenes. The consensus sequence lengths ranging from 200 bp to more than 3,000 bp, and above each column is indicated the number of genes of each length range. The most abundant unigenes are 300 bp (19205) and the least abundant unigenes were 3000 bp (139); sequences over 3000 bp were grouped together. The number of sequences decreased as the length increased
Fig. 2Differentially expressed unigenes displayed by FPKM in As_PG versus As_AB, in log10 scale. X-axis (As_AB) and Y-axis (As_PG) shows the logarithm value of normalized expression of each gene in FPKM (Fragments per kb per Million fragments). There are 21,965 unigenes that are up regulated (red dots), measured by As_PG(FPKM)/As_AB(FPKM) > 2. On the contrary, 14,292 unigenes are down regulated (green dots), since their As_PG(FPKM)/As_AB(FPKM) < 0.5. Most of the unigenes, 25,895, were equally expressed (blue dots) in both tissue (0.5 < As_PG(FPKM)/As_AB(FPKM) < 2)
Summary of annotation results
| Nr | Nt | SwissProt | KEGG | COG | GO |
|---|---|---|---|---|---|
| 27,344 | 20,403 | 19,143 | 17,351 | 8,502 | 12,049 |
Unigenes were annotated with the databases of Nr, Nt, Swiss-Prot, KEGG, COG and GO. Then counted the number of unigenes annotated with each database. The result is summarized as the follow table
Nr annotations of assembled A. segetum consensus sequences
| Species | Gene numbers | Percentage |
|---|---|---|
|
| 14,108 | 50.56 % |
|
| 7,807 | 27.98 % |
|
| 642 | 2.3 % |
|
| 569 | 2.04 % |
|
| 420 | 1.51 % |
|
| 293 | 1.05 % |
| Other | 4063 | 14.56 % |
The first column shows the species with the highest number of similar genes in descending order, the second column indicates the number of these annotated genes, and the last column shows the percentage of genes with respect to the total annotated genes sequences
Fig. 3COG classification of unigenes. Histogram of COG classifications of assembled consensus sequences. Results are presented for the 25 main COG categories. The number above the column indicates number of unigenes in each category
Fig. 4GO classification of unigenes. Histogram of GO classifications of consensus sequences. Results are summarized for the three main GO categories: biological process, cellular component and molecular function. The number on the bars represents the total number of unigenes in each category
Fig. 5Biosynthetic pathway leading to the sex pheromone of Agrotis segetum, modified from [37]. It starts with carboxylation of acetyl-CoA to malonyl-CoA, and then they are entered to a cycle of fatty acid synthesis and end up with common fatty acids stearate and palmitate. The ∆11 desaturase inserts a double bond in the acyl chain and then the unsaturated fatty acid is subjected to three rounds of chain-shortening by β-oxidation, forming three acyl-chains different by two carbon atoms. These acyl-chains are then reduced by fatty-acyl CoA reductase (FAR) to make fatty alcohols, which are then acetylated to acetate esters, the final A. segetum pheromones. Thick arrows represent steps are functionally assayed in heterologous systems
Fig. 6The neighbor-joining tree of selected lepidopteran desaturase genes, constructed using amino-acid sequences. Desaturases described in this study are indicated by different shapes (with signature motif displayed for the First Desaturase), followed by unigene expression levels in the gland and abdomen library, respectively (As_PG_FPKM /As_AB_FPKM). Desaturases in previous studies are named as follows: biochemical activities (if known) are indicated in connection to the species name, followed by accession number in parenthesis. Most of the desaturases used in here are First Desaturases that introduce double bond into saturated fatty acids. Among the First Desaturase, four distinctive groups formed that separate their biological functions. The ∆9 desaturases are usually used for normal fatty acid metabolism, with the “KPSE” group having preference on C16 and “NPVE” group mainly modifying C18. The ∆11,∆10 and bifunctional desaturases with the “xxxQ” motif (with a few exceptions having “xxxE” motif) exclusively involved in pheromone biosynthesis. The ∆5,∆6,∆14 group contain a mixture of different signature motifs derived from the ∆9 and ∆11 groups, and their biological function are also diverged. The tree was rooted on the ∆9-desaturase-KPSE (C16 > C18) functional class
Fig. 7Functional characterization of the highest expressed desaturase gene, Ase_5534, in yeast expression system. GC-MS analyses of methanolyzed lipid extracts from yeast transformed with empty plasmid pYEX-CHT (a) and pYEX-CHT-Ase_5534 (b). Double bond position of the unsaturated palmitate was confirmed by DMDS derivatization (c, d)
The gene candidates found in As_PG that may be involved in p-oxidation processes
| genelD | GenBank accession | As PG FPKM | As AB FPKM | Swissprot-annotation | ko_definition |
|---|---|---|---|---|---|
| acyl-CoA dehydrogenase | |||||
| Unigene5337_All | KJ622076 | 2.49 | 0.93 | Short-chain specific acyl-CoA dehydrogenase | butyryl-CoA dehydrogenase[EC:1.3.8.1] |
| Unigenel8315_All | KJ622077 | 4.35 | 3.26 | Short-chain specific acyl-CoA dehydrogenase | butyryl-CoA dehydrogenase[EC:1.3.8.1] |
| CL3917.Contigl_All | KJ622078 | 8.02 | 14.14 | Short-chain specific acyl-CoA dehydrogenase | butyryl-CoA dehydrogenase[EC:1.3.8.1] |
| Unigenel5571_All | KJ622079 | 24.39 | 67.70 | Short/branched chain specific acyl-CoA dehydrogenase | short/branched chain acyl-CoA dehydrogenase [EC:1.3.99.12] |
| CL 1315 .Contig2_All | KJ622080 | 29.09 | 40.69 | Probable medium-chain specific acyl-CoA dehydrogenase | acyl-CoAdehydrogenase [EC: 1.3.99.3] |
| Unigene4168_All | KJ622081 | 4.04 | 3.43 | Acyl-CoA dehydrogenase family member 9 | Acyl-CoA dehydrogenase family member 9 [EC:1.3.99.-] |
| CL252.Contigl_All | KJ622082 | 10.03 | 7.93 | Very long-chain specific acyl-CoA dehydrogenase | Very long chain acyl-CoA dehydrogenase [EC:1.3.99.-] |
| CL622.Contig4_All | KJ579211 | 77.52 | 107.57 | Trifunctional enzyme subunit beta | acetyl-CoAacyltransferase [EC:2.3.1.16] |
| Unigene7769_All | KJ622083 | 3.29 | 2.51 | Isovaleryl-CoA dehydrogenase | Isovaleryl-CoA dehydrogenase [EC:1.3.8.4] |
| Unigene9593_All | KJ622084 | 8.77 | 9.31 | Isovaleryl-CoA dehydrogenase | Isovaleryl-CoA dehydrogenase [EC:1.3.8.4] |
| acyl-CoA oxidase | |||||
| CL3960.Contigl_All | KJ622085 | 16.52 | 17.33 | Probable peroxisomal acyl-CoA oxidase | acyl-CoA oxidase [EC:1.3.3.6] |
| CL4327.Contigl_All | KJ622086 | 3.85 | 4.30 | Peroxisomal acyl-CoA oxidase | acyl-CoA oxidase [EC:1.3.3.6] |
| CL6898.Contigl_All | KJ622087 | 5.75 | 11.79 | Probable peroxisomal acyl-CoA oxidase 1 | acyl-CoA oxidase [EC:1.3.3.6] |
| CL7062.Contigl_All | KJ622088 | 14.54 | 18.14 | Peroxisomal acyl-CoA oxidase 3 | acyl-CoA oxidase [EC:1.3.3.6] |
| Unigenel 9521_All | KJ622089 | 29.25 | 23.07 | Probable peroxisomal acyl-CoA oxidase 1 | acyl-CoA oxidase [EC:1.3.3.6] |
| Unigene26088_All | KJ622090 | 2.29 | 0.68 | Peroxisomal acyl-CoA oxidase 1 | acyl-CoA oxidase [EC:1.3.3.6] |
| Unigene33961_All | KJ622091 | 3.46 | 2.66 | Probable peroxisomal acyl-CoA oxidase 1 | acyl-CoA oxidase [EC:1.3.3.6] |
| Unigene4386_All | KJ622092 | 1.74 | 1.78 | Probable peroxisomal acyl-CoA oxidase 1 | acyl-CoA oxidase [EC:1.3.3.6] |
| Unigene5059_All | KJ622093 | 1.92 | 1.13 | Peroxisomal acyl-CoA oxidase 1 | acyl-CoA oxidase [EC:1.3.3.6] |
| Unigene6715_A11 | KJ622094 | 386.08 | 4.64 | Probable peroxisomal acyl-CoA oxidase 1 | acyl-CoA oxidase [EC:1.3.3.6] |
| enoyl-CoA hydratase | |||||
| CL1656.Contigl_All | KJ622095 | 16.69 | 23.69 | Methylglutaconyl-CoA hydratase | Methylglutaconyl-CoA hydratase [EC:4.2.1.18] |
| CL2503.Contigl_All | KJ622096 | 8.85 | 1.87 | Probable enoyl-CoA hydratase | enoyl-CoA hydratase [EC:4.2.1.17] |
| CL2595.Contigl_All | KJ622097 | 122.91 | 142.29 | Trifunctional enzyme subunit alpha | enoyl-CoA hydratase [EC:4.2.1.17] |
| Unigenel4514_All | KJ622098 | 4.15 | 4.35 | 3-hydroxyisobutyryl-CoA hydrolase | 3-hydroxyisobutyryl-CoA hydrolase [EC:3.1.2.4] |
| Unigenel7624_All | KJ622099 | 73.15 | 87.01 | Enoyl-CoA hydratase | enoyl-CoA hydratase [EC:4.2.1.17] |
| Unigene6308_All | KJ622100 | 5.00 | 5.89 | Enoyl-CoA hydratase domain-containing protein 3 | enoyl-CoA hydratase [EC:4.2.1.17] |
| L-3-hydroxyacyl-CoA dehydrogenase | |||||
| Unigenel2153_All | KJ622101 | 11.64 | 1.60 | 3-hydroxyacyl-CoA dehydrogenase | 3-hydroxyacyl-CoA dehydrogenase [EC1.1.1.35] |
| Unigenel5615_All | KJ622102 | 8.46 | 8.38 | 3-hydroxyacyl-CoA dehydrogenase | 3-hydroxyacyl-CoA dehydrogenase [EC1.1.1.35] |
| Unigenel7583_All | KJ622103 | 174.80 | 83.79 | Peroxisomal multifunctional enzyme | 3-hydroxyacyl-CoA dehydrogenase [EC1.1.1.35] |
| Unigenel9554_All | KJ622104 | 103.82 | 78.74 | Flydroxyacyl-CoA dehydrogenase | 3-hydroxyacyl-CoA dehydrogenase [EC1.1.1.35] |
| Unigene8215_All | KJ622105 | 8.18 | 4.22 | Probable 3-hydroxyacyl-CoA dehydrogenase | 3-hydroxyacyl-CoA dehydrogenase [EC1.1.1.35] |
| 3-ketoacyl-CoA thiolase | |||||
| CL2371.Contigl_All | KJ579207 | 10.01 | 5.43 | 3-ketoacyl-CoA thiolase | acetyl-CoA acyltransferase 2 [EC:2.3.1.16] |
| Unigene21478_A11 | KJ622106 | 4.47 | 0.05 | 3-ketoacyl-CoA thiolase | acetyl-CoA acyltransferase 2 [EC:2.3.1.16] |
| Unigene26739_All | KJ622107 | 3.32 | 0.00 | 3-ketoacyl-CoA thiolase | acetyl-CoA acyltransferase 2 [EC:2.3.1.16] |
| Unigene28250_All | KJ622108 | 122.89 | 42.24 | Trifunctional enzyme subunit beta | acetyl-CoA acyltransferase [EC:2.3.1.16] |
| Unigene30699_All | KJ622109 | 1.19 | 0.76 | 3-ketoacyl-CoA thiolase | acetyl-CoA acyltransferase 2 [EC:2.3.1.16] |
| Unigene5444_All | KJ622110 | 2.35 | 15.28 | 3-ketoacyl-CoA thiolase | acetyl-CoA acyltransferase 2 [EC:2.3.1.16] |
| A3,A2-trans-enoyl-CoA isomerase | |||||
| CL3311.Contigl_All | KJ622111 | 24.40 | 35.52 | Enoyl-CoA delta isomerase 1 | 3,2-trans-enoyl-CoAisomerase, mitochondrial [EC:5.3.3.8] |
| Unigene2565_All | KJ622112 | 3.56 | 1.88 | Enoyl-CoA delta isomerase 2 | peroxisomal 3,2-trans-enoyl-CoA isomerase [EC:5.3.3.8] |
| Δ3,5 Δ2,4-dienoyl-CoA isomerase | |||||
| Unigenel6713_All | KJ622113 | 11.43 | 18.26 | Delta(3,5)-Delta(2,4)-dienoy 1-Co A is omerase | delta(3,5)-Delta(2,4)-dienoyl-CoA isomerase [EC:5.3.3.-] |
Fig. 8Phylogenetic relationship of FARs from arthropods, mammals and lepidoptera constructed using amino acid sequences. The pgFAR clade is marked by a black bracket, which contains previously studied functional FARs involved in moth pheromone biosynthesis. FARs identified in this study are displayed by black dots, with As_PG_FPKM and As_AB_FPKM indicated
Fig. 9Functional assay of the highest expressed FAR in As_PG, Ase_1929 identified in this study. GC trace of hexane extract of yeast transformed by empty plasmid (a) and pYES2_CL1929 supplemented with Z5-10:Me (b), Z7-12:Me (c), Z9-14:Me (d). The control yeast produced no fatty alcohols whereas the yeast expressing Ase_1929 convert a series of fatty acids into their corresponding fatty alcohols (b-d)
List of tested acetyltransferases that are generated by annotations and by Blastx of As_PG library with previously published ([54, 55] and references therein) promising candidates as queries
| Gene object | Accession code | bp | As PG FPKM | As AB FPKM | Nr annotation | KO/COG/GO/Swissprot annotation | Species | Accession number | % Identity |
|---|---|---|---|---|---|---|---|---|---|
| CL1935 | KJ579206 | 1542 | 53.53 | 70.39 | Acetyl-CoA acetyltransferase | aeetyl-CoA C-aeetyltransferase [EC:2.3.1.9] |
| EHJ68573 | 82 |
| CL2371 | KJ579207 | 1203 | 14.63 | 8.51 | Thiolase 4 | aeetyl-CoAacyltransferase 2 [EC:2.3.1.16] |
| AGG55002 | 92 |
| CL2825 | KJ579208 | 771 | 6.38 | 14.16 | Hypothetical protein KGM 16,501 | Predicted acetyltransferases and hydrolases |
| EHJ72951 | 87 |
| CL3492 | KJ579209 | 2094 | 16.48 | 10.62 | Crooked neck protein | peptide alpha-N-acetyltransferase [EC:2.3.1.88] |
| XP001653815 | 83 |
| CL5827 | KJ579210 | 1608 | 682.84 | 198.28 | Sterol carrier protein 2/3- oxoacyl-CoA thiolase | Sterol carrier protein 2 [EC:2.3.1.176] |
| AAT72922 | 94 |
| CL622 | KJ579211 | 1425 | 146.13 | 113.81 | Fatty acid beta-oxidation complex subunit beta | acetyl-CoA acyltransferase [EC:2.3.1.16] |
| ADB57045 | 95 |
| CL7064 | KJ579212 | 543 | 76.93 | 186.14 | Acetyltransferase | Peptide alpha-N-acetyltransferase [EC:2.3.1.88] |
| AGQ45625 | 100 |
| Unigenel520 | KJ579213 | 1191 | 136.8 | 89.85 | Thiolase 1 | aeetyl-CoA acyltransferase 2 [EC:2.3.1.16] |
| AGG54999 | 93 |
| Unigene5444 | KJ579214 | 1113 | 2.35 | 15.28 | Thiolase 3 | aeetyl-CoA acyltransferase 2 [EC:2.3.1.16] |
| AGG55001 | 80 |
| Unigene7635 | KJ579215 | 891 | 23.37 | 46.32 | Palmitoyl-protein thioesterase 1 like isoform XI | Predicted acetyltransferases and hydrolases |
| XP004931556 | 73 |
| Unigene8748 | KJ579216 | 549 | 13.51 | 9.31 | Dynactin 4 protein | Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily |
| NP001040117 | 99 |
| CL197 | KJ579217 | 1080 | 20.88 | 20.02 | Monoacylglycerol acyl transferase | 2-acylglycerol O-acyltransferase 2 [EC:2.3.1.22] |
| AHH25136 | 72 |
| CL2038 | KJ579218 | 1443 | 5.54 | 30.82 | Transmembrane protein nessy | lysophospholipid acyltransferase 5 [EC:2.3.1.23 2.3.1.-] |
| XP004933932 | 76 |
| CL2800 | KJ579219 | 1638 | 20.71 | 27.68 | Dihydroxyacetone phosphate acyltransferase | glyceronephosphate O-acyltransferase [EC:2.3.1.42] |
| XP004921835 | 64 |
| Unigene21478 | KJ579220 | 540 | 4.47 | 0.05 | Thiolase 2 | aeetyl-CoA acyltransferase 2 [EC:2.3.1.16] |
| AGG55000 | 73 |
| Unigene2251 | KJ579221 | 819 | 1.54 | 0.17 | Acetyltransferase | lysophosphatidate acyltransferase [EC:2.3.1.51] |
| AGQ45623 | 97 |
| Unigene4319 | KJ579222 | 1152 | 7.58 | 11.83 | Acyltransferase AGPAT6 | lysophosphatidic acid acyltransferase/[EC:2.3.1.51] |
| AGG54998 | 96 |
| CL3297 | KJ579223 | 1296 | 12.72 | 12.03 | Endophilin-A-like isoform XI | Endophilin-A |
| XP004929694 | 86 |
| CL3384 | KJ579224 | 1269 | 69.16 | 84.67 | Ipase 1-like | Predicted hydrolases or acyltransferases |
| XP004927335 | 57 |
| CL3414 | KJ579225 | 1428 | 7.08 | 16.56 | Sterol O-acyltransferase 1 | Sterol O-aeyltransferase [EC:2.3.1.26] |
| EHJ66395 | 66 |
| CL3666 | KJ579226 | 1272 | 38.77 | 53.18 | Putative glycerol-3-phosphate acyltransferase | type I keratin, acidic |
| EHJ77802 | 56 |
| CL3797 | KJ579227 | 1164 | 10.06 | 4.91 | Hypothetical protein KGM19212 | [EC:2.3.1.-] |
| EHJ67359 | 73 |
| CL5775 | KJ579228 | 1179 | 22.36 | 14.42 | l-acylglycerol-3-phosphate O- acyltransferase ABHD5 | abhydrolase domain-containing protein 5 [EC:2.3.1.51] |
| XP004927229 | 85 |
| CL6732 | KJ579229 | 1071 | 7.31 | 8.66 | Epoxide hydrolase 4-like protein | Soluble epoxide hydrolase [EC:3.3.2.10] |
| AGC92732 | 66 |
| CL8534 | KJ579230 | 1410 | 157.51 | 175.16 | Dihydrolipoamide succinyltransferase component of 2-oxoglutarate dehydrogenase, partial | 2-oxoglutarate dehydrogenase E2 component (dihydrolipoamide succinyltransferase) [EC:2.3.1.61] |
| BAM20532 | 93 |
| CL8651 | KJ579231 | 675 | 99.26 | 87.58 | Hypothetical protein KGM 17,353 | Pyruvate dehydrogenase E2 component (dihydrolipoamide acetyltransferase) [EC:2.3.1.12] |
| EHJ75541 | 75 |
| CL914 | KJ579232 | 1962 | 19.4 | 17.94 | Carnitine o-acyltransferase | Carnitine O-palmitoyltransferase 2 [EC:2.3.1.21] |
| XP004929482 | 75 |
| CL9375 | KJ579233 | 2073 | 16.36 | 9.35 | Juvenile hormone epoxide hydrolase-like protein 3 | Predicted hydrolases or acyltransferases |
| NP001159619 | 60 |
| Unigenel0654 | KJ579234 | 1440 | 10.41 | 5.11 | Lipoamide acyltransferase | 2-oxoisovalerate dehydrogenase E2 component (dihydrolipoyl transacylase) [EC:2.3.1.168] |
| XP004926652 | 71 |
| Unigenel5439 | KJ579235 | 924 | 0.58 | 6.57 | Serine hydrolase-like protein 2- like | [EC:3.1.-.-] |
| XP004926488 | 55 |
| Unigene6166 | KJ579236 | 1446 | 5.56 | 27.08 | Glycerol-3-phosphate acyltransferase 4 | Glycerol-3-phosphate O-acyltransferase 3/4 [EC:2.3.1.15] |
| XP004925117 | 84 |
| Unigene7548 | KJ579237 | 996 | 15.04 | 13.97 | Probable serine hydrolase-like | Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
| XP004924867 | 70 |
| Unigene7854 | KJ579238 | 522 | 3.92 | 1.09 | acyltransferase AGPAT5 | Lysophosphatidate acyltransferase [EC:2.3.1.51] |
| AGG54997 | 83 |
| Unigene8703 | KJ579239 | 1200 | 3.71 | 3.53 | Lysophospholipid acyltransferase 1-like | Lysophospholipid acyltransferase 1/2 [EC:2.3.1.51 2.3.1.-] |
| XP004927037 | 74 |
Fig. 10Functional assay of putative acetyltransferase genes. Y-axis represents the total amount of acetate esters (sum of Z5-10:OAc, Z7-12:OAc, and Z9-14:OAc) produced by the yeast cells transformed with candidate genes (±95 % confidence interval, n = 3). Negative control is yeast cell (∆ATF1) transformed with empty vector and the yeast strain overexpressing ATF1 gene serves as positive control. None of the 34 candidate genes produces significantly higher amount of acetate esters compared to the negative control (overlapping 95 % confidence intervals), whereas the ATF1 produces a 45-fold increase in acetate production