| Literature DB >> 32183717 |
Qi Chen1,2, Hanbo Zhao1,2, Ming Wen1,2, Jiaxin Li1,2, Haifeng Zhou1,2, Jiatong Wang1,2, Yuxin Zhou1,2, Yulin Liu1,2, Lixin Du1,2, Hui Kang1,2, Jian Zhang3, Rui Cao4, Xiaoming Xu5, Jing-Jiang Zhou1,2,6, Bingzhong Ren1,2, Yinliang Wang7,8.
Abstract
BACKGROUND: The fall webworm Hyphantria cunea is an invasive and polyphagous defoliator pest that feeds on nearly any type of deciduous tree worldwide. The silk web of H. cunea aids its aggregating behavior, provides thermal regulation and is regarded as one of causes for its rapid spread. In addition, both chemosensory and detoxification genes are vital for host adaptation in insects.Entities:
Keywords: Adaptation; Fall webworm; Gene expansion; Genetics; Genome; Metagenome; Molecular evolution
Mesh:
Year: 2020 PMID: 32183717 PMCID: PMC7079503 DOI: 10.1186/s12864-020-6629-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Overview of sequenced lepidopteran genomes
| Species | Assembly size (MB) | Protein-coding | Contig | GC | Intron (%) | Repeat (%) | Protein number | Pseudogenes |
|---|---|---|---|---|---|---|---|---|
| 393.47 | 19,340 | 1.85 | 39.8 | 30.70 | 34 | 21,661 | 145 | |
| 227.02 | 13,301 | 4.78 | 34 | 24.8 | n.a. | 16,620 | 66 | |
| 278.42 | 14,850 | 0.08 | 34.4 | n.a. | n.a. | 17,745 | 102 | |
| 243.23 | 15,322 | 0.49 | 34.9 | 45.5 | n.a. | 21,602 | 232 | |
| 245.87 | 13,152 | 1.42 | 33 | 33.3 | 22.7 | 18,966 | 70 | |
| 559.30 | 15,319 | 3.09 | 36.57 | 29.1 | 35.71 | 18,207 | 300 | |
| 337.07 | 15,081 | 2.35 | 37.5 | 39.3 | 14.6 | 21,035 | 65 | |
| 638.21 | 16,912 | 2.88 | 37.8 | 17.7 | 53.5 | 16,912 | n.a. | |
| 481.82 | 16,166 | 1.55 | 38.8 | 16.3 | 43.6 | 22,571 | 64 |
Fig. 1Overview of the H. cunea genome. a Types and numbers of homologous gene families among twelve species. b Maximum likelihood phylogenetic analysis among twelve insect species based on genomic data. The twelve species are Apis mellifera, Bombyx mori, Drosophila melanogaster, Helicoverpa armigera, Hyphantria cunea, Operophtera brumata, Papilio machaon, Papilio polytes, Papilio xuthus, Pieris rapae, Tribolium castaneum and Plutella xylostella. The numbers next to the nodes are the estimated node ages in million year (scale is 50.0 million years), and the colored box below indicates the geochronologic scale from Permian to Neogene
The number of chemosensory and detoxification genes of H. cunea and other insect
| CSPs | 20 | 23 | 13 | 22 | 21 |
| OBPs | 66 | 29 | 15 | 30 | 43 |
| ORs | 72 | 51 | 29 | 74 | 73 |
| GRs | 46 | 10 | 9 | 19 | 76 |
| IRs | 21 | 33 | 31 | 52 | 31 |
| P450s | 109 | 112 | 133 | 112 | 83 |
| GSTs | 25 | 11 | 11 | 11 | 26 |
| CCEs | 74 | 53 | 42 | 53 | 73 |
| APNs | 24 | 17 | 28 | 17 | 14 |
| ABCs | 78 | 120 | 90 | 124 | 51 |
| UGTs | 32 | 39 | 11 | 39 | 45 |
Gene families expanded in H. cunea as calculated by CAFE
| Divergence size | Species size | Gene ID | Annotation | |
|---|---|---|---|---|
| 2 | 4 | 0.006 | EVM0007968 EVM0014260 EVM0001423. | Adenosine deaminase-related growth factor A |
| 2 | 4 | <1e-7 | EVM0003438 EVM0001582 EVM0008984 EVM0011535 | ATP-dependent helicase YHR031C |
| 2 | 3 | 0.002 | EVM0003192 EVM0011563 EVM0002688 | Cytoskeleton |
| 1 | 2 | <1e-7 | EVM0000776 | Kazal-type serine proteinase inhibitor 1; gag-like protein |
| 2 | 3 | 0.001 | EVM0015065 EVM0001703 EVM0008164 | Uncharacterized protein |
| 1 | 2 | <1e-7 | EVM0000473 EVM0004248 | Retroelement polyprotein |
| 2 | 3 | 0.021 | EVM0008576 EVM0007758 EVM0001530 | Carboxyl/choline esterase CCE033a |
| 1 | 2 | 0.047 | EVM0012638 EVM0014428 | PREDICTED: serine/threonine-protein kinase SMG1-like |
| 1 | 2 | 0.041 | EVM0011073 EVM0014934 | Uncharacterized protein |
| 1 | 2 | <1e-7 | EVM0014320 EVM0009912 | Uncharacterized protein LOC103572275 |
| 1 | 2 | 0.021 | EVM0001502 EVM0011366 | Hypothetical protein KGM_05165 |
| 1 | 2 | <1e-7 | EVM0009792 EVM0013551 | Uncharacterized protein LOC101742343 |
| 1 | 3 | <1e-7 | EVM0002515 EVM0008850 | Uncharacterized protein |
| 1 | 2 | 0.021 | EVM0014968 EVM0000652 | Lysosomal-trafficking regulator-like |
| 1 | 2 | 0.001 | EVM0013380 EVM0003562 | Chemosensory protein precursor |
| 1 | 2 | 0.024 | EVM0008361 EVM0005241 | Insulin-like growth factor 2 mRNA-binding protein |
| 1 | 2 | <1e-7 | EVM0014740 EVM0000171 | GTPase-activating protein pac-1-like |
| 1 | 2 | <1e-7 | EVM0012402 EVM0000677 | Hemolymph protein 14 |
| 1 | 2 | 0.001 | EVM0013787 EVM0001757 | Glutathione S-transferase |
| 1 | 2 | 0.005 | EVM0003715 EVM0006371 | Hypothetical protein 3 - cabbage looper transposon TED |
| 1 | 2 | 0.001 | EVM0009126 EVM0008595 | Retinol dehydrogenase 11-like |
| 1 | 2 | 0.037 | EVM0011934 EVM0013760 | Apolipophorins |
| 1 | 2 | 0.027 | EVM0012995 EVM0001222 | S-antigen protein |
| 1 | 2 | <1e-7 | EVM0007537 EVM0012615 | Cecropin A |
| 1 | 2 | <1e-7 | EVM0004477 EVM0000882. | UDP-glycosyltransferase |
| 1 | 2 | <1e-7 | EVM0007937 EVM0005030 | Endonuclease and reverse transcriptase-like protein |
| 1 | 2 | 0.023 | EVM0014113 EVM0014355 | Amino acid transport and metabolism; Serine protease 24 |
| 1 | 3 | 0.012 | EVM0012395 EVM0001096 | ATP-dependent DNA helicase MER |
| 1 | 2 | 0.044 | EVM0003209 EVM0014358 | Proline-rich protein; lebocin-like protein |
| 1 | 3 | 0.001 | EVM0010172 EVM0008560 EVM0012239 | Uncharacterized protein |
| 1 | 2 | <1e-7 | EVM0001934 EVM0003728 | Hypothetical protein 2 - cabbage looper transposon TED |
| 1 | 2 | <1e-7 | EVM0009953 EVM0015050 | Piggybac transposable element-derived |
| 1 | 2 | <1e-7 | EVM0010118 EVM0011622 | Receptor guanylate cyclase |
| 1 | 2 | 0.014 | EVM0006212 EVM0007033 | Yolk protein 2 |
| 1 | 2 | 0.021 | EVM0001362 EVM0000363 | Uncharacterized protein |
| 1 | 2 | 0.021 | EVM0011788 EVM0007752 | Pickpocket protein 28-like |
| 1 | 3 | <1e-7 | EVM0000920 EVM0007834 EVM0004965 | Uncharacterized protein |
Fig. 2Expression heatmap of the expanded gene families in different tissues and stages. The colors represent the level of gene expression from low (purple) to high (yellow) as shown by log10 (FPKM + 0.000001). E (eggs), L2 (second-instar larvae), L4 (fourth-instar larvae), P (pupae), F (female adults), M (male adults); An (antenna), H (head), T (thorax), Ab (abdomen), L (leg), and Sg (female sexual gland). a Expression heatmap of the CSP gene family. The transcripts that were grouped together and specifically expressed in the antennae are EVM0003477, EVM0000271, EVM0002854, EVM0002317, EVM0009785, EVM0004106, EVM0004147, EVM0005331 and EVM0008758. The transcripts that were highly expressed in pupae are EVM0014103, EVM0014851, EVM0003329 and EVM0014642. The transcripts that were specifically expressed in the sex gland are EVM0010490 and EVM0014431. b Expression heatmap of the CCE gene family. c Expression heatmap of the GST gene family. d Expression heatmap of the UGT gene family. The heatmaps were constructed with HemI (windows_1_0_win32_x64) using the DEGs of each expanding family. The expression levels are presented by the color bar from low expression as green with negative numbers and high expression level as yellow with positive numbers
Genes under significant positive selection (LRT, p < 0.05)
| Gene ID | Annotation | |
|---|---|---|
| EVM0013226 | 2.79E-11 | ADP-ribosylation factor 6 |
| EVM0009669 | 2.58E-09 | RNA-binding protein |
| EVM0012652 | 4.88E-09 | Proteasome non-ATPase regulatory subunit |
| EVM0001215 | 2.12E-07 | WD repeat domain-containing protein 83 |
| EVM0000300 | 3.76E-05 | Adenylosuccinate lyase |
| EVM0012350 | 0.000269395 | Calcium uptake protein 1 |
| EVM0013607 | 0.000594247 | SNF-related serine/threonine-protein kinase |
| EVM0005265 | 0.000734493 | Glycerophosphocholine phosphodiesterase GPCPD1 |
| EVM0001021 | 0.0008973 | Flavin reductase (NADPH) |
| EVM0012819 | 0.001467642 | D-aspartate oxidase |
| EVM0014230 | 0.001971819 | Cyclin-Y-like protein 1 |
| EVM0010560 | 0.002186741 | Ras-related protein Rab-32 |
| EVM0008629 | 0.00253065 | Pyridine nucleotide-disulfide oxidoreductase domain-containing protein 1 |
| EVM0004149 | 0.005305157 | Pentatricopeptide repeat-containing protein 1, mitochondrial |
| EVM0004370 | 0.006201581 | Replication factor C subunit 3 |
| EVM0001595 | 0.00637425 | WW domain-binding protein 2 |
| EVM0004769 | 0.00712973 | Heparan-sulfate 6-O-sulfotransferase 2 |
| EVM0004055 | 0.00764959 | Transmembrane protein 147 |
| EVM0000132 | 0.01233362 | Carbonic anhydrase 1 |
| EVM0004959 | 0.01293744 | Phosphatidylinositol glycan |
| EVM0000082 | 0.01403601 | Disco-interacting protein 2 |
| EVM0015138 | 0.05027551 | Activin receptor type-1 |
| EVM0003028 | 0.15142045 | Cyclin A |
| EVM0013889 | 0015549715 | Phosphoserine aminotransferase |
| EVM0012220 | 0.015868726 | Glycogen-binding subunit 76A |
| EVM0002611 | 0.016694366 | E3 ubiquitin-protein ligase synoviolin A |
| EVM0005111 | 0.016757707 | Prefoldin subunit 5 |
| EVM0010778 | 0.018825687 | Metaxin-1 |
| EVM0006647 | 0.021983019 | Vacuolar ATP synthase subunit E; V-type H + -transporting ATPase subunit E (A) |
| EVM0008673 | 0.024327862 | Lysosome-associated membrane glycoprotein 1 |
| EVM0000028 | 0.024646682 | Transmembrane protein 183 |
| EVM0010281 | 0.029467391 | Secretion-regulating guanine nucleotide exchange factor-like |
| EVM0004884 | 0.030539908 | Formin-binding protein 1-like |
| EVM0013117 | 0.034084621 | Golgi apparatus protein 1-like |
| EVM0013726 | 0.03547635 | Chitin binding domain protein |
| EVM0010443 | 0.036822765 | 3′(2′),5′-bisphosphate nucleotidase 1-like |
| EVM0010054 | 0.041270885 | Very-long-chain enoyl-CoA reductase |
| EVM0009687 | 0.046760334 | Cytochrome P450 306a1 |
| EVM0002230 | 0.047306291 | Ribosomal protein L27 |
Fig. 3Taxonomic-category-based microbiota composition in the H. cunea larval midgut. a Microbiota proportion at the phylum level. b Microbiota proportion at the order level. c Microbiota proportion at the class level. d Microbiota proportion at the family level. e Microbiota proportion at the genus level. f Microbiota proportion at the species level
Fig. 4Metagenomic genes in the midgut of H. cunea larvae. a KEGG pathway functional annotations of metagenomic genes. b Functional enrichments of metagenomic genes. The X-axis indicates the proportion of genes annotated in each category
Fig. 5Silencing results of HcunFib-H and phenotype analysis. a Anatomy of the larval silk gland of H. cunea. ASG indicates anterior silk gland, MSG indicates middle silk gland and PSG indicates posterior silk gland. b RT-qPCR results 4 days after dsRNA injection for RNAi. Statistical differences were evaluated by t-tests. *** p < 0.001, n.s., not significant. Data indicate the means + SEM, N = 5. The expression levels were normalised by the expression of β-actin gene in different treatment samples. c & d Diameter of the silk ball after RNAi; statistical differences were evaluated by t-tests. *** p < 0.001, n.s., not significant, scale bar = 0.5 mm. e & f Comparison of silk spinning between wild-type and dsHcunFib-H-injected insects
Identification of silk-web-related genes
| Functional classification | Gene name | Gene ID |
|---|---|---|
| Silk protein genes | Fibroin light chain gene ( | EVM0009358 |
| Fibroin heavy chain gene ( | EVM0005282 | |
| 25 kDa silk protein ( | EVM0009847 | |
| Sericin gene | None | |
| Silk regulate genes | Fibroinase | EVM0000430 |
| Fibroin-modulator-binding protein-1 | EVM0012647 | |
| Silk gland factor | EVM0004972; EVM0012444 | |
| Protease inhibitor | Kazal-type serine protease inhibitor ( | EVM0003147;EVM0009039;EVM0001538;EVM0000158; EVM0015003;EVM0006856; EVM0006205;EVM0013167; EVM0000098 |
Pacifastin-related serine protease inhibitor (pacifastin) | EVM0013021 | |
| Phosphatidylethanolamine-binding protein ( | EVM0007560;EVM0006444; EVM0011908;EVM0008732; EVM0002652; | |
| Alpha-2-macroglobulin ( | EVM0001565; EVM0002258 | |
| Cysteine proteinase inhibitor | EVM0003884;EVM0006961; EVM0010793;EVM0002065; EVM0006398 | |
| Carboxypeptidase inhibitor | EVM0003948 | |
| Cystatin | EVM0010793; EVM0012410 | |
| Proteasome inhibitor | EVM0011839 |