| Literature DB >> 35323562 |
Amali H Thrimawithana1, Chen Wu1, John T Christeller2, Robert M Simpson2, Elena Hilario1, Leah K Tooman1, Doreen Begum1,3, Melissa D Jordan1, Ross Crowhurst1, Richard D Newcomb1,3, Alessandro Grapputo4.
Abstract
The light brown apple moth, Epiphyas postvittana is an invasive, polyphagous pest of horticultural systems around the world. With origins in Australia, the pest has subsequently spread to New Zealand, Hawaii, California and Europe, where it has been found on over 500 plants, including many horticultural crops. We have produced a genomic resource, to understand the biological basis of the polyphagous and invasive nature of this and other lepidopteran pests. The assembled genome sequence encompassed 598 Mb and has an N50 of 301.17 kb, with a BUSCO completion rate of 97.9%. Epiphyas postvittana has 34% of its assembled genome represented as repetitive sequences, with the majority of the known elements made up of longer DNA transposable elements (14.07 Mb) and retrotransposons (LINE 17.83 Mb). Of the 31,389 predicted genes, 28,714 (91.5%) were assigned to 11,438 orthogroups across the Lepidoptera, of which 945 were specific to E. postvittana. Twenty gene families showed significant expansions in E. postvittana, including some likely to have a role in its pest status, such as cytochrome p450s, glutathione-S-transferases and UDP-glucuronosyltransferases. Finally, using a RAD-tag approach, we investigated the population genomics of this pest, looking at its likely patterns of invasion.Entities:
Keywords: Epiphyas postvittana; Lepidoptera genome; invasive species; light brown apple moth; pest species; polyphagy; population genomics
Year: 2022 PMID: 35323562 PMCID: PMC8951345 DOI: 10.3390/insects13030264
Source DB: PubMed Journal: Insects ISSN: 2075-4450 Impact factor: 2.769
Figure 1GenomeScope k-mer profile plots for short-read PE libraries illustrating the distribution of 21-mers in (A) 169 bp insert size PE data and (B) 260 bp insert size PE data.
Summary statistics of the assembly, features and quality assessment of the Epiphyas postvittana genome in comparison with those of other lepidopteran species.
| Species |
|
| |||||
|---|---|---|---|---|---|---|---|
| Genome assembly: | |||||||
| Assembly size (Mb) | 598.1 | 772.9 | 333.0 | 337.0 | 394.1 | 438.3 | 431.7 |
| Number of scaffolds | 14,077 | 1717 | 1916 | 997 | 1819 | 3597 | 43,622 |
| Max scaffold length (Mb) | 3.70 | 34.60 | 8.92 | 6.15 | 3.94 | 4.40 | 16.12 |
| N50 scaffold size (kb) | 301.17 | 8915.45 | 4648.10 | 1000.40 | 737.18 | 915.40 | 3717.00 |
| Number of contigs | 46,180 | 2221 | 7885 | 24,228 | 14,357 | 13,636 | 88,842 |
| N50 contig length (kb) | 26.2 | 862.5 | 140.0 | 18.3 | 49.4 | 68.4 | 15.5 |
| Genomic_features: | |||||||
| Protein coding gene | 31,389 | 17,184 | 14,384 | 17,086 | 18,071 | 15,317 | 14,623 |
| Repeats (%) | 33.8 | 42.9 | 16.7 | 14.6 | 34.0 | 31.8 | 43.6 |
| GC (%) | 38.3 | 37.4 | 35.5 | 36.1 | 38.4 | 36.6 | 38.8 |
| Mean CDS length (bp) | 1204.0 | 1461.0 | 1512.3 | 1330.9 | 1385.0 | 1565.2 | 1212.7 |
| Mean exon length (bp) | 230.0 | 256.9 | 316.7 | 309.2 | 214.0 | 235.8 | 222.2 |
| Mean intron length (bp) | 1254.0 | 1205.3 | 1603.3 | 1379.4 | 1224.0 | 1938.4 | 1084.1 |
| Mean number of exons per gene | 5.20 | 5.68 | 7.04 | 6.20 | 6.47 | 6.64 | 5.45 |
| Quality_assessment: | |||||||
| Genome | 98.5 (97.9) | 98.5 (97.8) | 98.8 (98.4) | 98.5 (97.1) | 92.7 (89.9) | 99.0 (98.4) | 94.5 (92.1) |
| Protein (OGS) | 95.7 (92.0) | 93.8 (90.6) | 97.8 (97.3) | 98.4 (96.0) | 80.1 (75.0) | 99.0 (97.9) | 96.3 (91.5) |
* Present = complete + fragmented.
Repeat classes in the Epiphyas postvittana genome compared with those of other Lepidoptera.
| Species |
|
|
|
|
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Repeat Types | Length (Mb) | P% | Length (Mb) | P% | Length (Mb) | P% | Length (Mb) | P% | Length (Mb) | P% | Length (Mb) | P% | Length (Mb) | P% |
| DNA elements | 14.07 | 2.35 | 26.95 | 3.49 | 21.92 | 6.58 | 0.22 | 0.45 | 7.50 | 1.90 | 8.4 | 1.96 | 69.07 | 10.44 |
| LINE | 17.83 | 2.98 | 68.51 | 8.86 | 6.66 | 2.00 | 0.13 | 0.27 | 20.62 | 5.23 | 46.27 | 10.81 | 101.25 | 15.31 |
| LTR | 4.59 | 0.77 | 11.39 | 1.47 | 1.51 | 0.45 | 0.06 | 0.12 | 9.86 | 2.5 | 2.41 | 0.56 | 4.55 | 0.69 |
| SINE | 6.12 | 1.02 | 21.03 | 2.72 | 5.47 | 1.64 | - | - | 2.01 | 0.51 | 8.51 | 1.99 | 8.62 | 1.30 |
| Simple repeat | 5.11 | 0.85 | 5.08 | 0.66 | 3.19 | 1.92 | - | - | - | - | 4.52 | 1.06 | 6.26 | 0.95 |
| Other | 0.51 | 0.09 | 1.81 | 1.39 | 3.19 | 1.92 | - | - | 0.000057 | 0 | 0.67 | 0.16 | 0.93 | 0.14 |
| Unclassified | 153.74 | 25.71 | - | - | 16.79 | 5.04 | 0.08 | 0.16 | 111.25 | 28.23 | 73.89 | 17.26 | 54.55 | 8.25 |
| Total | 201.97 | 33.77 | 341.54 | 42.87 | 55.54 | 16.68 | 49.00 | 14.60 | 151.24 | 38.37 | 136.17 | 31.8 | 245.23 | 37.33 |
Figure 2A phylogenetic tree of 22 insect species with Folsomia candida as an outgroup, obtained using the aligned protein sequences of 207 single-copy orthologs present in all 23 species, constructed using the ML method implemented in IQ-TREE. Branch lengths represent substitutions per site. All nodes have a bootstrap support of 100%. Numbers at the nodes and after each species name represent the number of gene families with an increase/decrease in gene number, respectively, estimated by CAFE. On the right, each bar shows the total gene counts per species grouped in the following categories: single-copy genes in all 23 species and in all except one species, gene duplicated in all or except one species, lineage-specific ortholog genes (in Lepidoptera, Diptera, Coleoptera, Hymenoptera, Hemiptera, Blattodea and Entomobryomorpha), species-specific ortholog genes, genes with other ortholog relationships and unassigned genes.
Number of SNP types, percentage and distribution in genic and intergenic regions of the Epiphyas postvittana genome.
| SNP Type | Genic | Intergenic | Total |
|---|---|---|---|
| Biallelic | 10,803 (57.56) | 5986 (31.9) | 16,789 (89.46) |
| Triallelic | 1261 (6.72) | 585 (3.12) | 1846 (9.84) |
| Tetraallelic | 96 (0.51) | 36 (0.19) | 132 (0.70) |
| Total | 12,160 (64.79) | 6607 (35.21) | 18,767 (100.0) |
Among brackets % calculated on the total number of SNPs.
Epiphyas postvittana population genetic parameters per population.
| Country | Population | N | pi | Theta | D |
|---|---|---|---|---|---|
| AUS | Gumeracha (GUM) | 7052 | 0.0012 | 0.0015 | −0.0325 |
| Wagga Wagga (NSW) | 5998 | 0.0012 | 0.0014 | −0.0423 | |
| Tasmania (TAS) | 5006 | 0.0009 | 0.0010 | −0.0291 | |
| Dalkeith (WAA) | 5890 | 0.0009 | 0.0012 | −0.0352 | |
| USA | California (CA) | 7934 | 0.0009 | 0.0011 | −0.0282 |
| NZ | Awatere (AWA) | 4325 | 0.0009 | 0.0010 | −0.0339 |
| Clyde (CLY) | 4005 | 0.0007 | 0.0008 | −0.0259 | |
| Hawke’s Bay (HB) | 4523 | 0.0009 | 0.0011 | −0.0373 | |
| Motueka (MOT) | 5625 | 0.0009 | 0.0011 | −0.0299 | |
| Te Puke (TPU) | 3669 | 0.0008 | 0.0009 | −0.0422 |
Average FST across the whole genome of Epiphyas postvittana (calculated as average of all gene and intergenic FSTs). For population abbreviation, see Table 4.
| GUM | NSW | TAS | WAA | CA | AWA | CLY | HB | MOT | TPU | |
|---|---|---|---|---|---|---|---|---|---|---|
| GUM | - | 0.10611 | 0.1199 | 0.14067 | 0.12127 | 0.10816 | 0.15075 | 0.11949 | 0.11523 | 0.14398 |
| NSW | - | 0.14548 | 0.1359 | 0.13567 | 0.12461 | 0.16092 | 0.12251 | 0.13378 | 0.14405 | |
| TAS | - | 0.1885 | 0.16099 | 0.12862 | 0.18155 | 0.15695 | 0.13225 | 0.19199 | ||
| WAA | - | 0.17158 | 0.16146 | 0.1973 | 0.1635 | 0.1703 | 0.18416 | |||
| CA | - | 0.14844 | 0.19214 | 0.1566 | 0.15068 | 0.18032 | ||||
| AWA | - | 0.14505 | 0.11311 | 0.09855 | 0.14918 | |||||
| CLY | - | 0.12674 | 0.14836 | 0.15415 | ||||||
| HB | - | 0.13135 | 0.08647 | |||||||
| MOT | - | 0.16317 | ||||||||
| TPU | - |
Figure 3Distribution of average Fst values among populations of Epiphyas postvittana calculated from the SNPs within gene models in the genome.
Figure 4Historical relationships and inferred migration events (indicated by the coloured arrows) among Epiphyas postvittana populations obtained with TreeMix.