| Literature DB >> 35775933 |
Shuai Ye1, Chen Zeng2, Jian-Feng Liu1, Chen Wu3, Yan-Fei Song1, Yao-Guo Qin4, Mao-Fa Yang1,5.
Abstract
Neotoxoptera formosana (Takahashi), the onion aphid, is an oligophagous pest that mainly feeds on plants from the Allium genus. It sucks nutrients from the plants and indirectly acts as a vector for plant viruses. This aphid causes severe economic losses to Allium tuberosum agriculture in China. To better understand the host plant specificity of N. formosana on Allium plants and provide essential information for the control of this pest, we generated the entire genome using Pacific Biosciences long-read sequencing and Hi-C data. Six chromosomes were assembled to give a final size of 372.470 Mb, with an N50 scaffold of 66.911 Mb. The final draft genome assembly, from 192 Gb of raw data, was approximately 371.791 Mb in size, with an N50 contig of 24.99 Kb and an N50 scaffold of 2.637 Mb. The average GC content was 30.96%. We identified 73 Mb (31.22%) of repetitive sequences, 14,175 protein-coding genes, and 719 noncoding RNAs. The phylogenetic analysis showed that N. formosana and Pentalonia nigronervosa are sister groups. We found significantly expanded gene families that were involved in the THAP domain, the DDE superfamily endonuclease, zinc finger, immunity (ankyrin repeats), digestive enzyme (serine carboxypeptidase) and chemosensory receptor. This genome assembly could provide a solid foundation for future studies on the host specificity of N. formosana and pesticide-resistant aphid management.Entities:
Keywords: zzm321990 Neotoxoptera formosanazzm321990 ; Hi-C; chromosome-level genome; gene family evolution; genome assembly
Mesh:
Year: 2022 PMID: 35775933 PMCID: PMC9339310 DOI: 10.1093/g3journal/jkac164
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.542
Fig. 1.Neotoxoptera formosana. a) N. formosana damaging Allium tuberosum; b) N. formosana damaging yellow A. tuberosum; c) N. formosana female and nymph.
Fig. 2.The K-mer frequency distribution analysis of Neotoxoptera formosana.
Genome assembly statistics of Neotoxoptera formosana.
| Assembly | Total length (Mb) | Number of scaffolds | N50 length (Mb) | Longest scaffold (Mb) | GC (%) | BUSCO ( | |||
|---|---|---|---|---|---|---|---|---|---|
| C | D | F | M | ||||||
| wtdbg | 393.678 | 2,806 | 2.637 | 24.999 | 31.55 | 92.1 | 2.6 | 2.0 | 5.9 |
| NextPolish | 392.017 | 2,806 | 2.632 | 24.933 | 31.59 | 94.0 | 2.6 | 0.9 | 5.1 |
| Purge_dups | 376.840 | 1,911 | 2.800 | 24.933 | 31.15 | 94.0 | 2.6 | 0.9 | 5.1 |
| 3D-DNA | 372.470 | 461 | 66.911 | 97.256 | 30.98 | 93.9 | 2.3 | 0.9 | 5.2 |
| Final | 371.791 | 357 | 66.908 | 97.223 | 30.96 | 93.9 | 2.3 | 0.9 | 5.2 |
Genome assembly and annotation statistics for Neotoxoptera formosana.
|
| |
|---|---|
| Genome assembly | |
| Assembly size (Mb) | 371.791 |
| Number of scaffolds/contigs | 357/1,259 |
| Longest scaffold/contig (Mb) | 97.223/24.933 |
| N50 scaffold/contig length (Mb) | 66.908/2.772 |
| GC (%) | 30.96 |
| Gaps (%) | 0.024 |
| BUSCO completeness (%) | 93.9% |
| Gene annotation | |
| Protein-coding genes | 14,175 |
| Mean protein length (aa) | 504.5 |
| Mean gene length (bp) | 6,552.6 |
| Exons/introns per gene | 9.3/8.0 |
| Exon (%) | 9.64 |
| Mean exon length | 271.4 |
| Intron (%) | 15.34 |
| Mean intron length | 501.4 |
| BUSCO completeness (%) | 93.9 |
Fig. 3.Hi-C contact map of the Neotoxoptera formosana genome.
Fig. 4.Circos plot that indicates chromosome length, GC content, and protein-coding gene/repeat sequence density.
Fig. 5.Chromosome collinearity analysis graph. Ap: Acyrthosiphon pisum; Nf: Neotoxoptera formosana; Rm: Rhopalosiphum maidis.
Fig. 6.Phylogenetic tree of Neotoxoptera formosana: The branch length represents evolution time, numbers represent the number of expanded, contracted and rapidly evolving (statistically significant, labeled as red) gene families in that branch.
Fig. 7.Gene family evolution of Neotoxoptera formosana. a) Bubble plot of GO enrichment analysis of rapidly expanding gene families; b) bubble plot of KEGG enrichment analysis of rapidly expanding gene families.