| Literature DB >> 31430367 |
Xin Jiang1, Qian Zhang1, Yaoguo Qin1, Hang Yin1, Siyu Zhang1, Qian Li1, Yong Zhang1, Jia Fan1, Julian Chen1.
Abstract
BACKGROUND: Sitobion miscanthi is an ideal model for studying host plant specificity, parthenogenesis-based phenotypic plasticity, and interactions between insects and other species of various trophic levels, such as viruses, bacteria, plants, and natural enemies. However, the genome information for this species has not yet to be sequenced and published. Here, we analyzed the entire genome of a parthenogenetic female aphid colony using Pacific Biosciences long-read sequencing and Hi-C data to generate chromosome-length scaffolds and a highly contiguous genome assembly.Entities:
Keywords: zzm321990 Sitobion avenaezzm321990 ; zzm321990 Sitobion miscanthizzm321990 ; Hi-C assembly; annotation; aphid; genome; long-read sequencing
Mesh:
Year: 2019 PMID: 31430367 PMCID: PMC6701489 DOI: 10.1093/gigascience/giz101
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1.Winged and wingless S. miscanthi. Top, winged adult; bottom, wingless adult.
Figure 2.19-mer distribution for the genome size prediction of S. miscanthi.
Assessment results based on 2 strategies
| Genome feature/assessment strategy | 19-mer analysis | PacBio |
|---|---|---|
| Genome size (Mb) | 393.12 | 397.90 |
| Guanine-cytosine content (%) | 31.70 | 30.25 |
| Repeat sequence content (%) | 35.07 | 24.14 |
| Heterozygosity (%) | 0.98 | 0.57 |
Assembly statistics of the S. miscanthi genome and 7 other aphid genomes based mainly on NGS
| Genome assembly/species |
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| Assembly size (Mb) | 397.9 | 319.4 | 393.0 | 541.6 | 302.9 | 347.3 | 405.7 | 294.0 |
| Contig count | 1,148 | 16,689 | 49,357 | 60,623 | 66,000 | 8,249 | 56,508 | 22,569 |
| Contig N50 (bp) | 1,638,329 | 96,831 | 12,578 | 28,192 | 15,844 | 71,400 | 17,908 | 45,572 |
| Scaffold count | 656 | 15,587 | 5,641 | 23,924 | 8,397 | 4,018 | 49,286 | 4,724 |
| Scaffold N50 (bp) | 36,263,045 | 116,185 | 397,774 | 518,546 | 174,505 | 435,781 | 23,273 | 437,960 |
| Genome annotation | ||||||||
| Gene count | 16,006 | 26,286 | 19,097 | 36,195 | 17,558 | 18,529 | 28,688 | 14,694 |
| Mean gene length (kb) | 7.805 | 1,543 | 1.316 | 1.964 | 1.520 | 1.839 | 1,222 | 1.964 |
| Mean exon count per gene | 6.7 | 5.20 | 3.0 | 5.0 | 6.2 | 6.1 | 3.7 | 10.1 |
| Mean exon length (bp) | 288 | 162 | 249.0 | 394.7 | 246 | 299 | 178 | 218 |
Figure 3.Hi-C contact heat map of the S. miscanthi genome.
Summary of S. miscanthi genome assembly
| Statistics | Draft scaffolds | Corrected by Hi-C |
|---|---|---|
| Contig number | 1,039 | 1,167 |
| Contig length | 397,907,165 | 397,907,165 |
| Contig N50 (bp) | 2,049,770 | 1,565,814 |
| Contig N90 (bp) | 256,083 | 185,510 |
| Contig max (bp) | 11,219,273 | 10,100,000 |
| Gap number/gap total length (bp) | 0 | 0 |
Detailed classification of repeats in the S. miscanthi genome assembly
| Type | Number | Length (bp) | Rate (%) |
|---|---|---|---|
| Class I (Retrotransposons) | 194,093 | 51,169,345 | 12.86 |
| DIRS (Dictyostelium intermediate repeat sequence) | 1,289 | 695,762 | 0.17 |
| LINE (Long interspersed nuclear element) | 40,230 | 10,832,765 | 2.72 |
| LTR (Long terminal repeats) /Copia | 2,438 | 742,051 | 0.19 |
| LTR/Gypsy | 18,807 | 6,949,790 | 1.75 |
| LTR/Unknown | 7,534 | 3,195,404 | 0.8 |
| PLE (Penelope-like elements)|LARD (Large retrotransposon derivatives) | 115,765 | 28,920,417 | 7.27 |
| SINE (Short interspersed nuclear element) | 6,665 | 1,075,456 | 0.27 |
| SINE|TRIM | 15 | 5,478 | 0 |
| TRIM (Terminal repeat retrotransposons in miniature) | 1,116 | 1,281,655 | 0.32 |
| Class I Unknown | 234 | 26,384 | 0.01 |
| Class II (DNA transposons) | 188,820 | 44,184,063 | 11.1 |
| Crypton | 299 | 20,282 | 0.01 |
| Helitron | 5,688 | 1,871,785 | 0.47 |
| MITE (Miniature inverted repeat transposable elements) | 7,972 | 1,434,924 | 0.36 |
| Maverick | 7,888 | 3,289,168 | 0.83 |
| TIR (Terminal inverted repeat) | 89,268 | 22,913,523 | 5.76 |
| Class II unknown | 77,705 | 15,793,696 | 3.97 |
| Potential host gene | 926 | 251,812 | 0.06 |
| SSR (Simple sequence repeats) | 2,611 | 381,142 | 0.1 |
| Unknown | 74,204 | 18,832,522 | 4.73 |
| Identified | 386,450 | 105,110,753 | 26.42 |
| Total | 460,654 | 123,943,275 | 31.15 |
Figure 4.The phylogenetic relationships of S. miscanthi with other arthropods.