| Literature DB >> 31682256 |
Sihan Lu1,2, Jie Yang1, Xuelei Dai3, Feiang Xie4, Jinwu He1, Zhiwei Dong2, Junlai Mao4, Guichun Liu1,2, Zhou Chang2, Ruoping Zhao2, Wenting Wan1, Ru Zhang1, Yuan Li5, Wen Wang1,2,6, Xueyan Li2.
Abstract
BACKGROUND: Papilio bianor Cramer, 1777 (commonly known as the Chinese peacock butterfly) (Insecta, Lepidoptera, Papilionidae) is a widely distributed swallowtail butterfly with a wide number of geographic populations ranging from the southeast of Russia to China, Japan, India, Vietnam, Myanmar, and Thailand. Its wing color consists of both pigmentary colored scales (black, reddish) and structural colored scales (iridescent blue or green dust). A high-quality reference genome of P. bianor is an important foundation for investigating iridescent color evolution, phylogeography, and the evolution of swallowtail butterflies.Entities:
Keywords: zzm321990 Papilio bianorzzm321990 ; butterfly; chromosome-level reference genome; high-throughput chromosome conformation capture map; single-molecule real-time (SMRT) sequencing
Mesh:
Year: 2019 PMID: 31682256 PMCID: PMC6827417 DOI: 10.1093/gigascience/giz128
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Characterization of Papilio bianor. (a) Female adult. Left, dorsal view; right, ventral view (scale bar = 20.0 mm; photo by Zhiwei Dong). (b) Heat map of chromosomal interactions. Each chromosome is framed with a blue block, and each scaffold is framed with a green block. (c) Circos plot of P. bianor chromosome-level reference genome with the previously released Papilio xuthus genome (obtained from a Chinese group) [9]. Shown from outermost to innermost are (1) gene density, (2) repeat element density, (3) GC content, and (4) syntenic regions with P. xuthus (left).
Comparison of quality and composition of different butterfly genomes
| Family | Species | Genome size (Mb) | Genome size without gap (Mb) | Heterozygosity[ | Scaffold N50 (kb) | BUSCO[ |
| GC content (%) | Repeat (%) | Exon (%) | Intron (%) | Number of proteins (k) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Papilionidae |
| 421 | 421 | 1.8 | 13,120 | 96.3 | NA | 36.6 | 55.3 | 5.05 | 27.44 | 15.4 |
|
| 244 | 238 | 1.0 | 6,199 | 97.6 | NA | 33.8 | 22.4 | 8.59 | 45.50 | 13.1 | |
|
| 281 | 266 | 1.2 | 1,150 | 95.5 | 98 | 32.3 | 22.3 | 7.37 | 30.36 | 15.5 | |
|
| 227 | 218 | NA | 3,672 | 91.8 | NA | 34.0 | 23.8 | 12.97 | 48.58 | 12.2 | |
|
| 233 | 219 | NA | 5,457 | 96.6 | NA | 32.8 | 22.5 | 11.31 | 43.17 | 12.4 | |
|
| 375 | 361 | 2.3 | 231 | 95.5 | 98 | 35.4 | 22.0 | 5.07 | 25.60 | 15.7 | |
| Hesperiidae |
| 567 | 536 | 1.5 | 558 | 97.3 | 98 | 35.3 | 25.0 | 3.57 | 28.40 | 15.9 |
|
| 298 | 290 | 1.5 | 525 | 95.1 | 98 | 34.4 | 15.5 | 6.96 | 31.60 | 17.4 | |
|
| 429 | 427 | 0.1 | 4,153 | 98.3 | 99 | 34.7 | 25.8 | 4.59 | 30.90 | 14.1 | |
| Pieridae |
| 246 | 243 | 1.5 | 617 | 98.0 | 99 | 32.7 | 22.7 | 7.91 | 33.30 | 13.2 |
|
| 406 | 347 | 1.2 | 257 | 97.7 | 97 | 39.0 | 17.2 | 6.20 | 25.50 | 16.5 | |
| Nymphalidae |
| 249 | 242 | 0.6 | 716 | 98.0 | 96 | 31.6 | 16.3 | 8.40 | 28.10 | 15.1 |
|
| 274 | 270 | NA | 194 | 95.6 | NA | 32.8 | 24.9 | 6.38 | 25.40 | 12.8 | |
|
| 390 | 361 | NA | 119 | 83.0 | 97 | 32.6 | 27.5 | 4.34 | 31.20 | 16.7 | |
|
| 475 | 470 | NA | 638 | 97.6 | NA | 36.5 | 25.8 | 4.73 | 38.36 | 22.6 | |
| Riodinidae |
| 809 | 783 | 0.5 | 206 | 95.6 | 99 | 34.9 | 34.8 | 2.25 | 19.60 | 15.4 |
|
| 855 | 824 | 1.3 | 175 | 93.9 | 99 | 35.0 | 38.8 | 2.17 | 20.50 | 15.6 | |
| Lycaenidae |
| 729 | 689 | 1.2 | 233 | 95.5 | 96 | 37.1 | 34.0 | 3.11 | 24.00 | 16.5 |
aThe heterozygosity of P. bianor, P. machaon, and P. xuthus was calculated on the basis of k-mer distribution analysis. The heterozygosity values of others (P. glaucus, A. lyciades, L. accius, M. ursus violae, P. rapae, P. sennae, D. plexippus, C. nemesis, C. virginiensis, C. cecrops) were estimated using the Genome Analysis Toolkit (GATK).
bBUSCO is calculated in this study.
NA: not available in the referenced citation.
Figure 2:Genomic analysis of Papilio bianor. (a) Breakdown of the whole-genome assemblies into different functional classes in Papilio. (b) Venn diagram of the shared gene families of Papilio. (c) The dynamic changes of the effective population size were plotted using PSMC software, with 100 bootstrap replicates to test the robust variations. The parameter “g” represents the generation time in years, and the parameter “μ” means the per generation mutation rate. Pb: Papilio bianor; Pgl: Papilio glaucus; Pma: Papilio machaon; Pme: Papilio memnon; Ppol: Papilio polytes; Pxu: Papilio xuthus. (d) Maximum likelihood phylogenetic tree of Papilionoidea constructed by the concatenated alignment of 1,378 1-to-1 single-copy ortholog genes. The numbers in the square brackets on the nodes are the 95% confidence intervals of divergence time. The red dots are fossil evidence downloaded from the TimeTree website [89], and the black dots are inferred time obtained from the TimeTree website. Both were used to calibrate divergent time.