| Literature DB >> 34076242 |
Emily A Ellis1, Caroline G Storer1, Akito Y Kawahara1.
Abstract
BACKGROUND: The availability of thousands of genomes has enabled new advancements in biology. However, many genomes have not been investigated for their quality. Here we examine quality trends in a taxonomically diverse and well-known group, butterflies (Papilionoidea), and provide draft, de novo assemblies for all available butterfly genomes. Owing to massive genome sequencing investment and taxonomic curation, this is an excellent group to explore genome quality.Entities:
Keywords: Papilionoidea; accessibility; genomics; life sciences; open data
Mesh:
Year: 2021 PMID: 34076242 PMCID: PMC8170690 DOI: 10.1093/gigascience/giab041
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Pre-assembled and de novo assembled genomes for each butterfly and subfamily shown on a topological sketch of [38]. Species-richness estimates and topology are presented for comparison only.
Figure 2:Natural log–normalized N50 and BUSCO scores plotted for both pre-assembled (squares) and de novo (circles) genome assemblies. Colors denote taxonomic family designation, as in Fig. 1. Letters correspond to inset images of representative species.
Highest quality genomes by butterfly subfamily, according to N50 and BUSCO scores
| Taxonomy | Organism | Accession ID | N50 (bp) | BUSCO 3 (C%) |
|---|---|---|---|---|
| Hesperiidae; Coeliadinae |
| SRR7174556 | 2,532 | 87.3 |
| Hesperiidae; Eudaminae |
| SRR7174453 | 9,497 | 76.7 |
| Hesperiidae; Hesperiinae |
| GCA_003671415.1 | 4,153,133 | 98.3 |
| Hesperiidae; Heteropterinae |
| SRR9330377 | 4,259 | 69.8 |
| Hesperiidae; Pyrginae |
| GCA_002930495.1 | 558,064 | 97.3 |
| Hesperiidae; Trapezitinae |
| SRR9330370 | 932 | 21.7 |
| Lycaenidae; Curetinae |
| SRR10158559 | 1,108 | 28.3 |
| Lycaenidae; Polyommatinae |
| SRR6727422 | 13,909 | 91.3 |
| Lycaenidae; Theclinae |
| GCA_001625245.1 | 233,537 | 95.5 |
| Nymphalidae; Charaxinae |
| SRR5175869 | 1,531 | 49.5 |
| Nymphalidae; Danainae |
| GCA_009731565.1 | 9,209,872 | 93.9 |
| Nymphalidae; Heliconiinae |
| LepBase_Heliconius_erato_demophoon_v1 | 10,688,973 | 97.4 |
| Nymphalidae; Limentidinae |
| SRR1504973 | 631 | 12.6 |
| Nymphalidae; Morphinae |
| GCA_009936525.1 | 1,720,500 | 35.2 |
| Nymphalidae; Nymphalinae |
| GCA_002938995.1 | 2,988,984 | 98.3 |
| Nymphalidae; Satyrinae |
| GCA_902806685.1 | 15,230,192 | 97.8 |
| Papilionidae; Baroniinae |
| SRR8954515 | 1,886 | 59.0 |
| Papilionidae; Papilioninae |
| GCA_000836235.1 | 6,198,915 | 97.6 |
| Papilionidae; Parnassiinae |
| SRR8954536 | 3,584 | 59.4 |
| Pieridae; Coliadinae |
| GCA_012273895.1 | 9,214,832 | 95.6 |
| Pieridae; Dismorphiinae |
| GCA_900199415.2 | 857,189 | 97.2 |
| Pieridae; Pierinae |
| LepBase_Pieris_napi_v1.1 | 12,597,868 | 94.4 |
| Riodinidae; Nemeobiinae |
| SRR10158562 | 1,806 | 30.3 |
| Riodinidae; Riodininae |
| GCA_002245505.1 | 206,312 | 95.6 |
C: complete.
Highest 50 quality papilionoid genome assemblies, regardless of subfamily, ranked using natural log–normalized N50 and BUSCO Complete scores
| Rank | Organism | Accession ID | N50 (bp) | BUSCO 3 (C%) |
|---|---|---|---|---|
| 1 |
| GCA_902806685.1 | 15,230,192 | 97.8 |
| 2 |
| LepBase_Pieris_napi_v1.1 | 12,597,868 | 94.4 |
| 3 |
| LepBase_Heliconius_erato_demophoon_v1 | 10,688,973 | 97.4 |
| 4 |
| GCA_009731565.1 | 9,209,872 | 98.0 |
| 5 |
| GCA_012273895.1 | 9,214,832 | 95.6 |
| 6 |
| GCA_000836235.1 | 6,198,915 | 97.6 |
| 7 |
| GCA_011763625.1 | 13,111,833 | 65.0 |
| 8 |
| GCA_003671415.1 | 4,153,133 | 98.3 |
| 9 |
| GCA_003118415.2 | 4,560,862 | 92.9 |
| 10 |
| GCA_000836215.1 | 3,672,263 | 91.8 |
| 11 |
| GCA_002938995.1 | 2,988,984 | 98.3 |
| 12 |
| LepBase_Junonia_coenia_JC_v1.0 | 1,571,165 | 98.2 |
| 13 |
| GCA_004959915.1 | 1,465,393 | 93.9 |
| 14 |
| GCA_001298355.1 | 1,174,287 | 95.5 |
| 15 |
| GCA_008963455.1 | 1,011,763 | 98.1 |
| 16 |
| GCA_900199415.2 | 857,189 | 97.2 |
| 17 |
| GCA_010014825.1 | 889,656 | 89.4 |
| 18 |
| GCA_900239965.1 | 638,282 | 97.6 |
| 19 |
| GCA_001856805.1 | 617,301 | 98.0 |
| 20 |
| GCA_013186455.1 | 596,599 | 94.3 |
| 21 |
| GCA_002930495.1 | 558,064 | 97.3 |
| 22 |
| GCA_001278395.1 | 525,349 | 95.1 |
| 23 |
| GCA_001586405.1 | 299,140 | 91.1 |
| 24 |
| GCA_009936525.1 | 1,720,500 | 35.2 |
| 25 |
| GCA_001625245.1 | 233,537 | 95.5 |
| 26 |
| GCA_000931545.1 | 230,841 | 95.5 |
| 27 |
| GCA_002245505.1 | 206,312 | 95.6 |
| 28 |
| GCA_010014985.1 | 193,720 | 96.5 |
| 29 |
| GCA_000313835.2 | 194,302 | 95.6 |
| 30 |
| GCA_009667785.1 | 212,945 | 88.3 |
| 31 |
| GCA_002245475.1 | 175,106 | 93.9 |
| 32 |
| LepBase_Heliconius_burneyi_helico3 | 106,325 | 96.5 |
| 33 |
| GCA_000716385.1 | 119,328 | 83.0 |
| 34 |
| GCA_009982905.1 | 95,765 | 92.5 |
| 35 |
| LepBase_Heliconius_hecalesia_helico3 | 68,855 | 96.5 |
| 36 |
| Lepbase_Heliconius_demeter_helico3 | 67,995 | 96.8 |
| 37 |
| LepBase_Heliconius_besckei_helico3 | 64,778 | 95.8 |
| 38 |
| LepBase_Heliconius_himera_helico3 | 48,684 | 96.5 |
| 39 |
| LepBase_Heliconius_sara_helico3 | 43,390 | 94.3 |
| 40 |
| LepBase_Heliconius_telesiphe_helico3 | 42,672 | 94.7 |
| 41 |
| LepBase_Eueides_tales_helico3 | 32,552 | 94.7 |
| 42 |
| SRR7174358 | 24,120 | 90.7 |
| 43 |
| LepBase_Agraulis_vanillae_helico3 | 21,413 | 94.6 |
| 44 |
| LepBase_Dryas_iulia_helico3 | 21,916 | 92.3 |
| 45 |
| SRR4341246 | 18,269 | 92.3 |
| 46 |
| GCA_900499025.1 | 16,525 | 88.0 |
| 47 |
| SRR8954516 | 14,618 | 93.5 |
| 48 |
| SRR6727422 | 13,909 | 91.3 |
| 49 |
| SRR6727440 | 13,611 | 87.0 |
| 50 |
| SRR10158585 | 43,550 | 35.1 |