| Literature DB >> 31609423 |
Sarah B Kingan1, Julie Urban2, Christine C Lambert1, Primo Baybayan1, Anna K Childers3, Brad Coates4, Brian Scheffler5, Kevin Hackett6, Jonas Korlach1, Scott M Geib7.
Abstract
BACKGROUND: A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region.Entities:
Mesh:
Year: 2019 PMID: 31609423 PMCID: PMC6791401 DOI: 10.1093/gigascience/giz122
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1.w. (A) Location of specimen collection (green marker), near the Reading Pagoda on Mt. Penn (Reading, Berks County, Pennsylvania, USA [40.33648 N, 75.90471 W]); (B) adult female Lycorma delicatula; (C) the host Ailanthus altissima tree (tree of heaven) from which the female adult sample was collected on 26 August 2018. Late nymph stage and adults can be seen covering the trunk of this host tree.
Figure 2.Lycorma delicatula input DNA and resulting library. FEMTO Pulse traces and “gel” images (inset) of the genomic DNA (gDNA) input (black) and the final library (blue) before sequencing.
Spotted lanternfly de novo genome assembly stats for the FALCON-Unzip and curated assemblies
| Assembly version | FALCON-Unzip | Curated assembly |
|---|---|---|
| Primary assembly size | 2.395 Gb | 2.252 Gb |
| Number of primary contigs | 4,209 | 2,927 |
| Contig length N50 | 1.423 Mb | 1.520 Mb |
| Haplotig assembly size (proportion of primary length) | 1.249 Gb (52%) | 1.349 Gb (60%) |
| Number of haplotigs | 10,103 | 10,652 |
| Haplotig N50 | 185.5 kb | 178.1 kb |
| BUSCO complete | 96.8% | 96.8% |
| BUSCO duplicate | 3.3% | 2.4% |
Assembly contiguity and BUSCO completeness stats are shown after FALCON-Unzip, and after curation to recategorize duplicated haplotypes in the primary contigs and removal of repetitive and redundant haplotigs and bacterial contigs. For complete BUSCO stats see Table S1.
Figure 3.Assembly visualization. The contiguity and completeness of the L. delicatula genome assembly is visualized as a circle, with the full circle representing the full assembly length of ∼2.3 Gb. The longest contig was 10.0 Mb, and the assembly has uniform GC content throughout, with very few contigs <50 kb in length.
Comparison of the spotted lanternfly genome assembly with previously described planthopper species assemblies, highlighting the improvements with regard to the required number of insect individuals, sequencing libraries, assembly sizes, and contiguity qualities
| Species |
|
|
|
|
|---|---|---|---|---|
| Number of individuals (source) | ∼5,000 (F13 from inbred line) | ∼120 (F6 from inbred line) | ∼100 (F22 from inbred line) | 1 (field-collected) |
| Number of sequencing libraries | 16 (+fosmid libraries) | 17 | 47 | 1 |
| Assembly size | 1.14 Gb | 0.72 Gb | 0.54 Gb | 2.25 Gb |
| Contig N50 | 24 kb | 71 kb | 118 kb | 1,520 kb |
[32].
[31] and Q. Wu (personal communication).
[33] and F. Cui (personal communication).