| Literature DB >> 32317270 |
Graham Wiley1, Matthew J Miller2.
Abstract
Woodpeckers are found in nearly every part of the world and have been important for studies of biogeography, phylogeography, and macroecology. Woodpecker hybrid zones are often studied to understand the dynamics of introgression between bird species. Notably, woodpeckers are gaining attention for their enriched levels of transposable elements (TEs) relative to most other birds. This enrichment of TEs may have substantial effects on molecular evolution. However, comparative studies of woodpecker genomes are hindered by the fact that no high-contiguity genome exists for any woodpecker species. Using hybrid assembly methods combining long-read Oxford Nanopore and short-read Illumina sequencing data, we generated a highly contiguous genome assembly for the Golden-fronted Woodpecker (Melanerpes aurifrons). The final assembly is 1.31 Gb and comprises 441 contigs plus a full mitochondrial genome. Half of the assembly is represented by 28 contigs (contig L50), each of these contigs is at least 16 Mb in size (contig N50). High recovery (92.6%) of bird-specific BUSCO genes suggests our assembly is both relatively complete and relatively accurate. Over a quarter (25.8%) of the genome consists of repetitive elements, with 287 Mb (21.9%) of those elements assignable to the CR1 superfamily of transposable elements, the highest proportion of CR1 repeats reported for any bird genome to date. Our assembly should improve comparative studies of molecular evolution and genomics in woodpeckers and allies. Additionally, the sequencing and bioinformatic resources used to generate this assembly were relatively low-cost and should provide a direction for development of high-quality genomes for studies of animal biodiversity.Entities:
Keywords: Piciformes; hybrid assembly; repetitive elements
Mesh:
Year: 2020 PMID: 32317270 PMCID: PMC7263694 DOI: 10.1534/g3.120.401059
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Male (left) and female (right) Golden-fronted Woodpecker (Melanerpes aurifrons). Photos by Bettina Arrigoni, cropped, and used under CC BY 2.0 license. Original photos available at: https://flickr.com/photos/69683857@N05/39849351035 and https://flickr.com/photos/69683857@N05/26752528708.
Run statistics for three Oxford Nanopore sequencing runs
| Sequencing Run | Reads | Median Read Length | Read Length N50 | Median Read Qual | Total Bases |
|---|---|---|---|---|---|
| MinION Run | 101,989 | 15,105 | 39,203 | 10.5 | 2.29 X 109 |
| PromethION Run 1 | 1.77 X 106 | 9,478 | 30.742 | 10.6 | 27.43 X 109 |
| PromethION Run 2 | 2.09 X 106 | 9,178 | 34,314 | 10.5 | 34.27 X 109 |
Results of genome size estimate using k-mer analysis
| Estimated genome size | ||
|---|---|---|
| 17 | 46 | 1.377 Gb |
| 26 | 42 | 1.404 Gb |
| 31 | 41 | 1.379 Gb |
Assembly contiguity statistics for the initial assembly and after multiple iterations of consensus correction
| Stage | # of contigs | Assembly size | Max contig size | contig L50 | contig L90 | contig N50 | contig N90 |
|---|---|---|---|---|---|---|---|
| flye (no correction) | 1519 | 1,353 Mb | 42.8 Mb | 33 fragments | 136 contigs | 14.5 Mb | 1.4 Mb |
| racon 1 | 847 | 1344 Mb | 46.8 Mb | 29 contigs | 117 contigs | 15.9 Mb | 1.5 Mb |
| racon 2 | 823 | 1343 Mb | 46.8 Mb | 29 contigs | 117 contigs | 15.9 Mb | 1.5 Mb |
| racon 3 | 808 | 1343 Mb | 46.8 Mb | 29 contigs | 117 contigs | 15.9 Mb | 1.5 Mb |
| pilon 2 | 808 | 1346 Mb | 46.8 Mb | 29 contigs | 117 contigs | 15.9 Mb | 1.5 Mb |
| purge_haplotigs | 441 | 1309 Mb | 46.8 Mb | 28 contigs | 100 contigs | 16 Mb | 2.3 Mb |
BUSCO summarized benchmarking at various stages of genome assembly
| Stage | Complete (Single, Duplicate) | Fragmented | Missing |
|---|---|---|---|
| flye (no correction) | 76.2% (74.7%, 1.5%) | 10.6% | 13.2% |
| racon 3 | 76.5% (75.1%, 1.4%) | 11.0% | 12.5% |
| pilon 1 | 92.6% (90.8%, 1.8%) | 4.5% | 2.9% |
| pilon 2 | 92.7% (90.8%, 1.9%) | 4.5% | 2.8% |
| purge_haplotigs | 92.6% (90.9%, 1.7%) | 4.5% | 2.9% |
Figure 2Circularized annotated mitochondrial genome assembly for the Golden-fronted Woodpecker (Melanerpes aurifrons). Figure generated by MitoAnnotator [36].
Figure 3Ideograms showing synteny between Golden-fronted Woodpecker pseudo-chromosomes and chromosomes from three different bird species.
Comparison of various genome assembly statistics for recently published wild bird genomes. * N50 scaffold not reported for assemblies comprised only of ungapped contigs. ** Statistics only reported for scaffolded contigs
| This study |