| Literature DB >> 34499122 |
Camilla Lo Cascio Sætre1, Fabrice Eroukhmanoff1, Katja Rönkä2,3, Edward Kluen2,3, Rose Thorogood2,3, James Torrance4, Alan Tracey4, William Chow4, Sarah Pelan4, Kerstin Howe4, Kjetill S Jakobsen1, Ole K Tørresen1.
Abstract
The reed warbler (Acrocephalus scirpaceus) is a long-distance migrant passerine with a wide distribution across Eurasia. This species has fascinated researchers for decades, especially its role as host of a brood parasite, and its capacity for rapid phenotypic change in the face of climate change. Currently, it is expanding its range northwards in Europe, and is altering its migratory behavior in certain areas. Thus, there is great potential to discover signs of recent evolution and its impact on the genomic composition of the reed warbler. Here, we present a high-quality reference genome for the reed warbler, based on PacBio, 10×, and Hi-C sequencing. The genome has an assembly size of 1,075,083,815 bp with a scaffold N50 of 74,438,198 bp and a contig N50 of 12,742,779 bp. BUSCO analysis using aves_odb10 as a model showed that 95.7% of BUSCO genes were complete. We found unequivocal evidence of two separate macrochromosomal fusions in the reed warbler genome, in addition to the previously identified fusion between chromosome Z and a part of chromosome 4A in the Sylvioidea superfamily. We annotated 14,645 protein-coding genes, and a BUSCO analysis of the protein sequences indicated 97.5% completeness. This reference genome will serve as an important resource, and will provide new insights into the genomic effects of evolutionary drivers such as coevolution, range expansion, and adaptations to climate change, as well as chromosomal rearrangements in birds.Entities:
Keywords: zzm321990 Acrocephalus scirpaceuszzm321990 ; Hi-C sequencing; genome assembly; long reads; reference genome
Mesh:
Year: 2021 PMID: 34499122 PMCID: PMC8459166 DOI: 10.1093/gbe/evab212
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Summary Statistics of the Reed Warbler Genome Assembly and Annotation
| Genome Assembly | |||
|---|---|---|---|
| Estimated genome size | 1.13 Gb | ||
| Guanine and cytosine content | 41.91% | ||
| N50 length (contig) | 13 Mb | ||
| Longest contig | 48 Mb | ||
| Total length of contigs | 1.07 Gb | ||
| N50 length (scaffold) | 74.44 Mb | ||
| Longest scaffold | 153.80 Mb | ||
| Total length of scaffolds | 1.08 Gb | ||
| Complete BUSCOs | 95.7% | ||
|
| |||
|
|
|
| |
|
| |||
| DNA | 0.22 | 2.35 Mb | |
| LINE | 4.11 | 44.2 Mb | |
| SINE | 0.09 | 0.98 Mb | |
| LTR | 4.50 | 48.4 Mb | |
| Unknown | 0.55 | 5.9 Mb | |
| Other (satellites, simple repeats, low complexity) | 1.49 | 16 Mb | |
| Total | 10.94 | 117.6 Mb | |
|
| |||
|
| |||
|
| |||
| Predicted genes | 14,645 | ||
| Average coding sequence length (bp) | 1,782 | ||
| Average exon length (bp) | 284 | ||
| Average intron length (bp) | 2918 | ||
| Complete BUSCOs | 97.5% | ||
Fig. 1Circos plot showing the synteny between the reed warbler (on the right side, denoted with the prefix as [Acrocephalus scirpaceus]) and the great tit (left side, prefix pm [Parus major]) genome assemblies. The reed warbler chromosome 6 is a fusion of great tit chromosomes 7 and 8, whereas reed warbler chromosome 8 is a fusion of great tit chromosomes 6 and 9 (see Hi-C contact maps in supplementary fig. 2, Supplementary Material online). The reed warbler chromosome Z corresponds to great tit chromosome Z, and a part of great tit chromosome 4A.