| Literature DB >> 33616654 |
María Recuerda1, Joel Vizueta1,2, Cristian Cuevas-Caballé2, Guillermo Blanco1, Julio Rozas2, Borja Milá1.
Abstract
The common chaffinch, Fringilla coelebs, is one of the most common, widespread, and well-studied passerines in Europe, with a broad distribution encompassing Western Europe and parts of Asia, North Africa, and the Macaronesian archipelagos. We present a high-quality genome assembly of the common chaffinch generated using Illumina shotgun sequencing in combination with Chicago and Hi-C libraries. The final genome is a 994.87-Mb chromosome-level assembly, with 98% of the sequence data located in chromosome scaffolds and a N50 statistic of 69.73 Mb. Our genome assembly shows high completeness, with a complete BUSCO score of 93.9% using the avian data set. Around 7.8% of the genome contains interspersed repetitive elements. The structural annotation yielded 17,703 genes, 86.5% of which have a functional annotation, including 7,827 complete universal single-copy orthologs out of 8,338 genes represented in the BUSCO avian data set. This new annotated genome assembly will be a valuable resource as a reference for comparative and population genomic analyses of passerine, avian, and vertebrate evolution.Entities:
Keywords: zzm321990 Fringilla coelebszzm321990 ; common chaffinch; reference genome; whole genome assembly
Mesh:
Substances:
Year: 2021 PMID: 33616654 PMCID: PMC8046334 DOI: 10.1093/gbe/evab034
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.(a) Circos plot comparing the zebra finch (right hemisphere) and the common chaffinch (left hemisphere) genome assemblies. The common chaffinch chromosomes marked with an asterisk (*) show inversions with respect to the zebra finch assembly. (b) Linear synteny plots of the common chaffinch chromosomes showing inversions relative to the zebra finch generated with the R package genoPlotR (Guy et al. 2010). The zebra finch assembly (top) is compared with the common chaffinch assembly (bottom), and numbers designate specific chromosomes.
Genome Statistics and Predicted ncRNAs of the Fringilla coelebs Genome Compared with Other Similarly Sized Avian Species (Melospiza melodia, Taeniopygia guttata, Ficedula albicollis, Manacus vitellinus, and Geospiza fortis), Modified from Louha et al. (2020).
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Number of genes | 17,703 | 15,086 | 17,561 | 16,763 | 18,976 | 14,399 |
| Mean gene length (bp) | 15,818 | 14,457 | 26,458 | 31,394 | 27,847 | 30,164 |
| Number of CDSs | 17,703 | 15,086 | 17,561 | 16,763 | 18,976 | 14,399 |
| Mean CDs length (bp) | 1,679 | 1,325 | 1,677 | 1,942 | 1,929 | 1,766 |
| Number of exons | 221,872 | 131,940 | 171,767 | 189,043 | 190,390 | 164,721 |
| Mean exon length (bp) | 165 | 153 | 255 | 253 | 264 | 195 |
| Mean number of exons/gene | 10.16 | 8.67 | 10.25 | 12.22 | 11.51 | 11.41 |
| Number of introns | 200,041 | 116,724 | 153,909 | 171,236 | 171,089 | 149,563 |
| Mean intron length (bp) | 1,902 | 1,695 | 2,930 | 3,257 | 3,294 | 2,813 |
| Total proteins | 21,831 | |||||
| ncRNA | ||||||
| tRNA | 325 | 267 | 184 | 179 | ||
| miRNA | 140 | 166 | 302 | 510 | ||
| snRNA | 18 | 16 | 44 | 32 | ||
| snoRNA | 126 | 154 | 241 | 199 | ||
| rRNA | 5 | 8 | 100 | 22 | ||
| lncRNA | 17 | 20 | 908 | 1473 |