| Literature DB >> 34146095 |
Gaik Tamazian1, Pavel Dobrynin2, Anna Zhuk2, Daria V Zhernakova2,3, Polina L Perelman4, Natalia A Serdyukova4, Alexander S Graphodatsky4, Aleksey Komissarov5, Sergei Kliver4, Nikolay Cherkasov1,6, Alan F Scott7, David W Mohr7, Klaus-Peter Koepfli8,9, Stephen J O'Brien2,10, Ksenia Krasheninnikova2,11.
Abstract
The Puma lineage within the family Felidae consists of 3 species that last shared a common ancestor around 4.9 million years ago. Whole-genome sequences of 2 species from the lineage were previously reported: the cheetah (Acinonyx jubatus) and the mountain lion (Puma concolor). The present report describes a whole-genome assembly of the remaining species, the jaguarundi (Puma yagouaroundi). We sequenced the genome of a male jaguarundi with 10X Genomics linked reads and assembled the whole-genome sequence. The assembled genome contains a series of scaffolds that reach the length of chromosome arms and is similar in scaffold contiguity to the genome assemblies of cheetah and puma, with a contig N50 = 100.2 kbp and a scaffold N50 = 49.27 Mbp. We assessed the assembled sequence of the jaguarundi genome using BUSCO, aligned reads of the sequenced individual and another published female jaguarundi to the assembled genome, annotated protein-coding genes, repeats, genomic variants and their effects with respect to the protein-coding genes, and analyzed differences of the 2 jaguarundis from the reference mitochondrial genome. The jaguarundi genome assembly and its annotation were compared in quality, variants, and features to the previously reported genome assemblies of puma and cheetah. Computational analyzes used in the study were implemented in transparent and reproducible way to allow their further reuse and modification. © The American Genetic Association. 2021.Entities:
Keywords: zzm321990 Puma yagouaroundizzm321990 ; 10X Genomics Chromium; Felidae; genome annotation; linked reads; whole genome assembly
Mesh:
Year: 2021 PMID: 34146095 PMCID: PMC8558579 DOI: 10.1093/jhered/esab036
Source DB: PubMed Journal: J Hered ISSN: 0022-1503 Impact factor: 2.645
Software and databases used for the assembly and analysis of the jaguarundi genome
| Analysis | Software or Database | Version |
|---|---|---|
| Genome assembly | Supernova | 2.1.1 |
| Assembly assessment | QUAST | 5.0.2 |
| BUSCO | 4.1.4 | |
| OrthoDB’s mammalian_odb10 dataset | Creation date: 2020-09-10, number of BUSCOs: 9226, number of species: 24 | |
| BlobTools2 | 2.3.3 | |
| UpSetR | 1.4.0 | |
| Reference-based repeat annotation | RepeatMasker | 4.0.8 (run with blastp version 2.0MP-WashU) |
| RepBase Update database | 20181026 | |
| Dfam database | Consensus 20181026 | |
| De novo repeat annotation | WindowMasker | 1.0.0 (from package blast 2.2.31) |
| Annotation of protein-coding genes | BUSCO | 4.0.4 |
| OrthoDB’s carnivora_odb10 dataset | Creation date: 2019-11-20, number of BUSCOs: 14502, number of species: 12 | |
| BLAT | 35 | |
| AUGUSTUS | 3.3.3 | |
| BEDTools | 2.27.1 | |
| Read mapping | BWA | 0.7.17-r1188 |
| Samtools | 1.11 (using HTSlib 1.11) | |
| Variant calling | FreeBayes | 1.3.4 |
| Variant effect prediction | SnpEff | 5.0 |
| Mitochondrial genome analysis | NCBI BLAST | 2.9.0+ (from package blast 2.9.0) |
| Circos | 0.69–8 | |
| LAST | 1060 | |
| Auxiliary tools | Snakemake | 5.10.0 and 5.32.0 |
| GNU Parallel | 20150122 | |
| R | 3.6.3 | |
| GenomicRanges | 1.38.0 | |
| tidyverse | 1.3.0 |
The citations are given in the text. The table does not list routines from the JDGA package.
Figure 1.Summary of the jaguarundi genome assembly. The assembly scaffolds are shown by dark-gray bars arranged on the inner side of the circle. The circle radius is the length of the longest scaffold in the assembly. The scaffold bars follow each other clockwise according to the scaffold lengths starting from the longest scaffold, which is represented by the red sector at the center of the circle. The bar widths decrease with the scaffold size resulting in higher density of smaller scaffolds while turning clockwise. The light-gray spiral at the center of the plot shows the cumulative scaffold count for the corresponding point on the circle. Both scaffold lengths (dark gray) and cumulative counts (light gray) are shown on a log scale. The orange circular stripe denotes the area which corresponds to scaffold lengths greater than the N50 scaffold length. The light-orange circular stripe marks the area between the N50 and N90 scaffold lengths. The outer ring of the circle visualizes GC and gap contents of the scaffolds. The wave-like pattern in the outer ring section between 90% and 0% marks is an overplotting artifact caused by multiple small scaffolds corresponding to that region. The summary of OrthoDB’s mammalia_odb10 dataset genes identified in the assembly by BUSCO is given in the upper right corner of the figure. The gap percentage of the assembly and the percentages of GC and AT base pairs in the assembly sequence without gaps are given in the lower right corner of the figure.
Figure 2.Fragments of scaffolds from the jaguarundi genome assembly arranged on domestic cat chromosomes. The fragments are shown by blue blocks with the black outline. Short fragments are shown by black bars without the blue fill. White blocks show domestic cat chromosomes with putative centromere positions as given in the Felis_catus_9.0 assembly of the domestic cat genome. The fragments were formed by consecutive BUSCO genes from OrthoDB’s mammalia_odb10 dataset. The order of the BUSCO genes in the fragments was preserved between jaguarundi scaffolds and domestic cat chromosomes. Each block contains at least 2 BUSCO genes.
Figure 3.Intersections of BUSCO genes from 4 categories: single copy (S), missing (M), fragmented (F), and duplicated (D) in the genome assemblies of jaguarundi (PYA), cheetah (AJU), and puma (PCO).