| Literature DB >> 35106558 |
Xinyuan Zhang1, Chuyu Lin2, Haimeng Li3,4, Sixia Liu1, Qing Wang3,4, Shangchen Yang5, Minhui Shi3,4, Sunil Kumar Sahu3, Yixin Zhu3,4, Jiangang Wang3, Junxuan Huang3, Yiyin Hu6,7, Jieyao Yu6,7, Shaofang Zhang6,7, Guanglong Li8, Wenyuan Guan9, Haorong Lu6,7, Tianming Lan3, Yanchun Xu1.
Abstract
The green peafowl (Pavo muticus) is facing a high risk of extinction due to the long-term and widespread threats of poaching and habitat conversion. Here, we present a high-quality chromosome-level genome assembly of the green peafowl with high contiguity and accuracy assembled by PacBio sequencing, DNBSEQ short-read sequencing, and Hi-C sequencing technologies. The final genome size was estimated to be 1.049 Gb, whereas 1.042 Gb of the genome was assigned to 27 pseudochromosomes. The scaffold N50 length was 75.5 Mb with a complete BUSCO score of 97.6%. We identified W and Z chromosomes and validated them by resequencing 14 additional individuals. Totally, 167.04 Mb repetitive elements were identified in the genome, accounting for 15.92% of the total genome size. We predicted 14,935 protein-coding genes, among which 14,931 genes were functionally annotated. This is the most comprehensive and complete de novo assembly of the Pavo genus, and it will serve as a valuable resource for future green peafowl ecology, evolution, and conservation studies.Entities:
Keywords: PacBio sequencing; chromosome-level genome assembly; conservation; green peafowl
Mesh:
Year: 2022 PMID: 35106558 PMCID: PMC8857919 DOI: 10.1093/gbe/evac015
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Statistics of Genome Assembly and Annotation of Pavo muticus
| Item | Category | Number |
|---|---|---|
| Sequencing data | PacBio (Gb) | 159.45 |
| Genome survey (Gb) | 139.52 | |
| Hi-C (Gb) | 692.45 | |
| RNA-seq (Gb) | 35.60 | |
| Assembly (PacBio) | Estimated genome size (Gb) | 1.05 |
| Assembled genome size (Gb) | 1.049 | |
| Contig number | 2324 | |
| Contig N50 (Mb) | 25.4 | |
| Longest scaffold (Mb) | 113.2 | |
| Assembly (Hi-C) | Assembled genome size (Gb) | 1.049 |
| Scaffold number | 115 | |
| Scaffold N50 (Mb) | 75.5 | |
| Longest scaffold (Mb) | 151.5 | |
| Annotation | GC content (%) | 42.1 |
| Repeat sequences (%) | 14.1 | |
| Number of protein-coding genes | 14,935 | |
| Number of functionally annotated genes | 14,931 | |
| Average gene length (kb) | 20.1 | |
| Average exon length (bp) | 171.9 | |
| Average intron length (kb) | 2.1 | |
| Average exon per gene | 9.8 |
Fig. 1.Genome architecture of Pavo muticus. (a) Genome features and syntenic analysis with the chicken genome. 1: GC content; 2: repeat density; 3: gene density; 4: syntenic result. (b) Improved genome annotation. HHGs represented the number of highly homologous genes with chicken’s gene set. (c) Averaged male: female depth ratios of scaffolds longer than 1 Mb. Red dots represented autosomes. (d) Genome-wide genetic diversity of eight macrochromosomes and two sex chromosomes in female individuals.