| Literature DB >> 33944937 |
Shangzhe Zhang1, Wenyu Liu1, Xinfeng Liu1, Xin Du1, Ke Zhang1, Yang Zhang2, Yongwu Song3, Yunnan Zi4, Qiang Qiu1, Johannes A Lenstra5, Jianquan Liu1.
Abstract
Structural variants (SVs) represent an important genetic resource for both natural and artificial selection. Here we present a chromosome-scale reference genome for domestic yak (Bos grunniens) that has longer contigs and scaffolds (N50 44.72 and 114.39 Mb, respectively) than reported for any other ruminant genome. We further obtained long-read resequencing data for 6 wild and 23 domestic yaks and constructed a genetic SV map of 372,220 SVs that covers the geographic range of the yaks. The majority of the SVs contains repetitive sequences and several are in or near genes. By comparing SVs in domestic and wild yaks, we identified genes that are predominantly related to the nervous system, behavior, immunity, and reproduction and may have been targeted by artificial selection during yak domestication. These findings provide new insights in the domestication of animals living at high altitude and highlight the importance of SVs in animal domestication.Entities:
Keywords: zzm321990 Bos grunnienszzm321990 ; domestication; reference genome; structural variants
Mesh:
Year: 2021 PMID: 33944937 PMCID: PMC8382902 DOI: 10.1093/molbev/msab134
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Assembly statistics comparison between BosGru2.0 and BosGru3.0.
| Assembly | BosGru2.0 | BosGru3.0 |
|---|---|---|
| Total length (bp) | 2,645,161,911 | 2,832,776,395 |
| Number of contigs | 41,192 | 414 |
| Contig N50 (Mb) | 1.41 | 44.72 |
| Scaffold N50 (Mb) | 1.41 | 114.39 |
| Chromosome number | 0 | 31 |
| Unplaced contig number | 41,192 | 383 |
| Number of gaps | 192,002 | 646 |
| GC content (%) | 41.7 | 42.0 |
| Protein-coding genes | 20,499 | 21,232 |
Fig. 1(A) Geographic distribution of all domestic and wild yaks sampled in this research. (B) Genetic groups of 91 domestic and wild yaks in total based on short-read whole-genome sequences with population structure K = 5. Triangles indicate samples selected for long-read whole-genome sequencing. Orange: domestic yak; Blue: wild yak. GS, Gansu; NP, Nepal; PK, Pakistan; QH, Qinghai; SC, Sichuan; XZ, Xizang; YN, Yunnan; WY, Wild yak. (C) Neighbor-joining tree constructed based on SNPs of all long-read samples. (D) Domestication-related SVs in the region of MAGI2.