| Literature DB >> 35883402 |
Jiabo Wang1, Xiaowei Li2, Wei Peng3, Jincheng Zhong1, Mingfeng Jiang1.
Abstract
The yak is the largest meat-producing mammal around the Tibetan Plateau, and it plays an important role in the economic development and maintenance of the ecological environment throughout much of the Asian highlands. Understanding the genetic components of body weight is key for future improvement in yak breeding; therefore, genome-wide association studies (GWAS) were performed, and the results were used to mine plant and animal genetic resources. We conducted whole genome sequencing on 406 Maiwa yaks at 10 × coverage. Using a multiple loci mixed linear model (MLMM), fixed and random model circulating probability unification (FarmCPU), and Bayesian-information and linkage-disequilibrium iteratively nested keyway (BLINK), we found that a total of 25,000 single-nucleotide polymorphisms (SNPs) were distributed across chromosomes, and seven markers were identified as significantly (p-values < 3.91 × 10-7) associated with the body weight trait,. Several candidate genes, including MFSD4, LRRC37B, and NCAM2, were identified. This research will help us achieve a better understanding of the genotype-phenotype relationship for body weight.Entities:
Keywords: Tibetan Plateau; body weight; genome-wide association study; heritability; yak
Year: 2022 PMID: 35883402 PMCID: PMC9311934 DOI: 10.3390/ani12141855
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 3.231
Figure 1The distribution of body weights in 2019. In total 406 weights of individuals yaks were observed from 23 to 485 Kg. The density of phenotype values was drawn as vertical bar. The red line fits normal distribution.
Figure 2The distribution of genotype and population stratification. In total 25,537 SNPs are marked in the 29 autosomes and two (X and Y) sex chromosomes. The relative positions in the chromosomes were used to indicate marker density (A). The heterozygosity frequency of all 406 individuals is shown as bar plot (B). Marker density and accumulation frequency are plotted in the figure (C). All markers were used to interpret population structure with PCA (D).
Figure 3Manhattan plots with all genotypes with three GWAS methods. The GWAS results of three GWAS methods were integrated into circle multiple Manhattan plots (A). The outer ring is the marker density and the significant markers were marked as red star. The markers detected by more than two methods were drawn with gray string line. The QQ plots of multiple methods were also integrated into figure (B).
The candidate genes’ information of associated significant SNPs in the 200 kb region. The SNP No. indicates order number in the whole marker list. The chromosome and position mean the physical location information in the genome data. The gene names were annotated from GTF file in the BosGru_v3.0 reference genome. The values in the brackets are distance of base pair between such marker and nearest gene. Class indicates the type of gene, and the region transcribed shows the region SNPs located in.
| SNP No. | Chromosome | Position (bp) | Gene Name |
|---|---|---|---|
| rs13559 | 1 | 25,773,622 | ENSBGRG00000000052-ENSBGRG00000000053 |
| rs137207 | 3 | 28,379,856 | |
| rs371363 | 7 | 87,945,452 |
|
| rs10942 | 1 | 22,314,135 |
|
| rs118493 | 2 | 140,542,613 |
|
| rs815163 | 19 | 16,238,800 |
Figure 4Phenotype distribution among the genotypes of the associated SNPs. The population was divided with the genotype of associated SNPs containing rs13559 (A), rs137207 (B), and rs371363 (C). In the figure there is only one individual with AA or GG genotype for SNP rs13559 or rs137207. For these three associated SNPs, there are no heterozygous genotypes observed in all individuals.