| Literature DB >> 36238155 |
Maulana M Naji1, Yifan Jiang2, Yuri T Utsunomiya3, Benjamin D Rosen4, Johann Sölkner1, Chuduan Wang2, Li Jiang2, Qin Zhang2, Yi Zhang2, Xiangdong Ding2, Gábor Mészáros1.
Abstract
Cattle have been essential for the development of human civilization since their first domestication few thousand years ago. Since then, they have spread across vast geographic areas following human activities. Throughout generations, the cattle genome has been shaped with detectable signals induced by various evolutionary processes, such as natural and human selection processes and demographic events. Identifying such signals, called selection signatures, is one of the primary goals of population genetics. Previous studies used various selection signature methods and normalized the outputs score using specific windows, in kbp or based on the number of SNPs, to identify the candidate regions. The recent method of iSAFE claimed for high accuracy in pinpointing the candidate SNPs. In this study, we analyzed whole-genome resequencing (WGS) data of ten individuals from Austrian Fleckvieh (Bos taurus) and fifty individuals from 14 Chinese indigenous breeds (Bos taurus, Bos taurus indicus, and admixed). Individual WGS reads were aligned to the cattle reference genome of ARS. UCD1.2 and subsequently undergone single nucleotide variants (SNVs) calling pipeline using GATK. Using these SNVs, we examined the population structure using principal component and admixture analysis. Then we refined selection signature candidates using the iSAFE program and compared it with the classical iHS approach. Additionally, we run Fst population differentiation from these two cattle groups. We found gradual changes of taurine in north China to admixed and indicine to the south. Based on the population structure and the number of individuals, we grouped samples to Fleckvieh, three Chinese taurines (Kazakh, Mongolian, Yanbian), admixed individuals (CHBI_Med), indicine individuals (CHBI_Low), and a combination of admixed and indicine (CHBI) for performing iSAFE and iHS tests. There were more significant SNVs identified using iSAFE than the iHS for the candidate of positive selection and more detectable signals in taurine than in indicine individuals. However, combining admixed and indicine individuals decreased the iSAFE signals. From both within-population tests, significant SNVs are linked to the olfactory receptors, production, reproduction, and temperament traits in taurine cattle, while heat and parasites tolerance in the admixed individuals. Fst test suggests similar patterns of population differentiation between Fleckvieh and three Chinese taurine breeds against CHBI. Nevertheless, there are genes shared only among the Chinese taurine, such as PAX5, affecting coat color, which might drive the differences between these yellowish coated breeds, and those in the greater Far East region.Entities:
Keywords: Bos taurus; IHS; bos indicus; cattle; fst; iSAFE; selection signature; whole-genome sequence (WGS)
Year: 2022 PMID: 36238155 PMCID: PMC9552183 DOI: 10.3389/fgene.2022.974787
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Alignment summary of the dataset.
| Breeds | Species | N animals | Total reads in millionb | Read lengthc | Mapped readsd | Depth |
|---|---|---|---|---|---|---|
| Fleckvieh |
| 10 | 326.52 | 101 | 0.977 | 5.823 |
| Kazakh |
| 6 | 250.32 | 131 | 0.998 | 8.859 |
| Mongolian |
| 12 | 210.78 | 139 | 0.998 | 8.259 |
| Yanbian |
| 10 | 226.90 | 137 | 0.998 | 8.513 |
| Dabieshan |
| 2 | 345.20 | 96 | 0.992 | 10.257 |
| Dehong |
| 2 | 342.82 | 93 | 0.997 | 10.052 |
| Dengchuan |
| 2 | 346.43 | 94 | 0.997 | 10.346 |
| Fujian |
| 2 | 332.11 | 95 | 0.997 | 9.988 |
| Guanling |
| 2 | 340.23 | 96 | 0.997 | 10.269 |
| Liping |
| 2 | 338.78 | 96 | 0.997 | 10.097 |
| Wenling |
| 2 | 339.14 | 90 | 0.997 | 9.387 |
| Luxi |
| 2 | 324.72 | 95 | 0.998 | 10.133 |
| Nanyang |
| 2 | 369.14 | 96 | 0.998 | 10.333 |
| Tibetan |
| 2 | 301.14 | 96 | 0.998 | 9.636 |
| Qinchuan |
| 2 | 347.76 | 93 | 0.998 | 9.846 |
Assigned species were based on the principal component analysis carried out in this study—Admixed of B taurus and B. indicus; Superscript b, c, d, e were the average values from individuals in each respective breed.
Depth values inferred from SNVs, in the final VCF, file.
FIGURE 1Map of origin for cattle used in this study; (A) Austrian Fleckvieh origin on European map with an inset of China and Austria in the world map; (B) Position of Chinese cattle breeds on the maps.
FIGURE 2Principal component analysis; component 1 explains 47.37 percent of variants and component 2 for 10.20 percent.
FIGURE 3Admixture analysis using (A) K = 22; (B) K = 3; (C) K = 4; and (D) K = 5.
Summary of output scores of SNVs and windows from iHS, iSAFE, and Fst tests.
| Pools | iSAFE test | iHS test | Fst test | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mean ± SD | Sign. SNVs | Intergenic | Gene | Mean ± SD | Sign. SNVs | Intergenic | Gene | Mean ± SD | Sign. Windows | Intergenic | Gene | |
| Fleckvieh | 0.07 ± 0.05 | 2,264 | 1,764 | 161 | -0.01 ± 0.70 | 6 | 3 | 2 | 0.08 ± 0.05 | 301 | 177 | 167 |
| Kazakh | 0.10 ± 0.08 | 1,502 | 1,039 | 56 | -0.53 ± 1.07 | 0 | 0 | 0 | 0.06 ± 0.05 | 336 | 190 | 185 |
| Mongolian | 0.06 ± 0.04 | 3,446 | 2,656 | 111 | -0.15 ± 0.64 | 41 | 18 | 7 | 0.08 ± 0.05 | 334 | 185 | 179 |
| Yanbian | 0.05 ± 0.03 | 5,068 | 3,681 | 258 | -0.09 ± 0.65 | 91 | 36 | 8 | 0.07 ± 0.05 | 334 | 188 | 182 |
| CHBI | 0.11 ± 0.07 | 0 | 0 | 0 | -0.01 ± 0.60 | 30 | 23 | 4 | NA | NA | NA | NA |
| CHBI_Med | 0.10 ± 0.07 | 469 | 368 | 3 | -0.49 ± 0.75 | 59 | 43 | 11 | NA | NA | NA | NA |
| CHBI_Low | 0.09 ± 0.06 | 1,648 | 881 | 28 | 0.02 ± 0.70 | 5 | 4 | 1 | NA | NA | NA | NA |
Pools: grouping of individuals - first four are specific B. taurus, CHBI_Low (seven Chinese B. indicus), CHBI_Med (four Chinese admixed), and CHBI (combination of CHBI_Low and CHBI_Med).
Mean and SD, of raw values for SNVs, reported in each respective test.
Number of SNVs, passing the threshold of genome-wide significance (-log10(p) = 7.301).
Number of SNVs, annotated to intergenic regions.
Number of SNVs, annotated to coding regions of genes.
Fst test of respective breed against CHBI.
Mean and SD, of Fst values for all windows.
Number of windows passing the threshold of genome-wide significance (-log10(p) = 7.301).
Number of windows annotated to intergenic regions.
Number of windows annotated to coding regions of gene.
FIGURE 4Manhattan plot for iSAFE and iHS tests for (A) Fleckvieh; (B) Kazakh; (C) Mongolian; and (D) Yanbian.
FIGURE 5Manhattan plot for iSAFE and iHS tests for (A) CHBI; (B) CHBI_Med; and (C) CHBI_Low.
FIGURE 6Manhattan plot for Fst test for (A) Fleckvieh; (B) Kazakh; (C) Mongolian; and (D) Yanbian against CHBI.
FIGURE 7Mean of the highest FPKM for all significant genes indicated by iHS and iSAFE tests in each respective individual pool.