| Literature DB >> 32460767 |
Marina Naval-Sánchez1,2, Laercio R Porto-Neto3, Diercles F Cardoso3,4,5, Ben J Hayes6, Hans D Daetwyler7,8, James Kijas3, Antonio Reverter3.
Abstract
BACKGROUND: Distinct domestication events, adaptation to different climatic zones, and divergent selection in productive traits have shaped the genomic differences between taurine and indicine cattle. In this study, we assessed the impact of artificial selection and environmental adaptation by comparing whole-genome sequences from European taurine and Asian indicine breeds and from African cattle. Next, we studied the impact of divergent selection by exploiting predicted and experimental functional annotation of the bovine genome.Entities:
Year: 2020 PMID: 32460767 PMCID: PMC7251699 DOI: 10.1186/s12711-020-00546-6
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Whole-genome sequences used in the study
| Name | Animal source | Sample size | Genome of origin |
|---|---|---|---|
| Brahman | Australia | 90 | |
| Nelore | Brazil | 5 | |
| Gir | Brazil | 6 | |
| Shahiwal | India/Pakistan | 1 | |
| Composite | Australia | 56 | Indicine-taurine |
| Brangus | USA | 5 | Indicine-taurine |
| Santa Gertrudis | USA | 4 | Indicine-taurine |
| BeefMaster | USA | 15 | Indicine-taurine |
| Charolais | France | 128 | |
| Angus | Great Britain | 51 | |
| Shorthorn | Great Britain | 5 | |
| British shorthorn | Great Britain | 2 | |
| N’ Dama | Africa | 12 | |
| Uganda-mix | Africa | 26 | Sanga |
| Africander | Africa | 5 | Sanga |
| Ankole | Africa | 10 | Sanga |
| Ogaden | Africa | 9 | |
| Boran | Africa | 10 |
Fig. 1Genetic variation and divergence. a PCA of the genetic distance was performed to assess the clustering of sequences according to their genome of origin, taurine (blue), indicine (orange) or admixed (green). For PCA breed composition, (see Additional file 1: Figure S2). b Proportion and number of private and shared SNPs for the taurine (blue), indicine (orange) or admixed (green) cattle datasets. c Estimated nucleotide diversity in 20-kb genomic bins for the taurine (blue), indicine (orange) or admixed (green) sequences. d–f Correlations between estimated reference allele frequencies (RAF) between taurine (blue) and indicine (orange) (d), admixed (green) and indicine (orange) (e) and admixed (green) and taurine (blue) (f). An increasing number of SNP counts is related to warmer colours
Fig. 2Candidate selective sweeps in taurine and indicine cattle. a Population differentiation index (FST) and relative nucleotide diversity between taurine and indicine cattle in genome-wide 20-kb genomic bins. Outlier bins that show evidence of selection in taurine breeds (blue) and indicine breeds (red). b Genome-wide distribution of relative nucleotide diversity. Positive and negative values represent candidate sweeps in taurine and indicine cattle, respectively. Outliner bins are coloured in red. c IGV screenshot of chr5: 47,526,093–48,203,280. In red, the 430 kb long selective sweep in Asian indicine cattle: spanning GRIP1, HELB, IRAK3, ENSBTAG00000026993, LLPH, and part of HMGA2. In green, a selective sweep in GRIP1 in European taurine cattle. In blue, the 6.2 kb tandem duplication HMGA2-CNVR reported by [53]. Below 10 variant files in vcf format for 10 Brahman animals, 10 Angus, and 10 Charolais
Top 10 enriched terms from Mouse Genome Informatics (MGI) phenotype in GREAT for the identified Bos taurus selective sweeps in the comparison Bos indicus versus Bos taurus
| Term name | Binomial raw p-value | Binomial test FDR Q-value | Binomial fold enrichment | Binomial observed region hits | Binomial region set coverage | Genes |
|---|---|---|---|---|---|---|
| Increased exploration in new environment | 2.08 × 10−14 | 1.65 × 10−10 | 7.11 | 26 | 0.04 | |
| Decreased aggression | 5.73 × 10−11 | 1.13 × 10−07 | 5.40 | 24 | 0.04 | |
| Abnormal kidney interstitium morphology | 1.25 × 10−08 | 1.10 × 10−05 | 4.87 | 20 | 0.03 | |
| Abnormal social investigation | 9.88 × 10−07 | 2.17 × 10−04 | 3.42 | 22 | 0.03 | |
| Abnormal strial marginal cell morphology | 1.64 × 10−06 | 3.09 × 10−04 | 10.25 | 8 | 0.01 | |
| Abnormal startle reflex | 1.87 × 10−06 | 3.44 × 10−04 | 2.37 | 37 | 0.06 | |
| Abnormal frontal bone morphology | 8.19 × 10−06 | 9.55 × 10−04 | 2.89 | 23 | 0.04 | |
| Abnormal lens induction | 4.21 × 10−05 | 3.00 × 10−03 | 4.20 | 12 | 0.02 | |
| Abnormal pain threshold | 5.29 × 10−05 | 3.57 × 10−03 | 2.03 | 37 | 0.06 | |
| Abnormal fear-related response | 8.43 × 10−05 | 5.09 × 10−03 | 2.98 | 17 | 0.03 |
242 20-kb windows p- value < 0.05
Fig. 3Genomic feature enrichment in selective sweeps. a The strength of enrichment for 20 genomic features in 72 indicine-specific regions assessed by overlapping genomic regions [50]. b Intersection of the delta allele frequency (ΔAF) with functional annotations derived from the reference UMD3.1 bovine genome. The number of SNPs in ΔAF bins is indicated on the left, and the M-value (log2-fold change) of the relative frequencies of SNPs in each functional category (on the right). The black line shows the number of SNPs within each (ΔAF) bin. c Intersection of the delta allele frequency (ΔAF) with functional annotations from the predicted regulatory elements in the cattle genome [20] and publicly available experimental epigenetic marks [49] and Fr-AgENCODE [48]
Fig. 4Candidate SNPs in the HELB gene. a Representation of the HELB gene based on human hg19 coordinates (chr12: 66,696,335–66,737,423) and bovine bosTau6 coordinates (chr5:47,713,179–47,751,456). b Multiple sequence alignment of amino acids across representative mammalian species. In red: the candidate indicine-specific mutations. c HELB amino acid sequence with the indicine-specific mutations are highlighted in red. Non-synonymous mutations with substitution residues are located at positions 10, 788 and 791. In purple, residue 428, which had been previously associated to temperature sensitive murine cell lines [88]. d Allele frequency of the SNP rs447470311 located in chr5:47726121 across the 36 breeds in Run6 1000 Bull Genomes Project which contain the G variant [24]
Fig. 5HMGA2-CNV. a IGV screenshot of the HMGA2-CNVR locus (in blue) and the visualization of the alignment of paired-end reads. Sample1_No_CNV shows the aligned reads of a composite animal presenting no CNV; Sample2_CNV shows the aligned reads of a Brahman animal presenting the duplication event. Paired-end reads in green imply duplication with respect to the reference genome. These read pairs are oriented towards the outer sides of the predicted insert. Red colour reads indicate paired end reads with an insert size larger than expected (possible deletion). b Contingency table of SNP rs447470311 (chr5:47,726,121) versus HMGA2-CNVR (chr5:48,074,233–48,080,443 (~ 6.2 kb) genotype