| Literature DB >> 23696874 |
Yuri Tani Utsunomiya1, Ana Maria Pérez O'Brien, Tad Stewart Sonstegard, Curtis Paul Van Tassell, Adriana Santana do Carmo, Gábor Mészáros, Johann Sölkner, José Fernando Garcia.
Abstract
As the methodologies available for the detection of positive selection from genomic data vary in terms of assumptions and execution, weak correlations are expected among them. However, if there is any given signal that is consistently supported across different methodologies, it is strong evidence that the locus has been under past selection. In this paper, a straightforward frequentist approach based on the Stouffer Method to combine P-values across different tests for evidence of recent positive selection in common variations, as well as strategies for extracting biological information from the detected signals, were described and applied to high density single nucleotide polymorphism (SNP) data generated from dairy and beef cattle (taurine and indicine). The ancestral Bovinae allele state of over 440,000 SNP is also reported. Using this combination of methods, highly significant (P<3.17×10(-7)) population-specific sweeps pointing out to candidate genes and pathways that may be involved in beef and dairy production were identified. The most significant signal was found in the Cornichon homolog 3 gene (CNIH3) in Brown Swiss (P = 3.82×10(-12)), and may be involved in the regulation of pre-ovulatory luteinizing hormone surge. Other putative pathways under selection are the glucolysis/gluconeogenesis, transcription machinery and chemokine/cytokine activity in Angus; calpain-calpastatin system and ribosome biogenesis in Brown Swiss; and gangliosides deposition in milk fat globules in Gyr. The composite method, combined with the strategies applied to retrieve functional information, may be a useful tool for surveying genome-wide selective sweeps and providing insights in to the source of selection.Entities:
Mesh:
Year: 2013 PMID: 23696874 PMCID: PMC3655949 DOI: 10.1371/journal.pone.0064280
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Types of Signatures of Selection detectable from genomic data. Ages of selection are based on estimations for human data in years, assuming a generation interval of 25 years [2].
| Type of signature | Detectable pattern | Methodologies | Underlying selection phenomena | Population level | Age of selection (generations) |
| Function-altering mutation | Changes in non-synonymousto synonymous variation ratio in the open reading frame of a coding region |
| Positive and Purifying selection | Within species | >40,000 |
| Local genetic diversity depression | Deficit of local heterozygosity compared to the rest of the genome |
| Positive selection | Within populations | <10,000 |
| Change in the allele frequency spectrum | Increase in the frequency of derived alleles |
| Positive selection | Within and between populations | <3,200 |
| Populationdifferentiation | Difference in the allele frequencies between populations |
| Positive and Balancing selection | Between populations | <3,000 |
| Extended haplotype homozygosity | LD persistency and unusual long-range haplotypes |
| Positive selection | Within and between populations | <1,200 |
Description of cattle genotypes available for analysis before (BF) and after (AF) filtering for cryptic relatedness and quality control.
| Breed | Code | Subspecies | Purpose | HapMap | BOKU | ZGC | Total | ||||
| BF | AF | BF | AF | BF | AF | BF | AF | ||||
| Angus | ANG |
| Beef | 27 | 24 | 0 | 0 | 0 | 0 | 27 | 24 |
| Brown Swiss | BSW |
| Dairy | 24 | 13 | 48 | 31 | 0 | 0 | 72 | 44 |
| Gir | GIR |
| Dairy | 30 | 23 | 0 | 0 | 0 | 0 | 30 | 23 |
| Nellore | NEL |
| Beef | 35 | 24 | 0 | 0 | 691 | 21 | 726 | 45 |
The Bovine HapMap Consortium [29].
University of Natural Resources and Life Sciences, Vienna.
Zebu Genome Consortium.
The actual number of NEL samples passing control criteria was 581: 557 for ZGC and 24 for HapMap. In order to avoid an unbalanced dataset, we decided to keep a final set of 45 NEL: all 24 HapMap samples plus 21 randomly chosen ZGC samples.
Final base dataset used for the selective sweep analyses.
Figure 1Manhattan plots of genome-wide meta-SS –log10(P-values) for Angus, Brown Swiss, Gyr and Nellore breeds.
Number of SNP indicated represents count of markers crossing the significance line (P<3.17×10−7). Red and blue diamonds are intragenic and intergenic top SNP on peaks, respectively.
Figure 2meta-SS, component tests, EHH and derived allele bifurcation for CNIH3 in Brown Swiss (A) and Nellore (B).
Vertical dashed lines and red diamonds represent the position of the intronic SNP detected as highly significant in Brown Swiss (BTA16:28478192, P = 3.82×10−12). Horizontal dashed lines mark the Bonferroni significance threshold (P<3.17×10−7).
Figure 3Descriptive Network of functional terms in Angus (A) and Brown Swiss (B).
Nodes (red circles) are annotated functional terms. Edges connecting nodes represent gene share, being thickness proportional to the number of genes shared between terms (i.e., the degree of gene set overlap).
Figure 4Protein network of human CNIH3, according to STRING 9.0 action view.
Nodes are proteins; edges and arrows indicate interaction. Blue edges: binding; green arrows: activation; pink edges: post-translational modification; yellow edges: expression.