| Literature DB >> 28957327 |
Leah E Mechanic1, Sara Lindström2, Kenneth M Daily3, Solveig K Sieberts3, Christopher I Amos4, Huann-Sheng Chen5, Nancy J Cox6, Marina Dathe1, Eric J Feuer5, Michael J Guertin7, Joshua Hoffman8, Yunxian Liu7, Jason H Moore9, Chad L Myers10, Marylyn D Ritchie11,12, Joellen Schildkraut13, Fredrick Schumacher14, John S Witte8, Wen Wang10, Scott M Williams14, Elizabeth M Gillanders1.
Abstract
Entities:
Mesh:
Year: 2017 PMID: 28957327 PMCID: PMC5619686 DOI: 10.1371/journal.pgen.1006945
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Stimulation of innovation in U4C.
Existing genome-wide association studies (GWAS), representing thousands of cases and controls. Data were shared and accessed in a manner consistent with informed consent. Some of these data sets were made available for the first time in U4C. Teams competed for a prize to develop innovative analytical methods and make novel discoveries using these data sets.
Overview of U4C entries.
| Team Name | Entry Title | dbGaP Accession Number for U4C-Designated Data Sets Used | Other Data Sets | Strategy | Replication Strategy |
|---|---|---|---|---|---|
| Battalion Y | Integrative Analysis of Diverse Genomic Data Identifies Novel Link Between Immunity Pathways and Inherited Breast Cancer Risk | phs000147, phs000383, phs000517, phs000799, phs000812, phs000851, phs000912 | GTEx, seeQTL, GenoSkyline, TCGA | GWAS, meta-analysis, functional annotation, protein–protein interaction, tissue specific enrichment, somatic mutations, eQTL analysis | Consistency across dbGaP data sets |
| CSMC_TEAM | Identifying Novel Genes and Pathways for Breast Cancer with Semiparametric Modeling | phs000147 | Expression data from European Genome-Phenome Archive (EGAS00000000083), SNP gene annotation databases (KEGG, panther, cell map, BioCarta, etc.) | Linked SNPs to genes and pathways and looked for enrichment of genes and pathways in breast cancer | Compared dbGaP association results with gene expression and annotation databases |
| Gene Fishing | Novel Genetic Variants of Breast Cancer—SNPs, Genes, and Gene-Gene Interactions | phs000799 | None | Linked SNPs to genes and performed gene-based and gene–gene interaction tests | Split data into testing and training |
| hapQTL | Haplotype Associations in Shanghai Breast Cancer Study (Up For A Challenge) | phs000799 | None | Examined haplotype associations and identified nearby genes | Down sampling 100 times |
| MDACC | Association of X-Chromosome Genetic Variants and Breast Cancer Risk | phs000147, phs000383, phs000812, phs000851 | Single SNPs and gene-based tests of association on X chromosome and pathway analysis (IPA) | Consistency across studies and previous gene-expression publication | |
| MDACC | Prediction of Breast Cancer Status Using X- Chromosome Genetic Variants | phs000147, phs000812, phs000851 | Single SNPS and gene-based random forests followed by pathway analysis (IPA) | Consistency across studies (used 1 study for training and others for testing) | |
| muStat | Breast Cancer muGWAS | phs000147, phs000812 | None | u-statistics for multivariate data (neighboring SNPs) integrating knowledge about genetics, leveraging information content and study-specific genome-wide significance | Consistency across studies (CGEMS, phs000147, and cohorts EPIC and PBCS of BPC3, phs000812) and consistency with published results from functional and expression data |
| snpsnbits | Identify Breast Cancer Pathways Using Iscore Screening | phs000147, phs000517, phs000799, phs000812, phs000851 | NHGRI GWAS Catalogue, SNPedia | Used SNPs from literature and identified in GWAS to identify pathways (or gene sets) associated with breast cancer. Interactions and new SNPs were identified using an iscore | 2 data sets for training, 3 data sets for testing |
| Team Transcription | Identification of Breast Cancer Associated Variants That Modulate Transcription Factor Binding | NHGRI GWAS Catalogue | ENCODE, Roadmap Epigenomics, TCGA, GTEx | Integrative genomics approach included identifying transcription factor motifs and association with breast cancer, SNPs in LD with top GWAS, SNPs within motifs and DNase I hypersensitivity sites, and eQTL analysis | Consistency across multiple data sets and cell types |
| Team UCSF | Team UCSF Up For A Challenge Submission | phs000147, phs000383, phs000517, phs000799, phs000812, phs000851, phs000912 | UK Biobank, GTEx | GWAS, GWAGE using PrediXscan, meta-analysis and admixture mapping | Replicated previous GWAS findings in the data sets, entry findings were replicated in UK biobank |
| U4C Maroons | U Chicago Maroons Project for U4C | phs000147, phs000383, phs000799, phs000812, phs000851 | GAME-ON breast cancer GWAS summary statistics, Depression Genes and Network, and GTEx | GWAGE using MetxScan and meta-analysis | Consistency across data sets, replicated in GAME-ON breast cancer GWAS |
| UCLA Team | Multi-Ethnic Meta-Analysis and Fine Mapping in Breast Cancer | phs000383, phs000812, phs000851, phs000912 | None | Mixed-model association, meta-analysis, forestPMplot, and fine mapping (CAVIAR) | Consistency across data sets |
| UMN-CSBIO | Genetic Interactions in Breast Cancer | phs000147, phs000812 | Hapmap PhaseIII, Molecular Signatures Database (MSigDB v3.0) curated pathway database, Gene Annotation (hg19) | Pathway interactions using annotated gene sets from MSigDB v3.0 | Replication in second data set (phs000147) |
| UNC-BIAS | U4C Breast Cancer Challenge | phs000147, phs000517, phs000799, | None | SNP and LD block-based (SNP set) association in subgroups and overall and meta-analysis | Consistency across data sets |
| UNH STATS | Data Mining of Genome-Wide Association Studies for New Hypotheses on the Possible Effect of Pathways on Breast Cancer Risk | phs000799, phs000851 | None | Variable clustering, variable elimination and bootstrap forests | For 2 different studies, divided each study into training and validation sets, i.e., cross validation within study and replication among 2 studies |
aThis team used results from GWAS data as reported in the NHGRI GWAS catalogue (https://www.ebi.ac.uk/gwas/).
Abbreviations: BPC3, Breast and Prostate Cancer Cohort Consortium; CAVIAR, CAusal Variants Identification in Associated Regions; CGEMS, Cancer Genetic Markers of Susceptibility; dbGaP, Database of Genotypes and Phenotypes; ENCODE, Encyclopedia of DNA Elements; EPIC, European Prospective Investigation into Cancer; eQTL, expression quantitative trait loci; GAME-ON, Genetic Associations and Mechanisms in Oncology; GTEx, Genotype-Tissue Expression project; GWAGE, genome-wide association of gene expression; GWAS, genome-wide association studies; IPA, Ingenuity Pathway Analysis; iscore, influence score; LD, linkage disequilibrium; MSigDB, Molecular Signatures Database; NHGRI, National Human Genome Research Institute; PBCS, Polish Breast Cancer Case-Control Study; SNP, single nucleotide polymorphism; TCGA, The Cancer Genome Atlas; U4C, Up For A Challenge.