| Literature DB >> 26224782 |
Christoph D Rau1, Brian Parks1, Yibin Wang2, Eleazar Eskin3, Petr Simecek4, Gary A Churchill4, Aldons J Lusis5.
Abstract
Human genome-wide association studies have identified thousands of loci associated with disease phenotypes. Genome-wide association studies also have become feasible using rodent models and these have some important advantages over human studies, including controlled environment, access to tissues for molecular profiling, reproducible genotypes, and a wide array of techniques for experimental validation. Association mapping with common mouse inbred strains generally requires 100 or more strains to achieve sufficient power and mapping resolution; in contrast, sample sizes for human studies typically are one or more orders of magnitude greater than this. To enable well-powered studies in mice, we have generated high-density genotypes for ∼175 inbred strains of mice using the Mouse Diversity Array. These new data increase marker density by 1.9-fold, have reduced missing data rates, and provide more accurate identification of heterozygous regions compared with previous genotype data. We report the discovery of new loci from previously reported association mapping studies using the new genotype data. The data are freely available for download, and Web-based tools provide easy access for association mapping and viewing of the underlying intensity data for individual loci.Entities:
Keywords: HMDP; genotyping; mouse; mouse diversity array
Mesh:
Year: 2015 PMID: 26224782 PMCID: PMC4592984 DOI: 10.1534/g3.115.020784
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
All SNPs present in the Mouse Diversity Array
| Total SNPs | ∼623,000 |
|---|---|
| Total high-quality SNPs | ∼550,000 |
| Intergenic SNPs | ∼337,000 |
| Intronic SNPs | ∼198,000 |
| Exonic SNPs | ∼8,900 |
| 3′ or 5′ UTR SNPs | ∼5,700 |
Shown is a listing of the SNPs and their classification on the Mouse Diversity Array. SNP, single-nucleotide polymorphism; UTR, untranslated region.
Figure 1Comparisons of Prior genotypes with Mouse Diversity Array (MDA) genotypes. (A) Fraction of single-nucleotide polymorphisms (SNPs) with missing calls in each strain for Prior (left) and MDA (right) genotypes. The red line indicates the average value. (B) Histogram showing the proportion of missing strains for each SNP for the prior (left) and MDA (right) genotypes. Highlighted in yellow and displayed as a percentage are the numbers of SNPs with more than 10% missing values (7% for prior, 0.03% for MDA). (C) Fraction of heterozygous SNPs within each strain for prior (left) and MDA (right) genotypes. The red line indicates the average value. (D) Histogram of concordance between SNPs found in both genotyping sets.
Figure 2Allele frequencies in genotyping datasets. Histograms of the allele frequency of single-nucleotide polymorphisms (SNPs) in the Prior (left) and Mouse Diversity Array (right) genotypes. Highlighted in yellow and displayed as a percentage are the SNPs whose allele frequencies are too low for genome-wide association studies.
Informative SNPs for performing GWAS in the Hybrid Mouse Diversity panel
| Prior Genotypes | Mouse Diversity Array Genotypes | |
|---|---|---|
| Total high-quality SNPs | ∼140,000 | ∼550,000 |
| More than 10% missing values | ∼9,000 | ∼200 |
| MAF less than 5% | ∼24,000 | ∼347,300 |
| Final informative SNPs | ∼108,500 | ∼202,500 |
A comparison of the number of SNPs in both the Prior and MDA genotypes, their reasons for removal and the final number of informative SNPs in each set. SNP, single-nucleotide polymorphism; GWAS, genome-wide association studies; MAF, minor allele frequency.
Figure 3Effects of new single-nucleotide polymorphisms (SNPs) on genome-wide association study results. In both cases, the phenotype being used is total heart weight after isoproterenol treatment. Red line indicates genome-wide significance threshold (4.1E-6). (A) Results using EMMA on the Prior genotypes reveals a single locus on chromosome 1. (B) Results using EMMA on Mouse Diversity Array (MDA) genotypes reveals four additional loci. (C) Results using EMMA on the MDA genotypes using a kinship matrix generated from the Prior genotypes does not demonstrably change the results from B).
Improved GWAS results due to MDA
| Chromosome | Peak SNP rsID | Peak P-value | Distance to Candidate | Candidate Gene | Evidence |
|---|---|---|---|---|---|
| Associated in prior genotypes | |||||
| 1 | rs33825648 | 1.1E-6 | 55 kb upstream | ||
| Associated in MDA genotypes | |||||
| 1 | rs33825648 | 9.8E-7 | 55 kb upstream | ||
| 2 | rs27922490 | 2.6E-6 | 2 kb upstream | ||
| 9 | rs36770705 | 3.1E-7 | Between Exon 4 and 5 | Splicing mutation, literature | |
| 9 | rs24885538 | 2.9E-7 | Between Exon 2 and 3 | ||
| 10 | rs49270079 | 3.1E-7 | 737 kb upstream | ||
| 2.8 mb upstream |
Significant loci were observed in both the Prior and MDA genotypes Dashed lines delineate loci from one another. GWAS, genome-wide association studies; MDA, Mouse Diversity Array.
MDA leads to many new significant loci compared with results from Rau
| Phenotype | Prior Genotypes | MDA Genotypes | Lost in MDA Genotypes | Gained in MDA Genotypes | ||||
|---|---|---|---|---|---|---|---|---|
| Suggestive | Significant | Suggestive | Significant | Suggestive | Significant | Suggestive | Significant | |
| Left ventricle | 0 | 0 | 2 | 0 | 0 | 0 | 2 | 0 |
| Right ventricle | 6 | 3 | 24 | 14 | 2 | 0 | 20 | 11 |
| Left atrium | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |
| Right atrium | 1 | 0 | 4 | 0 | 0 | 0 | 3 | 0 |
| Lung | 5 | 1 | 4 | 0 | 2 | 1 | 1 | 0 |
| Liver | 2 | 1 | 2 | 1 | 1 | 1 | 1 | 1 |
Suggestive (P < 4.1E-6) and significant (P < 4.1E-7) thresholds taken from Rau . See Table S3 for details about each locus. MDA, Mouse Diversity Array.