| Literature DB >> 27376238 |
Jérôme Nicod1, Robert W Davies1, Na Cai1, Carl Hassett2, Leo Goodstadt1, Cormac Cosgrove3, Benjamin K Yee4, Vikte Lionikaite5, Rebecca E McIntyre6, Carol Ann Remme7, Elisabeth M Lodder7, Jennifer S Gregory5, Tertius Hough2, Russell Joynson2, Hayley Phelps2, Barbara Nell2, Clare Rowe2, Joe Wood2, Alison Walling2, Nasrin Bopp1, Amarjit Bhomra1, Polinka Hernandez-Pliego1, Jacques Callebert8, Richard M Aspden5, Nick P Talbot9, Peter A Robbins9, Mark Harrison2, Martin Fray2, Jean-Marie Launay8, Yigal M Pinto7, David A Blizard10, Connie R Bezzina7, David J Adams6, Paul Franken11, Tom Weaver2, Sara Wells2, Steve D M Brown12, Paul K Potter12, Paul Klenerman3, Arimantas Lionikas5, Richard Mott1,13, Jonathan Flint1,14.
Abstract
Two bottlenecks impeding the genetic analysis of complex traits in rodents are access to mapping populations able to deliver gene-level mapping resolution and the need for population-specific genotyping arrays and haplotype reference panels. Here we combine low-coverage (0.15×) sequencing with a new method to impute the ancestral haplotype space in 1,887 commercially available outbred mice. We mapped 156 unique quantitative trait loci for 92 phenotypes at a 5% false discovery rate. Gene-level mapping resolution was achieved at about one-fifth of the loci, implicating Unc13c and Pgc1a at loci for the quality of sleep, Adarb2 for home cage activity, Rtkn2 for intensity of reaction to startle, Bmp2 for wound healing, Il15 and Id2 for several T cell measures and Prkca for bone mineral content. These findings have implications for diverse areas of mammalian biology and demonstrate how genome-wide association studies can be extended via low-coverage sequencing to species with highly recombinant outbred populations.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27376238 PMCID: PMC4966644 DOI: 10.1038/ng.3595
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Sequence diversity of the CFW population. (a) Distribution of heterozygosity in 100kbp windows genome-wide. (b) Histogram of genome-wide heterozygosity. (c) Example of novel and total SNP density for a region of chromosome 19. Results are representative of those seen genome-wide. (d) Minor allele frequency (MAF) density for population of wild Indian (n=10, 44.9 M whole genome sequencing SNPs), CFW mice (n=2,073, 5.7M imputed SNPs) and HS mice (n=1,904, 11K SNPs from a genotyping array). Known CFW variation refers to those variants also segregating among 14 sequenced classical inbred strains. (e) The extent of linkage disequilibrium in CFW and HS mice. Values are mean r2 between all pairs of SNPs binned by distance to the kbp.
Figure 2Mapping resolution and effect size of QTLs. Frequency distribution of (a) the size and (b) the number of genes present in the 95% confidence intervals (CI) in 255 QTLs, (c) The sum of variance explained by the QTLs plotted against heritability in 92 measures where heritability could be estimated and at least one QTL was detected. Colour of dots indicates the type of measure: behaviour, physiological (body weight, respiratory, electrocardiography) or tissue (any measure obtained after dissection)
Figure 3Summary Manhattan plot of 92 phenotypes. Genome-wide representation of all unique QTLs (n=156, FDR<5%) identified in this study. Light and dark grey dots show association from the 92 measures where at least one QTL was detected at the tagging SNPs positions (n=359,559). Most significant SNPs at each QTL are marked with a colour dot, depending on the type of measure. Y-axis shows –log10(P) of the imputed allele dosages with tested measures and is truncated at –log10(P)=32. The position of the 2 strongest QTLs with –log10(P) values of 133 (chr4) and 76 (chr17) is marked by triangles.
QTLs mapping to a single gene
| Phenotype | Chr. | Position (Mb) | -logP | Gene | References |
|---|---|---|---|---|---|
| Weight of soleus muscle (g) | 6 | 17.5 | 16.2 | ||
| Total distance travelled in Elevated Plus Maze (cm) | 6 | 110.2 | 5.6 | ||
| CD45+/CD3-/CD19+ cells (%) | 9 | 32.6 | 5.8 | ||
| CD45+/CD3-/DX5+ cells (%) | 1 | 168.2 | 4.7 | ||
| Wound healing | 2 | 134.2 | 5.5 | ||
| Number of long (>1min) sleep episodes | 5 | 51.8 | 6.8 | ||
| Ratio of CD3+/CD4+ to CD3+/CD8+ cells | 8 | 82.4 | 8.7 | ||
| Bone mineral content | 11 | 108.2 | 4.6 | ||
| CD3+/CD8+ cells (%) | 12 | 25.5 | 5.4 | ||
| Length of tibia (mm) | 5 | 51.7 | 4.5 | ||
| Startle pulse reactivity | 6 | 17.5 | 6.7 | ||
| Calcium (mmol/l) | 6 | 17.5 | 8.3 | ||
| Total Cholesterol (mmol/l) | 6 | 17.5 | 6.1 | ||
| Total Protein (g/l) | 6 | 17.5 | 27.7 | ||
| CD45+/CD3-/CD19+ cells (%) | 7 | 72.2 | 6.2 | ||
| Number of long (>1min) sleep episodes | 9 | 73.8 | 5.5 | ||
| Startle pulse reactivity | 10 | 68.0 | 8.0 | ||
| Weight of tibialis anterior muscle (g) | 11 | 17.6 | 6.3 | ||
| Length of tibia (mm) | 12 | 83.6 | 7.1 | ||
| Basal activity | 13 | 7.3 | 10.6 | ||
| Respiratory rate during Hypoxic Ventilatory Decline | 13 | 118.0 | 5.9 | ||
| Total distance travelled in Elevated Plus Maze (cm) | 14 | 82.1 | 6.2 | ||
| Measure of the size of tibia | 15 | 26.6 | 5.3 | ||
| Percentage of Eosinophils (%) | 17 | 70.4 | 5.2 | ||
| Percentage of Eosinophils (%) | X | 155.6 | 6.0 | ||
Figure 4Single-gene resolution mapping at 4 loci using the entire set of SNPs (7.1 M). (a) Weight of soleus muscle on chromosome 6 (n=1832), (b) Measure of the number of long sleep episodes on chromosome 9 (n=1577), (c) Ratio of CD4+ to CD8+ T cells (CD3+) on chromosome 8 (n=1324) and (d) Intensity of reaction to startle on chromosome 10 (n=1740). The plots were drawn using LocusZoom 49. Strongest associated SNP is marked with a purple diamond, the other SNPs that passed post-imputation quality control (IMPUTE2-style INFO scores > 0.4 and HWE r2>1e-6,) are coloured following LD r2 with strongest SNP. The grey dots represent SNPs that failed post-imputation QC and therefore were not used for the analysis.