| Literature DB >> 19668205 |
Hyuna Yang1, Yueming Ding, Lucie N Hutchins, Jin Szatkiewicz, Timothy A Bell, Beverly J Paigen, Joel H Graber, Fernando Pardo-Manuel de Villena, Gary A Churchill.
Abstract
We designed a high-density mouse genotyping array containing 623,124 single-nucleotide polymorphisms that captures the known genetic variation present in the laboratory mouse. The array also contains 916,269 invariant genomic probes targeted to functional elements and regions known to harbor segmental duplications. The array opens the door to the characterization of genetic diversity, copy-number variation, allele-specific gene expression and DNA methylation, and will extend the successes of human genome-wide association studies to the mouse.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19668205 PMCID: PMC2735580 DOI: 10.1038/nmeth.1359
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547
Final SNP and invariant genomic sequence selection.
| Selection Criteria | Type of Variant | Training Array | Final Array | Probes in + strand | Probes in − strand | Total number of probes | Unmapped Probes | Total number of probes (Build 37) |
|---|---|---|---|---|---|---|---|---|
| Paigen Set | SNP | 187,256 | 157,653 | 4 | 4 | 1,261,224 | 0 | 1,261,224 |
| B6 Singletons | SNP | 3,376 | 2,773 | 4 | 4 | 22,184 | 3 | 22,160 |
| MSM Singletons | SNP | 49,930 | 37,597 | 4 | 4 | 300,776 | 4 | 300,744 |
| NIEHS/Perlegen | SNP | 482,862 | 423,893 | 4 | 4 | 3,391,144 | 14 | 3,391,032 |
| extra | SNP | 569 | 339 | 4 | 4 | 2,712 | 7 | 2,656 |
| Wild-derived | SNP | 990 | 799 | 4 | 4 | 6,392 | 0 | 6,392 |
| chrY | SNP | 83 | 51 | 4 | 4 | 408 | 0 | 408 |
| chrM | SNP | 20 | 19 | 4 | 4 | 152 | 0 | 152 |
| SubTotal | SNP | 725,086 | 623,124 | 4,984,992 | 28 | 4,984,768 | ||
| Exon 1 | CNV | na | 373,667 | 1 | 1 | 747,334 | 14 | 747,306 |
| Exon 2 | CNV | na | 224,091 | 1 | 1 | 448,182 | 6 | 448,170 |
| UCEs | CNV | na | 34,739 | 1 | 1 | 69,478 | 0 | 69,478 |
| MSM Zero Hit | CNV | na | 94,471 | 1 | 1 | 188,942 | 0 | 188,942 |
| MSM Single Hit | CNV | na | 19,399 | 1 | 1 | 38,798 | 0 | 38,798 |
| Gap Filling Nsp | CNV | na | 87,145 | 1 | 1 | 174,290 | 471 | 173,348 |
| Gap Filling Sty | CNV | na | 82,757 | 1 | 1 | 165,514 | 355 | 164,804 |
| SubTotal | CNV | na | 916,269 | 1,832,538 | 846 | 1,830,846 | ||
| Total | na | 725,086 | 1,539,393 | na | na | 6,817,530 | na | 6,815,614 |
Figure 1Spatial distribution of SNP and invariant genomic probes
Number of SNP and IGPs in each 200Kb interval. IGPs are color coded by type (Exon1, Exon2, UCE and Gap Filling). Invariant MSM probes are not shown. The left and right histograms have different scales as indicated by the relative proportions of red and blue in the horizontal bar (lower right).
Figure 2Detection of CNV using the Mouse Diversity array
Open circles of different colors represent different types of probesets. (a) Difference in probeset intensity along the entire chromosome 17 between BALB/cByJ and C57BL/6J strains. (b) Z-score of the intensity of each probe set in BALB/cBy in a 23 Mb window of chromosome 17. (c) Z-score of the intensity of each probe set in BALB/cBy in a 700kb window that span a know duplication on BALB/cBy. Red lines are from HMM fit and black vertical lines denote the boundaries of the duplication described previously9.