| Literature DB >> 15534693 |
Mathew T Pletcher1, Philip McClurg, Serge Batalov, Andrew I Su, S Whitney Barnes, Erica Lagler, Ron Korstanje, Xiaosong Wang, Deborah Nusskern, Molly A Bogue, Richard J Mural, Beverly Paigen, Tim Wiltshire.
Abstract
Rapid expansion of available data, both phenotypic and genotypic, for multiple strains of mice has enabled the development of new methods to interrogate the mouse genome for functional genetic perturbations. In silico mapping provides an expedient way to associate the natural diversity of phenotypic traits with ancestrally inherited polymorphisms for the purpose of dissecting genetic traits. In mouse, the current single nucleotide polymorphism (SNP) data have lacked the density across the genome and coverage of enough strains to properly achieve this goal. To remedy this, 470,407 allele calls were produced for 10,990 evenly spaced SNP loci across 48 inbred mouse strains. Use of the SNP set with statistical models that considered unique patterns within blocks of three SNPs as an inferred haplotype could successfully map known single gene traits and a cloned quantitative trait gene. Application of this method to high-density lipoprotein and gallstone phenotypes reproduced previously characterized quantitative trait loci (QTL). The inferred haplotype data also facilitates the refinement of QTL regions such that candidate genes can be more easily identified and characterized as shown for adenylate cyclase 7.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15534693 PMCID: PMC526179 DOI: 10.1371/journal.pbio.0020393
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Figure 1Visualization of the SNP Sets Allows for Mapping in Crosses That Minimize the Number of Potential Modifiers
When the distribution of the SNPs is plotted out genome-wide, the expected irregular clustering of SNPs mark regions where heterozygosity was continuing to segregate during the inbreeding of the C57 family. Likewise, there are regions that were successfully homozygosed before the split of C58/J from the rest of the family members. In all five strain comparisons, no SNPs were found in the distal 25 Mb of MMU19.
Figure 2Genome-Wide SNP Data Accurately Represent the Known Ancestries of the Genotyped Strains
(A) A tree, adapted from Beck et al. (2000), tracing the lineage of the C57 family of mice (upper tree) shows almost perfect correlation with a phylogenic tree based solely on SNP data (lower tree). The only difference in the two trees is the location of C57BLKS/J, which splits from C57BL/6J sooner in the phylogenic tree because of the genomic contributions of the non-C57 strain, DBA/2J. The maximum parsimony phylogenic tree of the strain relatedness was built using the pseudoalignment of the 10,990 SNP alleles for 48 strains with the Phylip package, version 3.6b.
(B) The DBA/2J contribution to C57BLKS/J can be visualized in its allelic patterns. The region from 104 Mb to 109 Mb on MMU9 shows the same SNP alleles for both C57BLKS/J and its other parental strain, C57BL/6J (a period represents identity with the C57BLKS/J allele). At 110 Mb, the pattern switches and every C57BLKS/J allele matches the DBA/2J content through 120 Mb. SNP marker names are positioned above the alleles with the first number representing the chromosome the marker is located on, the second number being the Mb position on the chromosome, and the third number being an approximate location within the Mb.
Figure 3In Silico Mapping Method Correctly Identifies Coat Color, Retinal Degeneration, and Sweet Preference Loci from SNP Data
(A) Presence or absence of the retinal degeneration, albino, or agouti phenotypes was given a numerical value of 1 or 0 for use in the mapping algorithm. In each case, the most significant p-value (indicated by an arrow) was obtained for the region that contains the gene known to produce these phenotypes. A closer inspection of the retinal degeneration mapping shows that the maximum linkage region indicated by the algorithm covered a 0.4-Mb region from 102.4 Mb to 102.8 Mb on MMU5.
(B) Tas1r3 is known to be a major control gene for the complex trait of preference for sweet tastes. Values for the sweet preference of 23 strains of mice produced a highly significant association with the one Mb region of MMU4 that contains Tas1r3.
Comparison of In Silico QTL with Experimentally Derived QTL
B6, C57BL/6J. For definitions of other abbreviations, see Abbreviations section in text
a References indicate source of QTL data
b The in silico mapping algorithm was run twice for each phenotype, and only SNP blocks that obtained a log p-value above the cutoff in both runs are shown here. When more than one marker within the same genomic region obtained a log p-value above the cutoff, only the marker with the most significant p-value is shown
c The median p-value from each algorithm run was averaged
d QTL regions were defined as the experimentally determined 95% CIs for the particular strain-cross referenced unless otherwise noted
e The 95% CIs were not available, so the QTL regions were defined as ± 10 cM from the position with the highest likelihood-of-odds score (peak), as it represents the approximate size of the available 95% CIs for this dataset and has been previously published in X. Wang and Paigen (2002) as the definition of HDL QTL size. One marker 10 cM away from the peak on either side was chosen, and their physical positions were retrieved from Celera Mouse Database
f Additional crosses support this QTL region
g MMUX has not been studied in mouse crosses used to detect HDL QTL
h QTL for gallbladder mucin content, an early step in gallstone formation
Figure 4Analysis of Adcy7 Haplotypes Reveals Amino Acid Change Associated with HDL Phenotype
(A) Sequencing of Adcy7 in multiple strains revealed 28 SNPs distinguishing three distinct haplotype patterns. All strains were typed with markers selected to represent the three haplotypes. The strain distribution pattern predicted by the SNP data and the sample sequencing for this region was confirmed with NZB/BlNJ and BTBR T+ tf/J, I/LnJ and MA/MyJ, and C3H/HeJ, C57BL6/J, and C57L/J, each separating into unique haplotypes.
(B) The SNP represented by marker 08.089.597 resulted in a change from a cysteine to a tyrosine in the resulting protein (asterisk). This cysteine is conserved in orthologs of the gene in human, rat, and cow. It is also found at the beginning of a stretch of ten amino acids (indicated by black line) predicted to be one of the protein's ten transmembrane domains. Identical amino acids are black and conserved amino acid changes are gray.