| Literature DB >> 29186889 |
Abstract
Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.Entities:
Keywords: QTG; QTL; causal analysis; gene expression
Year: 2017 PMID: 29186889 PMCID: PMC5748665 DOI: 10.3390/genes8120347
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Overview of a strategy from quantitative trait loci (QTL) and quantitative trait genes (QTGs) identification using our studies as an example. (a) QTL analysis in an intersubspecific mouse population between wild Mus musculus castaneus and the C57BL/6JJcl (B6) inbred strain. The picture shows adult wild and B6 male mice at 20 weeks after birth (photographed by Keita Makino, Graduate School of Bioagricultural Sciences, Nagoya University, Japan). Twenty-four QTL for body weights at 3 weeks (Wt3) to 10 weeks (Wt10) of age are mapped [12,13,14], and among the QTL the most potent QTL (named Pbwg1) on mouse chromosome 2 is depicted as logarithm of odds (LOD) score plots. The figure was remade from previous data [14]; (b) Fine mapping of Pbwg1 using the founder congenic strain (B6.Cg-Pbwg1) and subsequent subcongenic strains (B6.Cg-Pbwg1/Nga#, abbreviation: SR#). The black and grey bars show minimum intervals derived from the wild and B6 mice, respectively. The hatched bar shows an interval where recombination occurred. The map positions (mega base pairs (Mb)) of DNA markers (D2Mit# and rs#) are approximately shown on the horizontal line. The horizontal double-headed arrows indicate the intervals of QTL for body weight and body composition traits [15,16,18,20], and among the QTL the intervals of Pbwg1.5 and Pbwg1.12 are highlighted by red [18]. The effects of the QTL alleles derived from the wild mouse are indicated with the arrows. The green triangle indicates the position of the Ly75 (lymphocyte antigen 75) gene, a putative QTG for Pbwg1.5 [19]; (c) Candidate gene prioritization using DNA sequence analysis, bioinformatics analysis, transcriptome analysis and causal analysis. In our previous studies, exome-seq analysis of the funder congenic interval [18] and RNA-seq analysis of the SR1 subcongenic interval [19] were performed. Furthermore, bioinformatics analyses (see Table 1) and the causal inference test (see Figure 2) using gene expression data were carried out; (d) QTG identification. To identify a QTG, the quantitative complementation test is performed as shown in Figure 3. To validate the QTG, a transgenic overexpression experiment is performed. Furthermore, to identify a QTN within the QTG, allelic substitution experiments using gene editing techniques such as the CRISPR/Cas9 system are performed. See text for details of each analysis.
Numbers of synonymous single nucleotide polymorphisms SNPs (sSNPs) and nonsynonymous SNPs (nsSNPs) detected by exome-seq analysis of 23 genes in a 2.1 Mb interval of Pbwg1.5 and the neighboring 3.8 Mb interval of Pbwg1.12, ranking of the genes, and damage of protein functions caused by the nsSNPs.
| QTL | Gene | sSNP | nsSNP | Gene Ranking 1 | Damage of Protein 2 | |
|---|---|---|---|---|---|---|
| PolyPhen-2 | SIFT | |||||
| 1 | 0 | |||||
| 21 | 4 | |||||
| 6 | 1 | |||||
| 15 | 6 | |||||
| 6 | 0 | |||||
| 1 | 0 | |||||
| 27 | 9 | 1 | Benign | Tolerated | ||
| 18 | 8 | |||||
| 11 | 3 | 2 | Benign | Affected | ||
| 2 | 0 | |||||
| 1 | 5 | |||||
| 1 | 0 | |||||
| 2 | 1 | |||||
| 6 | 0 | |||||
| 6 | 0 | |||||
| 0 | 1 | 1 | Benign | Tolerated | ||
| 2 | 2 | |||||
| 17 | 5 | |||||
| 3 | 1 | |||||
| 6 | 1 | |||||
| 4 | 1 | |||||
| 7 | 2 | 2 | Benign | Tolerated | ||
| 14 | 18 | |||||
The data are modified from [18]. 1 The top two genes were prioritized as candidate genes for each of the two QTL by Endeavour [29]; 2 The damage caused by nsSNPs was investigated for the ranked genes by PolyPhen-2 [30] and SIFT [31].
Differentially expressed genes in a 5.8 Mb genomic interval harboring Pbwg1.5 and Pbwg1.12 detected by RNA-seq analysis followed by real-time PCR analysis.
| Organ | Gene | Relative Expression Level 1 | Differences 2 | ||
|---|---|---|---|---|---|
| B/B | B/C | C/C | |||
| Liver | 1.00 | 1.81 | 3.19 | C/C>B/C>B/B | |
| 1.00 | −1.58 | 0.58 | B/B≥C/C≥B/C | ||
| 1.00 | 5.89 | 8.03 | C/C>B/C>B/B | ||
| 1.00 | 0.79 | 0.34 | B/B≥B/C≥C/C | ||
| Gonadal fat | 1.00 | 1.43 | 2.11 | C/C>B/C>B/B | |
| 1.00 | −0.47 | −0.53 | B/B>B/C≥C/C | ||
| 1.00 | 0.73 | 0.50 | B/B≥B/C≥C/C | ||
The data are modified from [19]. 1 The relative gene expression levels were investigated in segregating F2 mice with three diplotypes (B/B, B/C and C/C) and are shown as a ratio to B/B. B and C denotes haplotypes derived from B6 and wild mice, respectively; 2 Significantly different between the diplotypes at p < 0.05.
Figure 2Criteria for the causal inference test (CIT). (a) Four component tests of the CIT [32] assessing whether changes in genotype (G) lead to variation in a phenotype (P) through changes in mRNA expression (E); (b) Possible relationship models estimated from CIT results. In the causal model, G acts on P through E. In the reactive model, E changes as a result of changes in P. In the independent model, G acts on E and P independently.
Figure 3Quantitative complementation test. (a) Mating designs using two tester inbred strains (a strain with a knockout (KO) allele (dotted open circle) at a candidate gene locus and its background strain with a wild-type allele (closed circle) at the candidate gene locus and two experimental inbred strains (a congenic strain with a mutant-type allele (green triangle) at a QTL and its background strain with a wild-type allele (purple square) at the QTL. The two tester strains have all the same chromosomes (blue vertical bars) except for the chromosomal position of the KO locus. In the two experimental strains, all chromosomes (red vertical bars) are the same except for the congenic region in which alleles at some loci may be different between the two experimental strains. In the F2 animals, four types of genotypes are segregating on a uniform genetic background; (b) Quantitative complementation (KO locus ≠ QTL), indicated by no statistical interaction between KO and QTL alleles; (c) Quantitative failure to complement (KO locus = QTL), indicated by a significant interaction between KO and QTL alleles.