| Literature DB >> 30068523 |
Chaozhi Zheng1, Martin P Boer2, Fred A van Eeuwijk2.
Abstract
The study of gene flow in pedigrees is of strong interest for the development of quantitative trait loci (QTL) mapping methods in multiparental populations. We developed a Markovian framework for modeling ancestral origins along two homologous chromosomes within individuals in fixed pedigrees. A highly beneficial property of our method is that the size of state space depends linearly or quadratically on the number of pedigree founders, whereas this increases exponentially with pedigree size in alternative methods. To calculate the parameter values of the Markov process, we describe two novel recursive algorithms that differ with respect to the pedigree founders being assumed to be exchangeable or not. Our algorithms apply equally to autosomes and sex chromosomes, another desirable feature of our approach. We tested the accuracy of the algorithms by a million simulations on a pedigree. We demonstrated two applications of the recursive algorithms in multiparental populations: design a breeding scheme for maximizing the overall density of recombination breakpoints and thus the QTL mapping resolution, and incorporate pedigree information into hidden Markov models in ancestral inference from genotypic data; the conditional probabilities and the recombination breakpoint data resulting from ancestral inference can facilitate follow-up QTL mapping. The results show that the generality of the recursive algorithms can greatly increase the application range of genetic analysis such as ancestral inference in multiparental populations.Entities:
Keywords: Ancestral inference; Collaborative Cross (CC); Identical by descent; Junction theory; MPP; Multiparent Advanced Generation InterCross (MAGIC); QTL mapping resolution; Recombinant Inbred Line (RIL); multiparental populations
Mesh:
Year: 2018 PMID: 30068523 PMCID: PMC6169389 DOI: 10.1534/g3.118.200340
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
List of symbols and their brief descriptions
| Symbol | Description |
|---|---|
| An indicator function that equals 1 if statement | |
| An indicator function that equals 0 if | |
| A gene or haplotype is maternally ( | |
| Pedigree members | |
| Father of | |
| Probability of IBD between maternal gene of | |
| Probability of IBD between two genes of | |
| Probability of non-IBD between two genes of | |
| Probability of non-IBD among three genes of | |
| Expected junction density (per Morgan) on the maternal chromosome of | |
| On the chromosome of | |
| A junction type denoted by two-locus four-gene identity state: | |
| on the left-hand (right-hand) side of junction, and | |
| Seven types: 1112, 1121, 1122, 1211, 1213, 1222, and 1232 | |
| Expected density of junction type | |
| parental origin | |
| Expected overall junction density for | |
| A set of distinct FGLs assigned to the founders of a pedigree, | |
| FGLs | |
| Probability that the maternal gene of | |
| Probability that the gene of | |
| Probability that the maternal gene of | |
| Probability that two genes of | |
| Probability that three genes of | |
| Expected junction density (per Morgan) on chromosome of | |
| where the left-hand (right-hand) side of junction has FGL | |
| Expected junction density: the four genes have FGLs | |
| and haplotype | |
| Similarly for |
Figure 1Illustration of some quantities. An identity state may correspond to many ancestral states. Different FGLs are shown by different colors, and the irrelevant chromosomes are shown as gray. (A) Some two- or three-gene ancestral coefficients and their corresponding identity coefficients. (B) Some expected ancestral junction densities and their corresponding expected identity junction densities. See Table 1 for brief explanations of these quantities.
Figure 3The pedigree of Native Americans. It consists of 6 founders and 16 non-founders. Circles denote females, and rectangles for males.
The identity coefficients and the expected identity junction densities for pedigree member ”M22”
| Quantity | AA autosomes | XX autosomes | ||
|---|---|---|---|---|
| Numerical | Simulated | Numerical | Simulated | |
| 0.18359 | 0.18324 | 0.23437 | 0.23459 | |
| 0.18359 | 0.18371 | 0.17969 | 0.17969 | |
| 0.61572 | 0.61516 | 0.35938 | 0.35978 | |
| 2.93652 | 2.93462 | 1.47656 | 1.47587 | |
| 0.61572 | 0.61605 | 0.57031 | 0.57047 | |
| 2.93652 | 2.93824 | 1.71094 | 1.71227 | |
A unique FGL is assigned to each haploid genome of each founder in Figure 3.
The ancestral coefficients and the expected ancestral junction densities for pedigree member ”M22”
| Quantity | AA autosomes | XX autosomes | ||
|---|---|---|---|---|
| Numerical | Simulated | Numerical | Simulated | |
| A | 0.21875 | 0.21885 | 0.125 | 0.12507 |
| B | 0.21875 | 0.21874 | 0.125 | 0.12467 |
| J | 0 | 0 | 0 | 0 |
| L | 0.125 | 0.12487 | 0.25 | 0.25078 |
| P | 0.3125 | 0.31241 | 0.25 | 0.24939 |
| C | 0.125 | 0.12513 | 0.25 | 0.25009 |
| AB | 0.31250 | 0.31272 | 0.14062 | 0.14098 |
| AJ | 0 | 0 | 0 | 0 |
| AL | 0.09375 | 0.09348 | 0.09375 | 0.09364 |
| AP | 0.28906 | 0.28880 | 0.15625 | 0.15602 |
| AC | 0.11719 | 0.11768 | 0.09375 | 0.09330 |
| BJ | 0 | 0 | 0 | 0 |
| BL | 0.09375 | 0.09405 | 0.09375 | 0.09384 |
| BP | 0.28906 | 0.28890 | 0.15625 | 0.15626 |
| BC | 0.11719 | 0.11742 | 0.09375 | 0.09331 |
| JL | 0 | 0 | 0 | 0 |
| JP | 0 | 0 | 0 | 0 |
| JC | 0 | 0 | 0 | 0 |
| LP | 0.15625 | 0.15644 | 0.18750 | 0.18748 |
| LC | 0.03125 | 0.03124 | 0.12500 | 0.12422 |
| PC | 0.10938 | 0.10922 | 0.18750 | 0.18735 |
The label of each founder is used as FGL that is assigned to its diploid genomes.
Figure 4Illustration of different 8-way funnel breeding schemes. The alternating backcross scheme alternates between mother-son and father-daughter matings in generation and the father-daughter backcross alternates between father-daughter and random sibling matings in generation Circles denote females, and rectangles for males.
Figure 5The identity coefficient and the expected overall junction density for the breeding schemes in Figure 4. (A-B) The results for selfing mating schemes 1-2 in Figure 4. (C-D) The results for the autosomes of mating schemes 3-5 with backcross starting from in Figure 4. (E-F) The results for the sex chromosomes.
Figure 6The FGL non-exchangeability patterns for the ancestral coefficient and the expected ancestral junction density in the eight-way RIL by sibling. The FGLs of eight founders are A-H from left to right.
Figure 7The FGL non-exchangeability patterns of the expected ancestral junction density The results are for autosomes of the multi-way RIL by 100 generations of selfing or sibling. The FGLs for founder parents are letters starting from A up to P from left to right.
Evaluation of the prior FGL exchangeability on ancestral inference in the simulated CC population consisting of 100 funnels
| Probability | Simulated data set | EXCH | NON-EXCH | Improvement ( |
|---|---|---|---|---|
| Wrongly assigned | F11-AA | 0.02779 | 0.02715 | 2.3 |
| F11-XX | 0.01821 | 0.01650 | 9.4 | |
| F22-AA | 0.02484 | 0.02476 | 0.3 | |
| F22-XX | 0.01540 | 0.01452 | 5.7 | |
| Wrongly called | F11-AA | 0.01962 | 0.01888 | 3.8 |
| F11-XX | 0.01284 | 0.01175 | 8.5 | |
| F22-AA | 0.01766 | 0.01771 | −0.3 | |
| F22-XX | 0.01170 | 0.01084 | 7.4 |
The percentage decrease of the wrongly assigned (or called) probability for the analysis using the algorithm NON-EXCH, relative to the algorithm EXCH.
One minus the posterior probability of being true ancestral states.
One minus the fraction of called ancestral states being true ancestral states. At each SNP location within an individual, the ancestral state is called by its maximum posterior probability.
Figure 8Evaluation of the recursive algorithms that are applied in the ancestral inference for the first pair of autosomes of the 120 real CC lines. Panels (A-C) show the results for the wrongly assigned probability, the wrongly called probability, and the pedigree inconsistency, respectively. The pedigree inconsistency is measured by the sum of the posterior probabilities over the four mating pairs of founder parents.