| Literature DB >> 28049428 |
Fabian Grandke1,2, Soumya Ranganathan3, Nikkie van Bers4, Jorn R de Haan4, Dirk Metzler3.
Abstract
BACKGROUND: A large share of agriculturally and horticulturally important plant species are polyploid. Linkage maps are used to locate associations between genes and traits by breeders and geneticists. Linkage map creation for polyploid species is not supported by standard tools. We want to overcome this limitation and validate our results with simulation studies.Entities:
Keywords: Heuristic; Linkage mapping; Polyploids
Mesh:
Year: 2017 PMID: 28049428 PMCID: PMC5210299 DOI: 10.1186/s12859-016-1416-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Ordering of recombination matrix. The three stages of ordering (a, b, c) visualized with the pairwise recombination frequency matrix. Each row and column represents one marker. Dark and light shades of grey indicate low and high recombination frequencies, respectively. a) The markers are in random order. The diagonal is dark because the recombination frequency of a matrix with itself is zero. b) The markers are ordered according to their linkage groups. Seven separated rectangles are formed and easy to distinguish. c) The markers are ordered within each linkage group. Most of the low values moved to the diagonal
Observable numbers of recombination events between two biallelic tetraploid markers
| AAAA | AAAB | AABB | ABBB | BBBB | |
|---|---|---|---|---|---|
| AAAA | 0 | 1 | 2 | 3 | 4 |
| AAAB | 1 | 0 | 1 | 2 | 3 |
| AABB | 2 | 1 | 0 | 1 | 2 |
| ABBB | 3 | 2 | 1 | 0 | 1 |
| BBBB | 4 | 3 | 2 | 1 | 0 |
A and B are the major and minor alleles, respectively. Higher numbers of recombination are possible due to double crossovers (i.e. AAAA /AAAA could be 2 or 4), but ignored by the heuristic
Pairwise distance between the six markers A-F
| A | B | C | D | E | |
|---|---|---|---|---|---|
| B | 2 | ||||
| C |
|
| |||
| D |
|
| 2 | ||
| E | 8 | 7 |
|
| |
| F | 12 | 10 |
|
| 3 |
The bold values indicate equal distances to neighboring markers and thus, ambiguous marker orders. The underlined values are taken into account in the ordering step of PERGOLA to obtain a deterministic result
Fig. 2Goodman-Kruskal correlations. Goodman-Kruskal correlation values of simulated hexaploid data sets and corresponding linkage maps generated by PERGOLA. The x-axis shows four groups with different error values, indicating the amount of errors introduced to the data. The y-axis shows the mean Goodman-Kruskal correlation value for 100 simulations per parameter combination. The standard errors are represented by bars. Each group consists of four differently colored bars, indicating different rates of missing values
Fig. 3Global linkage map comparison - PERGOLA and JoinMap®;. Comparison of the linkage map created by PERGOLA and JoinMap®;. Both are split into ten linkage groups, highlighted by different shades of gray. The linkage groups consist of the same markers. White spaces indicate differences in the marker ordering
Pairwise correlations between the four maps. The bottom and top triangles show cophenetic and Goodman-Kruskal correlations, respectively
| PERGOLA | R/qtl | JoinMap®; | MapMaker | |
|---|---|---|---|---|
| PERGOLA | - | 0.999 | 0.999 | 0.999 |
| R/qtl | 0.930 | - | 0.999 | 0.999 |
| JoinMap®; | 0.938 | 0.922 | - | 0.999 |
| MapMaker | 0.961 | 0.928 | 0.915 | - |
Fig. 4Diploid simulation study result. We simulated six setups of diploid populations with two chromosomes and repeated each 100 times. We used population sizes of 50, 100 or 200 and 10 or 20 markers per chromosome. We applied PERGOLA and R/qtl to calculate linkage maps which were compared with the reference map. The bars show the mean correlation value of 100 repetitions and the error bars indicate the standard error