| Literature DB >> 27917289 |
Erica A Di Pierro1, Luca Gianfranceschi1, Mario Di Guardo2, Herma Jj Koehorst-van Putten3, Johannes W Kruisselbrink4, Sara Longhi3, Michela Troggio5, Luca Bianco5, Hélène Muranty6, Giulia Pagliarani7, Stefano Tartarini7, Thomas Letschka8, Lidia Lozano Luis8, Larisa Garkava-Gustavsson9, Diego Micheletti5, Marco Cam Bink4, Roeland E Voorrips3, Ebrahimi Aziz3, Riccardo Velasco5, François Laurens6, W Eric van de Weg3.
Abstract
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species.Entities:
Year: 2016 PMID: 27917289 PMCID: PMC5120355 DOI: 10.1038/hortres.2016.57
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Identity and origin of the 21 full-sib families used for developing the integrated genetic linkage map (iGLMap)
| 12_B | ‘Generos’ | X-6417 | 48 | INRA_Angers-France | [ |
| 12_E | ‘Generos’ | X-6683 | 58 | INRA_Angers-France | [ |
| 12_F | X-3318 | X-6564 | 48 | INRA_Angers-France | [ |
| 12_I | X-3263 | X-3259 | 47 | INRA_Angers-France | [ |
| 12_J | X-3318 | ‘Galarina’ | 23 | INRA_Angers-France | [ |
| 12_K | X-6679 | X-6808 | 47 | INRA_Angers-France | [ |
| 12_N | X-3305 | X-3259 | 48 | INRA_Angers-France | [ |
| 12_P | ‘Rubinette’ | X-3305 | 48 | INRA_Angers-France | [ |
| DiPr | ‘Discovery’ | ‘Prima’ | 77 | JKI-Germany | [ |
| DLO.12 | 1980-15-25 | 1973-1-41 | 219 | DLO-Netherlands | [ |
| FuGa | ‘Fuji’ | ‘Gala’ | 141 | UNIBO-Italy | [ |
| FuPi | ‘Fuji’ | ‘Pinova’ | 91 | RCL-Italy | [ |
| GaPi | ‘Gala’ | ‘Pinova’ | 40 | RCL-Italy | [ |
| I_BB | X-6417 | X-6564 | 43 | INRA_Angers-France | [ |
| I_CC | X-6679 | ‘Dorianne’ | 50 | INRA_Angers-France | [ |
| I_J | X-3318 | X-3263 | 48 | INRA_Angers-France | [ |
| I_M | X-6683 | X-6681 | 45 | INRA_Angers-France | [ |
| I_W | X-6398 | X-6683 | 44 | INRA_Angers-France | [ |
| JoPr | ‘Jonathan’ | ‘Prima’ | 174 | DLO-Netherlands | [ |
| PiRea | ‘Pinova’ | ‘Reanda’ | 45 | JKI-Germany | [ |
| TeBr | ‘Telamon’ | ‘Braeburn | 202 | KUL-Belgium | [ |
| Total | 1 586 |
These overview data have been partially presented by Bianco et al.[22] The number of genotyped seedlings has been updated after data curation in the current study during the construction of the iGLMap: 16 pairs of identical individuals were discovered across 6 families for which only 1 individual per pair was kept in the final data set; thus, a total of 16 identicals were removed. The involved families were 12_B (1 pair), DLO.12 (6 pairs), FuGa (1 pair), FuPi (1 pair), GaPi (3 pairs), I_M (1 pair), I_W (1 pair), JoPr (1 pair), and PiRea (1 pair). In addition, two individuals, 12_B058 and 12_J025 that showed a very high recombination rate (>5.0) in almost all linkage groups were considered out-crossers and excluded from the final data set. Most of these populations were part of the previous European project HiDRAS,[41] and four of them derived from other previous studies as reported in the last column. Pedigrees of the X-numbered accessions are given in Bink et al.[13]
X-6690.
Figure 1Graphical visualization of the combined HaploBlock and backcross approach presented in the current study. The figure illustrates the main steps of the process with an example from the true data of five families, each represented by seven individuals, two HaploBlocks (HBs) and one individual SNP on linkage group 1. Genotype codes presented here follow the format of JoinMap v3 and later versions for the cross-pollinated (CP) segregation types (Segr), where
Summary of the 21 single-family genetic maps and the final integrated Genetic Map (iGLMap) by their genetic length (a) and number of markers (b)
| LG1 | 61 | 56 | 55 | 65 | 56 | 74 | 46 | 73 | 58 | 79 | 59 | 50 | 64 | 67 | 64 | 84 | 65 | 68 | 64 | 64 | 81 | 64±9.8 | 63.1 |
| LG2 | 91 | 75 | 93 | 70 | 86 | 86 | 70 | 71 | 94 | 93 | 74 | 68 | 64 | 80 | 66 | 92 | 80 | 81 | 82 | 78 | 78 | 80±9.5 | 78.4 |
| LG3 | 88 | 69 | 70 | 80 | 66 | 70 | 66 | 66 | 73 | 80 | 69 | 70 | 58 | 82 | 87 | 70 | 81 | 103 | 87 | 57 | 79 | 75±11.1 | 74 |
| LG4 | 84 | 62 | 69 | 74 | 55 | 67 | 55 | 63 | 73 | 72 | 60 | 67 | 77 | 60 | 70 | 69 | 62 | 77 | 77 | 65 | 74 | 68±7.7 | 65.5 |
| LG5 | 84 | 84 | 90 | 85 | 81 | 90 | 77 | 78 | 108 | 87 | 82 | 95 | 53 | 77 | 80 | 101 | 99 | 89 | 80 | 69 | 87 | 85±11.6 | 77.8 |
| LG6 | 77 | 73 | 78 | 81 | 93 | 96 | 70 | 63 | 69 | 88 | 67 | 68 | 65 | 77 | 76 | 71 | 78 | — | 62 | 59 | 64 | 74±10.2 | 75.3 |
| LG6P1 | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 32 | — | — | — | — | — |
| LG6P2 | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 66 | — | — | — | — | — |
| LG7 | 91 | 77 | 48 | 88 | 90 | 93 | 75 | 80 | 91 | 88 | 78 | 68 | 69 | 60 | — | 100 | 77 | 71 | 95 | 79 | 84 | 80±12.8 | 82.4 |
| LG7P1 | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 60 | — | — | — | — | — | — | — | — |
| LG8 | 70 | 78 | 55 | 68 | 68 | 63 | 53 | 58 | 77 | 98 | 65 | 56 | 66 | 65 | 65 | 68 | 63 | 66 | 84 | 72 | 67 | 68±10.2 | 68.5 |
| LG9 | 64 | 68 | 72 | 76 | 76 | 95 | 68 | 68 | 90 | 68 | 66 | 57 | 61 | 76 | 72 | 62 | 89 | 72 | 73 | 56 | 67 | 71±10.1 | 67.1 |
| LG10 | 90 | 82 | 93 | 78 | 95 | 105 | 80 | 77 | 90 | 94 | 77 | 76 | 57 | 86 | 74 | 100 | 78 | 102 | 76 | 72 | 87 | 84±11.7 | 81.3 |
| LG11 | 80 | 79 | 77 | 68 | 89 | 155 | 79 | 83 | 91 | 96 | 72 | 89 | 67 | 64 | 85 | 85 | 88 | 100 | 83 | 62 | 85 | 85±19.1 | 80.9 |
| LG12 | 92 | 61 | 90 | 59 | 75 | 82 | 76 | 62 | 69 | 77 | 63 | 61 | 54 | 69 | 64 | 77 | 57 | 82 | 68 | 56 | 62 | 69±11 | 65.4 |
| LG13 | 85 | 57 | 62 | 77 | 111 | 121 | 70 | 57 | 70 | 78 | 63 | 68 | 78 | 54 | 81 | 79 | 71 | 61 | 66 | 54 | 65 | 73±17 | 71.4 |
| LG14 | 62 | 54 | 67 | 61 | 62 | 62 | 70 | 67 | 73 | 79 | 57 | 55 | 49 | 53 | 70 | 67 | 87 | 74 | 78 | 52 | 64 | 65±9.9 | 64.4 |
| LG15 | 145 | 103 | 131 | 114 | 143 | 128 | 111 | 103 | 108 | 140 | 112 | 111 | 105 | 135 | 110 | 116 | 101 | 124 | 115 | 106 | 122 | 118±13.8 | 112.2 |
| LG16 | 68 | 72 | 85 | 74 | 87 | 71 | 79 | 53 | 61 | 69 | 70 | 79 | 68 | 74 | 64 | 74 | 82 | 47 | 69 | 84 | 76 | 72±10 | 67.5 |
| LG16P1T | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 33 | — | — | — | — | — |
| LG17 | 92 | 66 | 79 | 75 | 102 | 96 | 61 | 77 | 66 | 74 | 64 | 55 | 71 | 76 | — | 70 | 73 | 86 | 80 | 67 | 76 | 75±11.8 | 71.8 |
| LG17P1 | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 56 | — | — | — | — | — | — | — | — |
| LG17P2 | — | — | — | — | — | — | — | — | — | — | — | — | — | 42 | — | — | — | — | — | — | — | — | |
| Total | 1424 | 1216 | 1312 | 1290 | 1434 | 1551 | 1205 | 1197 | 1360 | 1460 | 1198 | 1195 | 1123 | 1256 | 1232 | 1384 | 1332 | 1418 | 1339 | 1153 | 1319 | 1305±113.3 | 1267 |
| LG1 | 305 | 276 | 296 | 178 | 287 | 189 | 206 | 225 | 340 | 289 | 275 | 314 | 340 | 254 | 146 | 270 | 276 | 241 | 407 | 270 | 265 | 269±59.5 | 672 |
| LG2 | 453 | 320 | 519 | 468 | 553 | 407 | 537 | 543 | 523 | 431 | 507 | 528 | 548 | 431 | 410 | 385 | 436 | 410 | 597 | 491 | 548 | 478±70.5 | 1 042 |
| LG3 | 413 | 420 | 444 | 489 | 362 | 426 | 382 | 460 | 368 | 523 | 418 | 465 | 452 | 480 | 427 | 268 | 323 | 424 | 509 | 474 | 495 | 430±62.8 | 951 |
| LG4 | 291 | 309 | 354 | 353 | 190 | 308 | 320 | 447 | 368 | 338 | 422 | 356 | 346 | 357 | 292 | 413 | 392 | 372 | 453 | 405 | 436 | 358±62.4 | 804 |
| LG5 | 390 | 519 | 491 | 464 | 357 | 512 | 527 | 578 | 548 | 500 | 511 | 544 | 483 | 592 | 408 | 364 | 442 | 485 | 639 | 534 | 594 | 499±75.6 | 1 114 |
| LG6 | 230 | 261 | 362 | 241 | 354 | 255 | 263 | 243 | 378 | 255 | 382 | 345 | 385 | 352 | 293 | 303 | 359 | — | 408 | 409 | 433 | 326±65.6 | 741 |
| LG6P1 | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 29 | — | — | — | — | — | |
| LG6P2 | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 246 | — | — | — | — | — | |
| LG7 | 279 | 288 | 252 | 230 | 293 | 288 | 187 | 208 | 268 | 217 | 334 | 304 | 226 | 340 | — | 224 | 374 | 295 | 357 | 234 | 365 | 278±55.7 | 674 |
| LG7P1 | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 53 | — | — | — | — | — | — | — | — |
| LG8 | 367 | 305 | 386 | 414 | 351 | 348 | 431 | 415 | 258 | 404 | 410 | 379 | 382 | 432 | 459 | 311 | 302 | 324 | 236 | 326 | 477 | 367±63.9 | 862 |
| LG9 | 405 | 419 | 366 | 327 | 425 | 339 | 273 | 281 | 372 | 355 | 383 | 333 | 399 | 410 | 304 | 364 | 333 | 436 | 397 | 374 | 447 | 369±48.9 | 839 |
| LG10 | 459 | 323 | 382 | 399 | 424 | 404 | 402 | 421 | 509 | 460 | 542 | 448 | 458 | 457 | 345 | 438 | 402 | 411 | 560 | 506 | 498 | 440±60.1 | 1 049 |
| LG11 | 539 | 557 | 474 | 434 | 444 | 418 | 324 | 548 | 497 | 444 | 599 | 550 | 539 | 551 | 407 | 504 | 392 | 453 | 586 | 376 | 494 | 482±74.6 | 1 085 |
| LG12 | 457 | 471 | 396 | 559 | 425 | 332 | 495 | 479 | 482 | 428 | 395 | 437 | 295 | 503 | 311 | 429 | 472 | 410 | 589 | 482 | 579 | 449±78.4 | 1 000 |
| LG13 | 437 | 511 | 417 | 438 | 378 | 445 | 397 | 431 | 447 | 333 | 357 | 391 | 432 | 428 | 303 | 410 | 455 | 291 | 567 | 415 | 473 | 417±63.7 | 844 |
| LG14 | 414 | 414 | 432 | 327 | 417 | 416 | 252 | 353 | 364 | 325 | 410 | 438 | 436 | 389 | 303 | 386 | 343 | 380 | 382 | 381 | 380 | 378±47.8 | 813 |
| LG15 | 681 | 663 | 483 | 520 | 481 | 508 | 656 | 576 | 578 | 424 | 670 | 641 | 588 | 713 | 498 | 561 | 713 | 481 | 826 | 645 | 596 | 595±99.9 | 1 260 |
| LG16 | 314 | 436 | 391 | 452 | 273 | 458 | 435 | 383 | 426 | 466 | 429 | 398 | 431 | 459 | 384 | 396 | 362 | 250 | 555 | 466 | 419 | 409±68.6 | 834 |
| LG16P1T | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 94 | — | — | — | — | — |
| LG17 | 314 | 324 | 307 | 310 | 337 | 251 | 150 | 247 | 370 | 301 | 379 | 367 | 163 | 352 | — | 354 | 312 | 318 | 386 | 360 | 419 | 316±69.0 | 833 |
| LG17P1 | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 55 | — | — | — | — | — | — | — | — |
| LG17P2 | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 172 | — | — | — | — | — | — | — | — |
| Total | 6748 | 6816 | 6752 | 6603 | 6351 | 6 304 | 6237 | 6838 | 7096 | 6493 | 7423 | 7238 | 6903 | 7500 | 5570 | 6380 | 6688 | 6350 | 8454 | 7148 | 7918 | 6848±634 | 15 417 |
| Avg. dist. (cM) | 0.21 | 0.18 | 0.19 | 0.2 | 0.23 | 0.25 | 0.19 | 0.18 | 0.19 | 0.22 | 0.16 | 0.17 | 0.16 | 0.17 | 0.22 | 0.22 | 0.2 | 0.22 | 0.16 | 0.16 | 0.17 | 0.19 | |
The size of the genetic maps in cM (a) and the number of SNP markers mapped (b), are reported for each of the 21 full-sib families, together with average values (±s.d.) across families (Pop Mean) and the final integrated Genetic Linkage Map (iGLMap). The table includes some single-parent maps that could not be integrated due to the lack of bi-parental markers (LG6 and LG16 in I_W, and LG7 and LG17 in I_CC). Average marker distance (Avg. dist.), is estimated as total genetic length divided by the total number of markers, is reported at the bottom of the table.
The parental maps of LG6 and LG17, in family I_W and I_CC, respectively, could not be integrated due to the absence of bi-parental markers and had to be considered separately. Their total length is therefore an estimate, derived by the sum of the paternal maps (cM) to the non-overlapping region of the maternal map, obtained from the alignment of the single-parent maps to the iGLMap.
In family I_CC, the genetic maps of LG7 is given by the maternal parent data only, as the paternal parent did not show any polymorphic marker.
The top part of the LG16 maternal map in family I_W (LG16P1T) could not be integrated to the entire map due to homozygosity in the paternal LG16 for that region.
For family 12_J, only 2,969 of the 6,351 markers were used in the construction of the iGLMap as a set of 3382 identical markers sensu JoinMap had been overlooked due to an administrative error. The identity of these markers is given in Supplementary File 2, together with their genetic positions on the bi-parental map and their progeny genotypes for family 12_J.
Summary of the number of available, validated and mapped SNPs markers
| FB | 14 628 | 12 508 | 12 349 |
| Customized markers | 103 | 60 | 53 |
| IRSC–RosBREED | 2750 | 2601 | 2481 |
| IRSC–GDsnp | 538 | 528 | 520 |
| Multi-locus SNPs replicates | 115 | 14 | |
| Total | 18 019 | 15 812 | 15 417 |
Abbreviations: FB, FruitBreedomics, designed in the 20 K Infinium SNP array development;[22] IRSC, RosBREED and GDsnp markers, from the previously developed International Rosaceae SNP Consortium (IRSC) 8 K Infinium SNP array by Chagnè et al.;[37] SNP, single-nucleotide polymorphism.
Starting from the available markers as present on the 20 K Infinium SNP array,[22] the number of validated SNPs (as the number of markers that mapped in at least one of the 21 individual families) and the number of SNP markers successful mapped on the integrated Genetic Linkage Map (iGL-mapped SNPs) are reported for each source type. Due to the presence of 111 multi-locus SNPs, the total count of validated and mapped SNPs includes multi-locus SNPs replicates as shown in the table.
Summary of designed and mapped HaploBlocks (HBs)
| FB | 10 | 1696 | 1670 | 11 |
| FB+IRSC | 20 | 642 | 640 | 15 |
| IRSC | 100 | 499 | 487 | 8 |
| Total | 2837 | 2797 |
Abbreviations: FB, FruitBreedomics; HB, HaploBlock; IRSC, International Rosaceae SNP Consortium; SNP, single-nucleotide polymorphism.
The number of designed and mapped HBs is reported for each SNP source type together with the physical distance range that defined the HB. The range of clustered SNPs per HB went from two up to 15 SNPs (FB+IRSC markers).
See legend of Table 3.
Figure 2Frequency distribution of informative meiosis (in percentage) in the initial set of individual SNP markers (light gray bars) and the final set of HaploBlock (HB) together with remaining individual SNP markers (dark gray bars). The graph highlights the different amount of informative meiosis carried by individual SNPs and the more informative HB markers. SNP markers carried a maximum of 50% of the total information when being completely bi-allelic, as expected, and a maximum of 60% when being tri-allelic in some families when accounting for null-alleles and signal intensity differences. However, the latter is true only for a small proportion (0.1%) of the SNPs, while the majority of SNPs is informative for 20–40% of the individuals. On the contrary, the majority of HB markers (+remaining single SNP) are informative for 40–80% of the individuals across all families and even 8.6% of the HBs is fully informative (100%).
Final integrated Genetic Linkage Map (iGLMap) and core map (values in brackets), summarized per linkage group (LG) by number of HB-markers, genetic length, average and maximum interval between adjacent markers
| LG1 | 132 (123) | 43 (16) | 63.08 (63.73) | 0.36±0.37 (0.46±0.50) | 1.68 (3.42) |
| LG2 | 181 (170) | 63 (32) | 78.42 (77.73) | 0.32±0.27 (0.39±0.34) | 1.42 (1.47) |
| LG3 | 183 (171) | 39 (23) | 73.95 (76.43) | 0.31±0.22 (0.40±0.33) | 1.53 (1.99) |
| LG4 | 139 (133) | 30 (18) | 65.51 (66.10) | 0.39±0.37 (0.44±0.40) | 1.84 (1.82) |
| LG5 | 201 (192) | 40 (18) | 77.84 (78.66) | 0.32±0.32 (0.38±0.45) | 1.84 (3.45) |
| LG6 | 146 (135) | 34 (15) | 75.26 (76.60) | 0.42±0.46 (0.52±0.55) | 3.29 (2.99) |
| LG7 | 133 (121) | 27 (12) | 82.39 (82.83) | 0.52±0.52 (0.63±0.70) | 2.99 (5.20) |
| LG8 | 147 (135) | 32 (15) | 68.52 (66.04) | 0.38±0.42 (0.44±0.45) | 2.6 (2.55) |
| LG9 | 151 (144) | 50 (28) | 67.08 (66.79) | 0.34±0.34 (0.39±0.39) | 2.65 (2.58) |
| LG10 | 186 (174) | 45 (23) | 81.3 (81.93) | 0.35±0.39 (0.42±0.46) | 2.66 (2.71) |
| LG11 | 186 (176) | 45 (20) | 80.94 (83.22) | 0.35±0.34 (0.43±0.40) | 1.96 (2.27) |
| LG12 | 169 (161) | 39 (26) | 65.44 (66.26) | 0.32±0.28 (0.36±0.32) | 1.53 (1.50) |
| LG13 | 164 (149) | 29 (17) | 71.36 (71.50) | 0.37±0.36 (0.43±0.44) | 2.59 (2.65) |
| LG14 | 145 (135) | 34 (15) | 64.39 (66.03) | 0.36±0.41 (0.44±0.45) | 3.02 (3.13) |
| LG15 | 234 (227) | 69 (34) | 112.15 (112.98) | 0.37±0.40 (0.44±0.52) | 3.17 (3.30) |
| LG16 | 151 (150) | 30 (21) | 67.48 (68.43) | 0.38±0.45 (0.40±0.53) | 2.92 (3.90) |
| LG17 | 149 (135) | 27 (11) | 71.80 (73.68) | 0.41±0.41 (0.51±0.58) | 2.31 (4.10) |
| Total | 2797 (2631) | 676 (344) | 1266.88 (1278.84) | ||
| Average | 165 (155) | 40 (20) | 74.52 (75.23) | 0.37 (0.44) | |
Figure 3Visual presentation of the distribution of the iGLMap haploblock markers (red lines) and single SNP markers (blue lines) over the 17 linkage group apple. Position and name of the markers is detailed in Supplementary Figure S3.
Figure 4Collinearity between the integrated Genetic Linkage Map (iGLMap) and v1 (a) and v3 (b) of the apple genome sequence. Physical coordinates for SNP markers were retrieved from v1 (15 133 SNP markers traced) and v3 (14 539 SNP markers traced) of the Apple Genome and their physical positions (Mb) are plotted versus marker genetic positions (cM). Filled dots and triangles indicate markers physically positioned on the Primary Assembly sequence and alternative scaffolds sequences, respectively. Different colors indicate different pseudo-chromosomes according to the legend in the figure; thus, markers inconsistently assigned to Linkage Groups (LGs) and pseudo-chromosome are immediately highlighted.