| Literature DB >> 28791035 |
Julie Godbout1, Laurence Tremblay2, Caroline Levasseur1, Patricia Lavigne1, André Rainville3, John Mackay4, Jean Bousquet5, Nathalie Isabel1,5.
Abstract
Biological material is at the forefront of research programs, as well as application fields such as breeding, aquaculture, and reforestation. While sophisticated techniques are used to produce this material, all too often, there is no strict monitoring during the "production" process to ensure that the specific varieties are the expected ones. Confidence rather than evidence is often applied when the time comes to start a new experiment or to deploy selected varieties in the field. During the last decade, genomics research has led to the development of important resources, which have created opportunities for easily developing tools to assess the conformity of the material along the production chains. In this study, we present a simple methodology that enables the development of a traceability system which, is in fact a by-product of previous genomic projects. The plant production system in white spruce (Picea glauca) is used to illustrate our purpose. In Quebec, one of the favored strategies to produce elite varieties is to use somatic embryogenesis (SE). In order to detect human errors both upstream and downstream of the white spruce production process, this project had two main objectives: (i) to develop methods that make it possible to trace the origin of plants produced, and (ii) to generate a unique genetic fingerprint that could be used to differentiate each embryogenic cell line and ensure its genetic monitoring. Such a system had to rely on a minimum number of low-cost DNA markers and be easy to use by non-specialists. An efficient marker selection process was operationalized by testing different classification methods on simulated datasets. These datasets were generated using in-house bioinformatics tools that simulated crosses involved in the breeding program for which genotypes from hundreds of SNP markers were already available. The rate of misidentification was estimated and various sources of mishandling or contamination were identified. The method can easily be applied to other production systems for which genomic resources are already available.Entities:
Keywords: SNP; breeding program; genomic fingerprint; parental assignment; quality control; somatic embryogenesis; tissue culture; traceability
Year: 2017 PMID: 28791035 PMCID: PMC5524734 DOI: 10.3389/fpls.2017.01264
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Overview of the breeding strategy for white spruce as described in this paper. Field sites abbreviations: GP, Grandes-Piles; STM, St-Modeste.
Figure 2Overview of the strategy used to develop a traceability system for the white spruce case study (steps 1–4).
Summary statistics for each SNP locus of the Sequenom® genotyping array.
| A-001 | ss538953861 | 0.404 | 0.484 | 0.367 | 0.724 |
| A-002 | ss538943407 | 0.204 | 0.280 | 0.242 | 0.800 |
| A-003 | ss538953915 | 0.412 | 0.455 | 0.352 | 0.733 |
| A-004 | ss538944742 | 0.399 | 0.491 | 0.371 | 0.722 |
| A-005 | ss511222774 | 0.507 | 0.495 | 0.374 | 0.718 |
| A-006 | ss524300150 | 0.288 | 0.338 | 0.281 | 0.776 |
| A-007 | ss538941338 | 0.007 | 0.007 | 0.007 | 0.993 |
| A-008 | ss538942949 | 0.474 | 0.484 | 0.367 | 0.724 |
| A-009 | ss538954305 | 0.396 | 0.407 | 0.324 | 0.749 |
| A-010 | ss538943025 | 0.494 | 0.491 | 0.371 | 0.722 |
| A-011 | ss511222888 | 0.228 | 0.255 | 0.223 | 0.814 |
| A-012 | ss538940594 | 0.448 | 0.498 | 0.374 | 0.719 |
| A-013 | ss538942166 | 0.447 | 0.498 | 0.374 | 0.719 |
| A-014 | ss538954139 | 0.432 | 0.447 | 0.347 | 0.736 |
| A-015 | ss538943850 | 0.437 | 0.500 | 0.375 | 0.719 |
| A-017 | ss538953932 | 0.376 | 0.436 | 0.341 | 0.740 |
| A-019 | ss538953749 | 0.525 | 0.465 | 0.357 | 0.730 |
| A-020 | ss538941729 | 0.368 | 0.365 | 0.299 | 0.765 |
| A-021 | ss538942910 | 0.510 | 0.492 | 0.371 | 0.721 |
| A-022 | ss538944651 | 0.465 | 0.439 | 0.342 | 0.739 |
| A-023 | ss538942912 | 0.514 | 0.492 | 0.371 | 0.721 |
| A-024 | ss538941273 | 0.456 | 0.493 | 0.372 | 0.721 |
| A-025 | ss538946011 | 0.374 | 0.403 | 0.322 | 0.751 |
| A-026 | ss538951794 | 0.427 | 0.418 | 0.331 | 0.746 |
| A-027 | ss538945503 | 0.433 | 0.459 | 0.353 | 0.732 |
| A-028 | ss538940804 | 0.438 | 0.483 | 0.366 | 0.724 |
| A-029 | ss538945192 | 0.428 | 0.429 | 0.337 | 0.742 |
| A-030 | ss538951576 | 0.507 | 0.477 | 0.363 | 0.726 |
| A-031 | ss538940956 | 0.486 | 0.474 | 0.362 | 0.727 |
| A-033 | ss538944911 | 0.478 | 0.476 | 0.363 | 0.726 |
| A-034 | ss538945049 | 0.428 | 0.465 | 0.357 | 0.730 |
| A-036 | ss538942739 | 0.085 | 0.081 | 0.078 | 0.927 |
| A-039 | ss538945302 | 0.302 | 0.299 | 0.254 | 0.793 |
| A-040 | ss538954082 | 0.385 | 0.405 | 0.323 | 0.750 |
| A-041 | ss538940617 | 0.395 | 0.407 | 0.324 | 0.749 |
| A-042 | ss538944866 | 0.533 | 0.498 | 0.374 | 0.719 |
H.
Figure 3Assessment of the robustness of the assignment exclusion method FAP (A) and of the parental allocation method Papa (B) for detecting the presence of missing data and/or genotyping errors using simulated progeny datasets. Circle size represents the percentage of errors measured.
Figure 4Number of cell lines genotyped and analyzed in this project (in parentheses, the number of trees sampled). Each circle represents one type of material: on the left, embryogenic tissues in cryobanks and on the right, trees in two separate field tests.
Types and frequency of the different errors found among the individuals and stages analyzed.
| Controlled crosses | 43.7 | Correct mother/wrong father | Pollen contamination |
| Controlled crosses | 21.4 | Wrong mother and sometimes wrong father | Misidentification of parental trees in the seed orchard |
| Somatic embryogenesis laboratory, cutting production or planting | 30.2 | Correct embryogenic tissue in cryobanks, but errors found among trees in the field | Misidentification of the material |
| Cryobanks | 4.8 | Wrong embryogenic tissue, but correct trees planted in the field | Misidentification of the embryogenic tissue |
The percentage is the proportion of the 112 cell lines identified as problematic.