| Literature DB >> 32024121 |
Jorge Marques da Silva1, Andreia Figueiredo1, Jorge Cunha2, José Eduardo Eiras-Dias2, Sara Silva3, Leonardo Vanneschi3,4, Pedro Mariano1.
Abstract
When a dark-adapted leaf is illuminated with saturating light, a fast polyphasic rise of fluorescence emission (Kautsky effect) is observed. The shape of the curve is dependent on the molecular organization of the photochemical apparatus, which in turn is a function of the interaction between genotype and environment. In this paper, we evaluate the potential of rapid fluorescence transients, aided by machine learning techniques, to classify plant genotypes. We present results of the application of several machine learning algorithms (k-nearest neighbors, decision trees, artificial neural networks, genetic programming) to rapid induction curves recorded in different species and cultivars of vine grown in the same environmental conditions. The phylogenetic relations between the selected Vitis species and Vitis vinifera cultivars were established with molecular markers. Both neural networks (71.8%) and genetic programming (75.3%) presented much higher global classification success rates than k-nearest neighbors (58.5%) or decision trees (51.6%), genetic programming performing slightly better than neural networks. However, compared with a random classifier (success rate = 14%), even the less successful algorithms were good at the task of classifying. The use of rapid fluorescence transients, handled by genetic programming, for rapid preliminary classification of Vitis genotypes is foreseen as feasible.Entities:
Keywords: Kautsky effect; Vitis; artificial neural networks; chlorophyll a fluorescence; decision trees; genetic programming; k-nearest neighbors; molecular markers; photosynthesis
Year: 2020 PMID: 32024121 PMCID: PMC7076723 DOI: 10.3390/plants9020174
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Phenogram of the genotyped Vitis samples based on the unweighted pair group method with arithmetic mean averages with a squared distances matrix, generated with allelic data from the nine SSR polymorphisms analyzed.
Success rate and parameterization of the machine learning methods used.
| Method | Success Rate | Main Parameters |
|---|---|---|
| K-nearest neighbors | 58.5% | Number of neighbors: 5 |
| Decision tree | 51.6% | Split criterion: entropy |
| Neural network | 71.8% | Number of neurons: 5000 |
| Genetic programming | 75.3% | Number of individuals: 250 |
Figure 2Overlap of all rapid fluorescence induction curves measured.
Figure 3Confusion matrices for all the methods, using the parameterizations specified in Table 1.
Figure 4K-nearest neighbors (KNN) output for each of the possible seven input Vitis genotypes. KNN used k = 5 as the number of neighbors with which to perform the classification. Global success rate is 58.5%.
Figure 5Decision trees (DT) output for each of the possible seven input Vitis genotypes. DT were defined using the entropy criterion, a maximum depth of 19 and a minimum number of samples of five. Global success rate is 51.6%.
Figure 6Neural networks (NN) output for each of the seven possible inputs. NN were defined with 5000 neurons, a single hidden layer, and the neurons in the hidden layer used the logistic activation function. Global success rate is 71.8%.
Figure 7Genetic programming (GP) classifier output for each of the seven possible inputs. Results were obtained using a population of 250 individuals allowed to evolve for at most 100 generations. Global success rate is 75.3%.
Figure 8Histogram of Kautsky curve time point usage in the DT and GP models obtained.
Grapevine genotypes used in this study. Grapevine species, variety and V. vinifera name, accession on the Portuguese National Ampelographic Collection, number on the Vitis International Variety Catalogue [38], leaf traits and country of origin are shown.
| Genotype | Variety | Acession PRT051 | VIVC | Photo (VIVC) * | Leaf Colour | Leaf Bright | Country of Origin |
|---|---|---|---|---|---|---|---|
| Rupestris du Lot | 13,821 | 10,389 | Light green | bright | France | ||
| Riparia Gloire de Montpellier | 13,822 | 4824 | Dark green | dull | France | ||
| Isabella | 13,619 | 5560 | Dark green | dull | United States of America | ||
| Pinot Noir | 10,918 | 9279 | green | dull | France | ||
| Cabernet Sauvignon | 10,714 | 1929 | Light green | Slightly bright | France | ||
| Riesling Weiss | 13,413 | 10,077 | Dark green | Slightly bright | Germany | ||
| Trincadeira | 11,402 | 15,685 | Dark green | Very bright | Portugal |
* Photo credit: Julius Kühn-Institut, Institute for Grapevine Breeding Geilweilerhof, Germany—Vitis International Variety Catalogue—www.vivc.de—(September 2019).
Name, linkage group, microsatellite sequences and references of the simple sequence repeats (SSRs) markers used in this study.
| SSR Name | Linkage Group | Microsatellite Repeat Motif | Reference |
|---|---|---|---|
| VVS2 | 11 | (GA)n | Thomas and Scott [ |
| VVMD5 | 16 | (CT)nAT(CT)nATAG(AT)n | Bowers and Meredith [ |
| VVMD7 | 7 | (CT)n | Bowers and Meredith [ |
| VVMD25 | 11 | (CT)n | Bowers et al. [ |
| VVMD27 | 5 | (CT)n | Bowers et al. [ |
| VVMD28 | 3 | (CT)n | Bowers et al. [ |
| VVMD32 | 4 | (CT)n | Bowers et al. [ |
| VRZAG62 | 7 | (GA)n | Sefc et al. [ |
| VRZAG79 | 5 | (GA)n | Sefc et al. [ |