| Literature DB >> 26312996 |
Carla Lang1, Flávia Regina Capellotto Costa2, José Luís Campana Camargo3, Flávia Machado Durgante4, Alberto Vicentini4.
Abstract
Precise identification of plant species requires a high level of knowledge by taxonomists and presence of reproductive material. This represents a major limitation for those working with seedlings and juveniles, which differ morphologically from adults and do not bear reproductive structures. Near-infrared spectroscopy (FT-NIR) has previously been shown to be effective in species discrimination of adult plants, so if young and adults have a similar spectral signature, discriminant functions based on FT-NIR spectra of adults can be used to identify leaves from young plants. We tested this with a sample of 419 plants in 13 Amazonian species from the genera Protium and Crepidospermum (Burseraceae). We obtained 12 spectral readings per plant, from adaxial and abaxial surfaces of dried leaves, and compared the rate of correct predictions of species with discriminant functions for different combinations of readings. We showed that the best models for predicting species in early developmental stages are those containing spectral data from both young and adult plants (98% correct predictions of external samples), but even using only adult spectra it is still possible to attain good levels of identification of young. We obtained an average of 75% correct identifications of young plants by discriminant equations based only on adults, when the most informative wavelengths were selected. Most species were accurately predicted (75-100% correct identifications), and only three had poor predictions (27-60%). These results were obtained despite the fact that spectra of young individuals were distinct from those of adults when species were analyzed individually. We concluded that FT-NIR has a high potential in the identification of species even at different ontogenetic stages, and that young plants can be identified based on spectra of adults with reasonable confidence.Entities:
Mesh:
Year: 2015 PMID: 26312996 PMCID: PMC4551484 DOI: 10.1371/journal.pone.0134521
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Number of specimens used to obtain FT-NIR spectra.
| Species | Seedlings/Juveniles | Mature |
|---|---|---|
|
| 12 | 24 |
|
| 10 | 13 |
|
| 7 | 10 |
|
| 20 | 19 |
|
| 19 | 16 |
|
| 18 | 15 |
|
| 10 | 20 |
|
| 9 | 15 |
|
| 12 | 13 |
|
| 8 | 13 |
|
| 11 | 16 |
|
| 9 | 10 |
|
| 5 | 12 |
|
|
|
|
Description of tests based on FT-NIR spectra to discriminate between Burseraceae species.
| Linear Discriminant Analyses (LDAs) Models | ||||
|---|---|---|---|---|
| Dataset | Adult Model | Young Model | Combined Model | |
| Mean of readings | Adaxial+Abaxial | x | x | x |
| Adaxial | x | x | x | |
| Abaxial | x | x | x | |
| Single spectrum | x | x | ||
Model Adult, function generated with samples of adults only. Validation: samples of seedlings and young plants; Young Model, function generated with samples of seedlings/young only. Validation: samples of adults; Combined Model, function generated with 2/3 of the total sample, all developmental stages combined. Validation: the remaining 1/3 of the total sample. All listed combinations were tested both with all wavelength components, as with those identified by stepwise analysis as the most informative.
Results of discriminat analysis using the average of the readings and all wavelengths of the spectrum FT-NIR.
| Adult Model | Young Model | Combined Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset | Dataset | Dataset | |||||||||
| Species | MeanAd+Ab | MeanAd | MeanAb | Single Spectrum | MeanAd+Ab | MeanAd | MeanAb | Single Spectrum | MeanAd+Ab | MeanAd | MeanAb |
|
| 66 | 50 | 59 | 54 | 100 | 100 | 100 | 96 | 100 | 100 | 100 |
|
| 71 | 87 | 71 | 86 | 0 | 8 | 31 | 65 | 100 | 100 | 100 |
|
| 80 | 100 | 100 | 100 | 70 | 50 | 70 | 20 | 100 | 100 | 100 |
|
| 25 | 15 | 30 | 12 | 95 | 100 | 100 | 95 | 100 | 91 | 91 |
|
| 47 | 57 | 21 | 53 | 12 | 0 | 19 | 12 | 98 | 98 | 90 |
|
| 22 | 55 | 94 | 47 | 85 | 77 | 73 | 76 | 100 | 100 | 100 |
|
| 100 | 100 | 90 | 98 | 100 | 95 | 95 | 100 | 100 | 100 | 100 |
|
| 0 | 0 | 0 | 0 | 100 | 100 | 100 | 80 | 100 | 100 | 100 |
|
| 66 | 100 | 58 | 27 | 69 | 92 | 69 | 100 | 100 | 100 | 100 |
|
| 100 | 100 | 100 | 100 | 92 | 100 | 100 | 73 | 100 | 100 | 100 |
|
| 36 | 36 | 45 | 64 | 100 | 87 | 87 | 86 | 100 | 100 | 100 |
|
| 11 | 33 | 33 | 11 | 80 | 70 | 80 | 70 | 100 | 100 | 100 |
|
| 100 | 100 | 100 | 100 | 41 | 58 | 33 | 75 | 100 | 100 | 100 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Mean Ad +Ab, the average of 12 readings; Mean Ad, the average of the readings of the adaxial surface; Mean Ab, the average of the readings abaxial surface; Single reading, a randomly selected reading. Hit percentage for each species and average percentage for models generated with each data set.
Fig 1Matrices with the results of Discriminant Analysis for the three models (Adult, Young and Combined) and using average of 12 readings (abaxial+adaxial), and for both all wavelengths (left) and stepwise selected wavelengths (right).
The observed species names are given in columns, while predicted names are given in rows. Therefore, the values on the diagonal are correct predictions, and off-diagonal values are wrong predictions. Abbreviations: C. rho = Crepidospermum rhoifolium; P. api = Protium apiculatum; P. dec = Protium decandrum; P. gra = Protium grandifolium; P. heb.A = Protium hebetatum forma A; P. heb. B = Protium hebetatum forma B; P. kru = Protium krukoffi; P. occ = Protium occultum; P. pal = Protium pallidum; P. pan.n = Protium paniculatum var. nova; P.pan.r = Protium paniculatum var. reidelianum; P. sag = Protium sagotianum; P. sub = Protium subserratum.
Results of discriminant analysis based on the average of 12 readings and the most informative wavelengths selected by stepwise modelling.
| Adult Model | Young Model | Combined Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset | Dataset | Dataset | |||||||||
| Species | MeanAd+Ab | MeanAd | MeanAb | Single Spectrum | MeanAd+Ab | MeanAd | MeanAb | Single Spectrum | MeanAd+Ab | MeanAd | MeanAb |
|
| 75 | 100 | 33 | 67 | 91 | 96 | 83 | 96 | 100 | 100 | 100 |
|
| 100 | 100 | 100 | 93 | 61 | 54 | 69 | 0 | 100 | 100 | 100 |
|
| 100 | 100 | 100 | 90 | 30 | 20 | 30 | 70 | 87 | 100 | 75 |
|
| 60 | 20 | 65 | 29 | 100 | 100 | 95 | 92 | 91 | 77 | 71 |
|
| 84 | 68 | 79 | 47 | 0 | 0 | 6,0 | 53 | 60 | 70 | 41 |
|
| 89 | 83 | 89 | 16 | 35 | 40 | 35 | 69 | 100 | 100 | 100 |
|
| 60 | 50 | 100 | 100 | 80 | 70 | 95 | 100 | 100 | 100 | 100 |
|
| 89 | 11 | 55 | 0 | 80 | 60 | 87 | 80 | 100 | 100 | 100 |
|
| 67 | 58 | 92 | 11 | 92 | 85 | 54 | 84 | 100 | 100 | 86 |
|
| 75 | 0 | 100 | 98 | 100 | 100 | 100 | 84 | 100 | 100 | 100 |
|
| 27 | 36 | 18 | 45 | 81 | 75 | 75 | 68 | 100 | 87 | 87 |
|
| 89 | 100 | 67 | 44 | 50 | 80 | 10 | 30 | 100 | 100 | 100 |
|
| 100 | 100 | 100 | 100 | 80 | 80 | 80 | 43 | 100 | 100 | 100 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Mean Ad+Ab, the average of 12 readings; Mean Ad. the average of the readings adaxial surface (Ad); Mean Ab, the average of the readings adaxial surface; Single Reading, a randomly selected reading. Hit percentage for each species and average percentage for models generated with each set of data are given.
Fig 2(a, c, e) Graphic representation of a two dimensional Principal Component Analysis (PCA) of FT-NIR spectra of young and adult individuals. (b, d, f) Representation of full spectra of individuals. Spectra is composed of 1557 wavelengths, for the average of 12 readings per individual. Young plants in black, adult plants in red. (a, b) Protium grandifolium; (c, d) Protium subserratum (e, f) Protium apiculatum.