| Literature DB >> 34753418 |
Claire A Holden1, Camilo L M Morais2, Jane E Taylor3, Francis L Martin4, Paul Beckett5, Martin McAinsh3.
Abstract
BACKGROUND: Japanese knotweed (R. japonica var japonica) is one of the world's 100 worst invasive species, causing crop losses, damage to infrastructure, and erosion of ecosystem services. In the UK, this species is an all-female clone, which spreads by vegetative reproduction. Despite this genetic continuity, Japanese knotweed can colonise a wide variety of environmental habitats. However, little is known about the phenotypic plasticity responsible for the ability of Japanese knotweed to invade and thrive in such diverse habitats. We have used attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy, in which the spectral fingerprint generated allows subtle differences in composition to be clearly visualized, to examine regional differences in clonal Japanese knotweed.Entities:
Keywords: Ecosystem; Epigenomics; FTIR spectroscopy; Invasive species; Japanese knotweed; Physiological adaptation; Plants; Principal component analysis; Spectrum analysis; Support vector machine
Mesh:
Substances:
Year: 2021 PMID: 34753418 PMCID: PMC8579538 DOI: 10.1186/s12870-021-03293-y
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Fig. 6Soil parameters for each site, with error bars showing standard error; a) percentage mass lost on ignition (LOI), b) % water loss, c) pH, d) plant available phosphorus, e) carbon to nitrogen ratio (C:N). Statistical significance was calculated using a Kruskal-Wallis followed by a post hoc test using the criterium Fisher’s least significant difference (LSD) to determine where the difference lies, signified by lowercase letters above the bars. Within each graph, all bars which share letters are not significantly different from each other. Data are mean +/− standard errors. pH, n = 9; C:N n = 9; % water loss mean, n = 6; % loss on ignition mean, n = 6; plant available phosphorus mean, n = 3 except for EDB and ESB where n = 9
Fig. 1(a) Class means raw and (b) class means pre-processed (SG smoothed and vector normalised) IR spectra in the fingerprint region (1800–900 cm− 1) grouped by the different regions where knotweed samples were collected (NEE: North East England, NWE: North West England, WS: West Scotland); (c) class means raw and (d) class means pre-processed (SG smoothed and vector normalised) spectra in the fingerprint region (1800–900 cm− 1) grouped by the different sites where knotweed samples were collected (Scotland: SRC, SOM, SLM, SAP; North West England: ESA, ESB; North East England: EDB)
Fig. 2(a) PCA scores, (b) PCA-LDA canonical scores and (c) SVM class predicted probability for the IR spectral dataset according to different regions where knotweed samples were collected (NEE: North East England, NWE: North West England, WS: West Scotland). Numbers inside parenthesis indicate the percentage of explained variance in each PC. Each spectral point in these scores plots represents a single spectral acquisition
Quality parameters for spectral classification based on different regions. The predictive performance of PCA-LDA towards the external test set was relatively poor. However, SVM performed well in training and test sets, indicating that knotweed leaf samples can be differentiated by region based on their IR spectral profile
| Algorithm | Class | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| PCA-LDA | North East England | 63% | 30% | 96% |
| North West England | 62% | 37% | 87% | |
| West Scotland | 67% | 88% | 47% | |
| SVM | North East England | 100% | 100% | 100% |
| North West England | 98% | 95% | 100% | |
| West Scotland | 98% | 100% | 97% |
Fig. 3Difference between mean support vector spectra for (a) North East England (+ coefficients) and North West England (− coefficients), (b) North East England (+ coefficients) and West Scotland (− coefficients), (c) North West England (+ coefficients) and West Scotland (− coefficients). The main wavenumbers responsible for class differentiation between the three regions are labelled, and have been used to identify spectral markers
Biomolecular markers for class differentiation between the three regions (Scotland, North West England and North East England) and each site (SOM, SAP, SRC, SLM, ESA, ESB and EDB). Spectral markers were found by investigating the difference-between-mean support vectors spectra found by SVM, and linked to the biomolecules associated with each wavenumber from published literature
| Comparison | Wavenumber/ cm | Tentative Molecular Assignment | Reference |
|---|---|---|---|
| NEE and NW | 1736 | C=O stretching [lipids] | [ |
| 1643 | C=O stretching [Amide I] | [ | |
| 1605 | [ | ||
| 1546 | Amide II: [protein N–H bending, C–N stretching], α-helical structure | [ | |
| 1466 | CH2 bending in lipid | [ | |
| 1446 | aromatic ring stretch vibrations, tannins | [ | |
| 1405 | CH3 asymmetric deformation | [ | |
| 1385 | Ring stretching vibrations mixed strongly with CH in-plane bending | [ | |
| 1158 |
| [ | |
| 1034 | C-O stretch, tannins | [ | |
| 1015 | ν (CO), ν (CC), δ (OCH), ring in pectin | [ | |
| 964 | C-O deoxyribose, C-C | [ | |
| NEE and WS | 1725 | C=O stretching band mode of the fatty acid ester | [ |
| 1662 | Amide I, or fatty acid esters | [ | |
| 1648 | Amide I | [ | |
| 1608 | aromatic ring stretch vibrations, tannins | [ | |
| 1586 | Amide II | [ | |
| 1542 | Amide II | [ | |
| 1531 | Amide II | [ | |
| 1446 | aromatic ring stretch vibrations, tannins | [ | |
| 1530 | C=N adenine, cytosine | [ | |
| 1014 | phosphodiester stretching bands [symmetrical and asymmetrical] | [ | |
| NWE and WS | 1725 | C=O stretching band mode of the fatty acid ester | [ |
| 1678 | Stretching C=O vibrations that are H-bonded [changes in the C=O stretching vibrations could be connected with destruction of old H-bonds and creation of the new ones] | [ | |
| 1662 | Amide I, or fatty acid esters | [ | |
| 1445 | lipids | [ | |
| 1397 | CH3 symmetric deformation | [ | |
| SRC and others | 1748 | C=O stretching vibration of alkyl ester, pectin | [ |
| 1728 | ν (C=O) ester, cutin | [ | |
| 1678 | Stretching C=O vibrations that are H-bonded [changes in the C55O stretching vibrations could be connected with destruction of old H-bonds and creation of the new ones] | [ | |
| 1651 | phenolic compounds/ cutan [aromatic and C=C functional groups] | [ | |
| 1608 | aromatic ring stretch vibrations, tannins | [ | |
| 1542 | Amide II | [ | |
| 1455 | C-O-H | [ | |
| 1443 | [ | ||
| SLM and others | 1755 | lipid | [ |
| 1735 | C=O stretching, the phenolic compound ellagic acid/ the secondary metabolite quercetin | [ | |
| 1512 | ν (C-C) aromatic (conjugated with C=C phenolic compounds | [ | |
| 1481 | symmetric deformation NH2 +, glyphosateX | [ | |
| 1466 | CH2 bending in lipid | [ | |
| SOM and others | 1755 | lipid | [ |
| 1736 | lipids | [ | |
| 1481 | symmetric deformation NH2 +, glyphosateX | [ | |
| 1466 | CH2 bending in lipid | [ | |
| 1161 | carbohydrate; stretching vibrations of hydrogen-bonding C–OH groups (found in serine, threonine and tyrosine residues of cellular proteins); cellulose | [ | |
| 1103 |
| [ | |
| SAP and others | 1755 | lipid | [ |
| 1736 | lipid | [ | |
| 1481 | symmetric deformation NH2 +, glyphosateX | [ | |
| 1466 | CH2 bending in lipid | [ | |
| 1103 |
| [ | |
| ESA and others | 1755 | lipid | [ |
| 1732 | lipid; fatty acid esters; hemicellulose | [ | |
| 1647 | amide I; pectin | [ | |
| 1512 | ν(C=C) in lignin, carotenoid or protein | [ | |
| 1481 | symmetric deformation NH2 +, glyphosateX | [ | |
| 1466 | aromatic ring stretch vibrations, tannins | [ | |
| ESB and others | 1755 | lipid | [ |
| 1736 | C=O stretching [lipids] | [ | |
| 1481 | symmetric deformation NH2 +, glyphosateX | [ | |
| 1466 | CH2 bending in lipid, or aromatic ring stretch vibrations, tannins | [ | |
| EDB and others | 1728 | ν (C=O) ester, cutin | [ |
| 1446 | aromatic ring stretch vibrations, tannins | [ | |
| 1408 | CH3 deformation, | [ |
Quality parameters for spectral classification based on different sites. Separation by PCA-LDA was relatively poor. However, SVM performed much better, indicating that knotweed leaf samples can be differentiated by the site at which they were collected using this method
| Algorithm | Class | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| PCA-LDA (10 PCs, 91% explained variance) | SRC | 85% | 80% | 90% |
| SLM | 70% | 57% | 83% | |
| SOM | 60% | 24% | 97% | |
| SAP | 62% | 35% | 89% | |
| ESA | 73% | 52% | 95% | |
| ESB | 56% | 20% | 92% | |
| EDB | 65% | 43% | 88% | |
| SVM | SRC | 100% | 100% | 100% |
| SLM | 100% | 100% | 100% | |
| SOM | 99% | 98% | 100% | |
| SAP | 100% | 100% | 100% | |
| ESA | 100% | 100% | 100% | |
| ESB | 97% | 94% | 100% | |
| EDB | 99% | 100% | 99% |
Fig. 4SVM class predicted probability for the IR spectral dataset according to different sites where knotweed samples were collected (Scotland: SRC, SOM, SLM, SAP; North West England: ESA, ESB; North East England: EDB). The clear separation indicates that the knotweed samples can be differentiated by the site at which they were collected
Fig. 5Difference between mean support vector spectra for (a) SRC (+ coefficients) and others (− coefficients), (b) SLM (+ coefficients) and others (− coefficients), (c) SOM (+ coefficients) and others (− coefficients), (d) SAP (+ coefficients) and others (− coefficients), (e) ESA (+ coefficients) and others (− coefficients), (f) ESB (+ coefficients) and others (− coefficients), and (g) EDB (+ coefficients) and others (− coefficients). These comparisons can be used to identify the key wavenumbers responsible for the differences between sites, which have been labelled above, and can be used to find spectral biomarkers
Fig. 7(a) PCA scores and (b) loadings for soil data (abbreviations define sites where samples were collected, Scotland: SRC, SOM, SLM, SAP; North West England: ESA, ESB; North East England: EDB). The North East England soil sample, EDB, has a high C:N ratio, and a lower pH than the other samples. EDB soil was found to be naturally different from the others on the PCA scores plot, with a greater separation in the Y-plane (PC2). ESB has a higher C:N than the other sites. SOM and SAP have high phosphorus, water loss, and LOI (organic carbon) compared with SRC, SLM, ESA, ESB, and EDB. SLM was a mixed sample, sharing similar soil traits with ESA and SRC. Note: An extreme sample was removed. SAP3 was a non-homologous urban environment and one of the three soil samples was an outlier. The bar graphs in Fig. 6a and b show high standard deviation for loss on ignition and percentage water loss for SAP, due to this sample