Literature DB >> 35928405

Genome-wide association analysis and pathway enrichment provide insights into the genetic basis of photosynthetic responses to drought stress in Persian walnut.

Mohammad M Arab1, Patrick J Brown2, Rostam Abdollahi-Arpanahi3, Seyed Sajad Sohrabi4, Hossein Askari5, Sasan Aliniaeifard6, Ali Mokhtassi-Bidgoli7, Mohsen B Mesgaran2, Charles A Leslie2, Annarita Marrano2, David B Neale2, Kourosh Vahdati1.   

Abstract

Uncovering the genetic basis of photosynthetic trait variation under drought stress is essential for breeding climate-resilient walnut cultivars. To this end, we examined photosynthetic capacity in a diverse panel of 150 walnut families (1500 seedlings) from various agro-climatic zones in their habitats and grown in a common garden experiment. Photosynthetic traits were measured under well-watered (WW), water-stressed (WS) and recovery (WR) conditions. We performed genome-wide association studies (GWAS) using three genomic datasets: genotyping by sequencing data (∼43 K SNPs) on both mother trees (MGBS) and progeny (PGBS) and the Axiom™ Juglans regia 700 K SNP array data (∼295 K SNPs) on mother trees (MArray). We identified 578 unique genomic regions linked with at least one trait in a specific treatment, 874 predicted genes that fell within 20 kb of a significant or suggestive SNP in at least two of the three GWAS datasets (MArray, MGBS, and PGBS), and 67 genes that fell within 20 kb of a significant SNP in all three GWAS datasets. Functional annotation identified several candidate pathways and genes that play crucial roles in photosynthesis, amino acid and carbohydrate metabolism, and signal transduction. Further network analysis identified 15 hub genes under WW, WS and WR conditions including GAPB, PSAN, CRR1, NTRC, DGD1, CYP38, and PETC which are involved in the photosynthetic responses. These findings shed light on possible strategies for improving walnut productivity under drought stress.
© The Author(s) 2022. Published by Oxford University Press on behalf of Nanjing Agricultural University.

Entities:  

Year:  2022        PMID: 35928405      PMCID: PMC9343916          DOI: 10.1093/hr/uhac124

Source DB:  PubMed          Journal:  Hortic Res        ISSN: 2052-7276            Impact factor:   7.291


Introduction

Walnut (Juglans regia L.) was domesticated in ancient Persia and is today cultivated for its edible nuts throughout temperate and semi-arid regions from Asia to Europe and the Americas [1]. Iran is a prominent centre of diversity for walnut, and ranks third in in-shell walnut production after China and the United States [2]. Walnut production in Central Asia and across the globe is constrained by abiotic stresses, particularly drought, heat, and salinity [3,4]. Drought is likely the most challenging abiotic stress for walnut, with ever-increasing global water scarcity driving large production losses [5-8]. Therefore, understanding the physiological and molecular mechanisms of drought tolerance in walnuts has become more important worldwide, with more extended drought periods expected in the coming decades. Walnut populations adapted to their native habitats around Iran offer an opportunity to better understand walnut responses to drought stress. So far, most studies conducted in walnut have focused on physiological aspects of drought stress, whereas molecular mechanisms are less well documented [3]. Therefore, a priority task for accelerating walnut improvement is deciphering the molecular genetic basis of drought related traits. Photosynthesis is a key physiological mechanism involved in adaption to abiotic stresses and regulation of plant development [9]. Under drought conditions, photosynthesis decreases due to stomatal closure, decreased CO2 availability in the chloroplast, and decreased carboxylation efficiency [10]. Excess light energy during drought-induced stomatal closure can cause serious damage to the plant through generation of reactive oxygen species (ROS) [11]. Plants have evolved diverse protective strategies to cope with excessive light, including non-photochemical quenching (NPQ), antioxidant production, and the regulation of electron transport to moderate ROS formation or detoxify ROS after they form [12]. Because excess light energy absorbed by chlorophyll can either be dissipated through NPQ or re-emitted as fluorescence [11], chlorophyll fluorescence analysis is a powerful tool for tracking fluxes of light absorption by chlorophyll through the electron transport chain [13]. The “OJIP” test represents a fast and non-destructive analysis of polyphasic chlorophyll fluorescence kinetics that has been employed for quick and precise assessment of biophysical aspects of photosynthesis under abiotic stress [14]. The OJIP test, together with gas exchange measurements, have been used successfully to study the photosynthetic apparatus of various tree crops, under abiotic stress conditions, including walnut [7,8]. Despite extensive physiological studies, the genetic basis underlying variation in walnut photosynthesis under both water-stress and re-watering conditions remains largely unknown. A first-draft reference genome of Persian walnut was released in 2016 (Chandler v1.0 [15]), followed by the development of a high-density Axiom™ J. regia 700 K SNP genotyping array [16] that facilitated advanced genomic studies, including QTL mapping and genome wide association studies (GWAS). These new genomic tools have been extensively used for investigating genomic diversity and association mapping in walnut [17-21]. However, there are few studies on the genetic basis of physiological traits in walnut [3,5]. Recently, two annotated, chromosome-level assemblies of the walnut genome [22,23] have been released, enabling SNP identification at chromosome scale and the application of genomic tools in plant breeding programs. SNP arrays provide reliable and robust markers for a multitude of applications in breeding programs and population genomic studies. However, SNP arrays are species- and population specific and ascertainment bias is one of their main drawbacks [24,25]. SNP array development also requires prior knowledge, and has a relatively high cost [24,25]. On the other hand, genotyping-by-sequencing (GBS) and restriction site associated DNA sequencing (RADseq) approaches have the potential to generate large marker datasets at low cost with minimal ascertainment bias, although large amounts of missing data and heterozygote undercalling are significant drawbacks that can only be partially addressed by imputation [24,25]. Therefore, GBS offers a cost-efficient alternative or complement to high-throughput genotyping arrays for gaining genomic information [25]. GWAS successfully identified many SNPs underlying a wide range of traits in plants. However, for complex or polygenic traits, the statistical power of GWAS for identifying variants of small effect is restricted by the stringent levels set for significance threshold and by insufficient numbers of high-frequency polymorphisms identified in most panels [26,27]. So, many small effect SNP markers are always ignored and most of the genetic variants contributing to the trait remains hidden [28]. Further, since many associated SNPs are noncoding it can be problematic to identify the molecular mechanisms by which they may act. Pathway or gene set enrichment analysis as a complementary method to GWAS can help tackle the aforesaid problems through assessing modules of functionally related genes instead of focusing only on one or a few markers that are most significantly associated [26,27]. Therefore, this approach by pooling information across many genetic variants (SNPs) can identify potentially relevant biological pathways or molecular mechanisms even when individual SNPs fail to reach a stringent significance threshold. In this study, we take advantage of natural variation in local walnut populations of Iran to investigate the genetic control of photosynthetic-related traits under well-watered (WW), water-stressed (WS), and water-recovery (WR) conditions, combining GWAS with network and pathway enrichment analyses. Given that the expression of physiological and photosynthetic traits in Persian walnut is under strong genetic control, we hypothesized that locally adapted Persian walnut populations would express different levels of trait plasticity under water-stress (mild and severe) and re-watering conditions. Our main objectives are: (1) to assess natural genetic variation and phenotypic plasticity in photosynthetic traits in a diverse collection of walnut trees under water-stress and re-watering conditions; (2)to uncover genomic regions contributing to photosynthetic trait variation through GWAS; and finally (3) to identify key pathways and hub genes related to photosynthesis under well-watered, water-stressed and recovery conditions through pathway and network analysis.

Results

Natural variation in photosynthetic traits of Persian walnut populations

From the 150 walnut families collected from major walnut-growing regions of Iran (Table S1), 30 photosynthetic-related traits, largely classified into two main categories (gas exchange and chlorophyll fluorescence measurements), were evaluated in walnut plants grown under control, water-stress, and recovery conditions (Figure S1). A brief description of calculations for each measured phenotype is provided in Table 1.
Table 1

Calculations and definitions of water relations, gas-exchange and fluorescence (Strasser et al., 2000, 2004) parameters measured in the study with their broad classification

Phenotype category Phenotype Calculation Definition
Drought score and water relations
DSDrought scoring system based on the appearance characteristics of seedlings
Drought scoreRWC(fresh weight (FW) – dry weight (DW)) / (turgor weight (TW) – DW)Leaf relative water content (RWC)
Gas-exchange parameters
PnNet photosynthetic rate (μmol CO2 m−2 s−1)
TrTranspiration rate (mmol H2O m−2 s−1)
gs Stomatal conductance (mol H2O m−2 s−1)
Gas-exchangeCiIntercellular CO2 concentration (μmol CO2 mol−1 air)
parametersCaAtmospheric CO2 concentration
WUEintriPn/gsWUEintri in μmol CO2 mmol H2O−1
WUEinstPn/TrWUEinst in μmol CO2 mmol H2O−1
CEPn/CiAn estimate of carboxylation efficiency of Rubisco
Fluorescence parameters
F0F0 = F50μsMinimum Fluorescence, when all PSII RCs are open
FJFJ = F2mFluorescence intensity at the J-step (2 ms)
PrimaryFIFI = F60msFluorescence intensity at the I-step (60 ms)
fluorescenceFMFM = F1sMaximum fluorescence, when all PSII RCs are closed
measurementsFVFV = FM - F0Variable fluorescence
VJVJ = (FJ-F0)/(FM-F0)Variable Fluorescence at the J-step
VIVI = (FI-F0)/(FM-F0)Variable fluorescence at the I-step
FM/F0
Fluorescence ratiosFV/F0Maximum efficiency of photochemistry
FV/FM(FM - F0)/FMMaximum yield of primary photochemistry
M04(F300μs - F0)/(FM - F0)Rate of reaction center closure
N(Area/(FM - F0)) × M0 × (1/VJ)Turn-over number QA reduction eventsbetween time 0 and FM
Ψ0Ψo = ET0/TR0 = 1- VJLikelihood that a trapped exciton can movean electron further than QA
Derived parametersφEoΦEo = ET0/ABS = (1-(F0/FM)) × ΨQuantum yield of electron transport to intersystem electron acceptors (between photosystem II and I)
φDoΦDo = 1- Φp0 = (F0/FM)Quantum yield at time 0 for energy dissipation
ΦpavΦpav = Φp0(1-Vav)Average quantum yield of primary photochemistry
PIABS(RC/ABS) × (φPo/1- φPo)Performance Index of PSII normalized for equal absorption
ABS/RCABS/RC = M0 × (1/VJ) × (1/Φp0)Energy absorption by antenna per reactioncenter (indicator of antenna size for PSII)
Energy fluxparametersTR0/RCTR0/RC = M0 × (1/VJ)Flux of excitons trapped per reaction center: reduction of Pheophytin and QA
ET0/RCET0/RC = M0 × (1/VJ) × Ψ0Energy flux for electron transport per reaction center: beyond QA
DI0/RCDI0/RC = (ABS/RC)-(TR0/RC)Flux ratio of energy dissipation per reaction center
Calculations and definitions of water relations, gas-exchange and fluorescence (Strasser et al., 2000, 2004) parameters measured in the study with their broad classification High phenotypic variation was observed among families for all the traits measured. Most traits revealed a near normal distribution (Figure 1; Figures S2-3). There was significant (P ≤ 0.001) genotypic variation in both categories of photosynthetic traits across treatments, except for some gas exchange parameters (Ci and Ci/Ca under drought recovery) and chlorophyll fluorescence measurements (VI under drought, as well as, FI and FM under recovery) (Tables S2-S5). Significant treatment effects (P ≤ 0.001) on all traits pointed out the expression of phenotypic plasticity under drought stress. Most of the photosynthetic related traits showed substantial reductions. However, φDo, ABS/RC, DI0/RC, WUE, and WUEi were increased for plants grown under drought stress compared to the control plants.
Figure 1

Distribution of the photosynthetic related traits of 6-month-old plants of walnut under well-watered (blue), severe stress (red) conditions and overlap between them (purple) in the first-year experiment. (a) Pn, (b) Ci, (c) Pn/Ci, (d) Ci/Ca, (e) WUEi, (f) RWC, (g) FM/F0, (h) FV/F0, and (i) FV/FM in 140 Iranian walnut families. Traits are indicated on the x-axis and number of families on the y-axis. Dashed vertical lines indicate the mean of each distribution.

Figure 3

Box plots of the regional differentiation of the gas exchange parameters in the 140 walnut families grown in a common garden under well-water and severe water stress conditions in the first-year experiment. (A) Pn, (B) Ci, (C) Tr and (D) WUEi. WW: well-water; WS: water-stress. Different letters indicate statistically significant differences at the level of p < 0.01 (Tukey’s test).

Distribution of the photosynthetic related traits of 6-month-old plants of walnut under well-watered (blue), severe stress (red) conditions and overlap between them (purple) in the first-year experiment. (a) Pn, (b) Ci, (c) Pn/Ci, (d) Ci/Ca, (e) WUEi, (f) RWC, (g) FM/F0, (h) FV/F0, and (i) FV/FM in 140 Iranian walnut families. Traits are indicated on the x-axis and number of families on the y-axis. Dashed vertical lines indicate the mean of each distribution. We observed several significant correlations among photosynthetic traits under drought stress (Figure 2A; Figures S4). For instance, drought stress tolerance index (DS) was negatively correlated with the gas exchange parameters gs, Ci, Tr, and Pn but positively correlated with the FV/FM, RWC, and WUE (Figure 2A). Our results also showed a negative correlation between WUE and Ci, gs, and Tr. The photosynthetic trait correlations under recovery conditions were almost in line with the DSI of the studied traits. Furthermore, we observed significant regional differentiation in gas exchange measurements (Figure 3; Figure S5). The highest WUE under severe drought stress was found in Markazi and Isfahan populations.
Figure 2

Correlation plot of photosynthetic traits and principal component analysis (PCA) plots of phenotypic data and SNP markers. (A) Correlation coefficient of plasticity in trait value (DSI), between all gas exchange and chlorophyll fluorescence phenotypes in the 140 walnut families grown in a common garden under severe water stress condition in the first-year experiment. The colour spectrum, bright blue to bright red represents highly positive to highly negative correlations. Stars in circle indicate the significance of correlations (*P ≤ 0.05, **P ≤ 0.01, and ***P ≤ 0.001). (B) Principal component analysis of photosynthetic traits with the first two components showing variation in plasticity in trait value in the 140 walnut families grown in a common garden under severe water stress condition in the first-year experiment. The traits coloured by red contributed more to the variation explained by PC1 and PC2, than those coloured by blue. (C) PCA of the 140 Persian walnut families using 44 207 GBS-derived SNPs (PGBS). (D) PCA of the 87 Persian walnut mother trees using 40 828 GBS-derived SNPs (MGBS). See Table 1 for the definition of measured traits.

Correlation plot of photosynthetic traits and principal component analysis (PCA) plots of phenotypic data and SNP markers. (A) Correlation coefficient of plasticity in trait value (DSI), between all gas exchange and chlorophyll fluorescence phenotypes in the 140 walnut families grown in a common garden under severe water stress condition in the first-year experiment. The colour spectrum, bright blue to bright red represents highly positive to highly negative correlations. Stars in circle indicate the significance of correlations (*P ≤ 0.05, **P ≤ 0.01, and ***P ≤ 0.001). (B) Principal component analysis of photosynthetic traits with the first two components showing variation in plasticity in trait value in the 140 walnut families grown in a common garden under severe water stress condition in the first-year experiment. The traits coloured by red contributed more to the variation explained by PC1 and PC2, than those coloured by blue. (C) PCA of the 140 Persian walnut families using 44 207 GBS-derived SNPs (PGBS). (D) PCA of the 87 Persian walnut mother trees using 40 828 GBS-derived SNPs (MGBS). See Table 1 for the definition of measured traits. Box plots of the regional differentiation of the gas exchange parameters in the 140 walnut families grown in a common garden under well-water and severe water stress conditions in the first-year experiment. (A) Pn, (B) Ci, (C) Tr and (D) WUEi. WW: well-water; WS: water-stress. Different letters indicate statistically significant differences at the level of p < 0.01 (Tukey’s test). We performed principal component analysis (PCA) on all photosynthetic traits of the 140 families during stress to further explore the key parameters and provide an integrated view of the relationships among traits within populations (Figure 2B; Figures S4). PCA using DSI (value of trait under WS / value of trait under WW) of the studied traits showed that the first five components (PC1–5) cumulatively explained more than 90% of the total variation for the photosynthetic traits across the panel under severe drought stress (Figure 2B). The first principal component (PC1), explaining more than 39% of the total variation, was associated with photosynthetic traits; positively with FV/FM (78%), WUE (58%) and RWC (57%), and negatively with Ci (66%), gs (66%), Tr (65%), and Pn (54%) (Figure 2B). Since PC1 has a positive and high correlation with water relation parameters and FV/FM index it could be viewed as a quality of plant water status and photosynthetic efficiency under severe drought stress. As a result, PC1 can be considered as a drought tolerant component in our studied walnut panel. On the other hand, since previous studies in plants have shown that positively correlated traits with PC1 (FV/FM, WUE and RWC) have higher heritability than other studied traits, they can be used as biomarkers for selection of drought tolerant genotypes in future studies. The second principal component (PC2), explaining more than 15% of the total variation, was positively associated with phenotypic variation of WUE (57%) (Figure 2B).

SNP calling and population structure

From the 95 mother trees genotyped using both the SNP array (MArray) and GBS (MGBS), and the 150 families genotyped through GBS (PGBS), 94, 87, and 136 gave good quality data, respectively, and were used as three separate panels for further genomic analysis. The Array-scored SNPs were categorized into six default groups of Affymetrix Power Tools (APT) according to clustering performance as follows; 1) Poly High Resolution (PHR), 2) Mono High Resolution (MHR), 3) No Minor Homozygote (NMH), 4) Call Rate Below Threshold (CRBT), 5) Off-Target Variant (OTV) and 6) Other as described by Arab et al (2019) [17]. PCA and population structure estimates for each set of panels genotyped through GBS (PGBS and MGBS; Figure 2C and D) divided our panels into four main clusters based on their geographical locations. Data from the Axiom J. regia 700 K SNP array (Arab et al., 2020) also classified mother trees into four main groups. These results confirm that our walnut panels (MArray, MGBS and PGBS) comprise mainly four genetic clusters. Therefore, the optimal number of genetic groups was chosen as four for association mapping studies to control the family structure. LD decayed (shown by r [2] < 0.2) within 10 kb across the genome as described by Arab et al. (2020) [3].

Genome wide association study

After filtering for minor allele frequency (MAF > 5%) and missing rate (< 10%), we obtained three SNP panels including 295 685 polymorphisms (MArray), 40 828 SNPs (MGBS), and 43 607 SNPs (PGBS). For ease of reading and understanding, we divided the GWAS results into six categories, based on the correspondence of the genotyping approach and the studied trait results. Also, for each of the categories, we classified the results into 5 distinct groups based on the experimental conditions (WW, WS and WR) and phenotypic plasticity of traits (DSI = WS/WW*100; DRI = WR/WW*100; see Methods) (Table 2; Table S6).
Table 2

Summary of significant marker-trait associations identified by GWAS analysis using two approaches (FarmCPU and MLMM) for all the photosynthetic traits across six categories (A–F) in well-water (WW), water-stress (WS), water-recovery (WR) conditions and for drought stress index (DSI) and drought recovery index (DRI) of traits as a relative measure

Trait Classification Marker-Trait WW WS WR DSI DRI
(A) Chl fluorescence (Mother trees- Array)Associations6369104123
Unique SNPs504193518
(B) Chl fluorescence (Mother trees- GBS)Associations596432319
Unique SNPs434526235
(C) Chl fluorescence (Progeny-GBS)Associations5114261468
Unique SNPs3810231355
(D) Gas exchange (Mother trees- Array)Associations28152111
Unique SNPs25142101
(E) Gas exchange (Mother trees- GBS)Associations5638131424
Unique SNPs3229101421
(F) Gas exchange (Progeny-GBS)Associations245-8
Unique SNPs244-8
Total associationsAssociations25920488111133
Unique SNPs1901437495108
Associations detected by the FarmCPU approachAssociations7147142432
Unique SNPs5340142427
Associations detected by the MLMM approachAssociations1881577487101
Unique SNPs146115657588
SNPs detected by both approachesTotal SNPs25920488111133
Unique SNPs1991557999115
Summary of significant marker-trait associations identified by GWAS analysis using two approaches (FarmCPU and MLMM) for all the photosynthetic traits across six categories (A–F) in well-water (WW), water-stress (WS), water-recovery (WR) conditions and for drought stress index (DSI) and drought recovery index (DRI) of traits as a relative measure Given the significant (α/n) and suggestive (1/n) thresholds (where n is marker number; see Methods), we identified 578 and 1543 unique SNPs, respectively, for the studied phenotypic traits (Table 2, Table S6, Supplementary data sets S1-S3). Our results found a total of 198 (34%), 228 (40%) and 152 (26%) significant associations for at least one photosynthetic-related trait under all conditions using the MArray, MGBS and PGBS datasets, respectively (Table 2; Figure 4). We also identified a total of 544 (35%), 524 (34%) and 481 (31%) suggestive SNPs associated with all photosynthetic traits under all conditions through the MArray, MGBS and PGBS datasets, respectively (Table S6; Figure S6). One and five of the suggestive SNPs identified were in common between the MArray and MGBS, and the MGBS and PGBS datasets, respectively. These results indicate that the strong significance of these marker-trait associations identified using two different genotyping methods.
Figure 4

Venn diagrams depicting the significant SNPs identified for all the photosynthetic traits across experiments (First and second years) and different conditions through GWAS using different datasets MArray (A), MGBS (B) and PGBS (C) and highlight the concordance between different datasets (D). Well-water: WW, water-stress: WS, water-recovery: WR, drought stress index: DSI, drought recovery index: DRI.

Venn diagrams depicting the significant SNPs identified for all the photosynthetic traits across experiments (First and second years) and different conditions through GWAS using different datasets MArray (A), MGBS (B) and PGBS (C) and highlight the concordance between different datasets (D). Well-water: WW, water-stress: WS, water-recovery: WR, drought stress index: DSI, drought recovery index: DRI. We identified 434 and 176 SNPs associated with chlorophyll fluorescence and gas exchange phenotypes, respectively. Of those, 131, 96, 58, 71, and 78 SNPs linked to chlorophyll fluorescence phenotypes were identified under WW, WS, and WR conditions and phenotypic plasticity of traits (DSI and DRI), respectively (Table 2; Supplementary data set S4). We also found 59, 47, 16, 24, and 30 SNPs associated with gas exchange parameters under WW, WS, and WR conditions and phenotypic plasticity of traits (DSI and DRI), respectively. We observed only a few significant trait-locus associations across experiments, indicating different responses of walnut to drought stress and re-watering. In addition, we detected 1699 suggestive SNPs associated with gas exchange (n = 1104) and chlorophyll fluorescence (n = 565) phenotypes (Supplementary data set S4). The highest number of photosynthetic-associated SNPs was identified under WW (508), followed by WS (407), DSI (221), DRI (300), and WR (263). More details are summarized in Tables S6–S8, and Supplementary data set S5. Significant associations were identified on all the walnut chromosomes. The greatest number of associations was found on chromosome 7 (60 SNPs) and the lowest number of associations was found on chromosome 14 (17 SNPs). We also found SNPs that were associated with multiple traits. A total of 11, 28, and 113 significant (p < 0.05/n) SNPs were associated with more than three, two, and one trait, respectively. Among these, the marker AX-170754326 on chromosome 2 was simultaneously associated with thirteen traits related to the chlorophyll fluorescence. We also identified a significant SNP (S11_15758875) located on chromosome 11 associated with seven traits related to chlorophyll fluorescence. Using the suggestive threshold (p < 1/n), 33, 47 and 95 SNPs were associated with more than five, four and three traits, respectively. In particular, the SNP AX-170754326 was simultaneously associated with twenty traits related to chlorophyll fluorescence. Also, the locus S4_5195821 on chromosome 4 was associated with fourteen chlorophyll fluorescence related traits. Our results are in line with the quantitative (multigenic) nature of drought tolerance and the strong correlation among the studied photosynthetic traits. By further lowering the P-values threshold to 9.95.0 × 10−5 (–log10P = 5), we also observed clusters of linked SNPs associated to a single trait. For example, twenty-three, twenty-three, twenty-two, nineteen and twelve suggestive SNPs on chromosome 4 were associated with DRI of FV, PC2, φDo, FV/FM and FM/F0, respectively. Also, under normal condition, thirty and twenty-nine suggestive SNPs on chromosome 4 were identified for the CE and gs traits. For DSI of PC1, FV/FM, FM/F0, φDo, Ci and RWC were identified twenty-six, twenty-four, twenty-three, seventeen, fifteen and fifteen suggestive SNPs on chromosomes 1, 2, 7, 2, 5 and 8 respectively. Under water stress condition, eighteen, fifteen, fourteen, eleven, and ten suggestive SNPs on chromosome 2 were identified for the FM/F0, ABS/RC, PC1, Fi, and FM respectively. On the other hand, forty-eight, forty-seven, thirty-nine suggestive SNPs on chromosome 8 were linked to Ψ0, φEo and VJ under normal respectively. Therefore, at these positions, a putative major quantitative trait locus (QTL) for studied traits may be located. More information regarding the suggestive SNPs is summarized in Supplementary data sets S6-S7. Manhattan plots and QQ plots for most of the important studied traits under different conditions are shown in Figures 5-8 and Figures S7-S12.
Figure 5

(A) and (B) Circular Manhattan plots (left), and quantile-quantile plots (right) of association analysis using the MGBS dataset and MLMM (Q + K) model for gas-exchange related traits from centre to the outside of plot including; (a) gs_WW21, (b) Pn_WS12, (c) Tr_WS12, (d) WUEi_WS21, and (e) PC2_WS21. The outermost circle shows SNP density in 1 Mb windows for each chromosome where green to red indicates low to high marker density. Black bold line (Y-axis) represents –Log10 P-value. The circles of red dashed lines represent the Bonferroni-corrected significance threshold. Red stars indicate genome-wide significantly associated SNPs. Vertical grey dashed lines are drawn through GWAS findings to indicate multi-trait associations. For QQ plots, X-axis represents expected − log10 (p-value) and Y-axis is observed − log10 (p-value) of each SNPs. Well-water: WW, water-stress: WS.12: severe drought stress in the first year, 21: severe drought stress in the second year.

Figure 8

(A) and (B) Circular Manhattan plots (left), and quantile-quantile plots (right) of association analysis using the PGBS dataset and MLMM (Q + K) model for chlorophyll fluorescence related traits from centre to the outside of plot including; (a) FJ_DRI22, (b) FM_DRI22, (c) FV/FM_DRI22, (d) PIABS_DRI22, and (e) PC2_DRI22. The outermost circle shows SNP density in 1 Mb windows for each chromosome where green to red indicates low to high marker density. Black bold line (Y-axis) represents –Log10 P-value. The circles of red dashed lines represent the Bonferroni-corrected significance threshold. Red stars indicate genome-wide significantly associated SNPs. Vertical grey dashed lines are drawn through GWAS findings to indicate multi-trait associations. For QQ plots, X-axis represents expected − log10 (p-value) and Y-axis is observed − log10 (p-value) of each SNPs. Drought recovery index: DRI. 22: recovery condition in the second year.

(A) and (B) Circular Manhattan plots (left), and quantile-quantile plots (right) of association analysis using the MGBS dataset and MLMM (Q + K) model for gas-exchange related traits from centre to the outside of plot including; (a) gs_WW21, (b) Pn_WS12, (c) Tr_WS12, (d) WUEi_WS21, and (e) PC2_WS21. The outermost circle shows SNP density in 1 Mb windows for each chromosome where green to red indicates low to high marker density. Black bold line (Y-axis) represents –Log10 P-value. The circles of red dashed lines represent the Bonferroni-corrected significance threshold. Red stars indicate genome-wide significantly associated SNPs. Vertical grey dashed lines are drawn through GWAS findings to indicate multi-trait associations. For QQ plots, X-axis represents expected − log10 (p-value) and Y-axis is observed − log10 (p-value) of each SNPs. Well-water: WW, water-stress: WS.12: severe drought stress in the first year, 21: severe drought stress in the second year. (A) Circular Manhattan plots (left), and (B) quantile-quantile plots (right) of association analysis using the MArray dataset and MLMM (Q + K) model for chlorophyll fluorescence related traits from centre to the outside of plot including; (a) DI0/RC_DSI21, (b) FV/F0_WS21, (c) φDo_DSI21, (d) TR0/RC_WS21, and (e) PC3_WR22. The outermost circle shows SNP density in 1 Mb windows for each chromosome where green to red indicates low to high marker density. Black bold line (Y-axis) represents –Log10 P-value. The circles of red dashed lines represent the Bonferroni-corrected significance threshold. Red stars indicate genome-wide significantly associated SNPs. For QQ plots, X-axis represents expected − log10 (p-value) and Y-axis is observed − log10 (p-value) of each SNPs. Water-stress: WS, water-recovery: WR, drought stress index: DSI. 21: severe drought stress in the second year and 22: recovery condition in the second year. (A) and (B) Circular Manhattan plots (left), and quantile-quantile plots (right) of association analysis using the MGBS dataset and MLMM (Q + K) model for chlorophyll fluorescence related traits from centre to the outside of plot including; (a) ABS/RC_WS21, (b) DI0/RC_WW21, (c) FV/FM_DSI21, (d) V The outermost circle shows SNP density in 1 Mb windows for each chromosome where green to red indicates low to high marker density. Black bold line (Y-axis) represents –Log10 P-value. The circles of red dashed lines represent the Bonferroni-corrected significance threshold. Red stars indicate genome-wide significantly associated SNPs. Vertical grey dashed lines are drawn through GWAS findings to indicate multi-trait associations. For QQ plots, X-axis represents expected − log10 (p-value) and Y-axis is observed − log10 (p-value) of each SNPs. Well-water: WW, Water-stress: WS, water-recovery: WR, drought stress index: DSI. 21: severe drought stress in the second year and 22: recovery condition in the second year. (A) and (B) Circular Manhattan plots (left), and quantile-quantile plots (right) of association analysis using the PGBS dataset and MLMM (Q + K) model for chlorophyll fluorescence related traits from centre to the outside of plot including; (a) FJ_DRI22, (b) FM_DRI22, (c) FV/FM_DRI22, (d) PIABS_DRI22, and (e) PC2_DRI22. The outermost circle shows SNP density in 1 Mb windows for each chromosome where green to red indicates low to high marker density. Black bold line (Y-axis) represents –Log10 P-value. The circles of red dashed lines represent the Bonferroni-corrected significance threshold. Red stars indicate genome-wide significantly associated SNPs. Vertical grey dashed lines are drawn through GWAS findings to indicate multi-trait associations. For QQ plots, X-axis represents expected − log10 (p-value) and Y-axis is observed − log10 (p-value) of each SNPs. Drought recovery index: DRI. 22: recovery condition in the second year.

Candidate gene identification for significant SNPs

Candidate genes underlying each measured trait were selected, based primarily on the significant and suggestive SNPs within the gene or in the flanking regions (Tables 3-4). A comprehensive list of identified genes is provided in Supplemental data sets S6-S7. Our BLASTX results showed out of 578 and 1543 significant and suggestive SNPs identified by GWAS, 67 (11%) and 204 (13%) SNPs were located inside the gene, respectively (Supplementary data set S6). When we searched 20 kb windows around the significant and suggestive SNPs associated with photosynthetic traits, 382 (66%) and 1043(68%) SNPs were functionally annotated based on the best/top BLAST alignments for each SNP (Supplementary data set S7). Most of the significant or suggestive SNPs located within or nearby the genes which were involved in the regulation of photosynthesis and drought tolerance. More details are described in supplementary file.
Table 3

Functional annotations of the associated SNPs located within the candidate genes for gas exchange parameters measured under drought stress and recovery in the first and second year experiments

MethodModelSNPPositionChromTraitP.valuemafR2SequenceDescriptionBlastTop Hit E-ValueBlast Top HitAccession
MGBSMLMMS10_5 181 3435 181 343Chr10Pn_WS123.06E-110.070.08chaperone protein ClpB3, chloroplastic-like6.00E-09KAF5457676
MArrayFarmCPUAX-17116126825 869 736Chr11CE_DSI122.19E-090.10.05transcription factor UPBEAT16.70E-09XP_018842308
MGBSMLMMS4_20 095 20520 095 205Chr4gs_WR222.19E-080.070.16probable LRR receptor-like serine/threonine-protein kinase At3g475702.90E-09XP_018815270
MGBSMLMMS10_14 590 76514 590 765Chr10RWC_DSI128.37E-080.370.07ethylene-responsive transcription factor ERF054-like3.70E-09XP_018838445
MArrayFarmCPUMLMMAX-1711617826 654 073Chr16CE_WW21gs_WW21PC1_WW21Tr_WW216.48E-072.14E-061.31E-055.11E-053.69E-061.56E-055.16E-050.170.230.200.170.14thylakoid lumenal 17.4 kDa protein, chloroplastic2.10E-11XP_018842827
MArrayFarmCPUMLMMAX-1711959501 291 896Chr13DS_WR22DS_WS21DS_WS129.21E-061.13E-051.91E-052.11E-053.58E-050.380.220.210.2zinc finger protein 4-like2.20E-10XP_018827763
MArrayFarmCPUMLMMAX-1712039065 028 296Chr2gs_WS12CE_WS211.83E-065.65E-062.80E-050.120.190.18ABC transporter A family member 2-like isoform X21.40E-10XP_035543787
PGBSMLMMFarmCPUS1_3 218 6723 218 672Chr1PC3_DSI12CE_DSI123.82E-065.01E-065.32E-062.77E-050.180.110.13serine/threonine-protein phosphatase 4 regulatory subunit 3-like isoform X31.80E-8KAF5464745
MArrayMLMMFarmCPUAX-1712230243 113 160Chr6PC2_WW219.09E-061.12E-050.430.12wall-associated receptor kinase 2-like4.40E-07XP_035546447
MArrayFarmCPUMLMMAX-17096293015 525 358Chr5Tr_DSI12gs_DSI129.34E-062.99E-051.14E-053.92E-050.110.190.15transcription factor bHLH155-like isoform X23.70E-06KAF5482376
MArrayFarmCPUMLMMAX-17115926315 485 089Chr5Tr_DSI12gs_DSI129.34E-062.99E-051.14E-050.110.180.16DEAD-box ATP-dependent RNA helicase 41 isoform X14.90E-09XP_018840854
MGBSFarmCPUS11_4 360 4384 360 438Chr11CE_DRI221.03E-050.340.20protein FAR1-RELATED SEQUENCE 5-like5.10E-10KAF5454297
MArrayFarmCPUMLMMAX-17121067139 073 355Chr1gs_DSI12Tr_DSI121.23E-051.74E-053.12E-050.050.180.17probable chlorophyll(ide) b reductase NYC1,chloroplastic isoform X11.10E-09XP_035539329
MArrayFarmCPUMLMMAX-171595118761 867Chr3Tr_DSI21Pn_DSI21PC1_DSI21gs_DSI211.54E-051.99E-052.53E-055.35E-051.67E-052.41E-054.15E-057.01E-050.230.170.160.040.15pentatricopeptide repeat-containing proteinAt5g12100, mitochondrial4.70E-12KAF5473779
PGBSFarmCPUMLMMS8_12 601 14712 601 147Chr8Tr_WR222.53E-055.12E-050.160.15protein FAR1-RELATED SEQUENCE 5-like3.80E-12XP_035550680
MArrayFarmCPUAX-17121067039 073 346Chr1Tr_DSI122.58E-050.050.17probable chlorophyll(ide) b reductase NYC1, chloroplastic isoform X11.50E-10XP_035539329
MArrayFarmCPUMLMMAX-1712009311 158 989Chr5PC1_WS21Ci_DSI212.74E-053.15E-053.81E-054.05E-050.200.140.14E3 ubiquitin-protein ligase KEG isoform X13.70E-12XP_018818212
MArrayMLMMFarmCPUAX-17059070022 447 120Chr16gs_DRI22Pn_DRI22Tr_DRI222.98E-054.61E-056.67E-054.24E-056.31E-057.95E-050.250.120.150.12serine/threonine-protein kinase PBS1-like1.70E-10XP_018838998
MArrayFarmCPUMLMMAX-1712011192 039 053Chr5PC2_WS213.01E-054.78E-050.110.18ATPase 11, plasma membrane-type-like isoform X22.30E-09XP_018816318
MArrayFarmCPUAX-171163808562 727Chr8PC3_DSI123.05E-050.140.17F-box protein CPR1-like isoform X11.90E-11KAF5461318
MArrayFarmCPUAX-17056252018 034 619Chr4WUE_WS21WUEi_WS123.08E-053.12E-050.140.110.14galactan beta-1,4-galactosyltransferase GALS3-like2.90E-10KAF5472203
PGBSMLMMS14_4 949 1364 949 136Chr14PC2_WS123.46E-050.080.11cytochrome b561 and DOMON domain-containing protein At3g25290-like1.40E-10KAF5446463
MArrayFarmCPUAX-17116686636 403 551Chr7PC3_WS123.87E-050.230.15mitogen-activated protein kinase kinase kinase NPK12.20E-09XP_018840497
MArrayFarmCPUMLMMAX-1711606243 197 511Chr4CE_DSI215.61E-058.85E-050.30.17light-harvesting complex-like protein OHP1, chloroplastic1.00E-10XP_018841680
MArrayFarmCPUAX-17120625315 247 129Chr1WUE_DSI216.38E-050.30.15transcription factor MYB93-like2.70E-10KAF5480862
PGBSMLMMS5_2 941 2822 941 282Chr5PC1_WS21Ci.Ca_WS216.98E-052.29E-050.140.070.09phosphatidylinositol-3-phosphatase myotubularin-1-like isoform X11.20E-10KAF5470068
PGBSFarmCPUS16_22 311 90722 311 907Chr16Pn_DRI227.20E-050.380.12glucose-6-phosphate 1-dehydrogenase, chloroplastic-like8.30E-08KAF5443584
PGBSMLMMS15_531 589531 589Chr15CE_WR227.90E-050.230.12protein EPIDERMAL PATTERNING FACTOR 20.0000016KAF5444497

CHR: chromosome; MAF: minor allele frequency; R2 (%): proportion of variation explained by an SNP.

WW: well-water; WS: water-stress; DSI: drought stress index; WR: water-recovery; DRI: drought recovery index.

12: severe drought stress in the first year, 21: severe drought stress in the second year and 22: recovery condition in the second year.

Table 4

Functional annotations of the associated SNPs located within the candidate genes for chlorophyll fluorescence parameters measured under drought stress and recovery in the first and second year experiments

MethodModelSNPPositionChromTraitP.valuemafR2SequenceDescriptionBlastTop Hit E-ValueBlast Top HitAccession
MGBSMLMMFarmCPUS1_3 020 8913 020 891Chr1FV_WR22FM_WR22FI_WR22FJ_WR229.05E-251.21E-127.61E-115.66E-069.40E-056.79E-051.29E-052.56E-050.090.250.290.300.26probable receptor-like protein kinase At1g305702.90E-09XP_018822983
MGBSMLMMFarmCPUS7_4 088 6614 088 661Chr7FM/F0_DSI21FV/F0_DSI21φPav_WS21PIABS_WS211.07E-193.89E-191.18E-084.25E-100.080.290.280.320.25vacuolar sorting protein 399.30E-10KAF5463988
PGBSMLMMS8_15 239 27315 239 273Chr8PIABS_WW211.00E-130.420.16protein CELLULOSE SYNTHASE INTERACTIVE 3-like8.90E-08XP_018830259
MArrayMLMMAX-17121222625 226 489Chr16FV/F0_WS21FM/F0_WS211.04E-131.04E-130.320.160.16cyclase-associated protein 1-like1.20E-08XP_018807064
PGBSMLMMS13_30 285 76430 285 764Chr13DI0/RC_WW211.58E-090.070.09glucan endo-1,3-beta-glucosidase1.10E-09XP_018824600
MArrayMLMMAX-17117402310 769 441Chr13FI_DSI211.85E-070.450.24ubiquinone biosynthesis protein COQ4 homolog, mitochondrial-like8.20E-11KAF5449503
MArrayFarmCPUAX-1711794547 717 597Chr1FJ_WS21TR0/RC_WS21FI_WS21FM_WS21FV_WS21FV/FM_DSI21FM/F0_WS21FV/F0_WS211.98E-053.24E-053.55E-053.81E-054.77E-055.46E-059.81E-059.81E-050.230.270.260.250.240.250.250.230.23LRR receptor-like serine/threonine-protein kinase RPK25.00E-08XP_018816677
PGBSMLMMS4_1 477 5601 477 560Chr4FM_WS21FI_WS215.78E-076.45E-070.390.080.07BAG family molecular chaperone regulator 41.70E-08KAF5471506
MGBSFarmCPUS12_21 754 47421 754 474Chr12DI0/RC_DSI211.62E-060.280.01pentatricopeptide repeat-containing protein At5g19020, mitochondrial1.00E-08XP_018840533
MArrayFarmCPUAX-1711777651 333 358Chr2DI0/RC_WS21ABS/RC_WS21ABS/RC_DSI21φD0_WS21FV/FM_WS21FV/FM_DSI21DI0/RC_DSI21N_DSI21φPav_WS21PC1_WS212.32E-062.39E-065.31E-068.26E-068.26E-069.33E-069.34E-062.46E-052.97E-053.57E-050.480.340.350.320.30.30.290.290.260.270.26cyclin-dependent kinase G-2-like isoform X16.90E-09XP_018822176
MArrayFarmCPUAX-17117629817 931 021Chr4ET0/RC_WR22VI_WS21φD0_DRI22FV/FM_DRI22N_DSI21FV_DRI223.29E-061.63E-052.42E-053.05E-053.10E-054.87E-050.050.30.240.250.260.260.26leucine-rich repeat receptor-like serine/threonine-protein kinase BAM37.50E-10XP_018814580
MArrayFarmCPUMLMMAX-17120002013 088 532Chr2VI_DRI223.34E-067.97E-060.230.28glucan endo-1,3-beta-glucosidase-like1.20E-08KAF5447740
MArrayFarmCPUMLMMAX-1712182713 371 396Chr6F0_DSI117.35E-061.17E-050.230.19wall-associated receptor kinase-like 14.00E-06XP_018812976
PGBSFarmCPUS6_3 429 3263 429 326Chr6TR0/RC_DRI226.15E-060.160.28wall-associated receptor kinase 2-like1.40E-09XP_035546447
MArrayFarmCPUAX-17121757728 076 649Chr11F0_DSI211.56E-050.440.29glutathione S-transferase-like3.30E-10KAF5455705
MArrayMLMMAX-17118036119 366 994Chr7ABS/RC_WS211.91E-050.140.019-cis-epoxycarotenoid dioxygenase NCED6, chloroplastic3.50E-11XP_018817253
MArrayFarmCPUAX-1711926019 713 679Chr13TR0/RC_DSI21F0_DSI211.934E-055.72E-050.360.300.26pentatricopeptide repeat-containing protein At3g04130, mitochondrial4.40E-10XP_018857296
MArrayFarmCPUMLMMAX-17116617218 425 461Chr1ψ0_DRI222.13E-053.19E-050.490.27beta-amylase-like6.10E-09XP_018846417
MGBSFarmCPUS11_4 360 4384 360 438Chr11φE0_WW21VJ_WW21ψ0_WW21ET0/RC_WW212.66E-052.69E-052.69E-054.34E-050.350.250.260.260.26protein FAR1-RELATED SEQUENCE 5-like5.10E-10KAF5454297
PGBSMLMMFarmCPUS11_10 333 16510 333 165Chr11F0_WS213.13E-053.33E-050.10.24zinc finger BED domain-containing protein DAYSLEEPER-like1.60E-07XP_035547471
MArrayMLMMFarmCPUAX-17119513542 508 998Chr1VJ_WR22ψ0_WR223.20E-053.20E-053.27E-053.27E-050.20.260.24cytochrome P450 94A2-like1.10E-09KAF5482357
MArrayFarmCPUMLMMAX-1712068011 322 756Chr15Fv_WS115.06E-063.81E-050.170.19calcium-dependent protein kinase 32-like7.30E-11XP_018834447
MArrayMLMMAX-17114101411 272 317Chr11TR0/RC_WW213.67E-050.180.01protein FAR-RED IMPAIRED RESPONSE 1-like6.10E-07XP_035546226
MArrayFarmCPUAX-17115975237 390 204Chr10DI0/RC_WW214.37E-050.110.2glucose-6-phosphate 1-dehydrogenase, chloroplastic-like isoform X36.60E-12XP_035550548
MArrayFarmCPUAX-17119957416 297 096Chr8φPav_DRI226.96E-050.430.212-alkenal reductase (NADP(+)-dependent)-like3.30E-09KAF5462701
PGBSFarmCPUMLMMS1_20 893 96420 893 964Chr1FV/FM_WS12F0_WS125.06E-061.21E-059.81E-063.45E-050.110.140.12G-type lectin S-receptor-like serine/threonine-protein kinase LECRK13.30E-11XP_018859482
MArrayMLMMAX-1706957202 303 269Chr11FV_WS121.82E-050.290.14acyl-CoA-binding domain-containing protein 44.70E-09XP_018850062
MArrayMLMMFarmCPUAX-1711974822 063 855Chr10FV/F0_WS121.95E-054.49E-050.280.16protein KINESIN LIGHT CHAIN-RELATED 2-like2.00E-08KAF5457244

CHR: chromosome; MAF: minor allele frequency; R2 (%): proportion of variation explained by an SNP.

WW: well-water; WS: water-stress; DSI: drought stress index; and DRI: drought recovery index.

11: moderate drought stress in the first year; 12: severe drought stress in the first year; 21: severe drought stress in the second year; and 22: recovery condition in the second year.

Functional annotations of the associated SNPs located within the candidate genes for gas exchange parameters measured under drought stress and recovery in the first and second year experiments CHR: chromosome; MAF: minor allele frequency; R2 (%): proportion of variation explained by an SNP. WW: well-water; WS: water-stress; DSI: drought stress index; WR: water-recovery; DRI: drought recovery index. 12: severe drought stress in the first year, 21: severe drought stress in the second year and 22: recovery condition in the second year. Functional annotations of the associated SNPs located within the candidate genes for chlorophyll fluorescence parameters measured under drought stress and recovery in the first and second year experiments CHR: chromosome; MAF: minor allele frequency; R2 (%): proportion of variation explained by an SNP. WW: well-water; WS: water-stress; DSI: drought stress index; and DRI: drought recovery index. 11: moderate drought stress in the first year; 12: severe drought stress in the first year; 21: severe drought stress in the second year; and 22: recovery condition in the second year.

Gene-set enrichment and network analysis

We extracted all genes within 20 kb around the significant and suggestive SNPs. Given a P-value of 9.95 × 10−5 (-log10P = 5), we detected 5907 (MArray), 1588 (MGBS) and 1497 (PGBS) SNPs associated with all the photosynthetic traits under all conditions (Figure 9A). Of these, 61 and SNPs were common between MGBS and PGBS, and 9 SNPs between MArray and MGBS (Figure 9A). We identified 6254 candidate genes adjacent to the SNPs of Array dataset, while 1621 and 1615 candidate genes were detected near the MGBS and PGBS SNPs, respectively (Figure 9B). Of these, 874 candidate genes were identified by more than one GWAS dataset (MArray, MGBS, and PGBS). We also found that 67 genes were common between the three datasets (Figure 9B). On the other hand, 3123, 4098 and 2268 SNPs associated with studied traits under WW, WS and WR conditions were located within or 10 kb upstream or downstream of 3376, 4468 and 2582 genes in the walnut gene annotation v2.0, respectively (Figure 9C and D). The identified genes were characterized, and various KEGG pathways and GO terms were found to be particularly relevant to drought tolerance and photosynthesis.
Figure 9

Complementarity of the three data sets to detect SNP-trait associations and candidate genes. (A) and (B); the number of specific genomic regions (A) and candidate genes (B) detected by each data set for all the photosynthetic traits across experiments (First and second years) and different conditions. (C) and (D); the number of genomic regions (C) and candidate genes (D) detected by both Array and GBS data sets across experiments (First and second years) under well-watered (WW), water-stressed (WS), and water-recovery (WR) conditions.

Complementarity of the three data sets to detect SNP-trait associations and candidate genes. (A) and (B); the number of specific genomic regions (A) and candidate genes (B) detected by each data set for all the photosynthetic traits across experiments (First and second years) and different conditions. (C) and (D); the number of genomic regions (C) and candidate genes (D) detected by both Array and GBS data sets across experiments (First and second years) under well-watered (WW), water-stressed (WS), and water-recovery (WR) conditions. We identified 96, 104 and 96 KEGG pathways using identified genes associated with photosynthetic traits under WW, WS and WR conditions, respectively, of which 15, 26 and 20 were significantly enriched (Figure 10A). Enriched pathways were related to metabolic processes including carbohydrate metabolism, amino acid metabolism, lipid metabolism, energy metabolism and signal transduction. We found that few pathways were shared by genes associated with photosynthetic traits in WW, WS and WR conditions, and several unique pathways were identified for genes associated with photosynthetic traits in each condition (Figure 10A). For instance, a few KEGG pathways related to carbohydrate metabolism including pentose and glucuronate interconversions (KEGG:00040), C5-Branched dibasic acid metabolism (KEGG:00660), and N-Glycan biosynthesis (KEGG:00510), were shared by genes associated with photosynthetic traits in all conditions. We found several pathways, including Ribosome (KEGG:03010), inositol phosphate metabolism (KEGG:00562), tyrosine metabolism (KEGG:00350), pentose phosphate pathway (KEGG:00030), Cysteine and methionine metabolism (KEGG:00270), starch and sucrose metabolism (KEGG:00500), and glycolysis/gluconeogenesis (KEGG:00010) showed an overrepresentation of significant genes associated with photosynthetic traits under WS condition. On the other hand, sphingolipid metabolism (KEGG:00600), sulfur relay system (KEGG:04122), Phosphonate and phosphinate metabolism (KEGG:00440), carotenoid biosynthesis (KEGG:00906), and cyanoamino acid metabolism (KEGG:00460) pathways were unique to genes detected for photosynthetic traits under WR condition. We also identified several others pathways related to lipid and amino acid metabolisms, such as fatty acid biosynthesis (KEGG:00061), nitrogen metabolism (KEGG:00910), tryptophan metabolism (KEGG:00380), and biotin metabolism (KEGG:00780), were specific to genes associated with photosynthetic traits under WW condition (Figure 10A).
Figure 10

Enrichment and network analysis highlighted key pathways and hub genes involved in photosynthetic responses in walnut under drought and recovery conditions. (A) Top KEGG descriptions significantly enriched using genes associated with photosynthetic traits under WW, WS and WR conditions. Different colours in X-axis represent different significant levels of the KEGG pathways. The y-axis represents the KEGG pathways. Sub-networks of the most important hub genes from all the genes associated with photosynthetic traits under (B) well-watered (WW), (C) water-stressed (WS), and (D) water-recovery (WR) conditions are shown.

Enrichment and network analysis highlighted key pathways and hub genes involved in photosynthetic responses in walnut under drought and recovery conditions. (A) Top KEGG descriptions significantly enriched using genes associated with photosynthetic traits under WW, WS and WR conditions. Different colours in X-axis represent different significant levels of the KEGG pathways. The y-axis represents the KEGG pathways. Sub-networks of the most important hub genes from all the genes associated with photosynthetic traits under (B) well-watered (WW), (C) water-stressed (WS), and (D) water-recovery (WR) conditions are shown. As a complementary approach to the KEGG survey, we found 25, 29 and 19 significant GO terms using identified genes, which were associated with photosynthetic traits under WW, WS and WR conditions, respectively (Figure S13). GO terms of the protein metabolic process (GO:0019538), lipid phosphorylation (GO:0046834), phosphatidylinositol phosphate kinase activity (GO:0016307), and carbohydrate biosynthetic process (GO:0016051) under WS condition, as well as, response to photooxidative stress (GO:0080183), membrane lipid metabolic process (GO:0006643), response to oxidative stress (GO:0006979) and antioxidant activity (GO:0016209) under WR condition were the important enriched terms (Figure S13). These results indicated different genetic controls of photosynthetic traits under drought and recovery conditions in Persian walnut. To provide further insight into the interaction in the pathways related to drought-response related genes, we proceeded beyond enriching pathways to identify highly modulated drought and photosynthesis specific sub-networks. The protein–protein interactions (PPI) were identified using the STRING database based on the genes associated with the photosynthetic traits in each condition separately (WW, WS and WR) and the network was subsequently constructed using Cytoscape (Figure 10B-D). The most important identified hub genes were directly or indirectly involved in the photosynthesis and drought stress responses (Figure 10B-D).

Discussion

Given the challenges of climate change and consequent water shortage in Persian walnut production areas, understanding the complex physiological and genetic basis of drought tolerance and adaptation in walnut is becoming increasingly important [3]. Natural variation in photosynthetic traits in Persian walnut is a largely unexploited resource that can provide useful information for breeding or engineering photosynthetic efficiency. In the current study, we explored natural variation in photosynthetic parameters in Persian walnut by combining a common garden approach with GWAS and pathway enrichment analysis. We have identified both genomic regions and pathways that suggest important adaptive mechanisms exist within the walnut populations sampled in this study. Here, we discuss our main findings and their implications for walnut breeding. Photosynthesis is highly susceptible to drought stress and can be studied by measuring gas exchange parameters or analysing chlorophyll fluorescence [29]. In present study, photosynthetic traits varied widely under WW, WS, and WR conditions, suggesting that genetic improvement of these traits is feasible in walnut. Water stress (WS) and subsequent recovery (WR) treatments significantly affected all traits compared to the well-watered control (WW), in agreement with previous reports in walnut [6,8]. Under water stress conditions most families exhibited decreased net photosynthetic rate (Pn), stomatal conductance (gs), transpiration rate (Tr), and intercellular CO2 concentration (Ci), whilst both intrinsic and instantaneous water use efficiency (WUE) increased substantially. These findings are consistent with those of Zhang et al. (2006) [30] and Arab et al. (2020) [3], suggesting that stomatal closure via the ABA-dependent pathway resulted in decreasing water loss and increasing WUE. We also observed high correlation between Pn, Tr, and gs implying regulation of stomatal aperture affected Pn under normal and stress conditions. Our results are consistent with the results of Rosati et al. (2006) [31] who reported that reduced photosynthesis is mainly associated with the stomatal closure, which consequently influences leaf biochemical processes. Furthermore, Our OJIP-test results showed that drought stress caused a significant inhibition of both PSII and beyond PSII electron transport activities. Our finding demonstrated a decrease in the maximal quantum yield of PSII (FV/FM), and decline of Ψ0 and φEo under water stress, especially in drought sensitive genotypes, reflects the accumulation of QA, indicating blockage of electron transfer from QA to QB on the PSII acceptor side, which was reported by Kalaji et al. (2016) [32]. These results are consistent with Liu et al. (2019) [8] who showed that drought stress affects photosynthetic electron transport of walnut plants.

SNP-array and GBS are complementary for understanding genetic basis of photosynthetic response to drought and re-watering in Persian walnut

GWAS identified at least one SNP with a significant or suggestive association with the majority of photosynthetic traits either in the WW, WS and WR conditions or plasticity of the trait across treatments. There was little overlap between MArray, MGBS, and PGBS results (Figure 9A). The first reason is that there is very little SNP overlap between array and GBS datasets (∼5 K SNPs). A second reason is that there are different numbers of individuals (94, 87, and 136) for each analysis. In line with Elbasyoni et al. (2018) [24], the lack of agreement between the GWAS analyses with array and GBS data is likely due to ascertainment bias inherent in the Array-scored SNPs because markers were discovered in a diverse panel independent from our walnut genotypes, as well as low coverage data in GBS. In agreement with Negro et al. (2019) [25], we identified more significant or suggestive SNPs associated with photosynthetic traits using the SNP-array compared to GBS. This result can be due to the higher genome coverage of the SNP-array. Based on our small dataset, we conclude that combining both datasets for GWAS may be expected to boost the likelihood of identifying trait-SNP associations. Our results showed the infrequent occurrence of genomic loci with a significant association for a trait under different treatments (WW, WS and WR conditions). This could be explained by substantial genetic-environment interaction for the majority of the traits. These highlight distinct mechanisms of photosynthetic response to drought and re-watering in Persian walnut. Interestingly, we identified far fewer significant and suggestive loci overlapping with plasticity in studied traits. This suggests independent genetic control of the expression of a trait and its plasticity [3].

Functional annotation revealed the complex and distinct mechanisms of photosynthetic response to drought and re-watering in Persian walnut

The photosynthetic traits characterized here are all quantitative traits, so multiple genomic regions with small effects are expected. We first searched for genes containing significant and suggestive GWAS SNPs (Table 2 and S6), and identified several potential candidates underlying photosynthetic trait variation. Here, we highlight some of the most important (Tables 3-4). A peak SNP on chromosome 10 found to be associated with Pn under water stress condition was located within a CLPB3 gene encoding a chaperone protein ClpB3. Previous study in Arabidopsis has demonstrated that ClpB/Hsp100 family of proteins is involved in chloroplast development [33]. On chromosome 10, a significant SNP located within a gene encoding an ethylene-responsive transcription factor (ERF), was associated with DSI of RWC. Many studies demonstrated that several ERFs bind to both GCC box and dehydration-responsive elements (DRE) and act as a key regulatory hub in plant responses to biotic and abiotic stresses [34]. On chromosome 16, we found a significant SNP associated with CE, gs, Tr and PC1 under well-water condition fell in a gene (TL17) encoding thylakoid lumenal protein. Recently lumen proteins shown to play important roles in regulating thylakoid biosynthesis and the activity of photosynthetic protein complexes, especially photosystem II [35]. On chromosome 1, a significant SNP which has been linked with FV, FM, FI, and FJ under recovery condition, we found to be located inside a gene encoding a receptor-like protein kinase (RLK) which is involved in abiotic stress responses, including calcium signaling and antioxidant defense system [36]. In addition, on chromosome 2, a SNP associated with DI0/RC, ABS/RC, φD0, FV/FM, φPav, and PC1 under drought stress fell in a gene encoding cyclin-dependent kinase (CDK). Previous studies have shown that CDKs as core cell cycle regulators play key role in diverse aspects of plant responses to abiotic stress [36]. Our results also showed that several other SNP markers associated with the studied traits were located within the genes encoding different protein kinases, especially receptor-like kinases (RLKs), calcium-dependent protein kinases (CDPKs), and mitogen-activated protein kinase (MAPK) cascades. As a result, it can be concluded that protein kinases play crucial roles in Persian walnut responses to the both of drought stress and recovery conditions through various signal transduction pathways. On chromosome 4, a significant SNP located within a gene BAG4 encoding protein BAG family molecular chaperone regulator 4, was found to be associated with the FM and FI under water-stress. Among the plant Bcl-2 associated athanogene (BAG) genes, BAG4 has been extensively studied and its overexpression in tobacco plants confers tolerance to abiotic stresses [37]. Three peak SNPs on chromosomes 12, 13 and 3 respectively associated with DSI of DI0/RC, DSI of TR0/RC and F0 and DSI of Pn, Tr, gs and PC1 were located in a gene encoding pentatricopeptide repeat-containing protein, thought to play crucial roles in plant responses to abiotic stresses. Jiang et al. (2015) [38] showed that upregulation of the pentatricopeptide repeat SOAR1 expression in Arabidopsis enhances stomatal closure and plant tolerance to multiple abiotic stresses (drought, salinity and cold). On chromosome 11, a SNP associated with FV under drought stress falls within an ACBP4 gene encoding an Acyl-CoA-binding domain-containing protein that may function as an intracellular carrier of acyl-CoA esters. Du et al. (2013) [39] revealed that overexpression of ACBP2 (ACBP2-OXs) in Arabidopsis confers tolerance to drought by promoting ABA signalling and stomatal closure. On chromosome 6, a wall-associated receptor kinase that may function as a signalling receptor of an extracellular matrix component was found to be linked with the DSI of F0, DRI of TR0/RC and PC2 under well-water condition. Hou et al. (2005) [40] have shown that various cell wall-associated receptor kinase (WAK) gene family members are involved in abiotic stress responses as well they are required for cell elongation and development. Interestingly, we found two SNPs on chromosomes 8 and 11 located within the FRS5 gene that encodes the protein FAR1-RELATED SEQUENCE, which has been linked with multiple traits, including Tr under re-watering condition as well as φE0, VJ, ψ0 ET0/RC, and TR0/RC under well-water condition. On the other hand, of particular interest to us are SNPs/or genes correlated with more than one trait. We searched in windows of ±10 kb around the SNPs associated with multi-traits (Supplementary data set S7). For example, GWAS identified two SNPs on chromosomes 2 linked to the protein FAR1-RELATED SEQUENCE, which individually associated with 10 and 8 traits (some in multiple environments). Recently, Ma and Li (2018) [41] have reported that this protein plays multiple roles in an extensive range of biological processes, including oxidative stress responses, chlorophyll biosynthesis, and starch synthesis. Taken altogether, our functional annotation revealed a high number of identified loci that were either close or within the known genes that play crucial roles in photosynthetic processes, including ABA signalling, regulation of stomatal function, chlorophyll biosynthesis, antioxidant biosynthesis, starch synthesis, lipid metabolism, and transduction of environmental signals. Other candidate genes identified in our study encode transcription factors, such as MYB, WRKY, bHLH, AP2-like ethylene-responsive protein, and HSP70, which are involved in drought responses in plant. Our results show that drought tolerance and recovery involve distinct and diverse mechanisms. Their polygenic nature represents a constraint on development of new trait combinations and needs to be considered when attempting to breed drought tolerance walnut rootstocks and cultivars.

Gene-set enrichment analysis identified key pathways involved in photosynthetic responses to drought and re-watering in Persian walnut

We complement our BLASTx results with gene-set enrichment analysis using KEGG and GO databases to identify molecular mechanisms underlying photosynthetic responses and drought tolerance in walnut. Pathway analysis revealed significantly enriched pathways that were linked to photosynthesis and drought stress responses. Our KEGG results indicate that photosynthesis pathway under both WW and WS conditions as well as carbon fixation in photosynthetic organisms pathway under WW and WR conditions were enriched with identified genes associated with photosynthetic traits. Previous studies showed that photosynthesis rate declined under WS in walnut [8], however, photosynthesis activity is essential for plant acclimation to WS [42]. Also, KEGG results showed that several amino acid and carbohydrate metabolism-related pathways such as tyrosine metabolism, cysteine and methionine metabolism and starch and sucrose metabolism were enriched under WS. In addition, GO terms related to signaling pathways such as phosphatidylinositol phosphate kinase activity and carbohydrate biosynthetic process were the important enriched terms under WS condition. It is well-documented that drought stress causes the accumulation of reactive oxygen species (ROS) in the cells, which in turns mediates multiple biological processes such as signal transduction pathways activation, oxidoreductase activities, and carbohydrate metabolic processes that are implicated in regulating the response to drought stress [43]. Additionally, high ROS doses have negative effects on cell protection [4]. Likewise, our KEGG results showed that ribosome pathway was significantly enriched by genes affecting photosynthetic efficiency under WS. In agreement with these finding, previous study in Arabidopsis thaliana suggest that ribosome biosynthesis highly increased in response to drought and recovery [44]. Plant have evolved several protective adaptations to respond to drought by up regulation of a considerable number of transcripts, therefore, they need a high number of ribosomes to translate these transcripts to proteins [45]. Also, it was suggested that plants need a significant number of ribosomes during the recovery phase to renew or repair their proteins. Moreover, biological functional analysis revealed that genes associated with photosynthetic traits under WS mapped to carbohydrate metabolism GO terms. KEGG finding revealed that sphingolipid metabolism, sulfur relay system, carotenoid biosynthesis, and cyanoamino acid metabolism were enriched by identified genes associated with photosynthetic traits under WR condition. Also, our results showed that the membrane lipid metabolic process, response to oxidative stress and antioxidant activity GO terms were enriched with these genes under WR condition. These findings suggest that plants have begun to grow and repair damaged tissues under WR condition. On the other hand, we found a number of enriched pathways shared between the two experimental conditions including; Galactose metabolism and Amino sugar and nucleotide sugar metabolism between WS and WR conditions, and Fructose and mannose metabolism, photosynthesis, and phenylpropanoid biosynthesis between WW and WS conditions, as well as carbon fixation in photosynthetic organisms between WW and WR conditions. Therefore, in line with previous studies [3,4,6] can be concluded that various metabolic processes are involved in Persian walnut adaptation to drought stress. In agreements with our results it has been suggested that amino acid (i.e. proline) and carbohydrates metabolisms play important roles during the drought stress response in Persian walnut [4,6]. Many studies have found that amino acid metabolism is closely related to drought tolerance [46]. Moreover, several studies have shown that carbohydrate metabolism as one of the key plant processes for absorbing the energy generated during photosynthesis occupies a vital function in drought stress responses in addition to acting as energy sources [47,48]. It was reported that that the increasing of sugars and other compatible solutes, contributes to osmotic adjustment under drought stress [4]. Hence, our results suggest that maybe osmotic adjustment is one of the important mechanisms of response to drought in Persian walnut. Overall, these results suggest that multiple biological processes are involved in drought stress responses as well as amino acid and carbohydrate metabolisms may play important roles in Persian walnut seedling responses to drought stress.

Network analysis highlighted hub genes involved in photosynthetic responses to drought and re-watering in Persian walnut

Among many biological pathways activated in plants under environmental stresses, the photosynthesis and cell growth processes are the most sensitive to drought and recovery. Our network analysis further identified the most important hub genes that are directly or indirectly involved in the complex interaction network linked to photosynthetic and drought-related stress responses. Among these, 15 associated with photosynthetic processes (GAPB, PSAN, and CRR1 under WW condition; PGK1 and NTRC under WS condition; DGD1, CYP38, and PETC under recovery condition), carbon and nitrogen metabolism (PPC1 under WR condition), carbohydrate metabolism (SUS6 and GAPC1 under WS condition),regulation of stomatal movement (PLDALPHA1 under WR condition), and cell growth and development (DXS, RPS13A, and HXK1 under WW and WR conditions) were the most important drought-responsive genes. The GAPA and GAPB genes encode one of the two subunits forming respectively the photosynthetic glyceraldehyde-3-phosphate dehydrogenase (GAPDH) and chloroplast localized glyceraldehyde-3-phosphate dehydrogenase, which play crucial roles in plant metabolism and are involved in abiotic stress response [49]. The CRR1 gene is required for both formation and activity of the chloroplast NAD(P)H dehydrogenase (NDH) complex of the photosynthetic electron transport chain [50]. Naranjo et al. (2016) [51] showed NTRC plays an important role in the control of photosynthetic electron transport in Arabidopsis. The RPS1 gene is required for optimal plastid performance and plays an important role in biosynthesis of thylakoid membrane proteins [52]. The PETC gene is a component of the cytochrome b6-f complex, which mediates electron transfer between PSII and PSI, cyclic electron flow around PSI, and state transitions [53]. Overall, these findings (promising candidate genes) will accelerate future efforts aimed at improving walnut drought tolerance.

Conclusion and perspectives

We characterized photosynthetic traits in diverse walnut families (n = 150) grown in a controlled greenhouse under well-watered, water-stressed, and re-watered conditions. GWAS analysis was performed using over 295 K array-scored and 43 K GBS-scored SNPs. Our main conclusions are the following: (1) Combining two genotyping technologies with different SNP distributions and densities allowed the identification of more marker-trait associations. (2) Identification of different genomic regions under drought stress and drought recovery conditions in walnut suggests that their genetic control is different. (3) Multiple candidate genes previously reported to be associated with photosynthesis and drought tolerance were identified. For the traits obtained from analysis of chlorophyll fluorescence, because of the time and/or labour intensity of data collection, a relatively small number of selected families (60) were evaluated and hence the power of GWAS was reduced. Therefore, a larger walnut population and more SNPs are clearly required to obtain more accurate GWAS results. On the other hand, GWAS of other drought-related traits such as water relations and biochemical parameters in future studies will complement the results of our study. In addition, because of the complex multigenic nature of drought tolerance, the most significant SNPs associated with photosynthetic traits are probably not the true causative loci. (4) The integration of GWAS and enrichment analysis was helpful for identifying promising candidate genes and pathways for further study. Together these findings provide new insight into possible drought tolerance mechanisms in walnut.

Materials and methods

Plant material and experimental design:

Plant materials used for this study consisted of a diverse panel of Persian walnut composed of 150 mother trees (local populations; 50- to 500-year-old open pollinated seedlings) from different geographical regions in Iran. This panel is expected to capture most of the genomic variation within locally-adapted populations (Table S1). Each of the sampled populations were located in a distinct habitat with very diverse environmental conditions (e.g. climate, geology, and topography). GPS coordinates and elevation were used to determine climatic parameters of the sampled areas (WorldClim [54]). At least 60 seeds along with leaf samples were collected from each of the 150 mother trees in 2015. A detailed list of mother trees is presented in Table S1. To evaluation early-lifetime phenotypes of mother trees, and progeny photosynthetic performance under drought condition, collected seeds were established in a common garden. Seeds were stratified to break dormancy and 20 seeds per mother tree (half-sib family) were subsequently planted on 7-liter polyethylene pots (15 cm × 15 cm × 50 cm deep) in a potting mix (2:1:1 (v/v/v), soil:sand:leaf manure). More details are described in the supplementary file and by Arab et al. (2020) [3]. Ten uniform seedlings from each of the 150 families were then selected to initiate a common garden experiment. Two water stress experiments were carried out under greenhouse conditions (25 (±5) °C, 45 (±10) % RH, and photoperiod of ~16 h) over two years at the Research Greenhouses of the Department of Horticulture, University of Tehran, Pakdasht, Tehran, Iran. In the first experiment, 10 uniform seedlings (6-month-old) from each family were randomly assigned to either the well-watered or the water-stressed group. Water stress was applied by withholding and setting three levels of moisture treatment; (i) well-watered (above 75% FC), (ii), mild water-stressed (~40–50% FC), and (iii) severely water-stressed (~25–35% FC) (see Figure S1). In the fall when the buds were dormant, 9-month-old sapling were transplanted into 15-liter polyethylene pots (25 cm × 25 cm × 70 cm deep) containing a mix soil as previous. In the second experiment, 15-month-old sapling were arranged in two groups as first experiment. They then were subjected to water stress by withholding and setting three levels of moisture treatment; (i) well-watered (above 75% FC), (ii), severely water-stressed (~25–35% FC), and (iii) re-watered following severe stress (see Figure S1). Experiments were laid out in a factorial completely randomized design with two factors (family and water treatment) and 2–3 replications.

Drought score index and relative water content (RWC)

Score index was visually graded on a range of 1 to 9 according to the appearance characteristics of the plant (1 to 9 indicates perfectly healthy plants to damaged and dying plants). Fresh leaves samples (ten uniform leaf discs) were collected from each plant, weighed [fresh weight (FW)], and placed in a petri dish filled with distilled deionized water for 24 h. Surface water on the leaves was removed through tissue paper and the leaves were weighed [turgor weight (TW)] and dried at 70°C. After 24 h, dry weight (DW) of samples was recorded and RWC was calculated as follow: (FW-DW)/(TW-DW) × 100 (Table 1).

Photosynthesis measurements

Gas-exchange photosynthetic parameters were measured three times on the 150 families using an infrared gas analysing system, IRGA (LI-6400, Li-Cor, Inc., Lincoln, NE, USA) under severe water-stress level in the first and second experiments, and at the end of recovery period in the second experiment. Measurements were performed on two fully-expanded, upper-canopy leaflets (fifth leaf in basipetal order) per plant, with controlled atmosphere (~400 μmol CO2 mol−1; 25 ± 2°C and ~ 50–60% relative humidity) and photosynthetic active radiation of 1200–1500 μmol photons m−2 s−1. The infra-red gas analyser system (IRGA) was manually adjusted and the levels of CO2 and H2O references were fixed before measurements. For each time-point, measurements were taken on only undamaged leaves during two consecutive days from 9 a.m. till 3 p.m. Net photosynthetic rate (Pn in μmol CO2 m−2 s−1), transpiration rate (Tr in mmol H2O m−2 s−1), stomatal conductance (gs in mol H2O m−2 s−1), and intercellular CO2 concentration (Ci in μmol CO2 mol−1 air) were recorded from two plants per treatment. Pn, Tr and gs were used to calculate instantaneous and intrinsic WUE (WUEinst = Pn/Tr in μmol CO2 mmol H2O−1; WUEintri = Pn/gs in μmol CO2 mol H2O−1, respectively). Pn/C ratio (CE) was taken as an estimate of carboxylation efficiency of Rubisco [55].

Chlorophyll fluorescence measurements in the first experiment

In the first experiment, parameters obtained for the 150 families from the analysis of chlorophyll fluorescence were recorded under mild (3 weeks after drought) and severe (5 weeks after drought) stress. For measuring chlorophyll a fluorescence, we used the same leaf immediately after the gas exchange analysis. The measurements were performed in the greenhouse during two consecutive days from 9 AM until 3 PM, using a portable chlorophyll fluorometer (PAM-2500, Walz, Effeltrich, Germany). After a dark-adapted period (30 min) with dark leaf clips, the minimum fluorescence (F0) was measured using weak modulated irradiation light [<0.1 μmol (photons) m−2 s−1]. Afterwards, a 800 ms saturating flash at 6000 μmol (photons) m−2 s−1 was applied to determine the maximum chlorophyll fluorescence (FM), variable fluorescence (FV = FM- F0) and maximum quantum yield of PSII (FV/FM) using the equation in Table 1.

Chlorophyll fluorescence measurements in the second experiment

In the second experiment, chlorophyll fluorescence parameters of 60 selected extreme families (very drought tolerant to very sensitive) were measured under severe stress (24 days after drought) and recovery (two weeks after re-irrigation). Polyphasic Chl a fluorescence transients (OJIP-test) were measured using a portable fluorometer (Fluorpen FP 100-MAX, Photon Systems Instruments, Drasov, Czech Republic) in the middle part of the sapling in young fully-expanded walnut leaflets with 3 replicates for each treatment (control or drought) after 20 min dark adaptation. To fully ensure that all PSII centers are open, plants were allowed to dark-adapt overnight, and the lights were extinguished in the greenhouse until measurements were concluded pre-dawn (between 1 and 5 a.m.). The fluorescence measurements were taken by a saturating light of ~3000 μmol m−2 s−1. Fluorescence intensities were recorded at four time points: 50 μs (O), 2 ms (J), 60 ms (I), and maximum fluorescence at around 1 s (P). Measurements related to the OJIP test were calculated based on the approaches described by Strasser et al. (2000, 2004) [13,29]. The definition of the measured parameters and detailed calculation formulas are listed in Table 1. More details of the OJIP-test are given in the supplementary file.

Calculation of phenotypic plasticity

The response of genotypes to drought stress for all measured traits was expressed as a relative change in water-stress compared with well-water conditions using the drought stress index (DSI) described in Wójcik-Jagła et al. (2013) [56] and calculated as follows: DSI = (value of trait under water-stressed condition) / (value of trait under well-watered condition) × 100. The drought recovery index (DRI) was calculated using the same equation but substituting the value of trait under recovery condition in place of the value of trait under drought condition.

Statistical analysis

Statistical analyses were performed using Minitab software (Minitab, Inc., State College, PA, USA) and R environment (R Development Core Team, 2017) and the related R packages. Descriptive statistics and normality tests were run on both of the data and their residuals, respectively. The results of descriptive statistics were plotted using ggplot2 R package [57]. Analysis of variance of each experiment was performed separately. We applied general linear models (GLM) to test the effect of families (F), water stress or re-watering treatment (T), and their interaction (F × T) on each photosynthetic-related trait under mild and severe drought stress, and subsequently re-watering. Pearson correlation analysis was conducted via factoextra package [58] in R.

DNA extraction, GBS library construction, and sequencing

In this study, 95 out of 150 mother trees were chosen for retrospective tissue sampling and mother-tree genotyping. In other words, genotype data are only available for 95 of the 150 mother trees. Mature fresh leaves were sampled from each family and mother tree (150 + 95 = 245 in total), immediately frozen in liquid nitrogen, and lyophilized prior to DNA extractions. Genomic DNAs of 95 mother trees and 150 families (pooled leaf tissue from 10 six-month-old individuals per each family) were isolated from young leaflets from 40 mg of dry leaves using the E-Z 96 Plant DNA Kit (Omega Bio-tek; Norcross, GA) according to the manufacturer’s instructions. The DNA concentrations were determined using Qubit dsDNA High Sensitivity (HS) Assay Kits (InVitrogen, Life Technologies), after adjusting to 50 ng/μl 10 μl aliquots (500 ng in total) were used for the library preparation. GBS was performed on 245 walnut samples using a two-enzyme GBS protocol previously described [59] with a few modifications. Briefly, individual DNA samples were digested using HindIII-HF (3 U) and MseI (1.5 U) followed by ligation of individual-specific barcode and common adapters using T4 DNA ligase. Ligated samples were pooled together (96-plex libraries) then amplified by polymerase chain reaction (PCR). Purification of final libraries was performed using magnetic beads (Omega Bio-Tek, GA, USA) for fragment selection in the range of 150-500 bp. Library size distribution was checked using a 2100 BioAnalyzer (Agilent Technologies, CA, USA). Three 96-plex libraries were pooled to generate a single lane (288-plex) sequencing libraries. Libraries were sequenced through single end 90 bp sequencing read length using Illumina HiSeq4000 of the University of California Davis Genome Center (https://dnatech.genomecenter.ucdavis.edu/illumina-library-sequencing/).

SNPs calling, filtering, and imputation

SNP calling was performed using the TASSEL 5 GBS v2 SNP-calling pipeline [60] and the walnut cv. Chandler v2.0 reference genome [22]. 64 bp tags that occurred at least ten times were mapped using Burrows-Wheeler Aligner (BWA) with default parameters [61]. SNPs with average coverage below 1 (n = 22 328) or low coverage across individuals (n = 7885; data in <20% of individuals) were discarded before SNP calling, as were SNPs with extremely high coverage (n = 14 462; log(average coverage) > 2.75; likely repetitive DNA) and SNPs with an excess of heterozygous genotypes (n = 7067; inbreeding coefficient F < −0.05 in the mother trees only; likely paralogous SNPs). The latter two thresholds were established by comparing the proportion of SNPs matching the Axiom array at different thresholds (Figure S14). Since some of our samples represent pooled tissue from multiple individuals, we relaxed the last threshold to discard only the few SNPs that showed the largest excess of heterozygous calls. After SNP calling, the resulting vcf file was filtered to exclude taxa and SNPs with >90% missingness and SNPs with minor allele count <20. Missing genotype calls were then imputed using Beagle 5.0 [62] using default parameters. After imputation, we further filtered the datasets to exclude SNPs with minor allele frequency less than 5% in both mother trees and their offspring (MGBS and PGBS) and SNPs with severe departures from the Hardy–Weinberg equilibrium only in mother trees (MGBS) using PLINK v1.9 software [63].

Genotyping with the Axiom J. regia 700 K SNPs array, and quality control

The 95 mother trees were genotyped at 609658 SNPs evenly distributed throughout the walnut genome using the high-density walnut array from Affymetrix [16] as described by Arab et al (2019) [17].

Single and multi-trait genome-wide association mapping

GWAS was performed for 30 photosynthetic traits under well-water (WW), water-stress (WS) and recovery (WR) conditions. Specifically, the phenotypic data included: (i) average performance of families for each trait under WW, WS and WR conditions; (ii) DSI and DRI; (iii) PC1–PC5 of phenotypic data; and (iv) PCs of DSI and DRI values. Three SNP panels including the array data on 94 mother trees (n = 295 685 SNPs), GBS data on 87 mother trees (n = 40 828 SNPs) and GBS data on 136 families (n = 43 607 SNPs) were used separately for GWAS. A subset of traits was measured on 60 selected families, in which case GWAS was performed using just 60 individuals. Single trait GWAS was conducted by applying two different models implemented in the R package Genome Association and Prediction Integrated Tools (GAPIT v3.0 [64]): (i) the Fixed and Random Model Circulating Probability Unification (FarmCPU [65]), and (ii) the Multiple Loci Linear Mixed Model (MLMM [66]). Population structure (PCA) and genetic relatedness (kinship matrix) were calculated in GAPIT and included in the GWAS models as a random effect to control spurious associations [67]. Population structure analysis is described in detail in the supplementary file. The best number of PCs to include in the GWAS models was determined based on the Bayesian Information Criterion (BIC), as implemented in the “model selection” function of GAPIT and scree-plot of PCA results. Quantile–quantile (QQ) plots were applied to check if the model was correctly fitted. Bonferroni genome-wide thresholds were set to define significant associations in each dataset. The significant Bonferroni P-value thresholds were 1.7E-7, 1.22E-6 and 1.15E-6 for the MArray, MGBS, and PGBS datasets, respectively. The suggestive P-value thresholds (1/number of SNP markers) were 3.38E-6, 2.44E-5, and 2.29E-5 for the MArray, MGBS, and PGBS datasets, respectively. A lower suggestive P-value of 9.99 × 10−5 (-log10P = 5) was also used to detect the suggestive SNP–trait associations for gene set enrichment and network analysis.

Gene annotation and pathway enrichment analysis

Significant and suggestive SNPs identified in the GWAS were used to search for putative candidate genes controlling photosynthetic traits in walnut. Flanking sequences of the significant and suggestive SNPs from both the SNP Array and GBS method were annotated using the National Centre for Biotechnology Information (NCBI; https://www.ncbi.nlm.nih.gov) BLASTX function. A SNP was considered to be located within a gene if it falls within a gene sequence. Then, based on the LD decay in our walnut panel [3], 20-kb windows were drawn around the identified marker-trait associations (10 kb upstream and downstream the SNP position) to search for candidate genes using the annotation v2.0 [22]. Enriched functional annotation clusters of the associated walnut candidate genes were defined using Blast2GO V5.0 tool [68] (E-value ≥1 × 10−5) implemented in Gene Ontology (GO [69]) and Kyoto Encyclopedia of Genes and Genomes (KEGG [70]) databases. The protein–protein interactions (PPI) were identified using the STRING database [71] based on the candidate genes and the network was subsequently constructed using Cytoscape [72]. Furthermore, hub genes were identified in the PPI networks using cytoHubb [73]. Click here for additional data file.
  56 in total

1.  Identification and functional characterization of the BAG protein family in Arabidopsis thaliana.

Authors:  Elena V Doukhanina; Shaorong Chen; Esther van der Zalm; Adam Godzik; John Reed; Martin B Dickman
Journal:  J Biol Chem       Date:  2006-04-24       Impact factor: 5.157

2.  STRING: a database of predicted functional associations between proteins.

Authors:  Christian von Mering; Martijn Huynen; Daniel Jaeggi; Steffen Schmidt; Peer Bork; Berend Snel
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

3.  GAPIT: genome association and prediction integrated tool.

Authors:  Alexander E Lipka; Feng Tian; Qishan Wang; Jason Peiffer; Meng Li; Peter J Bradbury; Michael A Gore; Edward S Buckler; Zhiwu Zhang
Journal:  Bioinformatics       Date:  2012-07-13       Impact factor: 6.937

4.  Combining phenotype, genotype, and environment to uncover genetic components underlying water use efficiency in Persian walnut.

Authors:  Mohammad M Arab; Annarita Marrano; Rostam Abdollahi-Arpanahi; Charles A Leslie; Hao Cheng; David B Neale; Kourosh Vahdati
Journal:  J Exp Bot       Date:  2020-01-23       Impact factor: 6.992

5.  Dihydrodipicolinate reductase-like protein, CRR1, is essential for chloroplast NAD(P)H dehydrogenase in Arabidopsis.

Authors:  Hideyuki Shimizu; Toshiharu Shikanai
Journal:  Plant J       Date:  2007-08-28       Impact factor: 6.417

6.  cytoHubba: identifying hub objects and sub-networks from complex interactome.

Authors:  Chia-Hao Chin; Shu-Hwa Chen; Hsin-Hung Wu; Chin-Wen Ho; Ming-Tat Ko; Chung-Yen Lin
Journal:  BMC Syst Biol       Date:  2014-12-08

7.  Sequencing a Juglans regia × J. microcarpa hybrid yields high-quality genome assemblies of parental species.

Authors:  Tingting Zhu; Le Wang; Frank M You; Juan C Rodriguez; Karin R Deal; Limin Chen; Jie Li; Sandeep Chakraborty; Bipin Balan; Cai-Zhong Jiang; Patrick J Brown; Charles A Leslie; Mallikarjuna K Aradhya; Abhaya M Dandekar; Patrick E McGuire; Daniel Kluepfel; Jan Dvorak; Ming-Cheng Luo
Journal:  Hortic Res       Date:  2019-03-25       Impact factor: 6.793

8.  Association and linkage mapping to unravel genetic architecture of phenological traits and lateral bearing in Persian walnut (Juglans regia L.).

Authors:  Anthony Bernard; Annarita Marrano; Armel Donkpegan; Patrick J Brown; Charles A Leslie; David B Neale; Fabrice Lheureux; Elisabeth Dirlewanger
Journal:  BMC Genomics       Date:  2020-03-04       Impact factor: 3.969

9.  Leveraging GWAS data to identify metabolic pathways and networks involved in maize lipid biosynthesis.

Authors:  Hui Li; Adam Thrash; Juliet D Tang; Linlin He; Jianbing Yan; Marilyn L Warburton
Journal:  Plant J       Date:  2019-03-20       Impact factor: 6.417

10.  High-throughput functional annotation and data mining with the Blast2GO suite.

Authors:  Stefan Götz; Juan Miguel García-Gómez; Javier Terol; Tim D Williams; Shivashankar H Nagaraj; María José Nueda; Montserrat Robles; Manuel Talón; Joaquín Dopazo; Ana Conesa
Journal:  Nucleic Acids Res       Date:  2008-04-29       Impact factor: 16.971

View more
  1 in total

1.  Complete Chloroplast Genomes of 14 Subspecies of D. glomerata: Phylogenetic and Comparative Genomic Analyses.

Authors:  Yongjuan Jiao; Guangyan Feng; Linkai Huang; Gang Nie; Zhou Li; Yan Peng; Dandan Li; Yanli Xiong; Zhangyi Hu; Xinquan Zhang
Journal:  Genes (Basel)       Date:  2022-09-09       Impact factor: 4.141

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.