| Literature DB >> 27617034 |
Mike Robert Allwright1, Adrienne Payne1, Giovanni Emiliani2, Suzanne Milner1, Maud Viger1, Franchesca Rouse1, Joost J B Keurentjes3, Aurélie Bérard4, Henning Wildhagen5, Patricia Faivre-Rampant4, Andrea Polle5, Michele Morgante6, Gail Taylor1.
Abstract
BACKGROUND: Second generation (2G) bioenergy from lignocellulosic feedstocks has the potential to develop as a sustainable source of renewable energy; however, significant hurdles still remain for large-scale commercialisation. Populus is considered as a promising 2G feedstock and understanding the genetic basis of biomass yield and feedstock quality are a research priority in this model tree species.Entities:
Keywords: Genetics; Leaf area; Lignocellulosic; Salicaceae; Short rotation coppice (SRC); Yield
Year: 2016 PMID: 27617034 PMCID: PMC5017058 DOI: 10.1186/s13068-016-0603-1
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Sites of Origin (SO) for P. nigra association mapping population at Northington, UK
| SO | Nation |
| Latitude°N | Longitude°E |
|---|---|---|---|---|
| Basento | Italy | 16 | 40.5 | 16.4 |
| Paglia | Italy | 21 | 42.8 | 11.8 |
| Ticino-North | Italy | 56 | 45.3 | 9.0 |
| Ticino-South | Italy | 37 | 45.2 | 9.1 |
| Bonny | France | 33 | 47.6 | 2.8 |
| Dranse | France | 35 | 46.4 | 6.5 |
| Drome 1 | France | 55 | 44.7 | 5.4 |
| Drome 6 | France | 53 | 44.8 | 4.9 |
| Erstein | France | 13 | 48.4 | 7.7 |
| Guilly | France | 31 | 47.8 | 2.3 |
| Ramieres | France | 37 | 44.7 | 4.9 |
| Rhinau | France | 19 | 48.3 | 7.7 |
| Loire | France | 44 | 46.4 | 3.2 |
| Strasbourg | France | 18 | 48.6 | 7.8 |
| Taubergiessen | France | 4 | 48.3 | 7.7 |
| Val Allier | France | 134 | 46.4 | 3.3 |
| Ebro-Alfranca | Spain | 24 | 41.6 | 1.0 |
| Ebro-Novillas | Spain | 24 | 41.9 | 1.4 |
| Kuhkopf | Germany | 33 | 49.8 | 8.5 |
| Netherlands | Netherlands | 23 | 52.1 | 5.7 |
| Individuals | France (2), Italy (1), Hungary (1) | 4 | – | – |
Subpopulation names are given in the first column followed by the country within which they are located. The number of individual genotypes within each subpopulation is provided (N) and their mean latitudinal and longitudinal coordinates for statistical analysis and calculation of pairwise geographic distances. “Individuals” are unique genotypes from outside of the given subpopulations
Fig. 1Map illustrating the nations and major river locations from which the P. nigra association population is drawn and the colours employed to illustrate these nations in subsequent figures
Fig. 2Histograms illustrate trait frequency distribution following Box-Cox transformation for a Estimated biomass yield 2013 (EB-13); b Epidermal cell number per leaf (CNPL-13) and c Saccharification potential (glucose yield) 2012 (SP-12)
Fig. 3Box plots depict range, interquartile range, median and mean (cross) for a Epidermal cell number per leaf 2013 (CNPL-13); b Estimated oven-dry biomass yield 2013 (EB-13); c Saccharification potential (glucose release) 2012 (SP-12); d Leaf area 2013 (LA-13); e Epidermal cell area 2013 (CA-13); f Stomatal index 2013 (SI-13)
Fig. 4Pairwise trait correlations are visualised with line colours and widths conferred according to the strength and direction of Pearson’s correlation coefficient (r) between trait pairs. Non-significant correlations are depicted with grey, point 1 lines. Significant positive and negative correlations (p < 0.05) are depicted with point 2 lines coloured light green or light red, respectively. Strong positive and negative correlations (r > 0.5) are depicted with point 3 lines coloured dark green or dark red, respectively. Very strong positive correlations (r > 0.8) are also shown in dark green with point 4 lines
Fig. 5Trait heritabilities show significant positive regression with their correlation coefficients (r) for a longitude of origin (r 2 = 0.643) but not b latitude of origin (r 2 = 0.112)
Fig. 6Satellite map of P. nigra association population subpopulation locations and their mean proportional cluster allocations from STRUCTURE for a K = 2 and b K = 7
Fig. 7Genetic distance matrix (pairwise FST) between 20 subpopulations of P. nigra association population. FST values are shaded according to magnitude (white to dark grey) with Italian subpopulations in purple; French in orange; Spanish in red; German in blue and Netherlands in green
Mantel tests reveal IBD and IBA in European P. nigra
| Mantel Test | Hypothesis | Corr. coefficient ( |
|
|---|---|---|---|
| (Gen, Geog) | – | 0.855 | <0.001 |
| (Gen, Pheno) | – | 0.385 | <0.001 |
| (Gen, Geog|Pheno) | IBD | 0.844 | <0.001 |
| (Gen, Pheno|Geog) | IBA | 0.304 | 0.001 |
Reports correlation coefficient (r) and p value (1000 permutations) for full Mantel tests investigating relationship between genetic and geographic (Gen, Geog) and genetic and phenotypic (Gen, Pheno) distance matrices as well as partial Mantel tests for isolation by distance (Gen, Geog|Pheno) and isolation by adaptation (Gen, Pheno|Geog)
Number of significant trait-SNP associations under all models
| Trait | Model I | Model II | Model III | Model IV | Model V | Model VI |
|---|---|---|---|---|---|---|
| EB-11 | 925 | 0 | 27 | 0a | 0 | 0 |
| Height-11 | 600 | 1 | 6 | 1 | 1a | 0 |
| EB-12 | 1492 | 0 | 56 | 0 | 0a | 0 |
| Height-12 | 1385 | 0 | 26 | 0 | 0a | 0 |
| EB-13 | 1750 | 0 | 17 | 0 | 0a | 0 |
| Height-13 | 1517 | 1 | 8 | 1 | 1a | 1 |
| BA-13 | 1690 | 0 | 21 | 0 | 0a | 0 |
| SC-13 | 334 | 0 | 8 | 0a | 0 | 0 |
| LA-13 | 2803 | 1 | 162 | 2 | 0a | 0 |
| SLA-13 | 157 | 0a | 42 | 0 | 0 | 0 |
| CA-13 | 321 | 0 | 0a | 0 | 0 | 0 |
| CNPL-13 | 2908 | 2 | 146 | 3 | 1a | 0 |
| SD-13 | 705 | 0a | 10 | 0 | 0 | 0 |
| SI-13 | 99 | 0a | 18 | 0 | 0 | 0 |
| SP-12 | 1 | 0 | 1 | 0a | 0 | 0 |
Number of significant trait-SNP associations at α < 8.79 × 10−6 under 6 possible models: (1) simple GLM (no genetic structure correction); (2) GLM with seven significant principal components of neutral genetic variation; (3) GLM with Q-matrix (K = 2) from STRUCTURE; (4) MLM with EMMA kinship matrix; (5) MLM with EMMA kinship and Q-matrix; (6) MLM with EMMA kinship and significant principal components of genetic variation. aIndicates the optimal model selected by comparison of log-likelihoods using BIC
Fig. 8QQ and Manhattan plots for the Q + K (optimal) models for the 3 traits with SNPs reaching genome-wide significance. Red and blue lines on Manhattan plots illustrate genome wide (α < 8.79 × 10−6) and putative (α < 1.76 × 10−4) significance levels, respectively. a QQ plot for Height-11 associated SNP on chromosome 7; b Manhattan plot for Height-11 association; c QQ plot for Height-13 associated SNP on chromosome 4; d Manhattan plot for Height-13 associated SNP; e QQ plot for CNPL-13 associated SNP on chromosome 13; f Manhattan plot for CNPL-13 associated SNP
Significant trait-SNP associations under optimal model at three significance levels
| Trait | Model | 5 % (α < 8.79 × 10−6) | 10 % (α < 1.76 × 10−5) | Putative (α < 1.76 × 10−4) |
|---|---|---|---|---|
| EB-11 | K (IV) | 0 | 0 | 1 |
| Height-11 | Q + K (V) | 1 | 2 | 3 |
| EB-12 | Q + K (V) | 0 | 0 | 2 |
| Height-12 | Q + K (V) | 0 | 0 | 1 |
| EB-13 | Q + K (V) | 0 | 0 | 0 |
| Height-13 | Q + K (V) | 1 | 1 | 1 |
| BA-13 | Q + K (V) | 0 | 0 | 0 |
| SC-13 | K (IV) | 0 | 0 | 2 |
| LA-13 | Q + K (V) | 0 | 0 | 2 |
| SLA-13 | P (II) | 0 | 0 | 3 |
| CA-13 | Q (III) | 0 | 0 | 5 |
| CNPL-13 | Q + K (V) | 1 | 1 | 6 |
| SD-13 | P (II) | 0 | 0 | 1 |
| SI-13 | P (II) | 0 | 0 | 1 |
| SP-12 | K (IV) | 0 | 0 | 1 |
Number of significant trait-SNP associations under the optimal model for each trait at 3 significance levels: 5 % α = 0.05/5690 (8.79 × 10−6); 10 %) α = 0.1/5690 (1.76 × 10−5); Putative) α = 1/5690 (1.76 × 10−4)
Fig. 9Bar plots of raw effects sizes (with standard error bars) for each trait-associated SNP with genome-wide significance from trait-specific optimal model for a Height-11 associated SNP; b Height-13 associated SNP and c CNPL-13 associated SNP. The x-axis of each plot gives the identity of each allelic variant (MM, MN or NN) with its sample size (n) within the population given in adjacent brackets
Trait-marker associations and candidate genes reaching genome-wide or putative significance under optimal models
| Trait | SNP | Chromosome | Position (bp) |
| Candidate gene | Additional information |
|---|---|---|---|---|---|---|
| EB-11 | SNP_IGA_6_17929363 | 6 | 17,822,770 | 9.55E−05 | POPTR_0006s18990 | CNGC17; ATCNGC17; calmodulin binding/cyclic nucleotide binding/ion channel |
| Height-11 |
|
|
|
|
|
|
| Height-11 | SNP_IGA_6_18338146 | 6 | 18,228,999 | 1.30E−05 | POPTR_0006s19240 | GAE1; UDP-glucuronate 4-epimerase/catalytic+ |
| Height-11 | SNP_IGA_15_11900175 | 15 | 11,834,554 | 1.44E−04 | POPTR_0015s11190 | Unknown protein |
| EB-12 | SNP_IGA_6_8443540 | 6 | 8,388,882 | 9.47E−05 | POPTR_0006s11060 | ATH9 (thioredoxin H-type 9) |
| EB-12 | SNP_IGA_7_993475 | 7 | 987,055 | 1.54E−04 | POPTR_0007s01700 | GLX2-4 (glyoxalase 2-4); hydrolase/hydroxyacylglutathione hydrolase/zinc ion binding |
| Height-12 | SNP_IGA_7_12319871 | 7 | 12,250,804 | 4.70E−05 | POPTR_0007s11900 | Unknown protein# |
| Height-13 |
|
|
|
|
|
|
| SC-13 | SNP_IGA_1_44937224 | 1 | 44,670,745 | 1.05E−04 | POPTR_0001s44200 | ATK3 (ARABIDOPSIS THALIANA KINESIN 3); ATPase/microtubule binding/microtubule motor |
| SC-13 | SNP_IGA_6_7875360 | 6 | 7,824,407 | 1.55E−04 | POPTR_0006s10480 | FER1; ATFER1; ferric iron binding/iron ion binding |
| LA-13 | SNP_IGA_13_111400 | 13 | 110,728 | 7.83E−05 | POPTR_0013s00340 | RCI2A (RARE-COLD-INDUCIBLE 2A)# |
| LA-13 | SNP_IGA_8_2418643 | 8 | 2,403,351 | 8.24E−05 | POPTR_0008s04290 | Unknown protein |
| SLA-13 | SNP_IGA_14_3311885 | 14 | 3,293,256 | 8.14E−05 | POPTR_0014s04150/POPTR_0014s04160 | Unknown protein/PEX11A (PEROXIN 11A) |
| SLA-13 | SNP_IGA_19_2255781 | 19 | 2,244,885 | 1.53E−04 | POPTR_0019s02450 | SWIM zinc finger protein-related |
| SLA-13 | SNP_IGA_6_23541394 | 6 | 23,399,832 | 1.73E−04 | POPTR_0006s24880 | PP2C; protein phosphatase 2C family protein |
| CA-13 | SNP_IGA_6_3818713 | 6 | 3,794,879 | 7.31E−05 | POPTR_0006s05370 | Unknown protein |
| CA-13 | PnCOL2_69 | 4 | 9,356,516 | 1.14E−04 | POPTR_0004s10800 | COL2 (constans-like 2); transcription factor/zinc ion binding# |
| CA-13 | SNP_IGA_1_29550802 | 1 | 29,371,580 | 1.20E−04 | POPTR_0001s30950 | RD21 (responsive to dehydration 21); cysteine-type endopeptidase/cysteine-type peptidase |
| CA-13 | LG_X_35_SNP_325 | 10 | 14,394,839 | 1.51E−04 | POPTR_0010s14950 | BAS1 (PHYB ACTIVATION TAGGED SUPPRESSOR 1); oxygen binding/steroid hydroxylase |
| CA-13 | LG_X_35_SNP_490 | 10 | 14,395,004 | 1.51E−04 | POPTR_0010s14950 | As above |
| CNPL-13 |
|
|
|
|
|
|
| CNPL-13 | SNP_IGA_6_12938719 | 6 | 12,856,743 | 8.11E−05 | POPTR_0006s15470 | Bacterial transferase hexapeptide repeat-containing protein |
| CNPL-13 | SNP_IGA_8_2308644 | 8 | 2,294,048 | 1.07E−04 | POPTR_0008s04110 | AGL62 (Agamous-like 62); DNA binding/transcription factor |
| CNPL-13 | SNP_IGA_6_23601531 | 6 | 23,459,494 | 1.32E−04 | POPTR_0006s24980 | Unknown protein |
| CNPL-13 | SNP_IGA_10_18449865 | 10 | 18,340,999 | 1.37E−04 | POPTR_0010s20920 | Immunophilin, putative/FKBP-type peptidyl-prolyl cis–trans isomerase, putative |
| CNPL-13 | SNP_IGA_7_14212007 | 7 | 14,130,925 | 1.72E−04 | POPTR_0007s14310 | AGL22 (Agamous-like 22); SVP; transcription factor/translation repressor, nucleic acid binding |
| SD-13 | SNP_IGA_6_11135816 | 6 | 11,064,511 | 1.67E−04 | POPTR_0006s13890 | TES (TETRASPORE); microtubule motor |
| SI-13 | SNP_IGA_6_8990445 | 6 | 8,932,413 | 1.24E−04 | POPTR_0006s11720 | DML1 (DEMETER-LIKE 1); DNA N-glycosylase/DNA-(apurinic or apyrimidinic site) lyase/protein binding |
| SP-12 | SNP_IGA_1_31674244 | 1 | 31,482,266 | 1.60E−04 | POPTR_0001s33290 | Zinc finger (DHHC type) family protein |
Under optimal models there are 29 SNPs (representing 25 candidate genes) reaching at least the putative significance level (α < 1.76 × 10−4) of which 4 are significant at p < 0.1 (indicated by a +) and three are significant at p < 0.05 (in italic typeface and indicated by a). Genes putatively associated with one trait whilst significantly associated with another at p < 0.05 are indicated by a #