| Literature DB >> 31166032 |
John Baison1, Amaryllis Vidalis2, Linghua Zhou1, Zhi-Qiang Chen1, Zitong Li3, Mikko J Sillanpää4, Carolina Bernhardsson1,5, Douglas Scofield6, Nils Forsberg1, Thomas Grahn7, Lars Olsson7, Bo Karlsson8, Harry Wu1, Pär K Ingvarsson5,9, Sven-Olof Lundqvist7,10, Totte Niittylä1, M Rosario García-Gil1.
Abstract
Norway spruce is a boreal forest tree species of significant ecological and economic importance. Hence there is a strong imperative to dissect the genetics underlying important wood quality traits in the species. We performed a functional genome-wide association study (GWAS) of 17 wood traits in Norway spruce using 178 101 single nucleotide polymorphisms (SNPs) generated from exome genotyping of 517 mother trees. The wood traits were defined using functional modelling of wood properties across annual growth rings. We applied a Least Absolute Shrinkage and Selection Operator (LASSO-based) association mapping method using a functional multilocus mapping approach that utilizes latent traits, with a stability selection probability method as the hypothesis testing approach to determine a significant quantitative trait locus. The analysis provided 52 significant SNPs from 39 candidate genes, including genes previously implicated in wood formation and tree growth in spruce and other species. Our study represents a multilocus GWAS for complex wood traits in Norway spruce. The results advance our understanding of the genetics influencing wood traits and identifies candidate genes for future functional studies.Entities:
Keywords: Norway spruce; candidate genes; functional trait mapping; genome-wide association mapping; sequence capture; single nucleotide polymorphisms
Mesh:
Year: 2019 PMID: 31166032 PMCID: PMC6852177 DOI: 10.1111/tpj.14429
Source DB: PubMed Journal: Plant J ISSN: 0960-7412 Impact factor: 6.417
Phenotypes, latent traits, SNP, SNP feature, frequency and PVE
| Phenotype | Latent Trait | QTL | SNP | Allele | SNP Feature | Frequency | PVE (%) |
|---|---|---|---|---|---|---|---|
| WD | Intercept | 167610 | MA_10435406_13733 | A/G | Downstream variant | 0.71 | 4.64 |
| Slope | 30469 | MA_33109_11804 | A/G | Upstream variant | 0.72 | 4.50 | |
| β2 | 30469 | MA_33109_11804 | A/G | Upstream variant | 0.551 | 4.15 | |
| β3 | 157442 | MA_10432646_63090 | G/A | Upstream variant | 0.567 | 2.43 | |
| EWD | Intercept | 167610 | MA_10435406_13733 | A/G | Downstream variant | 0.545 | 3.38 |
| Slope | 23798 | MA_20321_44812 | C/T | Upstream variant | 0.53 | 0.69 | |
| 70955 | MA_118446_4316 | T/A | Upstream variant | 0.644 | 0.40 | ||
| TWD | Slope | 131698 | MA_10235390_3386 | G/A | Stop gained | 0.672 | 1.58 |
| 160208 | MA_10433411_3386 | T/C | Intron variant | 0.595 | 3.41 | ||
| β2 | 89044 | MA_212523_6278 | T/C | Upstream variant | 0.534 | 3.34 | |
| LWD | Slope | 43797 | MA_62987_13474 | T/C | Missense variant | 0.524 | 1.81 |
| 165481 | MA_10434805_21408 | C/T | Intron variant | 0.588 | 1.21 | ||
| 171223 | MA_10436058_4902 | G/A | Intron variant | 0.712 | 4.03 | ||
| RW | Intercept | 11535 | MA_10694_9101 | A/C | Synonymous variant | 0.545 | 1.95 |
| 112391 | MA_879270_7373 | C/T,A | Stop gained | 0.532 | 1.45 | ||
| 112394 | MA_879384_3894 | C/A | Splice region variant | 0.692 | 2.56 | ||
| Slope | 165481 | MA_10434805_21408 | C/T | Intron variant | 0.521 | 2.66 | |
| β2 | 23808 | MA_20322_28351 | T/G | Synonymous variant | 0.554 | 1.78 | |
| 165481 | MA_10434805_21408 | C/T | Intron variant | 0.533 | 0.18 | ||
| β3 | 23808 | MA_20322_28351 | T/G | Synonymous variant | 0.55 | 1.20 | |
| 165481 | MA_10434805_21408 | C/T | Intron variant | 0.615 | 1.79 | ||
| TRW | Slope | 111057 | MA_817099_1105 | T/A | Missense variant | 0.685 | 1.12 |
| β2 | 33110 | MA_38472_13803 | T/A | Upstream gene variant | 0.657 | 3.23 | |
| 89295 | MA_214776_1624 | G/A | Upstream gene variant | 0.688 | 4.51 | ||
| β3 | 111057 | MA_817099_1105 | T/A | Missense variant | 0.672 | 1.20 | |
| LRW | Intercept | 143628 | MA_10428744_29330 | C/T | Downstream variant | 0.668 | 0.5 |
| β3 | 164772 | MA_10434624_20686 | C/A | Downstream variant | 0.571 | 0.06 | |
| MOE | Slope | 165481 | MA_10434805_21408 | C/T | Intron variant | 0.602 | 1.00 |
| NC | β2 | 145839 | MA_10429444_12692 | G/C | Upstream variant | 0.645 | 3.82 |
| ENC | Slope | 98508 | MA_402880_2045 | A/C | Upstream variant | 0.667 | 0.03 |
| 167610 | MA_10435406_13733 | A/G | Downstream variant | 0.685 | 0.01 | ||
| TNC | Intercept | 95870 | MA_346723_2241 | T/C | Upstream variant | 0.667 | 3.78 |
| 126785 | MA_9447489_687 | A/C | Upstream gene variant | 0.68 | 4.93 | ||
| LNC | Intercept | 143628 | MA_10428744_29330 | C/T | Downstream variant | 0.66 | 3.14 |
| Slope | 143628 | MA_10428744_29330 | C/T | Downstream variant | 0.672 | 4.77 | |
| EP | Intercept | 16868 | MA_15729_40331 | G/T | Intron variant | 0.609 | 3.32 |
| 91242 | MA_246125_1213 | G/A | Synonymous variant | 0.594 | 3.41 | ||
| TP | Intercept | 101203 | MA_462319_4322 | A/C | Upstream gene variant | 0.594 | 1.16 |
| 132014 | MA_10251995_2442 | A/C | Upstream gene variant | 0.601 | 3.22 | ||
| LP | β2 | 162397 | MA_10434007_77578 | C/T | Upstream gene variant | 0.892 | 1.14 |
| EP/LP | Intercept | 51657 | MA_80954_29644 | G/A | Downstream variant | 0.63 | 0.81 |
| 60787 | MA_98424_947 | C/T | Intron variant | 0.655 | 1.80 | ||
| 123639 | MA_8790100_1384 | A/C | Upstream variant | 0.628 | 0.75 | ||
| β2 | 59480 | MA_96191_7122 | A/G | Synonymous | 0.6 | 2.37 | |
| β3 | 117333 | MA_1045136_4310 | T/C | Missense variant | 0.523 | 1.34 | |
| Mass index (growth × density) | Intercept | 166235 | MA_10435002_4986 | G/A | Intergenic variant | 0.533 | 0.65 |
| Slope | 61096 | MA_99004_17108 | G/A | Synonymous variant | 0.66 | 0.01 | |
| 67181 | MA_109804_10278 | G/A | Missense variant | 0.612 | 0.05 | ||
| 1401 | MA_1378_4718 | C/A | Exon/stop gained | 0.588 | 1.19 | ||
| 138744 | MA_10427214_13968 | G/T | Missense variant | 0.58 | 1.80 | ||
| 162397 | MA_10434007_77578 | C/T | Upstream variant | 0.627 | 1.44 | ||
| β2 | 21924 | MA_19222_1789 | A/G | Upstream variant | 0.71 | 1.82 |
SNP: The SNP name was composed of the contig (MA_number) and SNP position on contig. For example, the first SNP MA_1043540_13733 was located on contig MA_1043540 at position 13 733 bp; PVE is the phenotypic variance explained.
SNP modes of inheritance
| Phenotype | QTL | SNP | Allele | 2a | d | d/a |
|---|---|---|---|---|---|---|
| WD | 167610 | MA_10435406_13733 | A/G | 19.63 | 9.45 | 0.96 |
| 30469 | MA_33109_11804 | A/G | 5.19 | 6.22 | 2.40 | |
| 157442 | MA_10432646_63090 | G/A | 4.49 | 2.80 | 1.25 | |
| EWD | 167610 | MA_10435406_13733 | A/G | 5.495 | 2.270 | 0.83 |
| 23798 | MA_20321_44812 | C/T | 4.814 | 5.905 | 2.45 | |
| 70955 | MA_118446_4316 | T/A | 1.989 | 0.966 | 0.97 | |
| TWD | 131698 | MA_10235390_3386 | G/A | 3.849 | −0.636 | −0.33 |
| 160208 | MA_10433411_3386 | T/C | 2.313 | 3.908 | 3.38 | |
| 89044 | MA_212523_6278 | T/C | 0.703 | −0.962 | −2.73 | |
| LWD | 43797 | MA_62987_13474 | T/C | 5.124 | −1.086 | −0.42 |
| 165481 | MA_10434805_21408 | C/T | 4.684 | 1.165 | 0.50 | |
| 171223 | MA_10436058_4902 | G/A | 0.938 | 2.482 | 5.29 | |
| RW | 11535 | MA_10694_9101 | A/C | 0.111 | 0.049 | 0.88 |
| 112391 | MA_879270_7373 | C/T,A | 0.056 | −0.027 | −0.98 | |
| 112394 | MA_879384_3894 | C/A | 0.194 | −0.045 | −0.45 | |
| 165481 | MA_10434805_21408 | C/T | 0.158 | 0.039 | 0.49 | |
| 23808 | MA_20322_28351 | T/G | 0.025 | 0.030 | 2.62 | |
| TRW | 111057 | MA_817099_1105 | T/A | 0.016 | 0.001 | 0.16 |
| 33110 | MA_38472_13803 | T/A | 0.029 | −0.002 | −0.19 | |
| 89295 | MA_214776_1624 | G/A | 0.026 | −0.001 | −0.13 | |
| LRW | 143628 | MA_10428744_29330 | C/T | 0.006 | −0.002 | −0.67 |
| 164772 | MA_10434624_20686 | C/A | 0.002 | 0.003 | 2.90 | |
| MOE | 165481 | MA_10434805_21408 | C/T | 0.376 | 0.101 | 0.53 |
| NC | 145839 | MA_10429444_12692 | G/C | 0.298 | 0.792 | 5.31 |
| ENC | 98508 | MA_402880_2045 | A/C | 4.144 | 1.314 | 0.63 |
| 167610 | MA_10435406_13733 | A/G | 4.695 | −3.033 | −1.29 | |
| TNC | 95870 | MA_346723_2241 | T/C | 0.529 | −0.187 | −0.71 |
| 126785 | MA_9447489_687 | A/C | 0.083 | −0.429 | −10.21 | |
| LNC | 143628 | MA_10428744_29330 | C/T | 0.219 | −0.057 | −0.52 |
| EP | 16868 | MA_15729_40331 | G/T | 0.542 | 0.149 | 0.55 |
| 91242 | MA_246125_1213 | G/A | 0.183 | −0.129 | −1.40 | |
| TP | 101203 | MA_462319_4322 | A/C | 0.469 | −0.199 | −0.85 |
| 132014 | MA_10251995_2442 | A/C | 0.339 | −0.429 | −1.63 | |
| LP | 162397 | MA_10434007_77578 | C/T | 0.127 | −0.071 | −1.11 |
| EP/LP | 51657 | MA_80954_29644 | G/A | 0.081 | 0.062 | 1.49 |
| 60787 | MA_98424_947 | C/T | 0.254 | −0.181 | −1.43 | |
| 123639 | MA_8790100_1384 | A/C | 0.032 | −0.078 | −4.81 | |
| 59480 | MA_96191_7122 | A/G | 0.120 | −0.117 | −1.95 | |
| 117333 | MA_1045136_4310 | T/C | 0.018 | 0.077 | 8.56 | |
| 166235 | MA_10435002_4986 | G/A | 0.138 | 0.013 | 0.19 | |
| MI | 61096 | MA_99004_17108 | G/A | 0.006 | −0.009 | −3.16 |
| 67181 | MA_109804_10278 | G/A | 0.007 | −0.012 | −3.14 | |
| 1401 | MA_1378_4718 | C/A | 0.003 | 0.004 | 2.67 | |
| 138744 | MA_10427214_13968 | G/T | 0.002 | 0.017 | 17.00 | |
| 162397 | MA_10434007_77578 | C/T | 0.025 | −0.010 | −0.79 | |
| 21924 | MA_19222_1789 | A/G | 0.014 | −0.008 | −1.14 |
Calculated as the difference between the phenotype means observed within each homozygous class (2a = |GBB − Gbb|, where Gij is the trait mean in the ijth genotype class).
Calculated as the difference between the phenotypic mean observed within the heterozygous class and the average phenotypic mean across both homozygous classes [d = GBb − 0.5(GBB+Gbb)], where Gij is the trait mean in the ijth genotypic class.
Figure 1EBV trajectories of four wood quality traits over time: (a) wood density, (b) late wood density, (c) annual ring width and (d) late wood ring width. Individual trajectories for each trait are shown in light blue lines and the black line represents the mean trajectory for the phenotype. These trajectories were used to determine the four latent traits of each tree, using linear splines with two knots.
Figure 2(a) Decay of linkage disequilibrium (LD) across all the tagged genomic sequences, the majority being exonic regions. The squared correlation coefficient between loci (r2) is plotted against distance, in base pairs, separating loci. The fitted curve (red) is representative of the trend of decay from the 178 101 SNPs utilized in the association mapping (AM). (b) Decay of LD with distance in base pairs between sites from across 41 contigs with significant associations. (c) Decay of LD across contig MA_96191 that has a significant association for ratio of percentage earlywood vs latewood on which two probes were captured. (d) Decay of LD on contig MA_80033 indicating the variable LD in the genome.
Figure 3Box plot of the estimated genotypic effect on the phenotypes in the study. The significant SNPs associated and each one of the traits have been correlated to give the impact each genotype has on the average of the overall trait
Figure 4The heatmap showing the expression levels (VST values) of spruce candidate genes in different organs and tissues based on data of Nystedt et al. (2013) available at http://congenie.org.
List of the phenotypes, their abbreviations and measurement unit
| Phenotype | Abbreviation | Unit |
|---|---|---|
| Ring wood density | WD | kg m−3 |
| Early wood density | EWD | kg m−3 |
| Transition wood density | TWD | kg m−3 |
| Late wood density | LWD | kg m−3 |
| Ring width | RW | μm |
| Early wood ring width | ERW | μm |
| Transition wood ring width | TRW | μm |
| Late wood ring width | LRW | μm |
| Ring number of cells | NC | |
| Early wood number of cells | ENC | |
| Transition wood number of cells | TNC | |
| Late wood number of cells | LNC | |
| Early wood percentage | EP | % |
| Transition wood percentage | TP | % |
| Late wood percentage | LP | % |
| Early/late wood percentage | EP/LP | % |
| Modulus of elasticity | MOE | GPa |
| Mass index (density × growth) | MI |