| Literature DB >> 34173826 |
Patrick Woods1,2, Brian J Campbell2, Timothy J Nicodemus3, Edgar B Cahoon3, Jack L Mullen2, John K McKay2.
Abstract
Understanding the genetic basis of complex traits is a fundamental goal of evolutionary genetics. Yet, the genetics controlling complex traits in many important species such as hemp (Cannabis sativa) remain poorly investigated. Because hemp's change in legal status with the 2014 and 2018 U.S. Federal Farm Bills, interest in the genetics controlling its numerous agriculturally important traits has steadily increased. To better understand the genetics of agriculturally important traits in hemp, we developed an F2 population by crossing two phenotypically distinct hemp cultivars (Carmagnola and USO31). Using whole-genome sequencing, we mapped quantitative trait loci (QTL) associated with variation in numerous agronomic and biochemical traits. A total of 69 loci associated with agronomic (34) and biochemical (35) trait variation were identified. We found that most QTL co-localized, suggesting that the phenotypic distinctions between Carmagnola and USO31 are largely controlled by a small number of loci. We identified TINY and olivetol synthase as candidate genes underlying co-localized QTL clusters for agronomic and biochemical traits, respectively. We functionally validated the olivetol synthase candidate by expressing the alleles in yeast. Gas chromatography-mass spectrometry assays of extracts from these yeast colonies suggest that the USO31 olivetol synthase is functionally less active and potentially explains why USO31 produces lower cannabinoids compared to Carmagnola. Overall, our results help modernize the genomic understanding of complex traits in hemp.Entities:
Keywords: genetic architecture; hemp; olivetol synthase
Mesh:
Substances:
Year: 2021 PMID: 34173826 PMCID: PMC9335937 DOI: 10.1093/genetics/iyab099
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.402
Figure 1Trait correlations of measured phenotypes. Correlation plot depicting Spearman’s rank correlation coefficient (ρ), between measured phenotypes in F2 population. Red colors indicate positive correlations while blue colors indicate negative correlations. The vertical bolded black line separates biochemical from agronomic traits. Only significant correlations (P < 0.05) are shown.
Figure 2Linkage map and QTL intervals. (A) Linkage map showing boxplots depicting the 1.5 LOD confidence intervals of QTL for the 8 measured agronomic traits. (B) Linkage map showing boxplots depicting the 1.5 LOD confidence intervals of QTL for the 17 measured biochemical traits. Black bars within each boxplot indicate the location of the peak LOD values while box colors indicate QTL identified for the respectively colored trait.
QTL models, locations, and effect sizes for agronomic traits
| Phenotype | QTL | Linkage group | Marker (bp) | Genetic position (cM) | LOD | Variance explained (%) |
|---|---|---|---|---|---|---|
|
| LWC.1 | 3 | 50,700,151 | 61.73 | 14.65 | 16.25 |
| LWC.2 | 5 | 23,530,617 | 3.00 | 12.37 | 13.48 | |
| LWC.3 | 10 | 58,605,089 | 8.27 | 4.23 | 4.32 | |
|
| PLHT.1 | 3 | 46,248,568 | 60.33 | 19.41 | 22.35 |
| PLHT.2 | 5 | 31,242,637 | 9.30 | 14.01 | 15.38 | |
| PLHT.3 | 10 | 74,603,632 | 5.88 | 4.40 | 4.45 | |
|
| TSM.1 | 3 | 89,444,283 | 74.00 | 8.44 | 8.29 |
| TSM.2 | 5 | 31,294,490 | 10.00 | 17.22 | 18.16 | |
| TSM.3 | 8 | 14,446,033 | 32.62 | 6.93 | 6.73 | |
| TSM.4 | 10 | 16,666,160 | 13.12 | 5.78 | 5.56 | |
|
| SD.1 | 3 | 65,433,658 | 59.44 | 18.86 | 15.71 |
| SD.2 | 4 | 86,599,640 | 53.04 | 6.38 | 4.80 | |
| SD.3 | 5 | 41,915,966 | 12.00 | 23.29 | 20.13 | |
| SD.4 | 5 | 84,759,484 | 109.77 | 4.45 | 3.30 | |
| SD.5 | 9 | 1,825,282 | 38.22 | 8.42 | 6.44 | |
| SD.6 | 10 | 16,666,160 | 13.12 | 5.11 | 3.81 | |
| SD.1:SD.5 | 5.84 | 4.38 | ||||
|
| SB.1 | 3 | 22,597,724 | 58.82 | 11.24 | 9.01 |
| SB.2 | 4 | 15,873,733 | 32.36 | 7.60 | 5.92 | |
| SB.3 | 5 | 31,242,637 | 9.30 | 32.48 | 31.05 | |
| SB.4 | 5 | 84,871,972 | 102.83 | 4.46 | 3.39 | |
| SB.5 | 10 | 24,849,063 | 37.50 | 7.26 | 5.65 | |
|
| SY.1 | 2 | 94,185,707 | 98.47 | 12.59 | 11.35 |
| SY.2 | 3 | 41,816,369 | 61.18 | 6.72 | 5.78 | |
| SY.3 | 4 | 84,450,770 | 49.52 | 4.92 | 4.17 | |
| SY.4 | 5 | 23,530,617 | 3.00 | 14.15 | 12.92 | |
| SY.5 | 5 | 57,357,245 | 36.00 | 4.04 | 3.41 | |
|
| DB.1 | 3 | 22,597,724 | 58.82 | 8.30 | 8.04 |
| DB.2 | 4 | 86,642,457 | 54.30 | 4.17 | 3.91 | |
| DB.3 | 5 | 31,242,637 | 9.30 | 22.29 | 24.19 | |
| DB.4 | 9 | 8,629,041 | 8.00 | 4.17 | 3.91 | |
|
| DTM.1 | 3 | 26,862,649 | 57.70 | 6.30 | 5.22 |
| DTM.2 | 4 | 86,599,640 | 53.00 | 6.05 | 5.01 | |
| DTM.3 | 5 | 23,530,617 | 3.00 | 33.10 | 34.27 | |
| DTM.4 | 9 | 3,462,161 | 43.60 | 5.24 | 4.31 |
Markers are reported as the physical base pair position of the linkage map marker closest to the LOD peak of the respective QTL. Genetic positions (centiMorgans), LOD values, and variance estimates of QTL have been rounded to two decimal places. Colons indicate interactions between the specified QTL.
QTL models, locations, and effect sizes for biochemical traits
| Phenotype | QTL | Linkage group | Marker (bp) | Genetic position (cM) | LOD | Variance explained (%) |
|---|---|---|---|---|---|---|
|
| AP.1 | 1 | 730,085 | 20.00 | 12.11 | 9.90 |
| AP.2 | 2 | 1,811,866 | 15.80 | 5.94 | 4.54 | |
| AP.3 | 6 | 65,509,552 | 41.60 | 15.76 | 13.43 | |
| AP.4 | 7 | 71,316,711 | 56.49 | 4.84 | 3.65 | |
| AP.5 | 8 | 3,907,307 | 24.00 | 8.64 | 6.80 | |
| AP.6 | 9 | 2,778,113 | 43.00 | 32.09 | 31.04 | |
| AP.1:AP.5 | 6.71 | 5.16 | ||||
| AP.3:AP.6 | 9.61 | 7.64 | ||||
|
| AT.1 | 6 | 19,863,635 | 39.10 | 10.17 | 15.70 |
| AT.2 | 9 | 2,941,221 | 44.00 | 14.31 | 23.15 | |
| AT.1:AT.2 | 3.51 | 5.04 | ||||
|
| BC.1 | 2 | 2,134,594 | 13.00 | 9.86 | 9.20 |
| BC.2 | 8 | 47,143,084 | 58.32 | 6.37 | 5.72 | |
| BC.3 | 9 | 2,323,644 | 41.28 | 35.06 | 43.88 | |
|
| CAM.1 | 6 | 17,849,490 | 36.29 | 10.51 | 17.36 |
| CAM.2 | 9 | 2,941,221 | 44.00 | 11.40 | 19.01 | |
| CAM.1:CAM.2 | 4.65 | 7.19 | ||||
|
| CO.1 | 9 | 2,992,284 | 44.38 | 11.75 | 20.42 |
| CO.2 | 10 | 34,580,106 | 93.00 | 10.59 | 18.18 | |
| CO.1:CO.2 | 6.91 | 11.38 | ||||
|
| CBC.1 | 9 | 2,778,113 | 43.00 | 6.56 | 12.57 |
|
| CBDa.1 | 6 | 29,867,178 | 33.73 | 32.56 | 28.21 |
| CBDa.2 | 9 | 2,453,237 | 41.10 | 44.85 | 44.79 | |
| CBDa.1:CBDa.2 | 13.64 | 9.59 | ||||
|
| CBGa.1 | 6 | 45,639,112 | 33.37 | 20.32 | 26.46 |
| CBGa.2 | 9 | 2,453,237 | 41.10 | 25.53 | 35.21 | |
| CBGa.1:CBGa.2 | 7.87 | 8.97 | ||||
|
| CIT.1 | 9 | 2,083,967 | 37.46 | 12.41 | 23.44 |
|
| EUC.1 | 5 | 72,066,134 | 53.98 | 3.52 | 6.39 |
| EUC.2 | 9 | 2,941,221 | 44.00 | 5.52 | 10.23 | |
|
| GT.1 | 6 | 19,863,635 | 39.10 | 7.64 | 11.03 |
| GT.2 | 9 | 2,270,995 | 39.39 | 17.44 | 28.12 | |
| GT.1:GT.2 | 2.84 | 3.89 | ||||
|
| GE.1 | 9 | 2,083,904 | 36.00 | 26.39 | 43.33 |
|
| LI.1 | 7 | 35,578,586 | 31.65 | 6.12 | 12.34 |
|
| LA.1 | 6 | 5,006,390 | 22.00 | 4.56 | 9.35 |
|
| OC.1 | 1 | 2,554,340 | 34.00 | 8.52 | 13.11 |
| OC.2 | 6 | 39,795,404 | 27.98 | 6.42 | 9.65 | |
| OC.3 | 9 | 2,941,221 | 44.00 | 7.67 | 11.68 | |
|
| THCa.1 | 6 | 39,794,190 | 28.90 | 20.58 | 24.54 |
| THCa.2 | 9 | 2,453,237 | 41.10 | 25.56 | 32.10 | |
| THCa.1:THCa.2 | 8.04 | 8.36 | ||||
|
| 3C.1 | 6 | 55,569,529 | 37.18 | 16.34 | 26.70 |
| 3C.2 | 9 | 3,462,212 | 43.48 | 15.23 | 24.59 | |
| 3C.1:3C.2 | 9.94 | 15.11 |
Markers are reported as the physical base pair position of the linkage map marker closest to the LOD peak of the respective QTL. Genetic positions (centiMorgans), LOD values, and variance estimates of QTL have been rounded to two decimal places. Colons indicate interactions between the specified QTL.
Figure 3Epistatic interaction of the QTL clusters LG6.35 (CBDa.1) and LG9.40 (CBDa.2). Line colors indicate F2 plant genotype at CBDa.1 while x-axis positions indicate F2 plant genotype at CBDa.2 (A, Carmagnola; B, USO31). y-axis values indicate the mean (± standard error) quantile normalized CBDa quantities. Lesser y-axis values indicate lower quantities while higher y-axis values indicate greater quantities of CBDa produced.
Figure 4Models for candidate genes underlying LG5.05 and LG9.40. Shown are the physical locations of the TINY (A) and OLS (B) genes with annotations depicting the three letter codes for amino acid substitution(s) identified between Carmagnola (blue) and USO31 (red).
Figure 5Olivetol production in media supplemented with 1 mM hexanoic acid. (A) Synthesis of olivetol from hexanoic acid and hexanoyl-Coenzyme A (CoA) as described by Taura and Kovalchuk . (B) Shown are bar plots comparing the mean (± standard deviation) olivetol production for biological triplicates of yeast colonies expressing the Carmagnola-derived OLS (black) and USO31-derived OLS (white). Yeast colonies transformed with empty pYES2 vector is shown in gray. Single asterisks indicate a t-test P-value less than 0.05 while double asterisks indicate a t-test P-value less than 0.01. The dashed horizontal line is used to indicate no detectible (ND) quantity of olivetol for colonies transformed with empty pYES2 vectors. (C) Mass spectra showing the major diagnostic ionization fragment and molecular ion (M+) for olivetol for the standard (top) and Carmagnola-derived OLS (bottom). (D) Chromatograms showing the retention time (min) for the peak containing the major diagnostic ionization fragment for olivetol for the olivetol standard, Carmagnola-derived OLS, and USO31-derived OLS.