| Literature DB >> 30519394 |
Nicholas R LaBonte1, Keith E Woeste2.
Abstract
Dispersal of seeds by scatter-hoarding rodents is common among tropical and temperate tree species, including chestnuts in the genus Castanea. Backcrossed (BC) interspecific hybrid chestnuts exhibit wide variation in seed traits: as the parent species (Castanea dentata and C. mollissima) have distinct seed phenotypes and tend to be handled differently by seed dispersers, phenotypic variation in BC trees is likely due to inheritance of genes that have undergone divergent evolution in the parent species. To identify candidate genomic regions for interspecific differences in seed dispersal, we used tagged seeds to measure average dispersal distance for seeds of third-generation BC chestnuts and sequenced pooled whole genomes of mother trees with contrasting seed dispersal: high caching rate/long distance; low caching rate/short distance; no caching. Candidate regions affecting seed dispersal were identified as loci with more C. mollissima alleles in the high caching rate/ long-distance pool than expected by chance and observed in the other two pools. Functional annotations of candidate regions included predicted lipid metabolism, dormancy regulation, seed development, and carbohydrate metabolism genes. The results support the hypothesis that perception of seed dormancy is a predominant factor in squirrel caching decisions, and also indicate profitable directions for future work on the evolutionary genomics of trees and coevolved seed dispersers.Entities:
Keywords: Fagales; coevolution; forest regeneration; mutualism; seed dispersal
Year: 2018 PMID: 30519394 PMCID: PMC6262733 DOI: 10.1002/ece3.4336
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Summary of seed dispersal data for trees in three genotyping pools, showing the pool each individual parent tree was assigned to, its mean seed weight, the number of seeds cached, the average distance seeds were cached away from the feeding site, and the total number of seeds found for that individual (cached + eaten)
| Pool | Species | Year | Mean seed mass (g) |
| Mean distance (m) | Total found | Total offered |
|---|---|---|---|---|---|---|---|
| A | CC | 2014–15 | 8.53 | 8 | 10.49 | 27 | 75 |
| A | BC1 | 2015 | 3.77 | 5 | 8.25 | 20 | 25 |
| A | BC1 | 2015 | 6.82 | 13 | 7.341 | 21 | 25 |
| A | BC1 | 2016 | 4.14 | 1 | 10.98 | 15 | 30 |
| A | BC1 | 2016 | 3.35 | 2 | 7.8 | 13 | 30 |
| A | BC1 | 2014 | 4.51 | 7 | 9.08 | 19 | 25 |
| A | BC1 | 2014 | 4.03 | 2 | 9.64 | 28 | 50 |
| A | BC1 | 2014 | 2.55 | 6 | 6.40 | 30 | 50 |
| B | BC1 | 2014 | 3.55 | 3 | 4.53 | 23 | 50 |
| B | BC1 | 2016 | 3.16 | 2 | 4.92 | 20 | 40 |
| B | BC1 | 2016 | 4.57 | 1 | 4.42 | 25 | 50 |
| B | BC1 | 2016 | 2.67 | 1 | 3.69 | 23 | 40 |
| B | BC1 | 2016 | 3.50 | 1 | 2.85 | 9 | 30 |
| B | BC1 | 2016 | 3.82 | 1 | 3.52 | 13 | 30 |
| B | BC1 | 2016 | 3.48 | 1 | 4.92 | 9 | 30 |
| C | BC1 | 2016 | 2.53 | 0 | 0 | 12 | 20 |
| C | BC1 | 2016 | 3.22 | 0 | 0 | 9 | 40 |
| C | BC1 | 2016 | 4.01 | 0 | 0 | 24 | 50 |
| C | BC1 | 2016 | 3.66 | 0 | 0 | 14 | 20 |
| C | BC1 | 2014 | 2.99 | 0 | 0 | 24 | 50 |
| C | BC1 | 2014 | 3.47 | 0 | 0 | 24 | 50 |
| C | BC1 | 2015 | 2.54 | 0 | 0 | 19 | 25 |
| C | BC1 | 2015 | 1.47 | 0 | 0 | 7 | 25 |
| C | BC1 | 2014 | 3.09 | 0 | 0 | 24 | 50 |
| C | AC | 2014‐15 | 1.52 | 1 | 1.85 | 30 | 50 |
aStrong‐dispersal pool; bModerate‐dispersal pool; cWeak‐dispersal pool.
Figure 1Scatterplot with simple linear regression line of average distance to caching (m) over average seed mass (g) for 25 BC3, 2 Castanea dentata, and 2 C. mollissima mother trees measured 2014–2016
Figure 2Histograms of heterozygosity estimates for SNPs in the pooled genome sequences of (a) seven BC3 and one Chinese chestnut with more frequent and longer‐distance nut dispersal (b) seven BC3 chestnut with intermediate dispersal distance and low caching frequency and (c) eight BC3 and one American chestnut with low caching frequency and short dispersal distance
Notable predicted genes within genome intervals identified based on lower major allele fraction (higher proportion of Cm alleles) in a pool of BC3 chestnut mother trees with long caching distance and high proportion of seeds cached, relative to other BC3 chestnuts in the study, including predicted molecular function and simulation‐based statistical support for an excess of Cm alleles at the predicted gene
| LG | BP interval |
| Gene | Uniprot hit | Annotation | SNPs |
| P(A) | P(B) | P(C) | BC1 Het. |
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LGA | 83,772,158–8,408,9674 | 3.32 | A.g10648 | LRX3_ARATH (77%) | Extensin‐like protein; cell wall formation | 10 | 2 | 0.17 | 0.59 | 0.59 | 0.742 | 0.000 |
| A.g10657 | VIL1_ARATH (48%) | Involved in vernalization, flowering regulation | 19 | 8 | 0.19 | 0.87 | 0.41 | 0.522 | 0.042 | |||
| LGC | 48,273,510–48,654,688 | 3.09 | C.g6050 | EMF2_ARATH (46%) | Polycomb group protein; flower development | 26 | 8 | 0.24 | 0.37 | 0.82 | 0.838 | 0.068 |
| LGG | 40,408,926–40,687,727 | 2.61 | G.g5214 | DLO2_ARATH (39%) | Salicylic acid catabolic process | 6 | 4 | 0.06 | 0.91 | 0.03 | 0.446 | 0.175 |
| LGH | 46,084,023–46,409,469 | 2.29 | H.g5097 | LRP1_ARATH (50%) | Involved in formation of female flower parts | 9 | 2 | 0.38 | 0.84 | 0.82 | 0.093 | 0.018 |
| LGI | 25,672,061–25,978,423 | 2.71 | I.g3291 | NLTL5_ARATH (33%) | Lipid binding and transport | 1 | 0 | 0.05 | 0.00 | na | 0.429 | 0.000 |
| I.g3304 | C94A2_VICSA (52%) | Hydroxylation of fatty acids | 1 | 0 | 0.64 | 0.91 | 0.64 | 0.600 | 0.000 | |||
| LGL | 59,955,058‐60,179,331 | 2.92 | L.g7556 | PME51_ARATH (59%) | Demethylesterification of cell wall pectin | 2 | 0 | 0.05 | 0.91 | na | 0.160 | 0.031 |
| L.g7302 | CESA2_ARATH (78%) | Crystallization of cell wall microfibrils | 10 | 0 | 0.48 | 0.91 | 0.86 | 0.203 | 0.092 | |||
| LGL | 65,537,114‐65,794,683 | 2.89 | L.g8192 | TPS13_RICCO (62%) | Sesquiterpene synthase | 50 | 10 | 0.18 | 0.91 | 0.42 | 0.700 | 0.022 |
| L.g8191 | NES1_FRAAN (61%) | Synthesis of volatile mono‐, sesquiterpenes | 3 | 1 | 0.43 | 0.91 | 0.35 | 0.671 | 0.155 | |||
| L.g8198 | NES1_FRAVE (58%) | Synthesis of volatile mono‐, sesquiterpenes | 31 | 3 | 0.39 | 0.91 | 0.27 | 0.554 | 0.142 | |||
| L.g8208 | NES2_FRAAN (60%) | Synthesis of volatile mono‐, sesquiterpenes | 9 | 2 | 0.30 | 0.91 | 0.42 | 0.667 | 0.123 |
aLinkage group and bbase position in the Chinese chestnut draft pseudochromosome assembly (Carlson et al., 2014); cNumber of standard deviations that the heterozygosity estimator (HE) for the strong‐dispersal pool exceeded either the weak‐dispersal or no‐dispersal pools; dNumeric code for a predicted gene (AUGUSTUS) in the Chinese chestnut draft reference genome (Carlson et al., 2014); eTop hit from the Uniprot/Swissprot curated protein sequence database for the predicted chestnut gene, with percent amino acid identity; fLikely function based on the top Uniprot hit; gInformative SNPs in the gene sequence; hSNPs where Cm allele frequency in the strong‐dispersal‐pool was in the top 5% of simulated random Cm allele frequencies; iAverage percentile value of all SNPs (smaller value = more Cm alleles than expected) in the predicted gene for the strong‐dispersal pool; jAverage percentile value of SNPs in the weak‐dispersal pool; kAverage percentile value of SNPs in the no‐dispersal pool; lHeterozygosity for all SNPs in the gene for “Clapper,” the Cm allele donor for all BC3 trees in the experiment; mHeterozgosity for all SNPs in the gene, averaged over two individual Cd samples.
Individual predicted genes identified as candidates for differences in seed dispersal based on significant departures from expected allele frequencies in a pool of BC3 chestnut mother trees with long caching distance and high proportion of seeds cached, relative to other BC3 chestnuts in the study
| Gene | Strong‐dispersal SNPsA | Significant SNPs |
| Uniprot | Annotation |
|---|---|---|---|---|---|
| A.g11556 | 9 | 0.82 | 0.343 | PME_PRUPE | Pectin methylesterase; cell wall remodeling |
| A.g1855 | 6 | 0.86 | 0.349 | GWD2_ARATH | Alpha‐glucan water dikinase |
| A.g1991 | 5 | 1.00 | 0.655 | BH030_ARATH | bHLH transcription factor |
| A.g3246 | 5 | 1.00 | 0.731 | C94A2_VICSA | Cytochrome p450 oxidase; fatty acid hydrolysis |
| A.g5184 | 3 | 1.00 | 0.811 | BZR1_ARATH | Modulates ovule development; brassinosteroid signaling |
| A.g5562 | 8 | 0.89 | 0.319 | GEX1_ARATH | Gametophyte development and embyrogenesis |
| A.g6140 | 7 | 0.78 | 0.965 | AOP1C_ARATH | Dioxygenase potentially involved in producing glucosinolates |
| A.g7734 | 1 | 1.00 | 0.564 | GEML5_ARATH | Maintenance of seed dormancy by abscisic acid |
| A.g8091 | 5 | 0.83 | 0.496 | LHT1_ARATH | Lysine histidine amino acid transporter |
| B.g3059 | 4 | 0.80 | 0.252 | PERR_RAUSE | Biosynthesis of monoterpenoid indole alkaloids |
| E.g1046 | 11 | 0.92 | 0.662 | C76AD_BETVU | Cytochrome p450 oxidase in the betalain synthesis pathway |
| E.g4071 | 14 | 0.78 | 0.240 | WAK4_ARATH | Cell‐surface kinase that binds to pectin; cell expansion |
| E.g7467 | 3 | 1.00 | 0.589 | STC_RICCO | Sugar carrier protein C; carbohydrate transport |
| E.g8229 | 8 | 0.80 | 0.899 | HPPR_PLESU | Hydroxyphenylpyruvate reductase; Biosynthesis of rosmarinic acid |
| E.g911 | 5 | 0.83 | 0.719 | GAIP_CUCMA | DELLA protein; repressor of gibberellin signal pathway |
| F.g2981 | 21 | 0.81 | 0.865 | TRA1_MAIZE | Putative AC transposase from transposon “Activator” |
| F.g927 | 4 | 1.00 | 0.365 | RKS1_ARATH | Carbohydrate‐binding serine/threonine protein kinase |
| F.g941 | 10 | 0.77 | 0.885 | SD16_ARATH | Receptor‐like kinase; regulation of cellular expansion |
| H.g315 | 6 | 0.86 | 0.433 | BPG2_ARATH | Brassinosteroid‐mediated transcriptional regulation |
| J.g6035 | 4 | 1.00 | 0.583 | AKR1_SOYBN | Aldo‐keto reductase |
| K.g1622 | 3 | 1.00 | 0.749 | BIOF_ARATH | 8‐amino‐7‐oxononoanoate synthase |
| K.g2221 | 4 | 0.80 | 0.565 | FRL4A_ARATH | FRIGIDA‐like protein; flower development |
| L.g6163 | 4 | 0.80 | 0.910 | MTEF8_ARATH | Transcription termination factor |
| L.g6184 | 13 | 0.77 | 0.722 | GIF2_ARATH | Transcription coactivator; active in cotyledon tissue |
aAUGUSTUS predicted gene from the Chinese chestnut draft reference genome (Carlson et al., 2014) preceded by linkage group letter; the initial letter indicates the pseudochromosome in which the predicted gene was found; bSNPs in the upper 5% of the distribution of simulated random Cm allele frequencies in the strong‐dispersal pooled genome. No SNPs significant at this level were identified in these intervals in the weak‐ or no‐dispersal pools; cPercent of informative SNPs in the gene sequence, for the high‐dispersal pool, in the upper 5% of simulated Cm allele frequencies; d F ST calculated for SNPs within the predicted gene among a sample of individual Chinese and American chestnut genome sequences; eTop hit for the predicted protein from the Uniprot/Swissprot curated protein database.
Figure 3Depiction of the role of candidate genes in Castanea nut development (upper right) and the perceptions of nut phenotypes by squirrels (Sciurus) that are hypothesized to cause some BC3 chestnuts to be dispersed farther and cached more frequently than others
Transcriptome alignments from members of the order Fagales, for selected predicted genes from chestnut genome regions associated with interspecific differences in seed dispersal
| Gene | Uniprot |
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A.g1855 | GWD2_ARATH | 100 | 100 | 94.5 | 98.7 | 100 | 97.9 | – | 77.5 | 83.6 | – | – | – | – |
| A.g1991 | BH030_ARATH | 100 | – | – | – | – | – | – | 62.3 | 65.3 | – | – | – | 58.3 |
| A.g3246 | C94A2_VICSA | 98.7 | 97.3 | 87.2 | 89.2 | 87.6 | 95.7 | – | – | – | – | – | – | – |
| A.g5184 | BZR1_ARATH | 100 | 100 | 80.3 | 81.5 | 97.4 | 81.0 | – | 75.0 | 75.6 | – | 90.2 | 79.7 | 74.0 |
| A.g7734 | GEML5_ARATH | 69.4 | 60.6 | 99.6 | 99.4 | – | 94.2 | – | 68.0 | 70.3 | 90.6 | 84.4 | 70.6 | 83.6 |
| A.g8091 | LHT1_ARATH | 95.5 | 100 | – | – | – | – | 78.1 | – | – | – | – | – | – |
| A.g10648 | LRX3_ARATH | 100 | 100 | 97.9 | 98.2 | 93.1 | 95.1 | 95.5 | – | 94.3 | 95.7 | 94.3 | 92.5 | 90.3 |
| A.g10657 | VIL1_ARATH | 100 | 100 | 100 | 93.1 | 96.1 | 96.2 | – | 83.0 | 83.0 | 84.4 | 92.4 | 89.0 | 87.9 |
| A.g11556 | PME_PRUPE | 100 | – | – | – | – | 87.7 | – | 78.5 | – | 84.6 | – | 72.4 | 76.8 |
| C.g6050 | EMF2_ARATH | 98.9 | 98.4 | – | – | – | – | – | – | – | – | – | – | 74.3 |
| E.g7467 | STC_RICCO | 94.8 | 86.8 | 88.1 | 91.6 | 78.6 | 85.9 | 77.5 | – | – | – | – | – | 81.2 |
| E.g8229 | HPPR_PLESU | 97.9 | – | – | – | – | 82.3 | – | 72.3 | 79.2 | 81.8 | – | 84.3 | 78.1 |
| F.g927 | RKS1_ARATH | 100 | 84.7 | 96.0 | 92.6 | – | 75.8 | 77.1 | 68.4 | – | – | – | – | 72.2 |
| G.g5214 | DLO2_ARATH | 100 | 99.4 | – | – | – | 72.5 | – | – | – | – | – | – | – |
| H.g315 | BPG2_ARATH | 89.6 | 94.0 | – | – | – | 81.5 | 94.5 | 81.1 | 80.7 | – | – | 85.1 | 85.6 |
| H.g5907 | LRP1_ARATH | 79.4 | 100 | 80.2 | 69.6 | 100 | 88.9 | – | 58.3 | 63.0 | – | – | – | 82.6 |
| I.g3291 | NLTL5_ARATH | 89.0 | 100.0 | 85.0 | – | – | 77.3 | – | 50.3 | 65.1 | 61.8 | – | – | 59.8 |
| J.g6035 | AKR_SOYBN | 100 | 84.9 | – | 72.3 | – | – | 88.2 | – | 86.4 | – | – | – | 80.7 |
| K.g2221 | FRL4A_ARATH | 81.9 | 100 | 85.5 | 86.1 | 100 | 84.5 | 79.7 | 67.9 | 65.0 | 84.6 | 97.3 | 70.2 | 78.4 |
| L.g6163 | MTEF8_ARATH | 100 | – | – | – | – | 100 | – | 72.6 | 72.5 | 68.9 | – | – | 85.9 |
| L.g6184 | GIF2_ARATH | 93.0 | – | – | – | 74.5 | 98.0 | – | 40.2 | 57.1 | – | 70.8 | 49.1 | 76.7 |
| L.g7302 | CESA2_ARATH | 85.6 | 100 | 90.0 | 88.3 | 100.0 | 87.3 | 86.0 | 84.4 | 88.5 | 100.0 | 79.1 | 60.5 | 81.9 |
| L.g8191 | NES1_FRAAN | 93.1 | – | – | – | – | – | – | – | 38.9 | – | – | – | – |
| L.g8192 | TPS_RICCO | – | – | – | – | 100.0 | – | – | – | – | – | – | – | – |
| L.g8198 | NES1_FRAVE | – | 92.1 | – | – | – | 95.7 | – | – | – | – | – | – | – |
| L.g8208 | NES2_FRAAN | – | 100 | – | – | – | 90.8 | – | 68.2 | 81.2 | 81.2 | – | – | 78.8 |
aHealthy stems, blight cankers, and whole‐plant tissues (Barakat et al. 2009, 2012) for Chinese chestnut, Castanea mollissima (Cm) and bAmerican chestnut, C. denata (Cd); cRoot transcriptomes for Japanese chestnut, C. crenata (Cc), and dEuropean chestnut, C. sativa (Cs) (Serrazina et al. 2015); Whole‐plant libraries of ewhite oak (Quercus alba; Qa), fnorthern red oak (Quercus rubra; Qr), gAmerican beech (Fagus grandifolia; Fg), hblack walnut (Juglans nigra; Jn), ired alder (Alnus rubra; Aru), and jwhite alder (Alnus rhomboids; Arh) downloaded from hardwoodgenomics.org; kPersian walnut (Juglans regia; Jr) whole‐plant transcriptome data (Martinez‐Garcia et al., 2016); lCommon hazelnut (Corylus avellana; Ca) all‐tissue library (Rowley et al., 2012); m Betula platyphylla (Mu et al., 2012); nPercent amino‐acid identity in a blastx alignment of the predicted chestnut protein with the cDNA transcript; oPredicted protein in chestnut was not the best blastx alignment for any transcript.
Selected candidate genes with summary of evidence for involvement in seed dispersal. Boldface indicates candidate genes with the strongest evidence
| Site | Gene | Pool A | Pool B | Pool C | Het. | Het. | UniProtg | % ID |
|---|---|---|---|---|---|---|---|---|
| Sd01 | A.g4714 | 0.488 | 0.026 | 0.067 | 0.120 | 0.013 | DBR_TOBAC | 64 |
| Sd02 | A.g8635 | 0.750 | 0.088 | 0.242 | 0.176 | 0.062 | ASR1_SOLLC | 82 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Sd04 | A.g13357 | 0.585 | 0.124 | 0.154 | 0.026 | 0.055 | SEC5A_ARATH | 57 |
| Sd04 | A.g13359 | 0.471 | 0.104 | 0.063 | 0.007 | 0.043 | SEC5B_ARATH | 71 |
| Sd05 | B.g1118 | 0.470 | 0.093 | 0.082 | 0.108 | 0.035 | GOLS2_ARATH | 72 |
| Sd06 | B.g3452 | 0.422 | 0.089 | 0.119 | 0.119 | 0.021 | SUP_ARATH | 41 |
| Sd06 | B.g3458 | 0.534 | 0.031 | 0.180 | 0.305 | 0.088 | CSLG2_ARATH | 36 |
| Sd06 | B.g3460 | 0.522 | 0.000 | 0.174 | 0.227 | 0.068 | CSLG2_ARATH | 38 |
| Sd07 | B.g5164 | 0.449 | 0.289 | 0.260 | 0.459 | 0.183 | UXS2_ARATH | 70 |
| Sd08 | C.g3074 | 0.437 | 0.077 | 0.093 | 0.236 | 0.042 | PME31_ARATH | 79 |
| Sd09 | C.g3725 | 0.432 | 0.399 | 0.062 | 0.755 | 0.008 | AAPT1_ARATH | 85 |
| Sd09 | C.g3728 | 0.516 | 0.287 | 0.061 | 0.787 | 0.025 | AAPT1_ARATH | 98 |
| Sd10 | C. g6050 | 0.583 | 0.325 | 0.155 | 0.838 | 0.068 | EMF2_ARATH | 46 |
| Sd12 | G.g4366 | 0.175 | 0.000 | 0.483 | 0.857 | 0.033 | ABR1_ARATH | 55 |
| Sd12 | G.g4369 | 0.484 | 0.012 | 0.369 | 0.55 | 0.130 | AMYA_VIGMU | 71 |
| Sd12 | G.g4370 | 0.460 | 0.018 | 0.434 | 0.592 | 0.198 | LAG12_ARATH | 66 |
|
|
|
|
|
|
|
|
|
|
| Sd14 | H.g5907 | 0.647 | 0.078 | 0.072 | 0.093 | 0.018 | LRP1_ARATH | 50 |
|
|
|
|
|
|
|
|
|
|
| Sd15 | I.g3304 | 0.550 | 0.000 | 0.082 | 0.600 | 0.000 | C94A2_VICSA | 52 |
| Sd16 | L.g7302 | 0.535 | 0.066 | 0.108 | 0.203 | 0.092 | CESA2_ARATH | 78 |
|
|
|
|
|
|
|
|
|
|
| Sd18 | L.g8191 | 0.516 | 0.236 | 0.419 | 0.671 | 0.155 | NES1_FRAAN | 61 |
|
|
|
|
|
|
|
|
|
|
| Sd18 | L.g8198 | 0.667 | 0.098 | 0.429 | 0.554 | 0.142 | NES1_FRAVE | 58 |
| Sd18 | L.g8208 | 0.527 | 0.078 | 0.455 | 0.667 | 0.123 | NES2_FRAAN | 60 |
aPool of individuals with highest mean dispersal distance and largest % of seeds cached rather than consumed; bPool of individuals with lower mean dispersal distance and lower caching %; cPool of seeds that were rarely or never cached; dProportion of heterozygous snps in gene for “Clapper” calculated using Vcftools; eProportion of heterozygous snps in gene for Cd calculated in Vcftools; fPercentile of expected Cm allele frequency distribution based on 1,000,000 simulated pooled genotypes for each pool, averaged over all SNP loci within gene.