| Literature DB >> 27748400 |
Zhijiao Song1,2,3, Miaomiao Zhang2,4, Fagen Li2, Qijie Weng2, Chanpin Zhou2, Mei Li2, Jie Li2, Huanhua Huang5, Xiaoyong Mo4, Siming Gan1,2.
Abstract
Identification of loci or genes under natural selection is important for both understanding the genetic basis of local adaptation and practical applications, and genome scans provide a powerful means for such identification purposes. In this study, genome-wide simple sequence repeats markers (SSRs) were used to scan for molecular footprints of divergent selection in Eucalyptus grandis, a hardwood species occurring widely in costal areas from 32° S to 16° S in Australia. High population diversity levels and weak population structure were detected with putatively neutral genomic SSRs. Using three FST outlier detection methods, a total of 58 outlying SSRs were collectively identified as loci under divergent selection against three non-correlated climatic variables, namely, mean annual temperature, isothermality and annual precipitation. Using a spatial analysis method, nine significant associations were revealed between FST outlier allele frequencies and climatic variables, involving seven alleles from five SSR loci. Of the five significant SSRs, two (EUCeSSR1044 and Embra394) contained alleles of putative genes with known functional importance for response to climatic factors. Our study presents critical information on the population diversity and structure of the important woody species E. grandis and provides insight into the adaptive responses of perennial trees to climatic variations.Entities:
Mesh:
Year: 2016 PMID: 27748400 PMCID: PMC5066178 DOI: 10.1038/srep34941
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Geographic distribution of the 16 Eucalyptus grandis populations studied.
The map was generated using software ArcGIS 10.0 (http://www.esri.com/software/arcgis/). Full description of the populations can be found in Table 1. SF, state forest; ACT, Australian Capital Territory.
Eucalyptus grandis populations, their origins and sample size as well as the mean values of three non-correlated climatic variables (during 1950–2000).
| No. | Code | Population | Latitude (S) | Longitude (E) | Altitude (m) | N | MAT (°C) | MAT partition | IT | IT partition | AP (mm) | AP partition |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Pic | Piccaninny CK Windsor, QLD | 16°13′ | 144°58′ | 1,160 | 9 | 19.7 | 5 | 60.1 | 2 | 1,352 | 3 |
| 2 | Cop | Copperlode, QLD | 16°58′ | 145°40′ | 425 | 9 | 21.7 | 4 | 56.9 | 2 | 2,195 | 2 |
| 3 | MS | MT Spec SF Paluma, QLD | 18°56′ | 146°07′ | 850 | 9 | 20.4 | 3 | 54.1 | 4 | 1,001 | 4 |
| 4 | Fin | Finch Hatton Gorge, QLD | 21°04′ | 148°37′ | 200 | 10 | 22.1 | 4 | 51.1 | 1 | 1,336 | 3 |
| 5 | Cre | Credition SF, QLD | 21°13′ | 148°28′ | 720 | 11 | 19.8 | 5 | 51.6 | 1 | 983 | 4 |
| 6 | Kin | Kin Kin, QLD | 26°12′ | 153°10′ | 40 | 9 | 20.8 | 3 | 50.6 | 3 | 1,517 | 1 |
| 7 | Bel | Belli, QLD | 26°29′ | 152°50′ | 100 | 9 | 19.9 | 5 | 51.7 | 1 | 1,498 | 1 |
| 8 | Bor | Borumba Range, QLD | 26°35′ | 152°36′ | 500 | 11 | 18.4 | 1 | 51.7 | 1 | 1,319 | 3 |
| 9 | Con | Connondale, QLD | 26°40′ | 152°36′ | 560 | 10 | 17.5 | 2 | 51.5 | 1 | 1,384 | 3 |
| 10 | Kil | Kilcop Creek, QLD | 26°45′ | 152°35′ | 400 | 10 | 18.0 | 2 | 51.8 | 1 | 1,276 | 3 |
| 11 | MM | MT Mee, QLD | 27°08′ | 152°43′ | 200 | 12 | 19.0 | 1 | 51.9 | 1 | 1,178 | 3 |
| 12 | MT | MT Tamborine, QLD | 27°55′ | 153°11′ | 500 | 12 | 17.7 | 2 | 52.0 | 1 | 1,522 | 1 |
| 13 | ML | MT Lindsay, QLD | 28°21′ | 152°45′ | 340 | 10 | 17.2 | 6 | 51.8 | 1 | 1,262 | 3 |
| 14 | Bag | Bagawa, NSW | 30°07′ | 152°54′ | 440 | 10 | 17.0 | 6 | 50.0 | 3 | 1,841 | 2 |
| 15 | Ora | Orara West SF, NSW | 30°20′ | 153°00′ | 293 | 9 | 17.5 | 2 | 50.3 | 3 | 1,951 | 2 |
| 16 | Bul | Bulahdelah SF, NSW | 32°20′ | 152°15′ | 20 | 9 | 17.7 | 2 | 50.2 | 3 | 1,307 | 3 |
| Total | 159 | 6 | 4 | 4 |
Using a k-means analysis42 on each of the non-correlated climatic variables, climatic partitioning assigned the populations to four or six groups. QLD, Queensland; MT, mountain; SF, state forest; NSW, New South Wales; N, number of individuals; MAT, mean annual temperature; IT, isothermality; AP, annual precipitation.
Genetic diversity parameters of E. grandis populations based on 31 putatively neutral gSSR loci.
| No. | Pop. code[ | ||||||
|---|---|---|---|---|---|---|---|
| 1 | Pic | 8.484 | 4 | 0.708 | 0.753 | 0.049 | 4.624 |
| 2 | Cop | 8.161 | 8 | 0.701 | 0.716 | 0.009 | 4.295 |
| 3 | MS | 7.903 | 8 | 0.717 | 0.748 | 0.040 | 4.557 |
| 4 | Fin | 8.903 | 7 | 0.692 | 0.706 | 0.005 | 4.459 |
| 5 | Cre | 9.903 | 4 | 0.738 | 0.736 | –0.002[ | 4.693 |
| 6 | Kin | 8.613 | 7 | 0.761 | 0.801 | 0.050 | 5.280 |
| 7 | Bel | 8.323 | 3 | 0.760 | 0.791 | 0.045 | 5.256 |
| 8 | Bor | 10.548 | 6 | 0.752 | 0.801 | 0.055 | 5.235 |
| 9 | Con | 8.484 | 6 | 0.769 | 0.803 | 0.039 | 5.199 |
| 10 | Kil | 9.032 | 6 | 0.752 | 0.773 | 0.029 | 5.000 |
| 11 | MM | 11.516 | 10 | 0.709 | 0.809 | 0.127 | 5.144 |
| 12 | MT | 11.355 | 8 | 0.769 | 0.806 | 0.056 | 5.077 |
| 13 | ML | 9.065 | 8 | 0.768 | 0.775 | 0.004 | 4.918 |
| 14 | Bag | 9.516 | 13 | 0.742 | 0.792 | 0.065 | 4.953 |
| 15 | Ora | 8.290 | 11 | 0.788 | 0.804 | 0.022 | 5.300 |
| 16 | Bul | 8.258 | 12 | 0.767 | 0.770 | 0.000 | 4.870 |
| Mean (SE) | 9.147 (1.129) | 7.560 (2.851) | 0.743 (0.009) | 0.774 (0.005) | 0.037 (0.011) | 4.929 (0.318) |
†See Table 1 for full description of the populations. ‡The negative F value was treated as zero. ANA, average number of alleles per locus; NPA, number of private alleles; HO, observed heterozygosity; HE, expected heterozygosity; F, fixation index; AR, allelic richness.
Figure 2Genetic structure of 16 Eucalyptus grandis populations based on 31 putatively neutral gSSR loci.
Full description of the populations can be found in Table 1. (a) Principal coordinates analysis (PCoA). (b) Unweighted pair group method with arithmetic mean (UPGMA) dendrogram. (c) Individual proportion and population membership to each of the clusters inferred in STRUCTURE analysis (K = 2).
FST outliers detected for the three un-correlated climatic factors in LOSITAN45, ARLEQUIN47 and BAYESCAN49.
| SSR locus | MAT | Isothermality | Annual precipitation | ||||||
|---|---|---|---|---|---|---|---|---|---|
| L[ | A[ | B[ | L[ | A[ | B[ | L[ | A[ | B[ | |
| EUCeSSR1061 | |||||||||
| Embra180 | ** | * | * | ||||||
| EUCeSSR347 | |||||||||
| Embra98 | |||||||||
| EUCeSSR0502 | |||||||||
| EUCeSSR0224 | |||||||||
| EUCeSSR0276 | |||||||||
| EUCeSSR0979 | ** | ||||||||
| Embra227 | |||||||||
| Embra280 | ** | ** | |||||||
| EUCeSSR313 | |||||||||
| EUCeSSR384 | |||||||||
| EUCeSSR0599 | |||||||||
| Embra130 | ** | ||||||||
| EUCeSSR0035 | |||||||||
| EUCeSSR151 | |||||||||
| EUCeSSR686 | |||||||||
| Embra242 | |||||||||
| Embra120 | * | ||||||||
| EUCeSSR626 | |||||||||
| EUCeSSR0455 | ** | * | |||||||
| Embra358 | * | ||||||||
| Embra304 | *** | ||||||||
| EUCeSSR0103 | * | ||||||||
| Embra188 | ** | * | |||||||
| EUCeSSR0906 | |||||||||
| Embra37 | |||||||||
| Embra187 | ** | ||||||||
| EUCeSSR0755 | ** | ** | * | *** | ** | ||||
| Embra233 | * | ||||||||
| EUCeSSR231 | |||||||||
| EUCeSSR0620 | * | ||||||||
| EUCeSSR880 | * | ||||||||
| EUCeSSR479 | |||||||||
| EUCeSSR1042 | * | ** | ** | * | |||||
| Embra83 | |||||||||
| Embra369 | * | ||||||||
| EUCeSSR1087 | |||||||||
| EUCgSSR21 | |||||||||
| EUCeSSR522 | * | ||||||||
| Embra197 | |||||||||
| EUCeSSR1070 | *** | ** | * | * | |||||
| Embra88 | ** | ** | * | * | |||||
| Embra53 | * | * | * | ||||||
| EUCeSSR0497 | * | ||||||||
| EUCeSSR0845 | |||||||||
| Embra217 | * | ||||||||
| EUCeSSR0679 | * | ||||||||
| EUCeSSR1044 | * | ||||||||
| EUCeSSR0568 | |||||||||
| Embra394 | *** | *** | *** | ** | *** | ** | |||
| EUCeSSR0893 | *** | ** | *** | *** | ** | ** | *** | ** | *** |
| Embra269 | * | ||||||||
| EUCeSSR292 | *** | ** | * | ** | |||||
| EUCeSSR349 | * | ||||||||
| EUCeSSR209 | * | ||||||||
| EUCeSSR0849 | ** | * | * | * | |||||
| EUCeSSR1145 | |||||||||
| Sub-total | 28 (24 + | 18 (8 + | 1 | 20 (9 + | 12 (9 + | 1 | 15 (13 + | 10 (5 + | 1 |
| L ∩ A | 11 (10.0%; 8 + | 10 (9.1%; 8 + | 5 (4.5%; 5 + | ||||||
| L ∩ A ∩ B | 1 (0.9%) | 1 (0.9%) | 1 (0.9%) | ||||||
| L ∪ A ∪ B | 35 (31.8%; 24 + | 22 (20.0%; 10 + | 20 (18.2%; 13 + | ||||||
| gSSRs | 13 (28.9%)†† | 8 (17.8%)†† | 7 (15.6%)†† | ||||||
| EST-SSRs | 22 (33.8%)†† | 14 (21.5%)†† | 13 (20.0%)†† | ||||||
| Total | 58 (52.7%; 32 + | ||||||||
†Confidence levels of the distribution relative to neutral selection: *, above the 95% level; **, above the 99% level; ***, above the 99.9% level; , below the 5% level; , below the 1% level; , below the 0.1% level. ‡Significance levels at the 95% confidence: *, 0.01 < P ≤ 0.05; **, 0.001 < P ≤ 0.01; ***, P ≤ 0.001. Significance levels at the 5% confidence: , 0.01 < P ≤ 0.05; , 0.001 < P ≤ 0.01. §Grades of evidence of selection: **, very strong evidence; ***, decisive evidence. ¶Percentage of the total number of markers used (110). ††Percentage of the total number of genomic SSR markers (45) or EST-SSR markers (65) used. The underlined asterisks and numbers are for balancing selection loci. MAT, mean annual temperature; L, LOSITAN; A, ARLEQUIN; B, BAYESCAN; L ∩ A, LOSITAN and ARLEQUIN; L ∩ A ∩ B, LOSITAN, ARLEQUIN and BAYESCAN; L ∪ A ∪ B, LOSITAN, ARLEQUIN or BAYESCAN.
Loci and alleles significantly associated with the three non-correlated climatic variables in E. grandis.
| Locus-allele (bp) | Scaffold[ | MAT | Isothermality | MAP | Putative function at E ≤ 10−5 (Organism; BlastX E value) |
|---|---|---|---|---|---|
| Embra180-120 | 1 | + | No significant match | ||
| EUCeSSR0755-266 | 6 | + | No significant match | ||
| EUCeSSR0755-276 | 6 | + | + | + | |
| EUCeSSR1044-406 | 10 | + | Zinc finger, C3HC4 type (RING finger) protein ( | ||
| Embra394-215 | 10 | + | Predicted: thionin-like protein 2 ( | ||
| EUCeSSR0849-212 | 11 | + | Predicted: uncharacterized protein LOC104426570 ( | ||
| EUCeSSR0849-232 | 11 | + |
†Scaffolds as aligned to E. grandis genome sequence (version 1.1, http://www.phytozome.net/eucalyptus.php). A plus sign (+) indicates the significant association detected by both G and Wald tests. MAT, mean annual temperature; MAP, mean annual precipitation.
Figure 3Linear regression for three significant associations between FST outlier allele frequencies and climatic variables.
Each dot represents a group of homogeneous populations in K-means climatic partition. (a) The 120 bp allele of locus Embra180 associated with mean annual temperature. (b,c) The 276 bp allele of EUCeSSR0755 associated with mean annual temperature and isothermality, respectively.