| Literature DB >> 29898656 |
Bochra A Bahri1,2, Guillaume Daverdin3,4, Xiangyang Xu3,5, Jan-Fang Cheng6, Kerrie W Barry6, E Charles Brummer7, Katrien M Devos3.
Abstract
BACKGROUND: Advances in genomic technologies have expanded our ability to accurately and exhaustively detect natural genomic variants that can be applied in crop improvement and to increase our knowledge of plant evolution and adaptation. Switchgrass (Panicum virgatum L.), an allotetraploid (2n = 4× = 36) perennial C4 grass (Poaceae family) native to North America and a feedstock crop for cellulosic biofuel production, has a large potential for genetic improvement due to its high genotypic and phenotypic variation. In this study, we analyzed single nucleotide polymorphism (SNP) variation in 372 switchgrass genotypes belonging to 36 accessions for 12 genes putatively involved in biomass production to investigate signatures of selection that could have led to ecotype differentiation and to population adaptation to geographic zones.Entities:
Keywords: Single nucleotide polymorphisms, Biomass genes, Selection, Subgenome, Population structure, Evolution
Mesh:
Year: 2018 PMID: 29898656 PMCID: PMC6000970 DOI: 10.1186/s12862-018-1193-2
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Summary statistics for the 251 SNPs analyzed in 12 biomass genes
| Polymorphism | Nucleotide diversity | Tajima’s test | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Non Syn | |||||||||||||||
| Gene | Contig | Chromosome | No. of samples | No. of sites (coding length bp) | Stot | Non Cod | Syn | Con | Non Con | d | πtot × 10-3 (SD × 10-3) | θw × 10-3 (SD × 10-3) | h | Hd (SD) | D |
| PHYC | contig03093 | Chr09N | 353 | 3224 (2279) | 20 | 7 | 6 | 2 | 5 | 0.620 | 1.68 (0.04) | 0.87 (0.25) | 91 | 0.963 (0.002) | 0.362 |
| TE | contig99597 | Chr05K | 351 | 2088 (278) | 21 | 21 | 0 | 0 | 0 | 1.006 | 3.43 (0.08) | 1.40 (0.31) | 84 | 0.883 (0.010) | 0.138 |
| TE | contig04674 | ChrOSNa | 334 | 2088 (278) | 21 | 20 | 1 | 0 | 0 | 1.006 | 3.78 (0.09) | 1.58 (0.34) | 110 | 0.959 (0.003) | 0.316 |
| VRN3 | contig07490 | Chr03Na | 337 | 2258 (1771) | 12 | 1 | 4 | 2 | 5 | 0.531 | 1.30 (0.05) | 0.74 (0.21) | 22 | 0.759 (0.013) | 0.723 |
| VRN3 | contigl6433 | Chr03K | 356 | 2258 (1773) | 10 | 3 | 1 | 3 | 3 | 0.443 | 0.95 (0.03) | 0.62 (0.19) | 30 | 0.728 (0.016) | 0.62 |
| DW3 | contig26301 | Chr06K | 327 | 2207 (2207) | 5 | 2 | 0 | 2 | 1 | 0.227 | 0.56 (0.02) | 0.32 (0.14) | 10 | 0.629 (0.018) | 1.267 |
| DW3 | contigll7938 | Chr06N | 337 | 2198 (2198) | 4 | 2 | 0 | 1 | 1 | 0.182 | 0.55 (0.02) | 0.25 (0.13) | 14 | 0.714 (0.010) | 0.793 |
| FLD | contigl02960 | Chr07K | 346 | 2329 (1395) | 3 | 2 | 1 | 0 | 0 | 0.129 | 0.10 (0.01) | 0.18 (0.10) | 5 | 0.207 (0.020) | −0.588 |
| FLD | contig01920 | Chr07N | 339 | 2330 (1395) | 1 | 0 | 0 | 0 | 1 | 0.043 | 0.15 (0.01) | 0.06 (0.06) | 2 | 0.346 (0.017) | 1.253 |
| Gl | contigl5400 | Chr05K | 342 | 3456 (1185) | 20 | 13 | 2 | 1 | 4 | 0.608 | 1.71 (0.06) | 0.81 (0.18) | 83 | 0.904 (0.006) | −0.264 |
| Gl | contig01489 | ChrOSN | 312 | 3464 (1185) | 10 | 7 | 0 | 1 | 2 | 0.289 | 1.05 (0.03) | 0.40 (0.13) | 22 | 0.757 (0.013) | 1.655 |
| FLT | contig09545 | Chr07K | 365 | 1826 (195) | 17 | 17 | 0 | 0 | 0 | 0.931 | 1.97 (0.05) | 1.30 (0.31) | 64 | 0.931 (0.005) | −0.103 |
| FLT | contig08422 | Chr07N | 339 | 2181 (195) | 21 | 21 | 0 | 0 | 0 | 0.963 | 2.85 (0.04) | 1.34 (0.29) | 62 | 0.899 (0.006) | 0.63 |
| PHYB | contigl3571 | Chr09N | 350 | 3198 (2758) | 4 | 1 | 0 | 1 | 2 | 0.125 | 0.55 (0.01) | 0.17 (0.09) | 7 | 0.697 (0.009) | 3.265** |
| PHYB | contig21054 | Chr09K | 342 | 3194 (2758) | 7 | 6 | 1 | 0 | 0 | 0.219 | 0.49 (0.01) | 0.30 (0.12) | 12 | 0.666 (0.012) | 0.771 |
| HD1 | contig03275 | Chr04K | 306 | 2220 (1104) | 7 | 2 | 2 | 0 | 3 | 0.360 | 0.93 (0.44) | 0.44 (0.17) | 8 | 0.776 (0.008) | 2.082 |
| HD1 | contig05584 | Chr04N | 363 | 2140 (1113) | 17 | 17 | 0 | 0 | 0 | 0.280 | 1.66 (0.04) | 1.11 (0.27) | 30 | 0.901 (0.005) | −0.142 |
| PGM | contigl7299 | Chr09K | 343 | 3120 (1634) | 8 | 5 | 2 | 0 | 1 | 0.256 | 0.59 (0.36) | 0.36 (0.13) | 28 | 0.848 (0.008) | 1.27 |
| PGM | contig200892 | Chr09N* | 315 | 696 (391) | 7 | 4 | 3 | 0 | 0 | 1.006 | 3.3 (0.05) | 1.40 (0.53) | 13 | 0.780 (0.007) | 1.328 |
| TB1 | contig06045 | Chr09K | 339 | 2428 (1109) | 10 | 2 | 5 | 1 | 2 | 0.412 | 1.42 (0.03) | 0.59 (0.21) | 38 | 0.817 (0.010) | 1.625 |
| TB1 | contig76312 | Chr09N | 314 | 2597 (1115) | 26 | 16 | 5 | 3 | 2 | 1.001 | 2.53 (0.07) | 1.41 (0.37) | 84 | 0.926 (0.005) | 0.037 |
| Total | 51,500 (28313) | 251 | 169 | 33 | 17 | 32 | |||||||||
| Sugenome Nb | 372 | 6709 (2884) | 106 | 73 | 12 | 9 | 13 | 1.61(0.04) | 0.65(0.31) | 610 | 0.99(0.20) | −0.353 | |||
| Sugenome Kb | 372 | 6709 (2884) | 100 | 67 | 13 | 6 | 10 | 0.31(0.03) | 0.53(0.11) | 637 | 0.99(0.10 | −0.269 | |||
Stot Total No. of polymorphic sites, Non Cod No. of SNPs in non-coding regions, Syn No. of synonymous SNPs, Con No. of conservative amino acid changes, Non Con No. of non-conservative amino acid changes, d Percentage polymorphism (SNP density), π Nucleotide diversity, θw Theta per site from Watterson estimator, D Tajima’s D, h No. of haplotypes, Hd Haplotype diversity, SD Standard deviation
aChromosome and subgenome allocation were corrected according to mapping data (P Qi and KM Devos, unpublished data)
bSubgenome comparisons were limited to overlapping regions
Fig. 1Number of SNPs in different classes of SNP frequencies and their heterozygosity level (a) (Heterozygous and homozygous SNPs are indicated in blue and red respectively) or their ecotype prevalence (b) (Blue indicates SNPs predominantly present in upland ecotypes (C1), red indicates SNPs predominantly present in lowland ecotypes (C2 and C3) and grey indicates SNPs that occur with similar frequencies in both ecotypes)
Characteristics of non-synonymous mutations in biomass genes and corresponding amino acid changes
| Amino Acid | Allele frequencies % | SNP characteristics | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Contig | Chromosome | Amino Acid position in Setaria Reference sequence | SNP region | Setaria italica reference sequence | Ref | Alt1 | Alt2 | Setaria italica | Wild-type allele | Mutant 1 | Mutant 2 | Conservative (Con) vs. non conservative (Non Con) |
| GI | contig15400 | Chr05K | 155 | Exon7 | Si000107m | E | K | E | 91.59 | 8.41 | Con | ||
| GI | contig15400 | Chr05K | 175 | Exon7 | Si000107m | S | Y | S | 80.32 | 19.68 | Non Con | ||
| GI | contig15400 | Chr05K | 177 | Exon7 | Si000107m | G | D | D | 32.80 | 67.20 | Non Con | ||
| GI | contig15400 | Chr05K | 180 | Exon7 | Si000107m | G | R | G | 92.79 | 7.21 | Non Con | ||
| GI | contig15400 | Chr05K | 234 | Exon9 | Si000107m | C | F | Y | C | 79.82 | 19.59 | 0.58 | Non Con |
| GI | contig01489 | Chr05N | 63 | Exon5 | Si000107m | W | S | S | 37.50 | 62.50 | Non Con | ||
| GI | contig01489 | Chr05N | 185 | Exon7 | Si000107m | R | Q | R | 65.84 | 34.16 | Con | ||
| GI | contig01489 | Chr05N | 960 | Exon14 | Si000107m | F | L | L | 35.83 | 64.17 | Non Con | ||
| PHYB | contig13571 | Chr09N | 661 | Exon2 | Si033968m | S | C | S | 63.64 | 36.36 | Non Con | ||
| PHYB | contig13571 | Chr09N | 702 | Exon2 | Si033968m | V | I | V | 79.13 | 20.87 | Con | ||
| PHYB | contig13571 | Chr09N | 713 | Exon2 | Si033968m | Y | D | D | 50.30 | 49.70 | Non Con | ||
| HD1 | contig03275 | Chr04K | 11 | Exon1 | Seita.4G122700.1 | E | M | E | 85.98 | 14.02 | Non Con | ||
| HD1 | contig03275 | Chr04K | 32 | Exon1 | Seita.4G122700.1 | A | T | A | 94.56 | 5.44 | Non Con | ||
| HD1 | contig03275 | Chr04K | 35 | Exon1 | Seita.4G122700.1 | G | S | S | 60.63 | 39.37 | Non Con | ||
| PGM | contig17299 | Chr09K | 351 | Exon6 | Si034948m | A | V | A | 73.21 | 26.79 | Non Con | ||
| TB1 | contig06045 | Chr09K | 57 | Exon1 | Si038692m | G | D | G | 91.61 | 8.39 | Non Con | ||
| TB1 | contig06045 | Chr09K | 213 | Exon1 | Si038692m | G | D | G | 89.76 | 10.24 | Non Con | ||
| TB1 | contig06045 | Chr09K | 337 | Exon1 | Si038692m | I | V | L | 56.64 | 43.36 | Con | ||
| TB1 | contig76312 | Chr09N | 51 | Exon1 | Si038692m | H | Y | H | 88.06 | 11.94 | Con | ||
| TB1 | contig76312 | Chr09N | 89 | Exon1 | Si038692m | A | P | A | 92.42 | 7.58 | Non Con | ||
| TB1 | contig76312 | Chr09N | 137 | Exon1 | Si038692m | S | P | S | 92.77 | 7.23 | Non Con | ||
| TB1 | contig76312 | Chr09N | 193 | Exon1 | Si038692m | I | V | V | 60.26 | 39.74 | Con | ||
| TB1 | contig76312 | Chr09N | 321 | Exon1 | Si038692m | N | S | N | 91.87 | 8.13 | Con | ||
| PHYC | contig03093 | Chr09N | 422 | Exon1 | Si034030m | V | L | L | 45.71 | 54.29 | Con | ||
| PHYC | contig03093 | Chr09N | 966 | Exon3 | Si034030m | E | V | V | 42.90 | 57.10 | Non Con | ||
| PHYC | contig03093 | Chr09N | 1029 | Exon3 | Si034030m | P | A | P | 81.03 | 18.97 | Non Con | ||
| PHYC | contig03093 | Chr09N | 1031 | Exon3 | Si034030m | K | E | K | 76.66 | 23.34 | Con | ||
| PHYC | contig03093 | Chr09N | 1041 | Exon3 | Si034030m | K | N | K | 83.44 | 16.56 | Non Con | ||
| PHYC | contig03093 | Chr09N | 1069 | Exon1 | Si034030m | L | W | L | 94.26 | 5.74 | Non Con | ||
| PHYC | contig03093 | Chr09N | 1104 | Exon1 | Si034030m | L | H | L | 67.70 | 32.30 | Non Con | ||
| VRN3 | contig07490 | Chr03N | 238 | Exon2 | Si021330m | E | D | E | 89.78 | 10.22 | Con | ||
| VRN3 | contig07490 | Chr03N | 286 | Exon3 | Si021330m | L | M | L | 93.11 | 6.89 | Con | ||
| VRN3 | contig07490 | Chr03N | 295 | Exon3 | Si021330m | T | A | T | 90.10 | 9.90 | Non Con | ||
| VRN3 | contig07490 | Chr03N | 412 | Exon4 | Si021330m | P | R | P | 91.53 | 8.47 | Non Con | ||
| VRN3 | contig07490 | Chr03N | 493 | Exon4 | Si021330m | L | S | S | 84.26 | 15.74 | Non Con | ||
| VRN3 | contig07490 | Chr03N | 511 | Exon4 | Si021330m | N | K | N | 93.63 | 6.37 | Non Con | ||
| VRN3 | contig07490 | Chr03N | 600 | Exon4 | Si021330m | G | V | E | 88.59 | 11.41 | Non Con | ||
| VRN3 | contig16433 | Chr03K | 144 | Exon2 | Si021330m | P | A | P | 90.45 | 9.55 | Non Con | ||
| VRN3 | contig16433 | Chr03K | 398 | Exon4 | Si021330m | V | I | A | 93.98 | 6.02 | Con | ||
| VRN3 | contig16433 | Chr03K | 409 | Exon4 | Si021330m | S | N | S | 89.42 | 10.58 | Con | ||
| VRN3 | contig16433 | Chr03K | 450 | Exon4 | Si021330m | D | G | G | 43.61 | 56.39 | Non Con | ||
| VRN3 | contig16433 | Chr03K | 569 | Exon4 | Si021330m | Q | E | R | 88.36 | 11.64 | Con | ||
| VRN3 | contig16433 | Chr03K | 682 | Exon4 | Si021330m | L | Q | Q | 28.61 | 71.39 | Non Con | ||
| DW3 | contig26301 | Chr06K | 672 | Exon3 | Si013123m | E | G | E | 94.29 | 5.71 | Non Con | ||
| DW3 | contig26301 | Chr06K | 751 | Exon3 | Si013123m | M | V | M | 21.22 | 78.78 | Con | ||
| DW3 | contig117938 | Chr06N | 713 | Exon3 | Si013123m | I | M | M | 87.84 | 12.16 | Con | ||
| DW3 | contig26301 | Chr06N | 897 | Exon3 | Si013123m | I | V | V | 82.96 | 17.04 | Con | ||
| DW3 | contig117938 | Chr06N | 872 | Exon3 | Si013123m | T | A | S | A | 78.04 | 21.96 | 0.30 | Non Con |
| FLD | contig01920 | Chr07N | 511 | Exon1 | Si009376 | S | G | G | 76.70 | 23.30 | Non Con | ||
Fig. 2STRUCTURE output assuming K = 3 for 372 switchgrass genotypes based on 251 SNPs. Each genotype is represented by a thin vertical line divided into K colored segments representing the individual’s estimated membership probability to each of K clusters (C1, C2 and C3 are in blue, green and red, respectively). Genotypes were grouped by ecotype
Comparison of genetic diversity between the three switchgrass subpopulations
| Diversity indexes** | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| N | Np | P | Representative accessions* | Ne (SE) | I (SE) | Ho (SE) | He (SE) | F (SE) | |
| C1 | 158 | 186(11) | 72.9 | PI 315724, PI 337553, PI 414066, PI 414067, PI 414068, PI 421520, PI 431575, PI 476292, PI 476294, PI 476295, PI 476296, PI 642190 and PI 642191 | 1.221a (0.017) | 0.236a (0.014) | 0.077a (0.005) | 0.145a (0.009) | 0.402a (0.019) |
| C2 | 81 | 142(3) | 55.6 | PI 414065, PI 414070, PI 421521, PI 421999 and SNF | 1.210 a (0.019) | 0.203 a (0.015) | 0.071a (0.007) | 0.129 a (0.010) | 0.361 a (0.023) |
| C3 | 71 | 200(5) | 78.4 | PI 315727, PI 422001, Citrus Co-FL, OSSP-FL and SWFWMD-FL | 1.211 a (0.020) | 0.217 a (0.014) | 0.050b (0.004) | 0.131 a (0.010) | 0.450 a (0.024) |
| Admix | 62 | 233(0) | 91.4 | PI 315725#, PI 422003, PI 476290 | 1.304 (0.018) | 0.323 (0.013) | 0.078 (0.004) | 0.198 (0.009) | 0.520 (0.020) |
| Total | 372 | 251 | 74.6 | 26 | 1.236 (0.009) | 0.245 (0.007) | 0.069 (0.002) | 0.152 (0.005) | 0.443 (0.011) |
N number of genotypes, Np No. of polymorphic loci (number of Private Alleles), P percentage of polymorphic loci, Ne No. of Effective Alleles, I Shannon’s Information Index, Ho Observed Heterozygosity; He Expected Heterozygosity, F Fixation Index, SE Standard Error
* Accession for which more than 70% of the genotypes belong to a specific subpopulation
**Mean and SE over loci for each subpopulation; Means with the same letter are not significantly different based on a Tukey HSD test
#Accession 3 is represented by only two genotypes
Fig. 3Principal coordinates analysis for 251 SNPs across the switchgrass genotypes. The genotypes are color-coded according to their affiliation to STRUCTURE subpopulations at K = 3. The cluster C1 comprises mainly upland ecotypes whereas genotypes from C2 and C3 are mostly lowland ecotypes
Pairwise Nei’s genetic distance (upper diagonal) and F values (lower diagonal) among the three switchgrass subpopulations C1, C2 and C3 using 251 SNPs
| C1 | C2 | C3 | |
|---|---|---|---|
| C1 | 0 | 0.359 | 0.231 |
| C2 | 0.313 | 0 | 0.144 |
| C3 | 0.221 | 0.167 | 0 |
AMOVAs calculating the percentages of molecular variance explained by latitude bins (A), accessions (B) and genetic subpopulations (C)
| Source | df | SS | MS | Est. Var. | % |
|---|---|---|---|---|---|
| (A) | |||||
| Among Latitude bins | 12 | 6207.75 | 517.31 | 8.09 | 15% |
| Among Individuals | 349 | 29,954.89 | 85.83 | 39.32 | 72% |
| Within Individual | 362 | 2605.00 | 7.20 | 7.20 | 13% |
| Total | 723 | 38,767.64 | 54.60 | 100% | |
| (B) | |||||
| Among Accessions | 35 | 12,965.07 | 370.43 | 14.92 | 28% |
| Among Individuals | 326 | 23,197.37 | 71.16 | 31.98 | 59% |
| Within Individual | 362 | 2605.00 | 7.20 | 7.20 | 13% |
| Total | 723 | 38,767.44 | 54.10 | 100% | |
| (C) | |||||
| Among Subpopulations | 2 | 8818.28 | 4409.14 | 22.69 | 37% |
| Among Individuals | 306 | 22,088.76 | 72.19 | 32.61 | 52% |
| Within Individual | 309 | 2151.00 | 6.96 | 6.96 | 11% |
| Total | 617 | 33,058.04 | 62.27 | 100% | |
df degrees of freedom, SS sum of squares, MS mean square
Fig. 4Genetic composition of switchgrass populations by geographic region. The genotypes are color-coded according to their affiliation to STRUCTURE subpopulations at K = 3 (C1: blue; C2: green; C3: red; admixed: gray) and grouped by geographic area. The circle size in each geographic area is proportional to the number of genotypes. USA Map was downloaded from https://upload.wikimedia.org/wikipedia/commons/c/ca/Blank_US_map_borders.svg
Fig. 5Differences in the percentage of polymorphic SNPs between switchgrass homoeologous regions for selected genes. TB1: Teosinte branched 1; FLT: Flowering Locus T; DW3: Dwarf 3; TE: Terminal ear; PHYB: Phytochrome B; FLD: Flowering Locus D; VRN3: Vernalization 3; GI: Gigantea; PGM: Phosphoglycerate mutase. PHYC (Phytochrome C), Rht1 (Gibberellin-insensitve gene) and HD1 (Heading date 1) were removed from the analysis because only genes with mapping data in > 80% of the accessions and overlapping regions between the two subgenomes were used
Fig. 6SNP divergence between switchgrass subgenomes N and K in the exon2-exon3 region of PHYB in chromosome 9. Relative frequencies of individuals carrying the SNP alleles are color-coded in blue, red, green or gray according to their genetic group as defined by STRUCTURE. Subgenomes N and K are represented in plain and dots, respectively. SNPs are displayed in 100 bp bin sizes