| Literature DB >> 31061417 |
Meki S Muktar1, Abel Teshome2, Jean Hanson1, Alemayehu T Negawo1, Ermias Habte1, Jean-Baka Domelevo Entfellner3, Ki-Won Lee4, Chris S Jones5.
Abstract
Napier grass is an important tropical forage-grass and of growing potential as an energy crop. One-hundred-five Napier grass accessions, encompassing two independent collections, were subjected to genotyping by sequencing which generated a set of high-density genome-wide markers together with short sequence reads. The reads, averaging 54 nucleotides, were mapped to the pearl millet genome and the closest genes and annotation information were used to select candidate genes linked to key forage traits. 980 highly polymorphic SNP markers, distributed across the genome, were used to assess population structure and diversity with seven-subgroups identified. A few representative accessions were selected with the objective of distributing subsets of a manageable size for further evaluation. Genome-wide linkage disequilibrium (LD) analyses revealed a fast LD-decay, on average 2.54 kbp, in the combined population with a slower LD-decay in the ILRI collection compared with the EMBRAPA collection, the significance of which is discussed. This initiative generated high-density markers with a good distribution across the genome. The diversity analysis revealed the existence of a substantial amount of variation in the ILRI collection and identified some unique materials from the EMBRAPA collection, demonstrating the potential of the overall population for further genetic and marker-trait-association studies.Entities:
Year: 2019 PMID: 31061417 PMCID: PMC6502793 DOI: 10.1038/s41598-019-43406-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Napier grass collections used in the study.
| ILRI collections | EMBRAPA collections | ||
|---|---|---|---|
|
| Hybrid ( | Elite lines | Accessions |
| 52 | 8 | 25 | 20 |
The detail of each accession is shown in the Supplementary Table S1.
Figure 1Distribution of polymorphic information content (PIC) values for the SilicoDArT (orange) and SNP (blue) markers.
Figure 2Genome-wide distribution of SilicoDArT (a) and SNP (b) markers across the seven chromosomes of the pearl millet genome. The markers that were not mapped are indicated by a 0, and those markers that were mapped onto different scaffolds are indicated by an S. The number of markers mapped per chromosome is shown on the x-axis.
SilicoDArT and SNP markers correlated with genes associated with different traits in pearl millet (Varshney et al.[34]).
| SNP_ID | Chr | Pos | Ref | Alt | Closest_gene_PM | Closest_gene_function | Traits | Treatment | P value | R2 (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| 9999783 | 5 | 65917816 | A | G | Pgl_GLEAN_10010048 | AP2/ERF domain | 1000-Grain Mass (g) | Early stress | 2.46E-11 | 19 |
| 8171327 | 3 | 3033078 | G | T | Pgl_GLEAN_10005840 | Ionotropic glutamate receptor | Fresh Stover Yield (t/ha) | Early stress | 5.80E-12 | 12 |
| 23610697 | 6 | 184529223 | C | G | Pgl_GLEAN_10022294 | NA | Fresh Stover Yield (t/ha) | Early stress | 4.93E-11 | 12 |
| 23598607 | 5 | 66047648 | T | C | Pgl_GLEAN_10002412 | Protein kinase, catalytic domain | Grain Number /m2 (No.) | Control | 2.41E-12 | 20 |
| 23588862 | 5 | 66273007 | T | C | Pgl_GLEAN_10007383 | UbiA prenyltransferase family | Grain Number /m2 (No.) | Late stress | 9.71E-12 | 17 |
| 9972446 | 5 | 60443354 | G | A | Pgl_GLEAN_10009273 | Fatty acid hydroxylase | Grain Number /m2 (No.) | Early stress | 5.39E-11 | 16 |
| 9968140 | 5 | 59276838 | C | T | Pgl_GLEAN_10025573 | Alcohol dehydrogenase superfamily, zinc-type | Grain Number /m2 (No.) | Early stress | 1.62E-11 | 17 |
| 23640298 | 3 | 5347418 | T | G | Pgl_GLEAN_10000839 | Peptidase S8/S53, subtilisin/kexin/sedolisin | Grain Number /Panicle (No.) | Control | 6.91E-11 | 15 |
| 23588558 | 5 | 64510034 | C | T | Pgl_GLEAN_10002983 | Phospholipase D/Transphosphatidylase | Grain Number /Panicle (No.) | Control | 3.09E-11 | 18 |
| 23602204 | 5 | 67552785 | G | A | Pgl_GLEAN_10006368 | Protein kinase, catalytic domain | Panicle Number (‘000/ha) | Control | 1.63E-11 | 18 |
| 23623063 | 3 | 8675580 | T | A | Pgl_GLEAN_10008425 | Proteasome, alpha-subunit, N-terminal domain | Panicle Number (‘000/ha) | Control | 3.48E-14 | 24 |
| 9967966 | 2 | 135590394 | A | T | Pgl_GLEAN_10018209 | Sodium/solute symporter | Panicle Number (‘000/ha) | Control | 8.84E-12 | 18 |
| 23617275 | 2 | 72418790 | G | C | Pgl_GLEAN_10021161 | NA | Panicle Number (‘000/ha) | Early stress | 2.99E-11 | 15 |
| 9966416 | 2 | 43404632 | G | C | Pgl_GLEAN_10021658 | NA | Panicle Number (‘000/ha) | Control | 4.92E-11 | 16 |
| 23634420 | 2 | 2182294 | C | T | Pgl_GLEAN_10023314 | Zinc finger, RING-type | Panicle Number (‘000/ha) | Control | 6.18E-11 | 16 |
| 23615392 | 4 | 162832391 | G | A | Pgl_GLEAN_10008211 | Raffinose synthase | Plant Height (cm) | Control | 2.79E-11 | 17 |
| 23618303 | 4 | 119817514 | G | C | Pgl_GLEAN_10012722 | Heat shock protein DnaJ, N-terminal | Plant Height (cm) | Control | 6.63E-11 | 15 |
| 9975905 | 4 | 78593371 | A | G | Pgl_GLEAN_10019616 | NA | Plant Height (cm) | Late stress | 1.00E-11 | 12 |
| 23624988 | 4 | 140312557 | C | A | Pgl_GLEAN_10031827 | Protein of unknown function DUF914, eukaryotic | Plant Height (cm) | Late stress | 1.75E-12 | 12 |
| 23588605 | 4 | 34598785 | C | T | Pgl_GLEAN_10036604 | Ubiquitin-associated/translation elongation factor | Plant Height (cm) | Control | 8.23E-13 | 18 |
| 23603442 | 2 | 214449399 | A | G | Pgl_GLEAN_10031324 | NA | Plant Population (‘000/ha) | Late stress | 5.38E-13 | 23 |
| 17974203 | 5 | 80840668 | G | A | Pgl_GLEAN_10038503 | Transcription factor, SBP-box | Plant Population (‘000/ha) | Late stress | 2.13E-13 | 23 |
Chr = chromosome; Pos = position within chromosome; Ref = reference allele; Alt = alternative allele; PM = pearl millet; NA = information not available.
Figure 3Estimated linkage disequilibrium decay (LD-decay) in 104 Napier grass accessions (blue), 45 EMBRAPA accessions (orange) and 59 ILRI accessions (red) (a). In (b), the LD-decay per chromosome is shown.
Figure 4Clusters of the 104 Napier grass accessions using 980 selected SNP markers. (a) UPGMA tree showing seven groups; (b) PCA plot for PC1 and PC2; (c) The delta K suggesting two major groups and up to 5 subgroups; (d) Bar plots based on the admixture model in STRUCTURE, for K = 2 and K = 5. The colors in (a) and (b) are according to the STRUCTURE analysis with k = 5.
Results of the analysis of molecular variance (AMOVA) for groups detected by different population structure analyses.
| Methods/markers used in population structure analysis | Source of variation | Degrees of freedom (df) | Sum of squares | Mean sum of squares | Percentage of variation |
|
|---|---|---|---|---|---|---|
| Two sub groups by STRUCTURE | VBG | 1 | 1958.79 | 1958.79 | 7.60 | 0.001 |
| VBGWG | 102 | 23258.73 | 228.03 | 7.09 | 0.003 | |
| VWG | 104 | 20336.38 | 195.54 | 85.31 | 0.001 | |
| TV | 207 | 45553.91 | 220.07 | 100 | ||
| Five subgroups by STRUCTURE | VBG | 4 | 5563.54 | 1390.88 | 13.95 | 0.001 |
| VBGWG | 99 | 19653.99 | 198.53 | 0.65 | 0.434 | |
| VWG | 104 | 20336.38 | 195.54 | 85.40 | 0.001 | |
| TV | 207 | 45553.91 | 220.07 | 100 | ||
| Seven subgroups using selected SNPs, and UPGMA tree inference | VBG | 6 | 6360.07 | 1060.01 | 13.28 | 0.001 |
| VBGWG | 97 | 18857.45 | 194.41 | −0.25 | 0.518 | |
| VWG | 104 | 20336.38 | 195.54 | 86.98 | 0.001 | |
| TV | 207 | 45553.91 | 220.07 | 100 | ||
| Seven subgroups using selected SilicoDArTs, and UPGMA tree inference | VBG | 6 | 6765.94 | 1127.66 | 14.42 | 0.001 |
| VBGWG | 97 | 18451.59 | 190.22 | −1.18 | 0.669 | |
| VWG | 104 | 20336.38 | 195.54 | 86.65 | 0.001 | |
| TV | 207 | 45553.91 | 220.07 | 100 |
VBG = Variation between groups; VBGWG = Variation between genotypes within groups; VWG = Variation within genotypes; TV = Total variation.
Napier grass subsets representing the diversity in the collection from the ILRI genebank.
| Optimal_water | Water-deficit | ||||||
|---|---|---|---|---|---|---|---|
| NAME | Species | Origin | Collection | NAME | Species | Origin | Collection |
| ILRI_1026* |
| Burundi | ILRI | ILRI_1026* |
| Burundi | ILRI |
| ILRI_16840* | Zimbabwe | ILRI | ILRI_14389 |
| Nigeria | ILRI | |
| ILRI_14982 | USA | ILRI | ILRI_14983 |
| USA | ILRI | |
| ILRI_14984 |
| USA | ILRI | ILRI_16811 |
| USA | ILRI |
| ILRI_16793* |
| Cuba | ILRI | ILRI_16791 |
| Swaziland | ILRI |
| ILRI_16794 |
| Mozambique | ILRI | ILRI_16793* |
| Cuba | ILRI |
| ILRI_16814* |
| USA | ILRI | ILRI_16816 |
| USA | ILRI |
| ILRI_16839 |
| Zimbabwe | ILRI | ILRI_16796 |
| Zimbabwe | ILRI |
| ILRI_16819 |
| USA | ILRI | ILRI_16806* |
| USA | ILRI |
| ILRI_16797 |
| Zimbabwe | ILRI | ILRI_16782 |
| Tanzania | ILRI |
| ILRI_16806* |
| USA | ILRI | ILRI_16814* |
| USA | ILRI |
| ILRI_16822 |
| Malawi | ILRI | ILRI_16840* | Zimbabwe | ILRI | |
| BAGCE_30* |
| Brazil | EMBRAPA | BAGCE_30* |
| Brazil | EMBRAPA |
| BAGCE_97* |
| Brazil | EMBRAPA | BAGCE_97* |
| Brazil | EMBRAPA |
*Accession selected in both subsets.
Comparisons between Napier grass accessions in the whole collection and the subsets for genetic and phenotypic diversity.
| Trait | Whole collection | Subset (OW) | Subset (WD) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Min | Max | Average | Min | Max | Average | Min | Max | Average | |
| EN-MR | 0.20 | 0.21 | 0.20 | 0.32 | 0.48 | 0.46 | 0.35 | 0.48 | 0.46 |
| EN-GD | 0.08 | 0.09 | 0.08 | 0.12 | 0.22 | 0.19 | 0.11 | 0.22 | 0.19 |
| Se | 7.51 | 7.52 | 7.52 | 7.49 | 7.52 | 7.51 | 7.50 | 7.52 | 7.51 |
| He | 0.44 | 0.44 | 0.44 | 0.41 | 0.44 | 0.44 | 0.42 | 0.44 | 0.44 |
| PIC | 0.23 | 0.38 | 0.36 | 0.17 | 0.38 | 0.35 | 0.17 | 0.38 | 0.35 |
| TFWPP | 4.55 (37.01) | 434.76 (313.16) | 239.40 (139.71) | 13.78 | 416.31 | 275.38 | 47.71 | 266.45 | 147.67 |
| TDWPP | 1.70 (7.92) | 127.17 (87.85) | 65.01 (39.86) | 3.29 | 117.29 | 73.27 | 12.06 | 73.15 | 42.35 |
| Fv/Fm | 0.56 | 0.77 | 0.73 | — | — | — | 0.61 | 0.75 | 0.70 |
| PI | 1.09 | 5.37 | 2.86 | — | — | — | 1.11 | 4.82 | 2.62 |
EN-MR = Average entry-to-nearest-entry distance according to the Modified Rogers (MR) distance using the genetic data; EN-GD = Average entry-to-nearest-entry distance according to Gower distance (GD) using the phenotype data; SH = Shannon’s allelic diversity index; He = expected heterozygosity; PIC = polymorphic information content; TFWPP = total fresh-weight per plant; TDWPP = total dry-weight per plant; FvFM = the ratio of variable fluorescence to maximum fluorescence; PI = performance index. The phenotype of the whole collection under water-deficit conditions is in parentheses. OW = optimal-water condition; WD = water-deficit condition.
Figure 5UPGMA tree for the subsets under optimal-water (a) and water-deficit (b) conditions. In (c), the positions of the subsets in the whole collection (68 accessions) is shown by different colours, accessions not selected for the subsets are shaded a tan-color. Accessions common to the two subsets are indicated with asterisks.