| Literature DB >> 31139243 |
Dawei Sun1,2, Haiyan Cen1,2,3, Haiyong Weng1,2, Liang Wan1,2, Alwaseela Abdalla1,2, Ahmed Islam El-Manawy1,2, Yueming Zhu1,2, Nan Zhao1, Haowei Fu4, Juan Tang5, Xiaolong Li5, Hongkun Zheng5, Qingyao Shu6, Fei Liu1,2, Yong He1,2,3.
Abstract
BACKGROUND: The advances of hyperspectral technology provide a new analytic means to decrease the gap of phenomics and genomics caused by the fast development of plant genomics with the next generation sequencing technology. Through hyperspectral technology, it is possible to phenotype the biochemical attributes of rice seeds and use the data for GWAS.Entities:
Keywords: Genome-wide association study (GWAS); Phenotyping; Protein content; Rice (Oryza sativa); Spectrology
Year: 2019 PMID: 31139243 PMCID: PMC6532189 DOI: 10.1186/s13007-019-0432-x
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Summary of trait categories
| Category | Names | Acronym |
|---|---|---|
| Biochemical traits | Amylose content (%) | AC |
| Gel consistency | GC | |
| Crude protein content (%) | PC | |
| Alkali spreading value | ASV | |
| Hyperspectral traits | Reflectance at wavelength 1177 (nm) | R1177 |
| Reflectance at wavelength 1227 (nm) | R1227 | |
| Normalized Differential Spectral Index | NDSI | |
| Differential Spectral Index | DSI | |
| Simple Ratio Index | SRI |
Fig. 1Schematic overview of the primary procedures of extracting hyperspectral traits for GWAS
Summary statistics for biochemical traits
| Min | Max | Mean | SD | CV | |
|---|---|---|---|---|---|
| ASV (mm) | 3.70 | 7.00 | 6.13 | 0.89 | 0.15a |
| GC (mm) | 42.00 | 82.00 | 64.94 | 6.94 | 0.11 |
| AC (%) | 8.30 | 20.10 | 14.20 | 2.11 | 0.15a |
| PC (%) | 7.40 | 10.50 | 8.62 | 0.77 | 0.09 |
aCV of ASV and AC are high (≥ 15%)
Fig. 2Frequency distribution for all the traits. AC, PC, and GC fit normalized distribution, except for ASV, which is heavily right skewed due to many rice accessions reached the maximum measurement limit
Pearson correlation analysis of extracted indices with selected wavelengths
| SRI | NDSI | DSI | ||||
|---|---|---|---|---|---|---|
| R2 | Formula | R2 | Formula | R2 | Formula | |
| GC (mm) | 0.21 | R1730/R1731 | 0.21 | (R1731-R1730)/(R1731 + R1730) | 0.26 | R2208-R2203 |
| PC (%) | 0.66 | R1227/R1177 | 0.68 | (R1227-R1177)/(R1227 + R1177)a | 0.67 | R1227-R1177 |
| AC (%) | 0.24 | R1638/R1799 | 0.24 | (R1639-R1799)/(R1639 + R1799) | 0.31 | R2028-R2002 |
aNDSI was selected for correlation analysis and GWAS
Fig. 3The mean spectra of rice accessions and Pearson correlation analysis. a The average spectra of all rice samples. The blue shade indicates 3 times of standard deviation. b The correlation analysis of hyperspectral index NDSI and PC used for GWAS, with correlation coefficients of R2 = 0.68
Fig. 4The population structure analysis based on all rice accessions. a Neighbor-joining tree of all rice accessions which was constructed from simple matching distance based on whole-genome SNP studies. Branch and circle block colors indicate different subgroup distributions. b Principal component (PCA) analysis plots of first and third components for all rice accessions, using the same colors as in a. c Cross-validation (CV) errors graph. d Population structure based on c (K = 6). e Genome-wide average LD decay analysis estimated from all rice accessions. f Histogram of kinship
A subset of associated loci and candidate genes numbers according to GWAS analysis for PC and NDSI
| Trait | Chr. | SNP# | SNP positiona | P_value | Gene# |
|---|---|---|---|---|---|
| NDSI | 1 | 3 | 7837287 | 2.75E−10 | 65 |
| 1 | 10727694b | 2.40E−10 | |||
| 2 | 21222443b | 4.72E−19 | |||
| PC | 1 | 2 | 10727694b | 8.53E−12 | 43 |
| 2 | 21222443b | 6.09E−34 |
aPosition in bp; Chr. Chromosome
bHyperspectral index NDSI could locate the exact SNP positions as PC
Fig. 5Manhattan plot and QQ plot from genome-wide association studies with local LD block of three selected loci. a Manhattan plot by NDSI. b Quantile–quantile plot for NDSI. c Local LD block of around Os01g0243400. The square lattice panel represents the extent of LD based on r. d Local LD block surrounding the locus Os01g0293000. e Local LD block surrounding the locus Os02g0560200. Blue horizontal dashed line and red horizontal dashed line indicates two genome-wide significance thresholds. The red dashed line is the negative logarithm of the 0.1/SNP number and the blue dashed line is the negative logarithm of the 0.01/SNP number. LD block heat maps (c–e) were surrounding the peak on chromosome 1, 2 respectively. Navy blue vertical dashed lines indicate the position of SNP loci
Subset of potential candidate gene loci related to PC
| #Gene name | Chr | SNP location | Gene description | References |
|---|---|---|---|---|
|
| 1 | 7837287 | C2 domain-containing protein At1g53590 | [ |
|
| 1 | 7837287 | Transcription factor bHLH150-like | [ |
|
| 1 | 7837287 | Glucan endo-1,3-beta- | [ |
|
| 1 | 10727694 | Sphingosine-1-phosphate lyase gene | [ |
|
| 1 | 10727694 | S-adenosylmethionine synthase 3 (SAS3) | [ |
|
| 2 | 21222443 | Two-component response regulator | [ |
|
| 2 | 21222443 | Cyclin-dependent kinase C-2-like; probable serine/threonine-protein kinase At1g54610 | [ |
|
| 2 | 21222443 | E3 ubiquitin-protein ligase EL5 (EL5) | [ |
|
| 2 | 21222443 | 3-(3-Hydroxy-phenyl)propionate/3-hydroxycinnamic acid hydroxylase-like |
Fig. 6KEGG enrichment histogram and Vernn diagram. a Pathway association identified using biochemical trait, PC. b Pathway association identified using hyperspectral trait NDSI. c Venn diagrams of pathways identified by PC and NDSI. d Venn diagrams of genes identified by PC and NDSI