| Literature DB >> 29616055 |
Yan Zhao1, Weipeng Zhao1, Conghui Jiang1, Xiaoning Wang2, Huaiyang Xiong2, Elena G Todorovska3, Zhigang Yin1, Yanfa Chen1, Xin Wang1, Jianyin Xie1, Yinghua Pan1,4, Muhammad A R Rashid1,5, Hongliang Zhang1, Jinjie Li1, Zichao Li1.
Abstract
Dry direct-seeding of rice is rapidly increasing in China, but variable planting depth associated with machine sowing can lead to low seedling emergence rates. Phenotype analysis of 621 rice accessions showed that mesocotyl length (ML) was induced by deep soil covering and was important in deep-sowing tolerance in the field. Here, we performed and compared GWAS using three types of SNPs (non-synonymous SNP, non-synonymous SNPs and SNPs within promoters and 3 million randomly selected SNPs from the entire set of SNPs) and found that Non-Syn GWAS (GWAS using non-synonyomous SNP) decreased computation time and eliminated confounding by other loci relative to GWAS using randomly selected SNPs. Thirteen QTLs were finally detected, and two new major-effect genes, named OsML1 and OsML2, were identified by an integrated analysis. There were 2 and 7 non-synonymous SNPs in OsML1 and OsML2, respectively, from which 3 and 4 haplotypes were detected in cultivated rice. Combinations of superior haplotypes of OsML1 and OsML2 increased ML by up to 4 cm, representing high emergence rate (85%) in the field with 10 cm of soil cover. The studies provide key loci and naturally occurring alleles of ML that can be used in improving tolerance to dry direct-seeding.Entities:
Keywords: Oryza sativa; deep-sowing tolerance; genome-wide association study; mesocotyl length; non-synonymous SNP
Year: 2018 PMID: 29616055 PMCID: PMC5864933 DOI: 10.3389/fpls.2018.00332
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Mesocotyl lengths (MLs) in 621 cultivated rice and relationship among ML, emergence rate (ER) and planting depth. (A) Principal component plot and distribution of accessions with different ML. (B) Neighbor-joining tree and distribution of accessions with ML of more than 3 cm on the tree. (C) Relationship between ER and ML (upper) and histogram of ML (lower) in field plantings with 10 cm of soil cover. (D) Differentiation of MLs in cultivated accessions. (E) Comparison of MLs of indica and japonica. (F) Relationship between ER and ML in box plantings with different depths of soil cover. Different letters above bars indicate significant differences (p < 0.05) detected by Duncan's multiple range test. (G) Relationship between ML and depth of soil cover for accessions with different mesocotyl elongation capacities.
Figure 2Summary of non-synonymous SNPs detected in the full population and their distribution associated with mesocotyl length in the indica genome. (A) Termination codon SNP (T-C SNP) along the genome. (B) Non-synonymous SNPs (N-S SNP) along the genome. Histograms of minor allele frequencies (MAFs) of non-synonymous SNPs in the (C) full population, (D) indica, and (E) japonica. Distributions of (F) SDSs and (G) ESDSs along the indica genome. SDS and ESDS show SNPs with significant and extremely significant differences (p < 0.05; p < 0.01) in allele frequency between polar pools. For each 500 kb sliding window, the numbers of SDSs and ESDSs were plotted on the entire genome. The sliding step is 50 kb. Adjacent chromosomes are delineated using different colors. Horizontal black lines show the thresholds for the 99th percentile of 10,000 permutations of the SDS and ESDS numbers.
Figure 3Genome-wide association studies of mesocotyl length under CMLM in full population using three sets of SNPs and different PC and kinship. Quantile-quantile plots and Manhattan plots of the CMLM using groups (A) I, (B) II, and (C) III in full population. Quantile-quantile plots and Manhattan plots of CMLM using groups (D) I and (E) II with PC and kinship derived from group III in full population. In quantile-quantile plots, gray dots show GLM, and other colored points show CMLM. The horizontal black dashed lines in Manhattan plots of CMLM show thresholds at p = 0.01 after Bonferroni-adjusted multiple test correction. Yellow stripes show two important signals in three GWAS using three sets of SNPs in the full population.
Summary of SNPs associated with ML by GWAS using CMLM and group I in the full population.
| LOC_Os02g07480 | Chr2_3858289 | 6.12 | 8.13 | 6.07 | 1.39 | 0 | G/T | T/N | 0.011 | Transglycosylase SLT domain containing protein, expressed | |
| LOC_Os03g51340 | Chr3_29376315 | 6.26 | 9.65 | 4.66 | 11.86 | 0.51 | C/T | P/L | 0.074 | Expressed protein | |
| LOC_Os03g53320 | Chr3_30603087 | 7.31 | 12.69 | 7.24 | 20.72 | 0.51 | G/A | A/V | 0.069 | Hypothetical protein | |
| LOC_Os03g53340 | Chr3_30606285 | 6.74 | 11.89 | 6.6 | 19.12 | 0.51 | G/T | D/E | 0.076 | HSF-type DNA-binding domain containing protein, expressed | |
| LOC_Os04g58590 | Chr4_34837207 | 6.53 | 3.52 | 3.54 | 0 | 2.37 | A/T | L/P | 0.01 | RNA recognition motif containing protein, putative, expressed | |
| LOC_Os07g22360 | Chr7_12552125 | 6.53 | 5.78 | 6.54 | 9.26 | 0 | G/A | V/I | 0.047 | Expressed protein | |
| LOC_Os07g23990 | Chr7_13602658 | 6.77 | 9.71 | 7.04 | 26.52 | 0.51 | A/T | M/L | 0.366 | Tetratricopeptide repeat domain containing protein, putative, expressed | |
| LOC_Os07g24010 | Chr7_13611491 | 10.51 | 14.9 | 10.35 | 31 | 0 | A/T | S/T | 0.26 | Hypothetical protein | |
| LOC_Os07g24170 | Chr7_13728692 | 6.73 | 9.66 | 6.95 | 28.82 | 0.51 | T/A | N/K | 0.39 | Expressed protein | |
| Chr7_13728704 | 6.27 | 9.1 | 6.43 | 26.45 | 0.51 | G/T | Q/H | 0.388 | |||
| Chr7_13729329 | 6.18 | 9.26 | 6.32 | 31.25 | 0 | G/A | V/M | 0.393 | |||
| LOC_Os07g24190 | Chr7_13746039 | 6.32 | 9.05 | 6.53 | 24.22 | 0 | C/T | M/I | 0.385 | CESA3 - cellulose synthase, expressed | |
| LOC_Os07g25460 | Chr7_14579544 | 6.11 | 9.06 | 5.96 | 22.65 | 0.51 | G/A | R/C | 0.394 | Ankyrin repeat domain containing protein, expressed | |
| LOC_Os07g27610 | Chr7_16129890 | 7.58 | 13.11 | 7.56 | 33.63 | 0 | G/A | R/Q | 0.438 | Expressed protein | |
| LOC_Os07g27630 | Chr7_16135146 | 6.57 | 12 | 6.5 | 29.53 | 0 | G/A | S/L | 0.185 | Expressed protein | |
| LOC_Os07g27680 | Chr7_16151805 | 6.65 | 12.31 | 6.65 | 33.63 | 0 | T/C | T/A | 0.457 | Expressed protein | |
| LOC_Os07g39660 | Chr7_23772528 | 6.14 | 9.65 | 6.47 | 27.42 | 0 | A/T | H/L | 0.221 | Hypothetical protein | |
| LOC_Os08g17350 | Chr8_10613231 | 6.17 | 6.24 | 6.18 | 0 | 0 | G/A | S/F | 0.004 | Expressed protein | |
| LOC_Os09g11800 | Chr9_6598055 | 6.63 | 8.52 | 6.25 | 2.37 | 0 | T/A | V/E | 0.009 | Expressed protein | |
| LOC_Os10g03730 | Chr10_1681425 | 6.64 | 9.13 | 6.63 | 4.8 | 0 | T/C | E/G | 0.024 | OsFBX347 - F-box domain containing protein, expressed | |
| LOC_Os10g03780 | Chr10_1713500 | 6.69 | 9.5 | 6.59 | 5.64 | 0 | G/A | Q/* | 0.02 | OsFBX351 - F-box domain containing protein, expressed | |
| LOC_Os11g10920 | Chr11_6031396 | 8.07 | 9.64 | 8.18 | 2.37 | 0 | A/C | V/G | 0.011 | Carboxyl-terminal proteinase, putative, expressed | |
| LOC_Os11g10990 | Chr11_6065939 | 6.91 | 8.42 | 6.92 | 2.37 | 0 | C/T | G/R | 0.01 | Heat shock protein DnaJ, putative, expressed |
−log(p) are association signals of CMLM using PC and kinship derived from non-synonymous SNPs (group I).
−log(p) are association signals of GLM using PC derived from group I.
−log(p) are association signals of CMLM using PC and kinship derived from group III.
x.
x.
Figure 4Identification of OsML1 for mesocotyl length in qFML3-2. (A) Genome-wide association signals in the region at 30.3–30.9 Mb on chromosome 3 and LD heatmap (bottom) for 28 non-synonymous SNPs (colored red) with different alleles between parents of known bi-parental maps. Blue arrow indicates possible functional locus in OsML1 (LOC_Os03g53320). Triangular block shows region with strong local LD (r2 > 0.6). (B) Expression analysis of OsML1 in the mesocotyls from 6 accessions with long- and short-mesocotyls. Names and ML of accessions are plotted on the X-axis. Red and blue bars show long- and short-mesocotyl accessions, respectively. Data represent means ±s.d (n = 3). Asterisks represent significant differences between accessions with long- and short-mesocotyls (**p < 0.01, Student's t test). (C) Structure of OsML1 showing non-synonymous SNPs and amino acid polymorphisms. (D) Haplotype analysis of two non-synonymous SNPs in OsML1. Red arrow indicates possible functional locus. (E) Comparison of ML among haplotypes of OsML1 in indica and japonica. Accessions with haplotype colored red had significantly longer mesocotyls than members of haplotypes colored green.
Figure 5Identification of OsML2 for mesocotyl length. (A) Genome-wide association signals (top) in the region 13.4–16.4 Mb on chromosome 7 and LD heatmap (bottom) of 10 non-synonymous SNPs above the threshold. A total of 10 red points show non-synonymous SNPs above threshold in qFML7-2, qFML7-3 and qFML7-4, and blue arrow indicates possible functional locus in OsML2 (LOC_Os07g24010). Triangular block (named Block 1) shows region with strong local LD (r2 > 0.6). (B) Haplotype analysis (left) of 7 non-synonymous SNPs above threshold in Block 1 and comparison of ML (right) among haplotypes in indica and japonica. Different letters above the violins indicate significant differences (p < 0.05) when analyzed by Duncan's test. Red arrow indicates possible functional locus. (C) Independent-sample T-tests for ML differences between alleles of each of the 6 non-synonymous SNPs in Block 1 using accessions carrying allele (A) of the peak signal (Chr7_13611491). Data in parentheses show number of accessions. Manhattan plots for ML in Block 1 using (D) 373 accessions and (E) 222 indica accessions carrying allele (A) of the peak signal. Red points show 6 non-synonymous SNPs in Block 1. (F) Expression analysis of OsML2 in the mesocotyls from 6 accessions with long- and short-mesocotyls. Names and ML of accessions are plotted on the X-axis. Red and blue bars show long- and short-mesocotyl accessions, respectively. Data represent means ± s.d (n = 3). Asterisks represent significant differences between accessions with long- and short-mesocotyls (**p < 0.01, Student's t test). (G) Haplotype analysis (left) of 12 non-synonymous SNPs in OsML2 and comparison of ML (right) among haplotypes of OsML2 in indica and japonica, respectively. Different letters above the violins indicate significant differences (p < 0.05) detected by Duncan's test.
Figure 6Functional validation of haplotype combinations of OsML1 and OsML2. (A) ML of different combinations of haplotypes in OsML1 and OsML2. Different letters indicate significant differences (p < 0.05) detected by one-way ANOVA. (B) Cross-validation by phenotyping of 10 accessions representing different genotypes of OsML1 and OsML2.
Nucleotide diversity and Tajima's D Test.
| 0.00224 | −0.65 | |
| 0.00301 | 0.20 | |
| 0.00365 | 1.97 | |
| 0.00049 | −2.20 | |
| 0.0061 | 1.31 | |
| 0.01671 | 1.27 | |
| 0.01093 | 0.30 | |
| 0.00604 | −1.02 |