| Literature DB >> 31798613 |
Yunqing Cheng1, Siqi Jiang1, Xingzheng Zhang1, Hongli He1, Jianfeng Liu1.
Abstract
Yield loss in the economically important hazelnut (Corylus spp.) occurs through the frequent formation of blank nuts. Although the condition is associated with embryo abortion, we have not yet identified the regulatory genes involved. Therefore, this study aimed to determine the genes related to embryo abortion in hazel. We performed whole-genome re-sequencing and single-nucleotide polymorphism (SNP) analysis on four mutant hazelnut trees (Empty1 to Empty4, C. heterophylla) bearing blank nuts and four wild-type trees (Full1 to Full4, C. heterophylla). A paired comparison of Empty1 vs. Full1, Empty2 vs. Full2, Empty3 vs. Full3, and Empty4 vs. Full4, along with the intersection of Empty1 to Empty4, revealed 3 081 common SNPs in the four blank-nut mutants. Of these, 215 synonymous SNPs in exonic regions were distributed across 178 candidate genes. Heterozygosity analysis showed that average homozygous and heterozygous SNP ratios were respectively 0.409 and 0.591 in the samples. According to Gene Ontology classification, candidate genes were enriched in the categories of binding, catalysis, molecular transducer, transporter, and molecular function regulator. Among these, 18 of 178 genes had homozygous SNPs in Empty1-4. Cis elements in the promoter region of starch synthase 4 (SS4) contain the RY-element, implying seed-specific expression. Starch granules were absent from Empty1-4 cotyledon cells, but abundantly present in Full1-Full4 cotyledon cells. The blank-nut phenotype has heavier nut shells. Overall, we conclude that single-nucleotide variants of Acetyl-CoA carboxylase 1 (ACC1), intracellular sodium/hydrogen exchanger 2 (NHX2), UDP-glycosyltransferase 74E2 (UGT74E2), DEFECTIVE IN MERISTEM SILENCING 3 (DMS3), DETOXIFICATION 43 (FRD3), and SS4 may induce embryo abortion, leading to blank-nut formation. Our results will benefit future research on how the gain or loss of candidate genes influences seed development. Moreover, our study provides novel prospects for seedless cultivar development.Entities:
Keywords: Corylus heterophylla; blank-nut mutants; embryo abortion; single-nucleotide polymorphism; whole-genome re-sequencing
Year: 2019 PMID: 31798613 PMCID: PMC6863972 DOI: 10.3389/fpls.2019.01465
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Nut characters of the eight hazel (Corylus heterophylla) trees sampled in northeastern China. (A) Empty1 (the first line) with blank nut trait and Full1 (the second line) with edible seed from Siping; (B) Empty2 (the first line) with blank nut trait and Full2 (the second line) with edible seed from Tieling; (C) Empty3 (the first line) with blank nut trait and Full3 (the second line)with edible seed from Kaiyuan; (D) Empty4 (the first line) with blank-nut trait and Full4 (the second line) with edible seed from Huludao.
Statistical analysis results of clean sequencing data, for eight hazel (Corylus heterophylla) trees sampled in Northeast China.
| Sample | Total reads | Total bases | Error % | Q20% | Q30% | GC % | Depth | Coverage |
|---|---|---|---|---|---|---|---|---|
| Empty1 | 32685090 | 4807907701 | 0.0183 | 95.01 | 88.72 | 38.94 | 15.82 | 0.71 |
| Empty2 | 32862042 | 4792174301 | 0.0184 | 94.92 | 88.62 | 37.26 | 15.77 | 0.73 |
| Empty3 | 39550994 | 5744594343 | 0.0187 | 94.78 | 88.34 | 37.67 | 18.90 | 0.67 |
| Empty4 | 40402514 | 5925325425 | 0.0156 | 96.20 | 91.20 | 38.87 | 19.50 | 0.71 |
| Full1 | 39614452 | 5725619379 | 0.0186 | 94.79 | 88.37 | 38.39 | 18.84 | 0.70 |
| Full2 | 38213274 | 5537932084 | 0.0183 | 94.96 | 88.71 | 37.52 | 18.22 | 0.71 |
| Full3 | 33925356 | 4978413145 | 0.0192 | 94.62 | 87.91 | 43.02 | 16.38 | 0.71 |
| Full4 | 51797148 | 7606639739 | 0.0152 | 96.30 | 91.58 | 38.88 | 25.03 | 0.70 |
Error %, Base error rate. Q20 and Q30, the percentage of bases with the Phred value > 20 and > 30, respectively. GC %, the percentage of G and C bases. Depth, sequencing depth; Coverage, coverage ratio, obtained base number relative to the total base number of reference hazelnut genome.
Statistical analysis results of SNP annotation, for eight hazel (Corylus heterophylla) trees sampled in Northeast China.
| Location | Empty1 | Empty2 | Empty3 | Empty4 | Full1 | Full2 | Full3 | Full4 |
|---|---|---|---|---|---|---|---|---|
| Downstream | 104704 | 116897 | 161416 | 162810 | 155144 | 154536 | 60405 | 212824 |
| Exonic | 283283 | 230763 | 296316 | 353058 | 315313 | 292243 | 223867 | 409446 |
| Intergenic | 647664 | 713970 | 969730 | 946126 | 876796 | 912011 | 398024 | 1290521 |
| Intronic | 258128 | 286216 | 375095 | 399487 | 378273 | 366754 | 155008 | 488644 |
| Splicing | 1588 | 1252 | 1677 | 2055 | 1859 | 1595 | 1261 | 2592 |
| Upstream | 117029 | 113107 | 157114 | 165578 | 149542 | 151003 | 71747 | 225400 |
| Upstream; downstream | 8034 | 8341 | 11881 | 12793 | 11979 | 11523 | 4797 | 16430 |
| UTR3 | 44169 | 45400 | 60344 | 67548 | 61792 | 59537 | 28643 | 82809 |
| UTR5 | 29731 | 23887 | 31639 | 36992 | 32175 | 30549 | 21462 | 46911 |
| Total | 1494330 | 1539833 | 2065212 | 2146447 | 1982873 | 1979751 | 965214 | 2775577 |
Statistical analysis results of the effect of SNP mutation sites in the exon regions on protein translation, for eight hazel (Corylus heterophylla) trees sampled in Northeast China.
| Empty1 | Empty2 | Empty3 | Empty4 | Full1 | Full2 | Full3 | Full4 | |
|---|---|---|---|---|---|---|---|---|
| Nonsynonymous SNV | 146644 | 120223 | 155071 | 185796 | 164205 | 151959 | 116828 | 216738 |
| Stopgain SNV | 4205 | 3273 | 4413 | 5504 | 4759 | 4166 | 3263 | 6636 |
| Stoploss SNV | 674 | 568 | 726 | 874 | 737 | 715 | 560 | 1117 |
| Synonymous SNV | 116226 | 94620 | 120275 | 141946 | 128290 | 119902 | 91310 | 161742 |
| Unknown | 15534 | 12079 | 15831 | 18938 | 17322 | 15501 | 11906 | 23213 |
SNV, single-nucleotide variant.
Figure 2Common and unique single-nucleotide polymorphisms (SNPs) in blank-nut mutants of hazel (Corylus heterophylla) sampled in Northeast China. (A) Paired comparison of Full1 vs. Empty1; 82 075 unique SNPs were found in the blank-nut mutant Empty1; (B) paired comparison of Full2 vs. Empty2; 797 299 unique SNPs were found in the blank-nut mutant Empty2; (C) paired comparison of Full3 vs. Empty3; 478 031 unique SNPs were found in the blank-nut mutant Empty3; (D) paired comparison of Full3 vs. Empty3; 880 405 unique SNPs were found in the blank-nut mutant Empty4; (E) unique SNPs in the blank-nut mutants revealed using Venn Diagrams.
Figure 3Heterozygosity statistical results of single-nucleotide polymorphisms (SNPs) in hazel (Corylus heterophylla) trees sampled in Northeast China. 1/1: homozygous genotype; 0/1 and 1/2: heterozygous genotypes. The pie charts each show three parts representing three genotypes, and three parameters that represent the genotype of SNP, the number of SNPs, and the corresponding percentage of SNPs, respectively.
Figure 4Gene Ontology (GO) classification analysis of the 178 candidate genes identified in hazel (Corylus heterophylla) sampled in Northeast China.
Cis element in the promoter region of SS4.
| Cis element | Organism | Location | Matrix score | Sequence | Biological function |
|---|---|---|---|---|---|
| ABRE |
| 1843/+ | 5 | ACGTG | Cis-acting element involved in the abscisic acid responsiveness |
| Box 4 |
| 429/+ | 6 | ATTAAT | Part of a conserved DNA module involved in light responsiveness |
|
| 1450/- | 6 | ATTAAT | ||
|
| 767/+ | 6 | ATTAAT | ||
| G-Box |
| 1842/- | 6 | CACGTT | Cis-acting regulatory element involved in light responsiveness |
| GARE-motif |
| 579/- | 7 | TCTGTTG | Gibberellin-responsive element |
| GATA-motif |
| 401/+ | 7 | GATAGGA | Part of a light responsive element |
|
| 466/- | 10 | AAGATAAGATT | ||
| P-box |
| 1052/+ | 7 | CCTTTTG | Gibberellin-responsive element |
| RY-element |
| 1040/- | 8 | CATGCATG | Cis-acting regulatory element involved in seed-specific regulation |
| TATC-box |
| 230/+ | 7 | TATCCCA | Cis-acting element involved in gibberellin-responsiveness |
|
| 1639/- | 7 | TATCCCA | ||
|
| 1361/+ | 7 | TATCCCA | ||
| TCT-motif |
| 742/- | 6 | TCTTAC | Part of a light responsive element |
Figure 5Comparison of the amino acid sequence of starch synthase 4 (SS4) in hazel (Corylus heterophylla) sampled in northeastern China with SS4 from other plants. AtSS4, Arabidopsis thaliana SS4; QsSS4, Quercus suber SS4; JrSS4, Juglans regia SS4; g16468.t1, Corylus avellana SS4; Empty, blank-nut mutant; Full, edible seed.
Figure 6Starch granules observed in cotyledons of hazel (Corylus heterophylla) sampled in Northeast China. (A) Full1; (B) Full2; (C) Full3; (D) Full4; (E) Empty1; (F) Empty2; (G) Empty3; (H) Empty4. Empty, blank-nut mutant; Full, edible seed.
Characteristics of nuts used in the present study, from eight hazel (Corylus heterophyll a) trees sampled in northeastern China.
| Sample | Shell | Kernel | Total nut | Shell | Kernel | Diameter (cm) | Blank-nut |
|---|---|---|---|---|---|---|---|
| Empty1 | 1.215 b | 0.003 e | 1.218 cd | 99.754 a | 0.246 c | 2.40 ab | 100 a |
| Full1 | 1.091 c | 0.642 a | 1.733 a | 62.954 c | 37.046 a | 2.44 ab | 2.47 c |
| Empty2 | 1.148 bc | 0.003 e | 1.151 d | 99.739 a | 0.261 c | 2.29 b | 100 a |
| Full2 | 0.964 d | 0.573 b | 1.537 b | 62.720 c | 37.280 a | 2.34 ab | 3.33 b |
| Empty3 | 1.32 a | 0.003 e | 1.323 c | 99.773 a | 0.227 c | 2.53 a | 100 a |
| Full3 | 1.181bc | 0.526 c | 1.707 a | 69.186 b | 30.814 b | 2.52 a | 0 d |
| Empty4 | 0.942 d | 0.003 e | 0.945 e | 99.683 a | 0.317 c | 1.89 c | 100 a |
| Full4 | 0.799 e | 0.358 d | 1.157 d | 69.058 b | 30.942 b | 1.94 c | 0 d |
Values within the same column followed by different lowercase letters are significantly different at P ≤ 0.05 as indicated by the LSD test. A nut with less than 5% kernel ratio is defined as a blank nut.
Interest common, unique, and nonsynonymous SNPs in coding regions of Empty1, Empty2, Empty3, and Empty4.
| No. | CHROM | POS | REF | ALT | HE-HO | Gene Name | Gene Annot |
|---|---|---|---|---|---|---|---|
| 1 | 00025 | 108257 | G | T | 1/1 | g526 | Protein DEFECTIVE IN MERISTEM SILENCING 3, DMS3 |
| 2 | 00355 | 58664 | G | C | 1/1 | g3976 | Protein DETOXIFICATION 43, DTX43 or FRD3 |
| 3 | 00852 | 15014 | C | T | 1/1 | g7423 | Probable xyloglucan glycosyltransferase 5, CSLC5 |
| 4 | 01112 | 39903 | G | A | 1/1 | g8887 | O-fucosyltransferase 34, OFUT34 |
| 5 | 01581 | 4652 | C | T | 1/1 | g11171 | Probable terpene synthase 9, TPS9 |
| 6 | 01736 | 26257 | G | A | 1/1 | g11831 | Acetyl-CoA carboxylase 1, ACC1 |
| 7 | 01830 | 22598 | C | A | 1/1 | g12219 | Sodium/hydrogen exchanger 2, NHX2 |
| 8 | 02132 | 19138 | A | G | 1/1 | g13442 | Non-functional pseudokinase ZED1, ZED1 |
| 9 | 05024 | 10274 | G | C | 1/1 | g21444 | Ankyrin repeat-containing protein BDA1, BAD1 |
| 10 | 07995 | 10257 | C | A | 1/1 | g26083 | U-box domain-containing protein 34, PUB34 |
| 11 | 08504 | 7843 | T | A | 1/1 | g26696 | Disease resistance protein At4g27190, At4g27190 |
| 12 | 08703 | 4254 | A | T | 1/1 | g26905 | UDP-glycosyltransferase 74E2, UGT74E2 |
| 13 | 03041 | 13510 | A | G | 0/1, 1/1 | g16468 | Starch synthase 4, SS4 |
All the SNP mutation types in coding regions are exonic and nonsynonymous; Chrom, reference sequence name; Abbreviation of reference sequence name was used. Eg. C.avellana_Jefferson_00025 was abbreviated as 00025. POS, SNPs (single-nucleotide polymorphisms) position on reference sequences. Ref, Reference base of SNP; ALT, Variant bases of SNP; HE-HO, Homozygous and heterozygous genotype of SNP. The homozygous genotype was 1/1, and the heterozygous genotype was 0/1, 1/2; Gene Annot, Annotation of gene with singl- nucleotide variant.