| Literature DB >> 28683715 |
Martin Schmidt1,2, Michiel Van Bel1,2, Magdalena Woloszynska1,2, Bram Slabbinck1,2, Cindy Martens3, Marc De Block3, Frederik Coppens1,2, Mieke Van Lijsebettens4,5.
Abstract
BACKGROUND: Cytosine methylation in plant genomes is important for the regulation of gene transcription and transposon activity. Genome-wide methylomes are studied upon mutation of the DNA methyltransferases, adaptation to environmental stresses or during development. However, from basic biology to breeding programs, there is a need to monitor multiple samples to determine transgenerational methylation inheritance or differential cytosine methylation. Methylome data obtained by sodium hydrogen sulfite (bisulfite)-conversion and next-generation sequencing (NGS) provide genome-wide information on cytosine methylation. However, a profiling method that detects cytosine methylation state dispersed over the genome would allow high-throughput analysis of multiple plant samples with distinct epigenetic signatures. We use specific restriction endonucleases to enrich for cytosine coverage in a bisulfite and NGS-based profiling method, which was compared to whole-genome bisulfite sequencing of the same plant material.Entities:
Keywords: Cytosine methylation; DNA methylation; Epiline; Oryza sativa; Plant; RRBS; Reduced representation bisulfite sequencing; Rice
Mesh:
Substances:
Year: 2017 PMID: 28683715 PMCID: PMC5501559 DOI: 10.1186/s12870-017-1070-y
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Fig. 1Workflow of plant-RRBS for plant science. (a) Main laboratory steps of library construction for Illumina sequencing, (b) main bioinformatics steps of plant-RRBS data analysis and (c) main steps for determination of coverage and cytosine methylation aspects
Genome and cytosine coverage in biological replicates of the control line and the LR2 epiline (fourth selfing) using plant-RRBS, and comparison to WGBS
| Line with biological replicatesa | Restriction endonuclease combination | Genome coverage (%)b | Cytosine coverage (%)c | Efficiencyd | Number of analyzed cytosine sites | ||
|---|---|---|---|---|---|---|---|
| CG | CHG | CHH | |||||
| (millions)e | |||||||
| Plant-RRBS | |||||||
| Control-1 |
| 39.6 | 42.8 | 1.1 | 15.3 | 13.1 | 48.1 |
| Control-2 |
| 42.3 | 44.9 | 1.1 | 15.6 | 13.6 | 51.1 |
| Control-3 |
| 30.8 | 32.8 | 1.1 | 11.6 | 10.0 | 37.0 |
| Control-4 |
| 35.0 | 37.1 | 1.1 | 13.0 | 11.3 | 42.0 |
| Control-5 |
| 21.3 | 22.9 | 1.1 | 8.3 | 7.1 | 25.5 |
| LR2–1 |
| 46.0 | 48.7 | 1.1 | 16.7 | 14.7 | 55.6 |
| LR2–2 |
| 45.7 | 48.5 | 1.1 | 16.7 | 14.7 | 55.3 |
| LR2–3 |
| 27.6 | 28.8 | 1.0 | 9.4 | 8.6 | 33.5 |
| LR2–4 |
| 45.1 | 48.4 | 1.1 | 17.0 | 14.7 | 54.7 |
| LR2–5 |
| 41.2 | 43.9 | 1.1 | 15.1 | 13.2 | 50.0 |
| Control-6 |
| 29.9 | 34.1 | 1.1 | 13.1 | 11.1 | 36.6 |
| Control-7 |
| 26.2 | 30.0 | 1.1 | 11.7 | 10.0 | 31.9 |
| Control-8 |
| 25.3 | 28.7 | 1.1 | 11.0 | 9.5 | 30.7 |
| Control-9 |
| 21.2 | 24.5 | 1.2 | 9.6 | 8.3 | 25.9 |
| Control-10 |
| 27.8 | 32.4 | 1.2 | 13.0 | 10.8 | 34.1 |
| LR2–6 |
| 23.0 | 26.0 | 1.1 | 9.8 | 8.8 | 27.9 |
| LR2–7 |
| 21.8 | 24.9 | 1.1 | 9.5 | 8.5 | 26.5 |
| LR2–8 |
| 22.3 | 25.4 | 1.1 | 9.6 | 8.7 | 27.1 |
| LR2–9 |
| 23.2 | 26.6 | 1.1 | 10.2 | 9.0 | 28.3 |
| LR2–10 |
| 24.0 | 27.1 | 1.1 | 10.0 | 9.0 | 29.3 |
| WGBS | |||||||
| Control −11 | - | 84.3 | 52.4 | 0.6 | 14.4 | 15.4 | 63.9 |
| LR2–11 | - | 83.7 | 52.6 | 0.6 | 14.5 | 15.6 | 63.9 |
Leaf material from five individual plants per line and per restriction endonuclease combination was used
The bisulfite conversion efficiency rate per biological replicate was higher than approximately 99%
a Name scheme: line–individual plant number (1–11) from selfing generation 4
b Genome coverage: coverage as number of genome nucleotide positions covered by at least one read *100% / 427,026,737 nucleotides in the reference genome [38]
c Cytosine coverage: proportion of analyzed (sufficiently covered) cytosine positions in the genome = sum of analyzed cytosines in CG, CHG and CHH context covered by at least ten informative nucleotides (means C or T) * 100% /178,637,468 cytosines in the reference genome for both strands [38]
d Ratio of cytosine coverage per genome coverage
e Millions of positions of a certain cytosine context (CG, CHG and CHH) in the reference genome for both strands that are sufficiently covered by at least ten informative nucleotides (means C or T) and therefore methylation level of cytosine sites was analyzed [38]
Detected cytosine positions relative to the genome-wide cytosine positions per five biological replicates of the control line and the LR2 epiline (fourth selfing) discriminated by restriction endonuclease combination and cytosine context (CG, CHG and CHH)
| Group of biological replicates | Restriction endonuclease combination | Detected cytosine positions in genome | |||
|---|---|---|---|---|---|
| CG | CHG | CHH | C (%) | ||
| Union (collection of all covered positions in at least one replicate) | |||||
| Control |
| 18,507,834 | 15,887,099 | 58,570,249 | 52.0 |
| LR2 |
| 19,434,811 | 16,678,896 | 61,436,046 | 54.6 |
| Control |
| 15,715,023 | 12,901,771 | 42,018,210 | 39.5 |
| LR2 |
| 14,573,921 | 12,185,636 | 39,744,568 | 37.2 |
| Intersection (common positions in all replicates) | |||||
| Control |
| 6,459,314 | 5,448,976 | 19,305,085 | 17.5 |
| LR2 |
| 8,023,858 | 7,435,066 | 29,372,103 | 25.1 |
| Control |
| 7,053,953 | 6,410,114 | 19,063,516 | 18.2 |
| LR2 |
| 5,454,292 | 5,509,874 | 15,910,607 | 15.0 |
| Jaccard indexa (proportion of common on all detected positions) | |||||
| (%) | (%) | (%) | (%) | ||
| Control |
| 34.9 | 34.3 | 33.0 | 33.6 |
| LR2 |
| 41.3 | 44.6 | 47.8 | 46.0 |
| Control |
| 44.9 | 49.7 | 45.4 | 46.1 |
| LR2 |
| 37.4 | 45.2 | 40.0 | 40.4 |
a Jaccard index or Jaccard similarity coefficient = intersection / union
Fig. 2Detected cytosine methylation levels of CG sites in a given annotation within the control line and epiline LR2 (fourth selfing). The proportion of cytosine positions with a particular methylation level bin is displayed as a stacked bar diagram. Ten methylation level bins are considered with an interval of 10%. Proportions of methylation level bins are visualized for different annotation features (O. sativa ssp. indica ASM465v1.27): protein-coding gene, non-coding gene, transcript, exon, intron, coding DNA sequence (CDS) and promoter (2000-nucleotide region upstream of the TSS)
Differential cytosine methylation and annotation of the control line versus the LR2 epiline (fourth selfing)
| Restriction endonuclease combination | Cytosine site | Number of differentially methylated sitesa | Number of diff. meth. sites within features | ||||
|---|---|---|---|---|---|---|---|
| Annotated region | Not-annotated region | Gene-associated region | |||||
| Hypo- | Hyper- | Gene | Promoterb | ||||
|
| CG | 541 | 463 | 182 | 822 | 160 | 362 |
| CHG | 169 | 94 | 28 | 235 | 28 | 81 | |
| CHH | 20 | 8 | 11 | 17 | 4 | 4 | |
| sum | 730 | 565 | 221 | 1074 | 192 | 447 | |
|
| CG | 29 | 74 | 9 | 94 | 8 | 27 |
| CHG | 10 | 25 | 0 | 35 | 0 | 11 | |
| CHH | 2 | 2 | 1 | 3 | 1 | 0 | |
| sum | 41 | 101 | 10 | 132 | 9 | 38 | |
Cytosine positions, represented by the numbers above, can be counted to the different annotations multiple times
a hypo−/hyper-methylated means a 25% lower/higher methylation level in LR2 compared with the control line
b Promoter is defined as 2000 nucleotides upstream of the TSS and belongs to the not-annotated region in the used genomic features