| Literature DB >> 29250096 |
Karolina Chwialkowska1, Urszula Korotko2, Joanna Kosinska3, Iwona Szarejko2, Miroslaw Kwasniewski1.
Abstract
Epigenetic mechanisms, including histone modifications and DNA methylation, mutually regulate chromatin structure, maintain genome integrity, and affect gene expression and transposon mobility. Variations in DNA methylation within plant populations, as well as methylation in response to internal and external factors, are of increasing interest, especially in the crop research field. Methylation Sensitive Amplification Polymorphism (MSAP) is one of the most commonly used methods for assessing DNA methylation changes in plants. This method involves gel-based visualization of PCR fragments from selectively amplified DNA that are cleaved using methylation-sensitive restriction enzymes. In this study, we developed and validated a new method based on the conventional MSAP approach called Methylation Sensitive Amplification Polymorphism Sequencing (MSAP-Seq). We improved the MSAP-based approach by replacing the conventional separation of amplicons on polyacrylamide gels with direct, high-throughput sequencing using Next Generation Sequencing (NGS) and automated data analysis. MSAP-Seq allows for global sequence-based identification of changes in DNA methylation. This technique was validated in Hordeum vulgare. However, MSAP-Seq can be straightforwardly implemented in different plant species, including crops with large, complex and highly repetitive genomes. The incorporation of high-throughput sequencing into MSAP-Seq enables parallel and direct analysis of DNA methylation in hundreds of thousands of sites across the genome. MSAP-Seq provides direct genomic localization of changes and enables quantitative evaluation. We have shown that the MSAP-Seq method specifically targets gene-containing regions and that a single analysis can cover three-quarters of all genes in large genomes. Moreover, MSAP-Seq's simplicity, cost effectiveness, and high-multiplexing capability make this method highly affordable. Therefore, MSAP-Seq can be used for DNA methylation analysis in crop plants with large and complex genomes.Entities:
Keywords: DNA methylation; MSAP; large genomes; methylome analysis; new technique; next generation sequencing
Year: 2017 PMID: 29250096 PMCID: PMC5714927 DOI: 10.3389/fpls.2017.02056
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
The sequences of the primers and adapters that were used for the MSAP-Seq analyses.
| Adapter | EcoRI_ A1 | CTCGTAGACTGCGTACC |
| EcoRI_ A2 | AATTGGTACGCAGTCTAC | |
| Adapter | HpaII_A1 | GACGATGAGTCTAGAA |
| HpaII_A2 | CGTTCTAGACTCATC | |
| PCR primer | E-AC | GACTGCGTACCAATTCAC |
| PCR primer | H-TG | GATGAGTCTAGAACGGTG |
Figure 1Detailed MSAP-Seq assay overview (A) Genomic DNA is cleaved using rare cutter (EcoRI) and methylation sensitive restriction enzyme (HpaII); only unmethylated recognition sites are digested by HpaII. Then adapters specific to sticky ends are ligated and obtained fragments are amplified in PCR using selective primers; only fragments generated from unmethylated regions are amplified as they contain ends complementary to adapters. Products are then purified and fragmented by sonication to create shorter tags. (B) Purified fragments are used for standard library preparation involving following steps: end repair, adenylation, barcoded adapters ligation and purification, PCR amplification and purification. Then libraries quality and quantity is estimated. (C) Prepared libraries are pooled and processed thru cluster generation and high-throughput sequencing. (D) Sequencing data are analyzed using dedicated automatic pipeline—MSEQER. Firstly, reads are filtered for presence of HpaII adapter and adapters are clipped. Then only reads containing CGG tags on the ends are mapped to the reference genome and functionally annotated. Obtained counts at each of the CCGG sites are normalized and differential methylation analysis among sets of samples is performed.
The statistics of MSAP-Seq reads during different data processing steps during evaluation of changes in barley methylome in leaves under water-deficiency stress.
| Initial number of reads [mln] | 14.16 | 19.88 | 9.57 | 16.45 | 14.17 | 15.18 | 16.62 | 10.10 | 13.66 | 14.42 |
| Number of reads after filtering | 6.96 | 10.3 | 5.10 | 6.60 | 6.47 | 6.50 | 7.50 | 5.22 | 6.58 | 6.81 |
| Percentage of mapped reads [%] | 86.9 | 86.7 | 86.5 | 84.3 | 84.7 | 84.3 | 85.4 | 84.3 | 84.9 | 85.3 |
| Number of different sites | 65,980 | 79,929 | 52,164 | 91,374 | 86,706 | 89,271 | 85,101 | 74,211 | 84,738 | 78,830 |
| Number of different genes (gene bodies) | 7,437 | 8,351 | 6,333 | 9,403 | 9,311 | 9,470 | 9,028 | 8,202 | 8,900 | 8,493 |
| Number of different genes (promoters) | 1,830 | 2,140 | 1,910 | 2,418 | 2,430 | 2,417 | 2,372 | 2,083 | 2,344 | 2,216 |
Filtering based on HpaII-related adapter presence followed by CGG sequence and 50 bp of minimal read length.
Figure 2Hierarchical clustering of MSAP-Seq data of three samples (control, drought, and re-watering) performed in three independent biological replicates.
Figure 3Percentages of different genic features identified within MSAP-Seq tags.
The statistics of MSAP-Seq reads during different data processing steps during comparative evaluation of DNA methylation changes in barley leaf and root in response to water deficiency stress.
| Initial number of reads [mln] | 77.52 | 72.00 | 56.46 | 68.66 | 31.18 | 37.13 | 16.05 | 28.12 |
| Number of reads after filtering | 18.29 | 19.22 | 11.44 | 16.31 | 7.21 | 8.14 | 3.32 | 6.22 |
| Percentage of mapped reads [%] | 88.3 | 85.9 | 89.4 | 87.9 | 64.1 | 45.3 | 37.9 | 49.1 |
| Number of different sites | 117,259 | 122,631 | 95,172 | 111,687 | 107,044 | 111,879 | 57,749 | 92,224 |
| Number of different genes (gene bodies) | 10,961 | 11,248 | 10,336 | 10,848 | 11,819 | 11,905 | 7,728 | 10,484 |
| Number of different genes (promoters) | 3,067 | 3,247 | 2,805 | 3,039 | 3,107 | 3,050 | 1,785 | 2,647 |
Filtering based on HpaII-related adapter presence followed by CGG sequence and 50 bp of minimal read length.