| Literature DB >> 33203349 |
Xinyu Hu1, Li Tang1, Linconghua Wang1, Fang-Xiang Wu2, Min Li3.
Abstract
BACKGROUND: DNA methylation in the human genome is acknowledged to be widely associated with biological processes and complex diseases. The Illumina Infinium methylation arrays have been approved as one of the most efficient and universal technologies to investigate the whole genome changes of methylation patterns. As methylation arrays may still be the dominant method for detecting methylation in the anticipated future, it is crucial to develop a reliable workflow to analysis methylation array data.Entities:
Keywords: DNA methylation; Differential methylation analysis; Downstream analysis; Normalization
Mesh:
Year: 2020 PMID: 33203349 PMCID: PMC7672854 DOI: 10.1186/s12859-020-03734-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1MADA Pipeline. It includes four stages: Pre-processing (Quality controls, Filtering, Normalization, batch effect correction), DMPs, DMRs and downstream analysis. The visualization of Pre-processing, DMP, DMR, and downstream analysis are also provided
Fig. 2MADA web application interface and case results. a WorkFlow form, where the user uploads their custom datasets and submits the request of processing. b Densityplot shows DNA methylation levels (as β values) for pre-receptive (LH + 2) and receptive (LH+ 8) endometrium samples from 17 women. c Mdsplot drawn with the largest difference between the first 1000 samples, can be used to reflect the similarity of samples. d Boxplot shows the data of each chip is still tidy so that we can detect DMPs or DMRs in the next step. e Volcano plot shows the degree of difference in CpG sites within different methylation periods. f Scatter plot can be used to reflect the distribution of CpG sites on chromosomes. g Pie plot shows the gene region feature category (UCSC) of significant CpG sites, and the percentage of significant CpG sites in different gene annotation region directly. h More detailed numerical information can be seen in table
List of the top 20 DMPs
| Probe | chr | pos | Relation_to_Island | UCSC_RefGene_Name | Islands_Name |
|---|---|---|---|---|---|
| cg15294279 | 3 | 174,842,010 | OpenSea | NAALADL2 | |
| cg00597723 | 5 | 158,691,793 | S_Shore | UBLCP1 | chr5:158690013–158,690,541 |
| cg16995742 | 2 | 237,992,612 | N_Shore | COPS8; | chr2:237994004–237,994,876 |
| cg13956086 | 7 | 2,434,521 | OpenSea | ||
| cg23432430 | 12 | 125,538,377 | S_Shelf | chr12:125534060–125,534,527 | |
| cg26845082 | 3 | 13,555,664 | OpenSea | ||
| cg17779733 | 22 | 49,589,242 | OpenSea | ||
| cg23928292 | 12 | 21,815,474 | OpenSea | ||
| cg06052372 | 16 | 83,967,808 | OpenSea | ||
| cg10360725 | 8 | 1.44E+ 08 | OpenSea | ||
| cg13885829 | 1 | 17,482,041 | OpenSea | ||
| cg14782559 | 6 | 33,131,893 | S_Shelf | COL11A2; | chr6:33129291–33,129,718 |
| cg26822175 | 22 | 27,018,010 | OpenSea | CRYBA4 | |
| cg20757478 | 6 | 31,012,262 | OpenSea | ||
| cg18673341 | 7 | 22,481,962 | OpenSea | MGC87042; | |
| cg17628377 | 5 | 180,121,337 | OpenSea | ||
| cg08161337 | 22 | 45,814,116 | OpenSea | RIBC2 | |
| cg15865722 | 11 | 68,860,657 | OpenSea | ||
| cg07584620 | 1 | 2,265,881 | N_Shore | MORN1 | chr1:2266007–2,266,432 |
| cg22664298 | 5 | 128,795,827 | Island | ADAMTS19 | chr5:128795503–128,797,417 |
Current typical platforms for methylation array data analysis
| platform | system | installation | Interface | Functions & tools | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| workflow | Preprocessing | DMP | DMR | Data visualization | Gene ontology analysis | Pathway analysis | Cluster analysis | ||||
| Unix/Linux, Mac OS, Windows | R package manager | Command line | Funnorm, SWAN, Illumina, SQN | Bumphunter | Preprocessing | ||||||
| Unix/Linux, Mac OS, Windows | R package manager | Command line interface, GUI | BMIQ.PBC SWAN, Funnorm,Combat | Limma | DMRcate, Bumphunter, ProbeLasso | Preprocessing, DMP/DMR/GO | |||||
| Unix/Linux, Mac OS, Windows | R package manager | Command line | Dasen, BMIQ, SWAN | ||||||||
| Unix/Linux, Mac OS, Windows | R package manager | command line | SWAN, Noob, Dasen | Limma | Preprocessing | ||||||
| Unix/Linux, Mac OS, Windows | not needed | Web server/Command line | Funnorm, SWAN, Illumina, SQN | A statistical testing | Preprocessing, DMR | ||||||
| Unix/Linux, Mac OS, Windows | R package manager | command line | Noob | Preprocessing | |||||||
| Unix/Linux, Mac OS, Windows | R package manager | Command line interface | SWAN | Limma | Preprocessing DMP | ||||||
| Unix/Linux, Mac OS, Windows | not needed/compilation from script | Web server/local web server | BMIQ, PBC, Noob, Funnorm, SWAN, Illumina, SQN, RAW, Combat | Limma | DMRcate, Bumphunter, ProbeLasso, Seqlm | Preprocessing DMP/DMRGO/KEGG | |||||