| Literature DB >> 34908915 |
Xiangjie Guo1,2, Yaqin Bai1, Hualin Guo1, Peng Wu1, Hao Li1, Liqin Zhai3,4, Yan Feng4, Jianguo Li2, Cairong Gao1, Keming Yun1.
Abstract
Anaphylaxis has rapidly spread around the world in the last several decades. Environmental factors seem to play a major role, and epigenetic marks, especially DNA methylation, get more attention. We discussed several GEO opening data classifications with TOP 100 specific methylation region values (normalized M-values on line) by machine learning, which are remarkable to classify specific anaphylaxis after monoallergen exposure. Then, we sequenced the whole-genome DNA methylation of six people (3 wormwood monoallergen atopic rhinitis patients and 3 normal-immune people) during the pollen season and analyzed the difference of the single nucleotide and DNA region. The results' divergences were obvious (the differential single nucleotides were mostly distributed in nongene regions but the differential DNA regions of GWAS, on the other hand), which may have caused most single nucleotides to be concealed in the regions' sequences. Therefore, we suggest that we should conduct more "pragmatic" and directly find special single-nucleotide changes after exposure to atopic allergens instead of complex correlativity. It is possible to try to use DNA methylation marks to accurately diagnose anaphylaxis and form a machine learning classification based on the single methylated CpGs.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34908915 PMCID: PMC8635942 DOI: 10.1155/2021/8202068
Source DB: PubMed Journal: Contrast Media Mol Imaging ISSN: 1555-4309 Impact factor: 3.161
Figure 1The ubiety between DNA methylation and gene. These include “TSS,” “Intragenic,” “Intergenic,” and CpG alone.
Figure 2(a) The result of GSE73745 by t-SNE. “a” represents respiratory allergy patients, and “c” represents healthy controls. (b) The result of GSE104471 by t-SNE. “a” represents allergic asthma patients, and “c” represents healthy controls. (c) The result of GSE59999 by t-SNE. “Egg” represents egg-allergic patients, “pea” represents peanut-allergic patients, and “c” represents healthy controls. (d) The result of GSE59999 by randomForest.
Figure 3(a) The result of GSE37853 by t-SNE and randomForest. “a-a” represents atopic asthmatic patients, “n-a” represents nonatopic asthmatic patients, and “hea” represents healthy controls. (b) The result of GSE50222 by t-SNE and randomForest. “a-d” represents allergic patients during the pollen season, “a-o” represents allergic patients outside of the pollen season, “h-d” represents healthy people during the pollen season, and “h-o” represents healthy people outside of the pollen season.
Figure 4The result of GSE40736 by t-SNE and randomForest. “l-f” represents lung_function, “pc20” represents PC20, “rever” represents reversible, and “c” represents healthy control.
The sequencing quality evaluation.
| Sample | Reads num. | Total bases (bp) | GC (%) | Error <1 (%) | Error <0.1 (%) |
|---|---|---|---|---|---|
| a9 | 305,727,176 | 45,859,076,400 | 21.34 | 95.31 | 90.30 |
| a10 | 333,038,672 | 49,955,800,800 | 21.37 | 95.28 | 90.36 |
| a12 | 300,768,056 | 45,115,208,400 | 21.24 | 95.49 | 90.64 |
| c1 | 297,128,974 | 44,569,346,100 | 21.19 | 95.35 | 90.34 |
| c2 | 301,946,840 | 45,292,026,000 | 21.23 | 95.23 | 90.25 |
| c3 | 337,787,768 | 50,668,165,200 | 21.28 | 95.48 | 90.63 |
Error: single-nucleotide distinguished error. The quality scores all bases of per sample in the attachment document.