| Literature DB >> 29619235 |
Shohei Komaki1, Yuh Shiwa1,2,3, Ryohei Furukawa1, Tsuyoshi Hachiya1, Hideki Ohmomo1,2, Ryo Otomo1,2, Mamoru Satoh1,2, Jiro Hitomi4,5, Kenji Sobue6, Makoto Sasaki5,7, Atsushi Shimizu1.
Abstract
We launched an integrative multi-omics database, iMETHYL (http://imethyl.iwate-megabank.org). iMETHYL provides whole-DNA methylation (~24 million autosomal CpG sites), whole-genome (~9 million single-nucleotide variants), and whole-transcriptome (>14 000 genes) data for CD4+ T-lymphocytes, monocytes, and neutrophils collected from approximately 100 subjects. These data were obtained from whole-genome bisulfite sequencing, whole-genome sequencing, and whole-transcriptome sequencing, making iMETHYL a comprehensive database.Entities:
Year: 2018 PMID: 29619235 PMCID: PMC5874393 DOI: 10.1038/hgv.2018.8
Source DB: PubMed Journal: Hum Genome Var ISSN: 2054-345X
Demographic and profile statistics of iMETHYL
| Monocytes | CD4+ T cells | Neutrophils | |
|---|---|---|---|
| 102 | 102 | 94 | |
| Males, | 48 (47.1) | 49 (48.0) | 48 (51.1) |
| Median age (range), years | 62.5 (35–75) | 62.0 (35–75) | 58.0 (24–81) |
| Sequencing depth | 31.1±1.8 | 31.0±1.6 | 54.7±1.6 |
| No. of autosomal CpGs | 23 ,941,821 | 24,037,518 | 25,483,031 |
| No. of sequencing reads | 33,917,157±3,153,528 | 35,175,996±1,275,575 | 47,040,140±6,289,540 |
| No. of genes | 16,282 | 18,299 | 14,534 |
| Sequencing depth | 27.2±1.0 | 27.2±1.0 | 53.3±13.2 |
| No. of SNVs | 8,945,669 | 8,951,822 | 8,792,880 |
Abbreviations: DNAm, DNA methylation; SNV, single-nucleotide variant.
Both cell types were obtained from the same 95 individuals out of a cohort of 102.
Average±standard deviation.
CpGs that were retained in ≥50% of subjects for each cell type.
Genes that were expressed with a fragments per kilobase of exon per million mapped fragments ≥0.1 in ≥50% of subjects for each cell type.
SNVs with a minor allele count >1.
List of available tracks in iMETHYL
| Track name | Description | Source |
|---|---|---|
| IMM_CpG_CD4T | Information for each CpG site of CD4T | |
| IMM_CpG_CD4T_avg | Average DNAm level of each CpG site of CD4T | |
| IMM_CpG_CD4T_sd | DNAm variations of each CpG site of CD4T measured by SD | |
| IMM_CpG_CD4T_RI | DNAm variations of each CpG site of CD4T measured by RI | |
| IMM_CpG_Mono | Information for each CpG site of monocytes | |
| IMM_CpG_Mono_avg | Average DNAm level of each CpG site of monocytes | |
| IMM_CpG_Mono_sd | DNAm variations of each CpG site of monocytes measured by SD | |
| IMM_CpG_Mono_RI | DNAm variations of each CpG site of monocytes measured by RI | |
| IMM_CpG_Neu | Information for each CpG site of neutrophils | This study |
| IMM_CpG_Neu_avg | Average DNAm level of each CpG site of neutrophils | This study |
| IMM_CpG_Neu_sd | DNAm variations of each CpG site of neutrophils measured by SD | This study |
| IMM_CpG_Neu_RI | DNAm variations of each CpG site of neutrophils measured by RI | This study |
| IMM_FPKM_CD4T | FPKM values of each transcript of CD4T | |
| IMM_FPKM_Mono | FPKM values of each transcript of monocytes | |
| IMM_FPKM_Neu | FPKM values of each transcript of neutrophils | This study |
| IMM_SNV_CD4T | Information for each SNV of CD4T | |
| IMM_SNV_Mono | Information for each SNV of monocytes | |
| IMM_SNV_Neu | Information for each SNV of neutrophils | This study |
| Reference sequence | Human genome hg19/GRCh37 sequence | UCSC genome browser |
| RepeatMasker | Repetitive elements | UCSC genome browser |
| CpGIslandsExt | CpG island locations | UCSC genome browser |
| HM450 | Probe information for Illumina Infinium HumanMethylation450 | UCSC genome browser |
| gencode_v19 | Information of genes obtained from GENCODE version 19 | GENCODE |
| gencode_v19_trs | Information of transcripts obtained from GENCODE version 19 | GENCODE |
Abbreviations: CD4T, CD4+ T-lymphocyte; DNAm, DNA methylation; FPKM, fragments per kilobase of exon per million fragments mapped; RI, reference interval; SD, standard deviation; SNV, single-nucleotide variant.
Figure 1Graphical view of iMETHYL. (a) Three-layer omics data are provided as browser tracks. The browser displays several tracks, which are shown for the region surrounding the DNAm biomarker for tobacco smoking, cg05575921. Users can select tracks that provide information from external sources on gene structure, expression, and SNVs and cell-type-specific original tracks (e.g., CD4T, monocytes, and neutrophils) that show average DNAm levels and different measures of variation (SD and RI). (b–d) Detailed information on CpG tracks for CD4T, monocytes, and neutrophils. The frequencies of the three DNAm categories among individuals are shown as Mlf_high (≥ 67%), Mlf_mid (34–66%), and Mlf_low (≤ 33%). CD4T, CD4+ T-lymphocytes; DNAm, DNA methylation; Mlf_high, frequency of hypermethylated DNA; Mlf_high, frequency of hypermethylated DNA; Mlf_low, frequency of hypomethylated DNA; Mlf_mid, frequency of intermediate methylation DNA; RI, reference interval; SD, standard deviation; SNV, single-nucleotide variation.