| Literature DB >> 26978482 |
Aniruddha Chatterjee1,2, Peter A Stockwell3, Euan J Rodger1, Ian M Morison1.
Abstract
Methylation of DNA molecules is a key mechanism associated with human disease, altered gene expression and phenotype. Using reduced representation bisulphite sequencing (RRBS) technology we have analysed DNA methylation patterns in healthy individuals and identified genes showing significant inter-individual variation. Further, using whole genome transcriptome analysis (RNA-Seq) on the same individuals we showed a local and specific relationship of exon inclusion and variable DNA methylation pattern. For RRBS, 363 million, 100-bp reads were generated from 13 samples using Illumina GAII and HiSeq2000 platforms. Here we also present additional RRBS data for a female pair of monozygotic twins that was not described in our original publication. Further, We performed RNA-Seq on four of these individuals, generating 174 million, 51-bp high quality reads on an Illumina HiSeq2000 platform. The current data set could be exploited as a comprehensive resource for understanding the nature and mechanism of variable phenotypic traits and altered disease susceptibility due to variable DNA methylation and gene expression patterns in healthy individuals.Entities:
Mesh:
Year: 2016 PMID: 26978482 PMCID: PMC4792177 DOI: 10.1038/sdata.2016.19
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Description of the study and experimental and bioinformatics pipeline.
(a) Overview of the main experimental and bioinformatic components and steps and used to generate genome-wide DNA methylation profiles from neutrophils of healthy individuals and identify inter-individual epigenetic variation. (b) Overview of the steps and approaches used to generate summaries of abundance of transcripts in healthy neutrophils and identify the relationship of exon usage and DNA methylation variation in our primary research paper.
Figure 2A representative example of signatures and quality of RRBS sequenced reads (sample: X9015).
(a) Per base sequence content as indicated by FastQC of the raw sequenced reads. The X-axis plots the sequencing cycle or positions in reads. The Y-axis represents percentages if the occurrence of the bases along the read. (b) Per base sequence content as indicated by FastQC after adaptor sequence removal. (c) A representative example of per base sequence quality for RRBS data (sample: X9015). For each position a Box and Whisker plot of the Phred quality scores is drawn. The central red line is the median value. The yellow box represents the inter-quartile range (25–75%). The upper and lower whiskers represent the 10 and 90% points. The blue line represents the mean quality.
Details of the participants for which RRBS data was generated.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| X9015 | Male | 26 | Indian subcontinent | 18.4 | 75 |
| X9006 | Female | 31 | Southeast Asia | 21.7 | 65 |
| X9010 | Female | 34 | United Kingdom | 30.9 | 75 |
| X9007 | Female | 25 | Indian subcontinent | 9.1 | 100 |
| X9019 | Female | 33 | Eastern Europe | 28.3 | 80 |
| X9020 | Male | 34 | Southeast Asia | 17.9 | 80 |
| X9014 | Male | 31 | United Kingdom | 18.7 | 90 |
| X9012 | Male | 29 | South America | 31.3 | 85 |
| X9018 | Male | 33 | Pacific islands | 43.8 | 85 |
| X9016 | Female | 32 | Western Europe | 45.6 | 80 |
| X9017 | Female | 32 | Western Europe | 23.6 | 80 |
| X9021 | Female | 28 | Pacific islands | 43.1 | 80 |
| X9012 _replicate | Male | 29 | South America | 16.3 | 85 |
Details of the participants for which RNA-Seq data was generated.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| X9015 | Male | 26 | Indian subcontinent | 7.90 | 38.2 |
| X9010 | Female | 34 | United Kingdom | 7.80 | 72.1 |
| X9019 | Female | 33 | Eastern Europe | 8.20 | 44.5 |
| X9014 | Male | 31 | United Kingdom | 8.90 | 19.6 |
Figure 3A representative example of per base sequence quality for RNA-Seq data (sample: X9015).
The plot was generated using FastQC program. See Fig. 2c legend for explanation.