| Literature DB >> 29653520 |
Lisa Olohan1, Laura-Jayne Gardiner2, Anita Lucaci1, Burkhard Steuernagel3, Brande Wulff3, John Kenny1, Neil Hall2,4, Anthony Hall5,6.
Abstract
BACKGROUND: Bread wheat has a large complex genome that makes whole genome resequencing costly. Therefore, genome complexity reduction techniques such as sequence capture make re-sequencing cost effective. With a high-quality draft wheat genome now available it is possible to design capture probe sets and to use them to accurately genotype and anchor SNPs to the genome. Furthermore, in addition to genetic variation, epigenetic variation provides a source of natural variation contributing to changes in gene expression and phenotype that can be profiled at the base pair level using sequence capture coupled with bisulphite treatment. Here, we present a new 12 Mbp wheat capture probe set, that allows both the profiling of genotype and methylation from the same DNA sample. Furthermore, we present a method, based on Agilent SureSelect Methyl-Seq, that will use a single capture assay as a starting point to allow both DNA sequencing and methyl-seq.Entities:
Keywords: Combining genotype and epi-type analysis; Methodology; Sequence capture
Mesh:
Year: 2018 PMID: 29653520 PMCID: PMC5899405 DOI: 10.1186/s12864-018-4640-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Workflow of the modified sequence capture method. Following fragmentation of the genomic DNA, a SureSelect Methyl-Seq library was constructed and hybridised to custom baits. Bait/target hybrids were bound to streptavidin beads, which were then washed to remove non- specifically bound DNA fragments. Target enriched DNA was eluted from the streptavidin beads and the eluate divided; ~ 3/4 of the eluate was bisulphite converted and then amplified, ~ 1/4 was neutralised, purified and then amplified. The quality of the purified libraries was assessed prior to sequencing
Mapping statistics for the reference sequence
| Sample | % of reads aligned pre-filtering | Average % coverage per ref. contig | Average depth of coverage per ref. contig | Number of ref. contigs mapped | % of ref. contigs mapped | Base-space mapped (bp) |
|---|---|---|---|---|---|---|
| BTF | 24.2 | 59.6 | 30.7 | 82,873 | 99.6 | 39,868,184 |
| BTS | 24.2 | 59.3 | 30.5 | 82,862 | 99.6 | 39,602,695 |
| NBTF | 73.3 | 70.7 | 36.9 | 83,107 | 99.9 | 48,808,952 |
| NBTS | 72.8 | 77.9 | 34.5 | 82,999 | 99.7 | 54,641,687 |
Detailing the mapping output statistics for the two enriched wheat DNA samples, NBTF and BTF (non-bisulphite treated and bisulphite treated) that were taken through separate capture reactions and the two samples that were split and one bisulphite treated while the other was non-bisulphite treated after a single capture (NBTS and BTS). Mapping statistics are in relation to the 82.5 Mb mapping reference
Fig. 2Methylkit Pearson correlation coefficient computations to compare methylation and SNPs between split and non-split samples. Figures demonstrate comparisons of methylation levels across the bisulphite and non-bisulphite treated samples at positions that are associated with a sub-genome A b sub-genome B and c sub-genome D. Comparisons of SNPs using allele frequencies were also computed for the same comparisons and are shown for SNPs in d sub-genome A e sub-genome B and f sub-genome D. Individual samples are labeled diagonally with an axis through the middle of the plot that acts as a mirror image division; comparative correlation plots lie to the left of the axis at the intersection between the two samples, with the corresponding correlation co-efficient for the plot to the left of the axis at the intersection between the two samples
Mapping statistics for the chloroplast genome
| Sample | % cytosine bases successfully converted | Average depth of coverage | % of chloroplast genome mapped | Base-space mapped (bp) |
|---|---|---|---|---|
| BTF | 98.73 | 391.5 | 99.73 | 114,672 |
| BTS | 98.82 | 386.6 | 99.75 | 114,691 |
Detailing the mapping output statistics for the bisulphite treated enriched wheat DNA sample, BTF, that was taken through an individual capture reaction and the sample that was split after capture and bisulphite treated (BTS). Mapping statistics are in relation to the chloroplast genome mapping reference
Mapping statistics for the reference sequence (Watkins line 1190103)
| Sample | Average % coverage per ref. contig | Average depth of coverage per ref. contig | Number of ref. contigs mapped | % of ref. contigs mapped | Base-space mapped (bp) |
|---|---|---|---|---|---|
| BTS | 64.8 | 42.4 | 81,634 | 98.1 | 42,266,334 |
| NBTS | 76.3 | 66.7 | 82,970 | 99.7 | 51,111,629 |
Detailing the mapping output statistics for the two Watkins wheat 1190103 samples that were split and one bisulphite treated while the other was non-bisulphite treated after a single capture (NBTS and BTS). Mapping statistics are in relation to the 82.5 Mb mapping reference
Fig. 3Representation of capture sequence data for Watkins accession 1190103 across the wheat chromosomes. Normalized frequency plots of SNP (blue) and cytosine positions for which DNA methylation information is available (black) per 1Mbp window across each wheat chromosomal pseudomolecule. Normalization of frequencies is to a scale of 0–1. Pre-normalization SNP maximum frequency was 1759 and minimum frequency was 0, cytosine maximum was 9214 and minimum was 0