| Literature DB >> 26722221 |
Zhihua Jiang1, Hongyang Wang2, Jennifer J Michal1, Xiang Zhou1, Bang Liu3, Leah C Solberg Woods4, Rita A Fuchs5.
Abstract
Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals.Entities:
Keywords: dominant and co-dominant markers, genome wide association study, human, animals and plants; restriction site associated DNA sequencing, genotyping by sequencing, reduced representation library sequencing, complexity reduction of polymorphism sequencing
Mesh:
Year: 2016 PMID: 26722221 PMCID: PMC4679402 DOI: 10.7150/ijbs.13498
Source DB: PubMed Journal: Int J Biol Sci ISSN: 1449-2288 Impact factor: 6.580
Figure 1Classification of GWSS methods. 1) GWSS without size selection after DNA digestion and ligation. Size selection may be employed after PCR amplification. 2) GWSS with semi-size selection. These methods perform size selection after ligation with adapters. Some methods may combine random shearing with size selection and then ligate with the second adapter. 3) GWSS with size selection immediately after DNA digestion. 4) GWSS with selective amplification. These methods are dependent on primer design for selective amplification of products for sequencing. Theoretically, the former two groups of methods can lead to reduction of genome complexity, while the latter two groups of methods may result in reduction of genome representation.
Figure 2Examples of GWSS NGS library construction methods used to discover genome-wide representative variants, mainly SNPs, among individuals. RE indicates the use of a single restriction enzyme in the assay, while RE1 and RE2 indicate the use of two enzymes for digestion of genomic DNA. The buffer zone contains additional nucleotides to tighten the restriction site ligation. The first four examples are designed for assessment with Illumina analyzers, while the last example is a library built to meet Ion Torrent platform requirements.
Restriction enzymes frequently used in the construction of GWSS libraries.
| Enzyme | Recognition Sequence | Dam Blocked? | Dcm Blocked? | CpG blocked? |
|---|---|---|---|---|
| AluI [11,27] | AG/CT | No | No | No |
| ApeKI [23,28,34,37,38] | G/CWGC | No | No | Yes |
| AvaII [25] | G/GWCC | No | Yes | Yes |
| BsaXI [31] | (9/12)ACNNNNNCTCC(10/7) | No | No | No |
| DraI [11,20] | TTT/AAA | No | No | No |
| EcoRI [22,26,33] | G/AATTC | No | No | Yes |
| EcoRV [11] | GAT/ATC | No | No | Yes |
| EcoT22I [37] | ATGCA/T | ? | ? | ? |
| HaeIII [11,20,30] | GG/CC | No | No | No |
| HincII [11] | GTY/RAC | No | No | Yes |
| HindIII [29] | A/AGCTT | No | No | No |
| HpaII [21] | C/CGG | No | No | Yes |
| MluCI [26] | /AATT | No | No | No |
| MlyI [11] | GAGTC(N)5/ | No | No | No |
| MseI [21,33] | T/TAA | No | No | No |
| MspI [24,26,29,40] | C/CGG | No | No | No |
| NlaIII [26] | CATG/ | No | No | No |
| PstI [10,24,37,38,40] | CTGCA/G | No | No | No |
| RsaI [11] | GT/AC | No | No | Yes |
| SbfI [10,12,26] | CCTGCA/GG | No | No | No |
| SphI [26] | GCATG/C | No | No | No |
| StuI [11] | AGG/CCT | No | Yes | No |
| TaqI [33] | T/CGA | Yes | No | No |
Source: Reprinted from www.neb.com (2015) with permission from New England Biolabs.