| Literature DB >> 23282328 |
Ewen F Kirkness1, Rashel V Grindberg, Joyclyn Yee-Greenbaum, Christian R Marshall, Stephen W Scherer, Roger S Lasken, J Craig Venter.
Abstract
There is increasing evidence that the phenotypic effects of genomic sequence variants are best understood in terms of variant haplotypes rather than as isolated polymorphisms. Haplotype analysis is also critically important for uncovering population histories and for the study of evolutionary genetics. Although the sequencing of individual human genomes to reveal personal collections of sequence variants is now well established, there has been slower progress in the phasing of these variants into pairs of haplotypes along each pair of chromosomes. Here, we have developed a distinct approach to haplotyping that can yield chromosome-length haplotypes, including the vast majority of heterozygous single-nucleotide polymorphisms (SNPs) in an individual human genome. This approach exploits the haploid nature of sperm cells and employs a combination of genotyping and low-coverage sequencing on a short-read platform. In addition to generating chromosome-length haplotypes, the approach can directly identify recombination events (averaging 1.1 per chromosome) with a median resolution of <100 kb.Entities:
Mesh:
Year: 2013 PMID: 23282328 PMCID: PMC3638138 DOI: 10.1101/gr.144600.112
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Genotyping of 238,872 informative loci in the genomes of 16 sperm cells. (A) The cumulative percentage of unique informative loci that was genotyped in at least one (•) or two (■) sperm cells. The calculation of expected values (○,□) assumes a random genotyping dropout rate of 55% per sperm cell. (B) After genotyping all 16 sperm cells, the percentage of informative loci is plotted with respect to the number of independent sperm cells in which they were genotyped.
Figure 2.Genome coverage of mapped sequence reads from amplified sperm DNA and unamplified blood DNA. (A) Distribution of mapped reads from amplified DNA of three haploid sperm cells (yellow, orange, and red bars) and unamplified DNA from blood cells (black bars) after dividing the reference genome into nonoverlapping bins of 1-kb length. (B) Unique genome coverage with increasing read depth for libraries prepared from a single sperm cell (▴) or diploid blood cells (■). The ideal mapping to a nonrepetitive genome is included for comparison (♦).
Figure 3.Depth of read coverage at 1.95 million heterozygous SNPs after shallow sequencing of amplified DNA from 11 independent sperm cells. (A) The read depth at target loci for each cell is indicated by the color-coded key. (B) The cumulative read coverage at target loci.
Figure 4.Heterozygous SNPs and multiple mutations that are predicted to be damaging in the HuRef genome.