Literature DB >> 20305652

Direct determination of molecular haplotypes by chromosome microdissection.

Li Ma1, Yan Xiao, Hui Huang, Qingwei Wang, Weinian Rao, Yue Feng, Kui Zhang, Qing Song.   

Abstract

Direct observation of haplotypes is still technically challenging. Here we report a method for the determination of haplotypes through chromosome microdissection. We determined human long-range chromosomal haplotypes with more than 98.85% accuracy at 24,245 genome-wide heterozygous single-nucleotide polymorphism (SNP) loci.

Entities:  

Mesh:

Year:  2010        PMID: 20305652      PMCID: PMC2871314          DOI: 10.1038/nmeth.1443

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


Haplotype refers to a group of alleles inherited on a single chromosome. Haplotype analysis plays an important role for mapping disease genes, elucidating population histories, studying the evolutionary genetics, and exploring cis-interactions in the regulation of gene expression 1,2. Statistical and computational methods have been developed to reconstruct haplotypes from conventional genotype data. Although statistical and computational methods can provide highly accurate estimates of haplotypes in most situations, even the best method can have a large error rate of more than 35% for some real datasets 3. The accuracy of statistical methods for haplotype reconstruction can be greatly improved when they are applied to trio datasets using information from relatives 2; however, the parental samples are not always available to make this feasible. Experimental approaches have been developed to determine haplotypes 4,5 with limited capability of haplotyping SNPs across a large chromosomal region spanning megabase distances, except the conversion technology, which can determine the long-range haplotypes by creating human-mouse hybrid cells that contain a subset of human chromosomes 6,7, and the polony (polymerase colony) technology, which can determine the long-range haplotype of a pair of distant SNP loci on the same immobilized colony within an acrylamide gel 8,9. Conversion technology involves considerable cost and time for large-scale applications, the polony technology is limited by the small number of SNPs that can be haplotyped in genome-wide studies 2,10. Here we report , 7 (poly) Dimensional DNA (7DDNA), a method built upon the concept that a somatic cell of a diploid organism contains two and only two sets of homologous chromosomes. If we collect ~ 23 chromosomes from one cell instead of its entire 46 chromosomes, the collection may contain only one copy of some chromosomes, it may also contain no copy or both copies of other chromosomes (Supplementary Fig. 1).We genotype the 7DDNA sample and an entire genomic sample in parallel; the latter allows us to determine which SNPs are heterozygous or homozygous in each sample. . If a 7DDNA sample shows “homozygous” genotypes at all polymorphic loci along a chromosome known to be heterozygous, this 7DDNA sample contains one single copy of this chromosome (Fig. 1, Supplementary Fig. 1 and Supplementary Table 1), which will enable us to read one haplotype of this chromosome directly from its conventional high-throughput genotyping output. The unobserved dropout alleles will constitute the other haplotype of this chromosome. If a 7DDNA sample shows heterozygous genotypes at loci known to be polymorphic, this harvest contains both chromosome copies. If a 7DDNA sample fails to call any SNPs along a chromosome, this chromosome is absent in this sample.
Figure 1

The principle and results on chromosome 5 of the 7DDNA haplotyping method. (a) 7DDNA haplotyping procedure. (b) haplotyped chromosome 5, on which 2,514 SNPs were resolved. Each dot represents a SNP. Scale bars, 10 μM.

We have performed a proof-of-principle study with a HapMap sample 11, GM10847, since the haplotypes can be computationally reconstructed with the parental information2,12. These computationally configured haplotypes should be accurate on those non-triple-heterozygote SNP loci unless there is mis-inheritance, genotyping errors, SNP recurrence, or an overlapping of a SNP with an insertion-deletion polymorphism (indel), a repeat, or a copy number variation (CNV), or segmental duplications. In our experiment, we microdissected three pieces from the chromosome spread of an individual (Online Methods). The microdissection harvests were subjected to whole genome amplification (WGA), which provided us with about 5-8 μg DNA from each harvest. After genotyping with an Illumina BeadChip (CNV370), we collectively haplotyped 15 chromosomes in these three harvested samples, among which eight chromosomes were covered by one harvest, five chromosomes were covered by two harvests, and one chromosome was covered by all three harvests (Supplementary Table 2). Totally 24,245 heterozygous SNPs on these 15 chromosomes were haplotyped (Supplementary Table 2), among which 2,089 SNPs were phased by more than one microdissection harvests, 98.85% of these SNPs (2,065 SNPs) showed consistent results between different microdissection harvests (Table 1). Our data showed a faithful amplification of our microdissection harvests, in which the error rate was 0.73% (Supplementary Table 3). We noticed that the no-call rate in the Illumina genotyping on haploid chromosomes of the microdissection harvests was as high as ~70% (Supplementary Table 4). The possible reasons may be that our WGA method was insufficient to amplify every locus when the quantity of input DNA was as small as one single copy, or this genotyping platform was insensitive to the DNA template amplified from one single molecule of template. We anticipate that this issue may be solved by a new robust WGA method for amplifying single-copy template or other genotyping platforms such as next-generation sequencers.
Table 1

Replication of the 7DDNA haplotyping results.

ChromosomeRepeated SNPs*Consistent SNPs**Consistency rate %
Chr351751699.81
Chr456555498.05
Chr1618918698.41
Chr1721721599.08
Chr1821220898.11
Chr2038938699.23

Total2,0892,06598.85

Repeated SNPs are those SNPs that were haplotyped in 2 or 3 microdissection harvests in our study.

Consistent SNPs are those repeated SNPs that obtained the identical results on haplotypes from different microdissection harvests.

Compared with computationally reconstructed haplotypes of this person (GM10847) on those non-triple-heterozygous SNP loci, our 7DDNA haplotypes reported discordant allele phases on only 0.65% of SNPs (Supplementary Table 2). Comparing our experimentally determined haplotypes of GM10847 with her parents’ haplotypes downloaded from the HapMap database that were computationally configured by IMPUTE++ 13, the difference was only 2.15%, among which 58.7% of those discrepancies occurred on triple-heterozygote loci. This data not only demonstrated a high accuracy of haplotypes configured with trio information 2, it also validated our 7DDNA haplotype results. About 10.8% of heterozygous SNPs on the diploid chromosomes in our microdissection harvests remained to be heterozygous (Supplementary Table 5); the other heterozygous SNPs on those diploid chromosomes received a homozygous genotype call due to allele dropout. However, on the haploid chromosomes in our microdissection harvests, only 0.08% of heterozygous SNPs appeared to be heterozygous (Supplementary Table 5). This substantial disparity (10.8% vs. 0.08%) enabled us to distinguish those haploid chromosomes from diploid chromosomes in a microdissection harvest (Supplementary Fig. 2 and Supplementary Fig. 3). To haplotype all chromosomes of each individual, multiple microdissection harvests should be collected to cover all of a person's chromosomes. Our statistical analysis (Supplementary Note-1) showed that 4-8 microdissection samplings of each individual will provide a 93% probability to obtain a person's whole genome haplotype (Supplementary Table 6 and Supplementary Fig. 4). In other words, 52% of individuals may have completed their genome haplotyping with no more than five microdissections, 93% of individuals will need no more than eight microdissections to complete their genome haplotyping (Supplementary Table 6 and Supplementary Fig. 4). When we reach 12 microdissection samplings, 99.6% of all individuals will have their whole genome haplotyped. In addition to chromosome coverage, this cumulative strategy will also effectively increase the SNP coverage on resolved haplotypes (Supplementary Table 7). We have estimated the cost of this strategy to be $0.0023 per SNP per sample (Supplementary Note-2). The entire 7DDNA procedure will take three days, with a ~10-min work on the first day, no work on the second day, and a 6-8 hour work on the third day. Based on this schedule, we may do five experiments each week, and handle 24 individuals in each experiment. A 1,000-sample project may need only 2-6 months. Because many researchers may be interested in particular chromosomes rather than the entire genome, such as in human leukocyte antigen (HLA) studies, we have explored an alternative approach in which a single chromosome is collected in a microdissection with an Objective HCX 150X/0.90 lens. Chromosome recognition can be achieved by karyotype morphology in combination with PCR validation. It is possible that a chromosome is broken by the microdissection procedure. When only a partial chromosome is harvested without accompaniment of its sister chromosome, there will be failed allele calls at those SNP loci located on the absent chromosomal segment. If a partial chromosome is harvested accompanied by its full-length sister chromosome in the microdissection, the heterozygous SNPs located on this partial chromosomal segment will receive heterozygous genotypes in the 7DDNA genotyping results, but those heterozygous SNPs located outside this partial chromosome piece will appear as “homozygous” loci in the 7DDNA genotyping results (Supplementary Fig. 2). The segmental boundaries will be deciphered from the genotyping results and the haplotypes of entire chromosome will be assembled from broken pieces in different microdissection harvests by their overlapping portion. The 7DDNA technology provides the first high-throughput experimental method for determination of long-range haplotypes for large-scale and genome-wide studies. Compared with conversion technology 6,7,14 and polony haplotyping 8,9, 7DDNA exhibits advantages on labor, time, throughput and the number of SNPs. Chromosomal haplotypes will be important for functional interpretations of SNPs in cis-interactions and the integration of genetic and epigenetic datasets along the chromosomes. AOP: Comparing the haplotypes of a few randomly microdissected chromosomes to a full genome wide haplotype allows one to determine the long-range haplotype of chromosomes for which only one copy was captured. Issue Comparing the haplotypes of a few randomly microdissected chromosomes to a full genome wide haplotype allows one to determine the long-range haplotype of chromosomes for which only one copy was captured.
  14 in total

1.  Conversion of diploidy to haploidy.

Authors:  H Yan; N Papadopoulos; G Marra; C Perrera; J Jiricny; C R Boland; H T Lynch; R B Chadwick; A de la Chapelle; K Berg; J R Eshleman; W Yuan; S Markowitz; S J Laken; C Lengauer; K W Kinzler; B Vogelstein
Journal:  Nature       Date:  2000-02-17       Impact factor: 49.962

2.  Experimentally-derived haplotypes substantially increase the efficiency of linkage disequilibrium studies.

Authors:  J A Douglas; M Boehnke; E Gillanders; J M Trent; S B Gruber
Journal:  Nat Genet       Date:  2001-08       Impact factor: 38.330

3.  Repbase update: a database and an electronic journal of repetitive elements.

Authors:  J Jurka
Journal:  Trends Genet       Date:  2000-09       Impact factor: 11.639

4.  A comparison of bayesian methods for haplotype reconstruction from population genotype data.

Authors:  Matthew Stephens; Peter Donnelly
Journal:  Am J Hum Genet       Date:  2003-10-20       Impact factor: 11.025

5.  Digital genotyping and haplotyping with polymerase colonies.

Authors:  Robi D Mitra; Vincent L Butty; Jay Shendure; Benjamin R Williams; David E Housman; George M Church
Journal:  Proc Natl Acad Sci U S A       Date:  2003-05-02       Impact factor: 11.205

6.  The International HapMap Project.

Authors: 
Journal:  Nature       Date:  2003-12-18       Impact factor: 49.962

7.  A comparison of phasing algorithms for trios and unrelated individuals.

Authors:  Jonathan Marchini; David Cutler; Nick Patterson; Matthew Stephens; Eleazar Eskin; Eran Halperin; Shin Lin; Zhaohui S Qin; Heather M Munro; Goncalo R Abecasis; Peter Donnelly
Journal:  Am J Hum Genet       Date:  2006-01-26       Impact factor: 11.025

Review 8.  Genotyping technologies for genetic research.

Authors:  Jiannis Ragoussis
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

9.  Long-range polony haplotyping of individual human chromosome molecules.

Authors:  Kun Zhang; Jun Zhu; Jay Shendure; Gregory J Porreca; John D Aach; Robi D Mitra; George M Church
Journal:  Nat Genet       Date:  2006-02-19       Impact factor: 38.330

10.  Are molecular haplotypes worth the time and expense? A cost-effective method for applying molecular haplotypes.

Authors:  Mark A Levenstien; Jürg Ott; Derek Gordon
Journal:  PLoS Genet       Date:  2006-06-28       Impact factor: 5.917

View more
  49 in total

1.  Noninvasive whole-genome sequencing of a human fetus.

Authors:  Jacob O Kitzman; Matthew W Snyder; Mario Ventura; Alexandra P Lewis; Ruolan Qiu; Lavone E Simmons; Hilary S Gammill; Craig E Rubens; Donna A Santillan; Jeffrey C Murray; Holly K Tabor; Michael J Bamshad; Evan E Eichler; Jay Shendure
Journal:  Sci Transl Med       Date:  2012-06-06       Impact factor: 17.956

2.  Completely phased genome sequencing through chromosome sorting.

Authors:  Hong Yang; Xi Chen; Wing Hung Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2010-12-15       Impact factor: 11.205

3.  GenomeLaser: fast and accurate haplotyping from pedigree genotypes.

Authors:  Wenzhi Li; Guoxing Fu; Weinian Rao; Wei Xu; Li Ma; Shiwen Guo; Qing Song
Journal:  Bioinformatics       Date:  2015-08-18       Impact factor: 6.937

4.  A comprehensively molecular haplotype-resolved genome of a European individual.

Authors:  Eun-Kyung Suk; Gayle K McEwen; Jorge Duitama; Katja Nowick; Sabrina Schulz; Stefanie Palczewski; Stefan Schreiber; Dustin T Holloway; Stephen McLaughlin; Heather Peckham; Clarence Lee; Thomas Huebsch; Margret R Hoehe
Journal:  Genome Res       Date:  2011-08-03       Impact factor: 9.043

Review 5.  Haplotype-resolved genome sequencing: experimental methods and applications.

Authors:  Matthew W Snyder; Andrew Adey; Jacob O Kitzman; Jay Shendure
Journal:  Nat Rev Genet       Date:  2015-05-07       Impact factor: 53.242

6.  High-resolution whole-genome haplotyping using limited seed data.

Authors:  Weinian Rao; Yamin Ma; Li Ma; Jian Zhao; Qiling Li; Weikuan Gu; Kui Zhang; Vincent C Bond; Qing Song
Journal:  Nat Methods       Date:  2013-01       Impact factor: 28.547

7.  Chromosome-Range Whole-Genome High-Throughput Experimental Haplotyping by Single-Chromosome Microdissection.

Authors:  Li Ma; Wenzhi Li; Qing Song
Journal:  Methods Mol Biol       Date:  2017

Review 8.  Haplotype phasing: existing methods and new developments.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Nat Rev Genet       Date:  2011-09-16       Impact factor: 53.242

9.  Whole-genome molecular haplotyping of single cells.

Authors:  H Christina Fan; Jianbin Wang; Anastasia Potanina; Stephen R Quake
Journal:  Nat Biotechnol       Date:  2010-12-19       Impact factor: 54.908

10.  Probing meiotic recombination and aneuploidy of single sperm cells by whole-genome sequencing.

Authors:  Sijia Lu; Chenghang Zong; Wei Fan; Mingyu Yang; Jinsen Li; Alec R Chapman; Ping Zhu; Xuesong Hu; Liya Xu; Liying Yan; Fan Bai; Jie Qiao; Fuchou Tang; Ruiqiang Li; X Sunney Xie
Journal:  Science       Date:  2012-12-21       Impact factor: 47.728

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.