Literature DB >> 20300295

Identification of FVIII gene mutations in patients with hemophilia A using new combinatorial sequencing by hybridization.

M Chetta, A Drmanac, R Santacroce, E Grandone, S Surrey, P Fortina, M Margaglione.   

Abstract

BACKGROUND: Standard methods of mutation detection are time consuming in Hemophilia A (HA) rendering their application unavailable in some analysis such as prenatal diagnosis.
OBJECTIVES: To evaluate the feasibility of combinatorial sequencing-by-hybridization (cSBH) as an alternative and reliable tool for mutation detection in FVIII gene. PATIENTS/
METHODS: We have applied a new method of cSBH that uses two different colors for detection of multiple point mutations in the FVIII gene. The 26 exons encompassing the HA gene were analyzed in 7 newly diagnosed Italian patients and in 19 previously characterized individuals with FVIII deficiency.
RESULTS: Data show that, when solution-phase TAMRA and QUASAR labeled 5-mer oligonucleotide sets mixed with unlabeled target PCR templates are co-hybridized in the presence of DNA ligase to universal 6-mer oligonucleotide probe-based arrays, a number of mutations can be successfully detected. The technique was reliable also in identifying a mutant FVIII allele in an obligate heterozygote. A novel missense mutation (Leu1843Thr) in exon 16 and three novel neutral polymorphisms are presented with an updated protocol for 2-color cSBH.
CONCLUSIONS: cSBH is a reliable tool for mutation detection in FVIII gene and may represent a complementary method for the genetic screening of HA patients.

Entities:  

Keywords:  Combinatorial sequencing-by-hybridization; FVIII gene; hemophilia A

Year:  2008        PMID: 20300295      PMCID: PMC2840789          DOI: 10.4103/0971-6866.44106

Source DB:  PubMed          Journal:  Indian J Hum Genet        ISSN: 1998-362X


Introduction

Hemophilia A (HA) is a common inherited recessive X-linked disorder of blood clotting caused by deficiency of factor VIII in the coagulation cascade and affects approximately 1 in 5,000 males world-wide.[12] The FVIII gene, comprises 26 exons ranging from 69 bp (exon 5) to 3.1 kb (exon 14) in size, spans 186 kb of genomic DNA and produces a 9030 nt mRNA. According to the UK Hemophilia Centre Doctors' Organisation (UKHCDO) Hemophilia Genetics Laboratory Network, the severity of HA in the pedigree should be determined first as this will influence the diagnostic strategy to be employed. Severe Hemophiliacs should be screened for the intron 22 inversion mutation followed by the intron 1 inversion mutation. This approach identifies the underlying mutation in 45-50% of severe HA patients.[34] The remaining severe HA pedigrees should then be analyzed further either by full mutation or linkage analysis. Mutations have been found in nearly all 26 exons of the factor VIII gene, over 400 mutations have been identified[56] and de novo mutations represent approximately 30% of all cases.[7] The most common detection methods include DNA sequence analysis which requires numerous reactions and individual analysis of each exon or alternative screening methods such as single-stranded conformation polymorphism (SSCP),[8] denaturing gradient gel electrophoresis (DGGE)[8] and denaturing high performance liquid chromatography (dHPLC).[9] We applied the new combinatorial sequencing-by-hybridization (cSBH) as an alternative method to the traditional Sanger dideoxy chain termination approach.[1011] Previous works have shown that cSBH is an efficient rapid and alternative method for mutation detection.[1213] We increased the quality of results with a new cSBH method that use two different colors (TAMRA and QUASAR). The platform is an indirect method which uses standard chemistry of base-specific hybridization of complementary nucleic acids to indirectly assemble the order of bases in a target DNA. Short oligonucleotide probes are arrayed in the form of high-density arrays of universal sequence and hybridized to sample DNA molecules. The resulting hybridization pattern is used to generate the target sequence using computer algorithms. We report development of a strategy to implement 2-color cSBH to screen a range of mutations within the FVIII gene.

Materials and Methods

Sample

After approval of the local ethics committee, the study was carried out according to the Principles of the Declaration of Helsinki. After informed consent the genomic DNA was isolated from clinically diagnosed individuals with HA, regularly investigated for the detection of the causative mutation within the F8 gene. Samples were labelled using an internal lab code. We selected samples that presented a point mutation at different position in the F8 gene. We selected samples 1 and 20, which present a nonsense mutation; samples 73 and 95, which present a 2 bp deletion; samples 2, 6, 37, and 89, which present a single bp deletion; and a series of randomly selected samples in which a missense mutation was identified. Samples 1 and 20 and 2, 6, and 37 carry the same mutation and were used to verify the quality of results in the analysis. In addition, we randomly selected 7 HA male patients, who were previously excluded to carry the inversion of the intron 1 or of the intron 22 and in whom no gene mutation was detected by a first FVIII gene screening, using common strategies (SSCP, DGGE, or dHPLC). All the patients involved in this study are regularly followed at one of 50 hemophilia treatment centers spread all over the country and belonged to the Italian Association of Hemophilia Centers (AICE; URL: ), an organization which was founded with the aim to organize clinical and experimental national activities in inherited bleeding disorders. Patients were considered to have HA according to the international consensus of the 2001 International Society on Thrombosis and Haemostasis (ISTH) Factor VIII and Factor IX Subcommittee.[9] All known HA patients are registered and followed at one of these centers. For each patient clinical and laboratory data (including FVIII clotting activity and inhibitor) were recorded. DNA was extracted from 5 to 10 ml of EDTA treated peripheral blood leukocytes with in standard salting out method. Numbering of mutations and polymorphisms was in agreement with genomic DNA sequence in Hamsters database mutations (http://europium.csc.mrc.ac.uk/WebPages/Main/main.htm). Patient DNA was amplified for exons spanning the FVIII gene and encompassing each different disease-causing mutation.

Primer design

Primers for multiplex and nested PCR were obtained from MWG Deutschland, (HPLC purified) and designed using Primer 3 software (www.genome.wi.mit.edu/cgibin/primer/primer3_www.cgi) to span the entire HA gene coding regions with intron-exon boundary coverage.

DNA amplification

The amplification of target for cSBH was performed in two consecutive steps [Table 1]. The first required three different multiplex PCR reactions to amplify all exons while the second step, a nested-asymmetric PCR, generated single-stranded templates corresponding to sense and antisense strands. The multiplex PCR was performed in 25 µ l containing 100 ng of genomic DNA, 33 pmol of each primer suspended in three different mixes (Mix 1, 2, and 3 with primers for exons 1-9, 10-13 and 15-25, and 14 and 26, respectively). Reactions were performed for 40 cycles with an initial denaturation at 96°C for 15 min. The protocols to amplify Mix 1 required an initial denaturation at 96°C for 15 min, followed by 20 cycles of denaturation (96°C, 30 sec), annealing (52°C for 30 sec), and extension (72°C, 50 sec); while for Mix 2, 20 cycles of denaturation (96°C, 30 sec), annealing (55°C for 30 sec) and extension (72°C, 50 sec) with a final extension at 72°C for 7 min were required. In Mix 3 the annealing temperatures are 55°C and 58°C for exons 14 and 26, respectively. This separation is required to optimize the amplification conditions for all exons.
Table 1

Sequence and position of primers used for multiplex and nested PCR reactions

ExonMultiplex PCR Sequence primersPosition*ExonNested PCR Sequence primersPosition*PCR Product
Ex1_Fgttcttcctgtggctgcttc-55/-35°Ex1_Fngctgcttcccactgataaaaa-43/-22 °387 bp
Ex1_Ragcagacttacatccccacaa353-373Ex1_Rnccgatcagaccctacaggac325-344
Ex2_Fttcaaatttgcctccttgct23012-23032Ex2_Fncagctgctttttgaagtgtcc23068-23089217 bp
Ex2_Rtttggcagctgcacttttta23328-23348Ex2_Rncccaatttcataaatagcattcaa23261-23284
Ex3_Ftttggaataacaggttttctgga25562-25585Ex3_Fnctttggcggacatctcattc25602-25623180 bp
Ex3_Rgccaccattacaaagcacac25808-25828Ex3_Rntgacaggacaataggagggta25661-25682
Ex4_Fcagtggatatagaaaggacaatttta29534-29560Ex4_Fcagtggatatagaaaggacaatttta29534-29560281 bp
Ex4_Rttcagttgtttgtacttctctgctt29823-29848Ex4_Rttcagttgtttgtacttctctgctt29823-29848
Ex5_Fttgttcttactgtcaagtaactgatga35313-35340Ex5_Fnttgttcttactgtcaagtaactgatga35313-35340168 bp
Ex5_Rtgatctttctataatcacctcctctt35544-35570Ex5_Rntgatctttctataatcacctcctctt35544-35570
Ex6_Fgagcagggaaggagaaagg37840-37859Ex6_Fngcggtcattcatgagacaca37863-37883215 bp
Ex6_Rccgagctgtttgtgaactga38101-38121Ex6_Rnttggaagaccctgaggattg38058-38078
Ex7_Ftttgtccattctgtcctagcaa53089-53111Ex7_Fntcatagccataggtgtcttattcc53138-53162350 bp
Ex7_Raatgtccccttcagcaacac53481-53501Ex7_Rngcaacacactatattcctgtacattg53462-53488
Ex8_Ftagcctgcagaaacatgagc55919-55939Ex8_Fntgtttggtttgtctgactcca56015-56036322 bp
Ex8_Rtggcttcaggatttgttggt56365-56385Ex8_Rntggggaagagagagtaccaat56316-56337
Ex9_Fcactccttgccttgattgaa56481-56501Ex9_Fntttttcttcccaacctctcatc56543-56565260 bp
Ex9_Rccattggagacaaggctgaa56833-56853Ex9_Rntccagactttttcttcttacctga56779-56803
Ex10_Fgaggccacttttatttatctgga61462-61485Ex10_Fngaccacagttttcttgttgatcc61490-61511192 bp
Ex10_Rggaccaacataattttagttgttattg61711-61738Ex10_Rnaaaaagagctataaatcgagggaat61656-61680
Ex 11_Fgaacccttgcaacaacaaca65470-65490Ex11_Fnatggttttgcttgtgggtag65533-65553260 bp
Ex11_Rggggacatacactgagaatgaa65808-65830Ex11_Rnagcacttggaaaggcaagaa65783-65803
Ex12_Ftgctagctcctacctgacaaca68617-68638Ex12_Fngcatttctttacccctttcaa68645-68666230 bp
Ex12_Rcattcattatctggacatcactttg68890-68915Ex12_Rnctttattcaccacccactgga68854-68875
Ex13_Ftcatgacaatcacaatccaaaa74746-74767Ex13_Fncttttccccattgtttttgc74795-74815273 bp
Ex13_Rgagcatacgaatggctagtgaa75068-75083Ex13_Rngcattcacagctgttggtaca75047-75067
Ex14-1_Fngagtcttattcttcctcatctcca91023-910471022 bp
Ex14-1_Rngggccatcaatgtgagtctt92025-92044
Ex14_Fctgggaatgggagagaacct90965-90985Ex14-2_Fncaaaacttccaataattcagcaa91994-920171176 bp
Ex14_Rttgttggtgtcatcatctgg94206-94226Ex14-2_Rnacctttgcaatgggtaatgg93150-93170
Ex14-3_Fncgagcaccctcacacagata93037-930571156 bp
Ex14-3_Rncacaagagcagagcaaagga94173-94193
Ex15_Fgcttggcaagagtatttcaagg116010-116032Ex15_Fnggcacctaggaaaatgagga116067-116087269 bp
Ex15_Raaaatttccaaaagtgggaataca116371-116393Ex15_Rnttttcttgtaattccactgtcctt116312-116336
Ex16_Faaagacaggtatttcttttagggatg117604-117630Ex16_Fngatcctagaagattattcttgcttttt117648-117676300 bp
Ex16_Rttgcacgtaggataaatatcaaaa117981-118003Ex16_Rnccaaaaagtggtcagcacaa117929-117949
Ex17_Ftctggttcataggtgagagagc118144-118166Ex17_Fngattgatgtcttccctcccta118178-118199301 bp
Ex17_Rccctggatcaagtctcatttg118484- 118505Ex17_Rnagtgcaatctgcatttcacag118458-118479
Ex18_Ftggtggagtggagagaaagaa118570-118591Ex18_Fnggcttctctgtgtccttctcc118613-118634229 bp
Ex18_Rgtgttcccagtgcctagacc118880-118900Ex18_Rnaaagagcacaaacaagctcatac118819-118842
Ex19_Fttcgcataaaccaatgtatctca120481-120504Ex19_Fnaatttctgttcctgttggttttt120529-10551169 bp
Ex19_Rgcaaccattccagaaaggaa120733-120753Ex19_Rnacctctgcccacattgctac120678-120698
Ex20_Fttgacgttctcccattttca121194-121213Ex20_Fngtgttccacccgtttcattt121259-121279134 bp
Ex20_Rgaagcatggagatggattcatta121430-121453Ex20_Rnatcagcccaggttcttgga121374-121393
Ex21_Ftgtctaggactaacccagctgaa122719-122742Ex21_Fntctgtttctttacttgggcaaa122750-122761146 bp
Ex21_Rtttgagcttgcaagaggaataa122914-122936Ex21_Rntgtgatacatttcccatcattg122874-122896
Ex22_Ftgatggacacacctgtagcaa126381-126402Ex22_Fntttcaggaggtagcacatacattt126431-126455250 bp
Ex22_Rccaatatctgaaatctgccaaa126695-126716Ex22_Rncaggcattccctttaaatgac126659-126680
Ex23_Fgttgagaaacgcacaaagca159393-159413Ex23_Fnccatgtatttgtgctaatctctcc159467-159491195 bp
Ex23_Rcatggttgagggaagaagga159664-159684Ex23_Rntgggatgacttggcacttac159642-159661
Ex24_Ftcagtggaagctgctcagta160784-160803Ex24_Fnactgaggctgaagcatgtcc160806-160826250 bp
Ex24_Raggccttccccgattttt161059-161102Ex24_Rncccaaccactgctctgagtc161036-161056
Ex25_Ftgagaagtgctgtggtatggtt162023-162055Ex25_Fnagggatttgggaatttctgg162059-162067336 bp
Ex25_Rtgttaagctctaggagaggtggt162336-162359Ex25_Rntctttccaaggagaccagctt162294-162315
Ex26-1_Fnctgtcagacaaccaataaatgct184933-184956999bp
Ex26_Fcagtgaccattgtcctgtca184919-184939Ex26-1_Rnttgcatcctcctgacttattttt185909-185932
Ex26_Rtcagtgttcacatttttatttcca+11/+36#Ex26-2_Fncaggaggatgcaattgttga185919-1859381048 bp
Ex26-2_Rntccagtgtaaaaacacaattctcaa+33/+56#

numbering according to the GeneBank NG_005114 sequence.

nucleotides upstream the start of the NG_005114 sequence.

nucleotides downstream the NG_005114 sequence.

Sequence and position of primers used for multiplex and nested PCR reactions numbering according to the GeneBank NG_005114 sequence. nucleotides upstream the start of the NG_005114 sequence. nucleotides downstream the NG_005114 sequence. The nested PCR amplification for all exons was performed in 100 µ l reactions containing a 1:4 dilution of multiplex PCR reaction (25 µ l multiplex reaction in 75 µ l of ddH20), 30 pmol of forward primer and 3 pmol of reverse primer, 0.6 U of polymerase (England Biolabs) to amplify all exons except exons 14 and 26 that required 1 U of polymerase (England Biolabs), and 1X PCR Callida's buffer from Callida's Genomics. Exon 14 (3.1 kb) and 26 (1.9 kb) required three and two overlapping nested asymmetric PCR, respectively. All PCR reactions were performed using Gene Amp PCR system 9700. PCR reactions were for 45 cycles with an initial denaturation at 96°C and for 3 min, followed by 96°C for 30 sec, annealing at 55°C for 30 sec and extension at 72°C for 1 min. Final extension was at 72°C for 1 min. An aliquot of the PCR product was analyzed on a 1% (w/v) agarose gel to verify size and quality of the PCR product. The single strands are detected with gel-star Nucleic acid Gel stain (Cambrex, Rockland, UK).

Probes

Amino-modified unlabeled hexamer (6-mer) probes as well as TAMRA and QUASAR labelled pentamers (5-mer) were manufactured and provided by Callida Genomics. HyChip™ slides contain eight replica arrays of a complete set of 4,096 glass-bound 6-mer probes and hundreds of fluorescent and empty marker dots to guide image analysis. A standard universal set of eight probe pools contains a complete set of 1,024 split in two colors: 512 probes are made with TAMRA and 512 with QUASAR. The pools were prepared by mixing 128 specific TAMRA-dye tagged probes and 128 labelled with QUASAR (Callida Genomics, Sunnyvale, CA) divided in 4 separated pools for one strand and four for the other strand. The emission wavelength of TAMRA is 552 nm and for QUASAR is 670 nm.

Target preparation

Prior to hybridization, the PCR nested–asymmetric products were precipitated using isopropanol, creating eight different mixes containing single-stranded DNA of the forward and eight containing the reverse strand. The mixes were performed to optimize the analysis results. We created eight different mixes as follow: exons 1 to 5, exons 6 to 9, exons 10 to17, exons 18 to 25, overlapping fragments 14a, 14b, 14c separately and fragments 26a and 26b in the same mix. Precipitated DNA was resuspended in 40 µ l of dH2O and therefore, the double-stranded PCR template was eliminated by restriction digestion leaving single strands intact. The digestion was performed using Callida's pre-made frozen mix in a total volume of 50 µ l with an optimized amount of DNase I (Gibco-BRL, Rockville, MD) at 37°C for 15-20 min followed by inactivation at 95°C for 5 min.

Hybridization

Single-strand DNA mixes (50 µ l) were adjusted to 73 µ l with dH2O and added to a frozen aliquot of Callida's pre-made ligation mix (47 µ l), containing hybridization buffer (New England Biolabs, Beverly, MA) and an optimized amount of T4 DNA ligase (New England Biolabs, Beverly, MA). The volume of 120 µ l of DNA-ligase mix (27 µ l) was dispensed into each of the eight labelled 5-mer TAMRA and QUASAR linked probe pools pre-aliquoted in a strip of eight small tubes supplied by Callida. Eight probe pools containing target DNA and ligase were mixed thoroughly and 27 µ l was loaded by pipette onto the HyChip cartridge, where it was drawn by capillary forces into the selected hybridization chamber. Hybridization and ligation occurred in a humidity chamber at room temperature for 60 min. Slides then were hot-washed (60-65°C) to remove non-ligated labelled probes using Callida's detergent (containing washing buffer) for 15 min in an orbital shaker 150 rpm, rinsed four times in dH2O and spin-dried at 1500 rpm for 3 min. Slides then were scanned at 20 µ m pixel resolution using a standard array reader GenePix® 4000B setting PMT (532 nm) = 650 and PMT (635 nm) = 750.

Analysis

The image analysis and base-calling programs are written in C++ programming language. The image analysis program takes advantage of an optimized pattern of fluorescent markers and empty spots on the HyChip™ slide to locate the spotting grid. This allows highly accurate automatic grid and spot detection without the need of any user input (such as clicking a corner spot on a display of the tagged image file format [TIFF] image). Robust pattern recognition statistics provide the correct location of spots in imperfect arrays made of shifted subarrays and tolerates a number of missing marker spots. Automated image analysis allows integration of chip scanning with final data analysis. The base-calling software uses an iterative approach to finding the most likely sample sequence, starting from the input reference sequence. For different analyses, it is assumed that the input reference matches the sample sequence more than 90 to 97%. In addition, other limitations (e.g., maximal length of insertions and deletions) are specified. Under these conditions, the algorithm starts by determining the high confidence reference bases, and then testing for various possible mutations to the reference. The program automatically combines hybridization data from two strands or repeated experiments. The following outline lists the main steps of the base-calling process: Normalize/correct raw spot scores for local variation in array preparation and chip loading. Create table of normalized scores for all 4,000,000 11-mers by combining all 128 labelled 5-mer probes from each pool with 4,096 6-mer sequences. Groups of 128 11-mers have the same score. Calculate a base score for each reference position by applying robust statistics on 11 (22 for two strands) overlapping 11-mer intensity scores reading a given base. Use reference sequence segments that have a strong base score (very likely have no mutation) to remove all full match and mismatch signals expected to be contributed by these sequences to the normalized spot scores. Generate all single-base changes to the reference (including insertions and deletions) and calculate mutation base intensity scores (the same statistics used as in step 3) and corresponding P-values. Identify candidate mutations with significant P-values. Two sequence variants with a high P-value would indicate a heterozygote site. For each candidate mutation, modify the reference and calculate the new P-values for the region surrounding the potential mutation. Apply a confidence threshold to the new P-values and record the accepted mutation. For sequence sites that have low P-value for the reference and all single-base variants, multiple mutations (currently including up to 20 base deletions and up to three base insertions), can be tested following the same general scheme (steps 5 and 6). To detect compound heterozygotes, multiple mutation variants have to be searched across the entire sequence length by a special function. The implemented base-calling algorithm efficiently uses available data for fast determination (few minutes total for both image analyses of two chips and base-calling of a 3-kb sequence) of most likely sequences present in the analyzed sample using common computers (1 GHz processor, 512 MB of RAM). The speed and accuracy is achieved by combining: 1) iterative sequence assembly starting by first finding non-mutated segments; 2) robust base-calling statistics that use carefully normalized hybridization scores of 11–22 overlapping probes; and, 3) a spot signal deconvolution function based on a database of expected hybridization/ligation efficiency factors for each full-match and mismatch 11-mer. An accurate database of probe behavior is established by statistical analysis of sequencing data obtained on 64 kb of specially prepared DNA. This recently introduced signal deconvolution function is critical for accurate analysis of complex multikilobase samples contributing several single mismatch results per each of 32,768 dot scores representing 128 11-mers each.

Results

The new 2-color cSBH is an indirect sequencing method in which overlapping probes of known sequence are hybridized to DNA target molecules. In cSBH, which is represented in a schematic overview in Figure 1, target DNA from a PCR product is exposed to two universal sets of short probes in the presence of DNA ligase. When both array-bound and solution-phase labelled probe hybridize to target DNA at precisely adjacent complementary positions, they are covalently linked by DNA ligase, creating one long, array-bound labeled probe of known sequence that is complementary to the solution-phase, single-stranded PCR-generated DNA template. A schematic presentation of a cSBH base-calling chromatogram is shown in Figure 2. The y-axis represents the relative intensity score of the hybridization signal of all overlapping probes at each base. In the presence of strong hybridization signals, the SBH readout appears high, and the reverse is true when there are weak probe hybridization signals. The x-axis represents the position of each base in the sequence analyzed. The computer-called sequence, compared to the known reference sequence used, is represented as the top line of the sequence (in black), and the actual SBH sample sequence readout is directly below (in blue).
Figure 1

cSBH process. A: Single-stranded nested-PCR template is combined with a ligation mix. B: Mix is dispensed into 8 separate probe pools of 5 mers each containing a complete set of 1,024 split in two colors: 512 TAMRA-labeled and 512 QUASAR-labeled. C: Eight replica arrays of 4096 hexamers each are used to generate a HyChip. A pool mix is loaded onto the array through the loading ports on the slide cover. D: Hybridization at room temperature for 60 min. E: Slides are hot-washed. F: Results are scanned and read on a standard slide scanner. G: Results are scored by image analysis Callida software

Figure 2

Graphs generated using HyChip™ for three patients. The computer-called sequence is represented on the top line in black; below in blue is the combinatorial sequencing-by-hybridization (cSBH) sequence signal readout on the specific sample tested. At the bottom of the graph are not called by the computer software bases (background). In the panel A the letter D represents an hemizygous deletion of A in Ex12 (sample 2)(DelC aa1194). In the panel B the letter D represents an hemizygous deletion of C in Ex12 (sample 3). In the panel C the letter S represents a T (Sample 64)µ substitution of A

cSBH process. A: Single-stranded nested-PCR template is combined with a ligation mix. B: Mix is dispensed into 8 separate probe pools of 5 mers each containing a complete set of 1,024 split in two colors: 512 TAMRA-labeled and 512 QUASAR-labeled. C: Eight replica arrays of 4096 hexamers each are used to generate a HyChip. A pool mix is loaded onto the array through the loading ports on the slide cover. D: Hybridization at room temperature for 60 min. E: Slides are hot-washed. F: Results are scanned and read on a standard slide scanner. G: Results are scored by image analysis Callida software Graphs generated using HyChip™ for three patients. The computer-called sequence is represented on the top line in black; below in blue is the combinatorial sequencing-by-hybridization (cSBH) sequence signal readout on the specific sample tested. At the bottom of the graph are not called by the computer software bases (background). In the panel A the letter D represents an hemizygous deletion of A in Ex12 (sample 2)(DelC aa1194). In the panel B the letter D represents an hemizygous deletion of C in Ex12 (sample 3). In the panel C the letter S represents a T (Sample 64)µ substitution of A Background bases, which are not called by the computer software, are present at the bottom of all graphs. Use of informative pools of labelled probes allows combinatorial ligation of all labelled and bound probes on one glass slide with only eight universal replica arrays of bound probes. This is the most efficient way to obtain hybridization data for all 4,194,304 11-mers using 32,768 (8 X 4,096) probe spots, an array that can fit only one half of all possible 8-mers. Data of pooled 11-mers are much more informative than data for individual 8-mers because each base is read with 11 instead of eight overlapping probes.

Mutation Detection in Italian Haemophilic A Patients

A total of 16 clinically diagnosed Italian haemophilic A patients and one obligate carrier previously genotyped and found to have disease-causing mutations in the FVIII gene were included in this study. Patient samples included a range of DNA mutations mapping throughout the FVIII gene. cSBH results are shown in Table 2. We confirmed single-base changes in samples 1, 20, 64, 65, 80, 88, 90, 93, 99. Previous genotyping of samples 2 [Figure 2A], 3 [Figure 2B], 37, and 89 carrying a single-base deletion in exon 14 (fragment b) as well as in sample 6 [Figure 2C] containing a single deletion in exon 12 was confirmed by cSBH. In addition, we found polymorphism in the 3’UTR region (Ex 26, nucleotide 8900 (G→A)) in sample 2, 3, 37, 73, and another polymorphism in the 3’UTR region at nucleotide 8519 A→G in sample 3.
Table 2

HA cSBH screen results in 16 patients with known and in 7 patients with unknown mutations

SampleSequencing resultscSBH results

Sporadic mutationsSporadic mutationsGene polymorphism
# 1Lys1040StopConfirmed
# 2Del A Aa 1191-4ConfirmedUTR 3’ G→A 8900
# 3Del C Aa 596 novelConfirmedUTR 3’; A→G 8519
UTR 3’ G→A 8900
# 6Del A Aa 1191-1194Confirmed
# 20Lys1270StopConfirmed
# 37Del A Aa 1191-1194ConfirmedUTR 3’ G→A 8900
# 64Asn235Ile novelConfirmed
# 65Asn235Ile novelConfirmed
# 73▵DelTT Aa 2275Confirmed*UTR 3’ G→A 8900
# 80Arg1997TrpConfirmed
# 88His94GlnConfirmed
# 89Del A Aa 1441Confirmed
# 90Pro74Ser novelConfirmed
# 93Arg1997TrpConfirmed
# 95Del CT Aa 49 novelConfirmed*
# 99Arg1997TrpConfirmed
# Cz36unknownunknownUTR 3’ A→G 8519
UTR 3’ G→A 8900
# P21unknownunknownUTR 3’ G→A 8900
# P25unknownLeu1843Thr
# R32unknownunknownSer 2182 His
# Cz1unknownunknownUTR 3’ A→G 8519
UTR 3’ G→A 8900
# Cz13unknownunknownUTR 3’ G→A 8900
# Cz34unknownunknownUTR 3’ G→A 8900

Genotyping was accomplished using a more advanced version of the software

HA cSBH screen results in 16 patients with known and in 7 patients with unknown mutations Genotyping was accomplished using a more advanced version of the software The correct genotyping of a patient (sample 95) and an obligate carrier (sample 73) with 2-bp deletions starting at codons 2275 and 49, respectively, was accomplished using a more advanced version of the software. We also analyzed samples of 7 haemophilic patients with unknown mutations, who were excluded to carry one of the common HA mutations, inversion of the intron 22 or intron 1. In this setting, we have found a previously undetected missense mutation Leu1843Thr in exon 16 in the sample P25. The mutation was then confirmed by direct gene sequencing. In sample R32 we found another polymorphic change in Ex 23 (Ser 2182 His). In addition, we found a polymorphism in the 3’UTR region (Ex 26, nucleotide 8900 (G→A)) in sample P21, Cz1,Cz13 and Cz34 and Cz36 and another polymorphism in the 3’UTR region at nucleotide 8519 A→G (samples Cz 36 and Cz 1).

Discussion

We demonstrated that cSBH, and the related HyChip product, is capable of identifying a range of point mutations located within the FVIII gene making this platform an attractive alternative to standard sequencing methods. We confirmed known FVIII gene mutations and found that large regions of genomic DNA can be sequenced with excellent readability using only one HyChip array. Because the maximal length of a single PCR product that could be sequenced by the chip is 1.2 kb in size, long exons such as exons 14 and 26 had to be divided in three and two fragments, respectively. The other exons are shorter and could be sequenced in entirety. cSBH sequence data showed 100% accuracy with only 0.2% of bases not called.[10] In addition, we demonstrated that the smaller exons 1-25 could be pooled together in groups in a manner allowing their complete sequencing on four different chips, with base readability of 100% and an accuracy of 100%. The reliability of this technique has been further provided by analyzing a series of HA patients in whom no gene mutation was detected by the first FVIII gene screening, using common strategies (SSCP, DGGE, or dHPLC). This approach was able to identify a new point mutation (Leu1843Thr), which was subsequently confirmed by direct gene sequencing. In the remaining patients, no mutation was found also when the direct gene sequencing approach was applied. Finally, how well this technique does work was provided by the identification of the mutant allele in a heterozygous carrier with one normal factor VIII gene. Recently, a different approach using a microarray technology has been applied for the detection of FVIII gene mutations.[14] As compared with that, the method we used was able to screen all exons using universal probes, rendering this approach reliable for all HA patients carrying a point mutation wherever it is located and chips available for a screening of different HA patients or distinct genes. cSBH is an efficient, reliable, and easy to use re-sequencing technique that offers an alternative approach for screening of large genes. cSBH allows sequencing of large segments of DNA on only one chip, thereby drastically reducing the time it takes to analyze a specific sequence. Actually the mean time to screen the coding regions of FVIII gene was three days. This period was reduced as compared to the time used in standard methods and comparable to that needed for sequencing analysis. The only equipment required, aside from basic tools (e.g., thermal cyclers, pipettes), is access to a standard array scanner, which makes it a very affordable technique to implement. The required data analysis software, which runs on standard PC computers, will be provided as part of the universal HyChip system and is expected to be economically priced. cSBH is a non-radioactive method and generates sequencing results that can be obtained from genomic DNA within a day. Another major advantage of cSBH are the truly universal HyChip arrays which can be applied to screen any DNA sequence without further probe or oligonucleotide changes, which can be very costly. cSBH offers an efficient and accurate screen for most DNA sequences. The cSBH method is highly reproducible because each base is currently read with 22 overlapped probes (11 per strand). In addition, sample preparation and ligation kits, aliquoted for individual assays, are prepared with all components including enzymes to yield very reproducible biochemical reactions. Furthermore, the base-calling software includes several normalization functions to minimize the influence of experimental variations.[13] The combinatorial ligation of universal probe sets coupled with informative probe pools provides a large number of long (currently 11-mers, potentially 12- to 15-mers in two-probe ligation and 17- to 24-mers in three probe ligation), overlapping probes of predetermined behavior that interrogate each base. Robust statistics take advantage of this redundant dataset to provide accurate base calling in spite of array imperfections (0.5–5.0% of missing dots; high droplet size variation), some impaired probe-target hybridizations due to target–target hybridization, significant experimental noise, and the always-present statistical noise related to probe-pooling and sequence repeats. The main software improvement in the future should address target DNA behavior (e.g., primers/primer dimers, palindromes, direct and inverted repeats, and uneven amounts of target DNA) to aid in a more accurate interpretation of base scores and associated P-values. Actually, the presence of short palindromes or repeats may contribute to the number of bases with lower “scores”, as seen in sample 2 [Figure 2A]. Furthermore, detection of low-signal AT rich sequences can be improved with optimization of probe concentrations and incorporation of 1,600 AT-rich 7-mers and 8-mers in the arrays, as well as 500 AT-rich 6-mers and 7-mers in the labelled probe pools, which are currently under development. Universal arrays use probes shorter than gene specific arrays. As a consequence, there is a higher chance to have repeated sequences longer than the length of probes (currently 11-mers in cSBH) occurring in a DNA sample. Such sequences may lead to uncertainty in the sequence assembly. For example, a stretch of 12 As can be perceived as 11, 12, 13, or more As. Also, a mutation in the middle of a repeated 20-mer or any longer segment may not be assigned to the actual copy of the repeat. This situation can be avoided by amplicon design that separates repeats into two sequencing reactions. Short identical repeats several bases in length or longer imperfect repeats, both found in exons or immediate intronic sequences that flank each exon, usually do not confuse the advanced base-calling software. There are limitations with cSBH analysis: the current software and probe set must be improved to detect DNA alterations within simple repeat regions, which is a problem common to other sequence technologies as well.[10] Another potential complication, which is common to other high resolution methods, including standard Sanger sequencing, is that all sequence changes, polymorphisms and missense changes that do not result in an obvious pathogenic mutation such as a stop codon and are of unknown clinical significance will be detected.[10] In summary, mutation detection for the FVIII gene requires a multimodal approach that can detect the full range of mutations present in this gene. First-stage mutation screening requires a fast, efficient and possibly economical approach to scan the entire gene. cSBH is a technique that is ideally suited. In this study, we showed that cSBH is capable of detecting a broad range of mutations, currently has the capacity to analyze long continuous read lengths of up to 2 kb per chip, and is able to analyze pooled PCR fragments (such as exons 1–25 in this study) on a single universal chip.
  12 in total

1.  DNA microarray analysis for the detection of mutations in hemophilia A.

Authors:  E Berber; J Leggo; C Brown; E Berber; N Gallo; H Feilotter; D Lillicrap
Journal:  J Thromb Haemost       Date:  2006-08       Impact factor: 5.824

2.  Factor VIII gene mutations found by a comparative study of SSCP, DGGE and CMC and their analysis on a molecular model of factor VIII protein.

Authors:  R Schwaab; J Oldenburg; M R Lalloz; U Schwaab; S Pemberton; P Hanfland; H H Brackmann; E G Tuddenham; K Michaelides
Journal:  Hum Genet       Date:  1997-12       Impact factor: 4.132

3.  The factor VIII Structure and Mutation Resource Site: HAMSTeRS version 4.

Authors:  G Kemball-Cook; E G Tuddenham; A I Wacey
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

4.  Evaluation of DHPLC in the analysis of hemophilia A.

Authors:  J Oldenburg; V Ivaskevicius; S Rost; A Fregin; K White; E Holinski-Feder; C R Müller; B H Weber
Journal:  J Biochem Biophys Methods       Date:  2001-01-30

Review 5.  Sequencing by hybridization (SBH): advantages, achievements, and opportunities.

Authors:  Radoje Drmanac; Snezana Drmanac; Gloria Chui; Robert Diaz; Aaron Hou; Hui Jin; Paul Jin; Sunhee Kwon; Scott Lacy; Bill Moeur; Jay Shafto; Don Swanson; Tatjana Ukrainczyk; Chongjun Xu; Deane Little
Journal:  Adv Biochem Eng Biotechnol       Date:  2002       Impact factor: 2.635

6.  Reproduction in hemophilia.

Authors:  R B Francis; C K Kasper
Journal:  JAMA       Date:  1983-12-16       Impact factor: 56.272

7.  Molecular characterization of severe hemophilia A suggests that about half the mutations are not within the coding regions and splice junctions of the factor VIII gene.

Authors:  M Higuchi; H H Kazazian; L Kasch; T C Warren; M J McGinniss; J A Phillips; C Kasper; R Janco; S E Antonarakis
Journal:  Proc Natl Acad Sci U S A       Date:  1991-08-15       Impact factor: 11.205

Review 8.  Haemophilias A and B.

Authors:  Paula H B Bolton-Maggs; K John Pasi
Journal:  Lancet       Date:  2003-05-24       Impact factor: 79.321

9.  Combinatorial sequencing-by-hybridization: analysis of the NF1 gene.

Authors:  Annalisa Schirinzi; Snezana Drmanac; Bruno Dallapiccola; Steve Huang; Kathryn Scott; Alessandro De Luca; Donald Swanson; Radoje Drmanac; Saul Surrey; Paolo Fortina
Journal:  Genet Test       Date:  2006

10.  Identification of APC gene mutations in colorectal cancer using universal microarray-based combinatorial sequencing-by-hybridization.

Authors:  Shannon Cowie; Snezana Drmanac; Donald Swanson; Kathleen Delgrosso; Steve Huang; Desirée du Sart; Radoje Drmanac; Saul Surrey; Paolo Fortina
Journal:  Hum Mutat       Date:  2004-09       Impact factor: 4.878

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.