Literature DB >> 21802048

Genetic flux between h1 and h2 haplotypes of the 17q21.31 inversion in European population.

Libin Deng1, Xiaoli Tang, Xiangwen Hao, Wei Chen, Jiari Lin, Yangyu Yu, Dake Zhang, Changqing Zeng.   

Abstract

The chromosome 17q21.31 inversion is a 900-kb common structural polymorphism found primarily in European population. Although the genetic flux within inversion region was assumed to be considerable suppressed, it is still unclear about the details of genetic exchange between the H1 (non-inverted sequence) and H2 (inverted sequence) haplotypes of this inversion. Here we describe a refined map of genetic exchanges between pairs of gene arrangements within the 17q21.31 region. Using HapMap phase II data of 1,546 single nucleotide polymorphisms, we successfully deduced 96 H1 and 24 H2 haplotypes in European samples by neighbor-joining tree reconstruction. Furthermore, we identified 15 and 26 candidate tracts with reciprocal and non-reciprocal genetic exchanges, respectively. In all 15 regions harboring reciprocal exchange, haplotypes reconstructed by clone sequencing did not support these exchange events, suggesting that such signals of exchange between two sister chromosomes in certain heterozygous individual were caused by phasing error regions. On the other hand, the finished clone sequencing across 4 of 26 tracts with non-reciprocal genetic flux confirmed that this kind of genetic exchange was caused by gene conversion. In summary, as crossover between pairs of gene arrangements had been considerably suppressed, gene conversion might be the most important mechanism for genetic exchange at 17q21.31.
Copyright © 2011 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21802048      PMCID: PMC5054451          DOI: 10.1016/S1672-0229(11)60014-4

Source DB:  PubMed          Journal:  Genomics Proteomics Bioinformatics        ISSN: 1672-0229            Impact factor:   7.691


Introduction

Genetic flux plays an important role at genome evolution. Crossover and gene conversion are important forms of genetic information exchange. Although recombination has been and is being studied across human genome at variable levels (high and low resolution), studies of gene conversion have only covered several specific regions (such as major histocompatibility complex regions and globin genes), or duplication genes distributed at different locus (such as human endogenous retroviral sequence in human Y chromosome) 1, 2, 3. It is less known about gene conversions across autosomes, since it is hard to distinguish gene conversion events from double crossovers on mammal meiotic products (. As one kind of chromosomal rearrangements, inversion is believed to be an important model to discover gene conversions. Since gene flow among different oriented chromosome segments is inhibited, these segments would accumulate mutations independently and evolve towards different directions (. According to previous nucleotide diversity researches in Drosophila genus, gene conversion rate would be uniform along inversion loop, whereas double-crossover rate would be higher in central part of inversion loop when compared with inversion breakpoints (. In previous research, human chromosome 17q21.31 region has been identified as a 900-kb inversion polymorphism by assembly of chromosome-specific BAC clones (. Two orientations of this inverted segment represent distinct lineages, H1 (non-inverted sequence) and H2 (inverted sequence). The inverted configuration is linked to H2, which is relatively common in Europen samples but nearly absent in other HapMap populations. Recent studies have suggested no evidence of having recombined between two haplotype groups. Therefore, such inversion region provides a good sample to discover the gene conversions within autosomal regions. In this study, we used phase II haplotype data across the human chromosome 17q21.31 inversion polymorphism region to detect conversion events in HapMap CEU samples. Based on the assumption that no recombination events had taken place between H1 and H2 haplotypes, 26 non-reciprocal gene conversions were detected. Meanwhile, several conversion events (reciprocal genetic flux) were false due to haplotype phasing error in HapMap dataset. Our study shows that haplotype-based methods can be applied to scan for genetic flux at inversion polymorphism regions.

Results

Haplotype substructures in 17q21.31 region

To determine the segmental orientations, we carried out cluster analyses of haplotypes by considering this 900-kb inversion segment as a specific polymorphism frame. In total, 120 haplotypes were constructed based on 1,546 single nucleotide polymorphisms (SNPs) from HapMap phase II data in all unrelated individuals. Further haplotype tree analysis showed that the CEU population was composed of two distinct subgroups (Figure 1). Using two SNPs, namely rs1800547 and rs9468, for orientation determination, the larger clade containing 96 haplotypes corresponded to the H1 lineage as in the study of Stefansson et al. (, and the other clade containing 24 haplotypes corresponded to the H2 lineage as inversion group (Table S1). These haplotypes were consistent with Stefansson’s result by combining microsatellites and SNPs.
Figure 1

Neighbor-joining tree of long-range haplotypes from CEU population samples. With each branch representing a haplotype, clusters are constructed from data of 120 HapMap chromosomes at 17q21.31. In the two distinct clades, “○” represents H1, and “♦” represents H2.

Candidate conversion events within inversion region

Using phased haplotypes in H1 and H2 lineages, we predicted 41 candidate conversion regions (Table S2) with Betran’s algorithm (. The average length of them was 14,118 bp, longer than the length of other known conversion tracts. The longest one was 66,767 bp, containing 157 SNPs. Within these predicted conversions, 15 candidate regions had the corresponding conversion events observed in the same position of the sister chromosome from same individuals. We defined these paired predicted regions as reciprocal genetic exchange (Table 1). As shown in Figure 2, group 1 illustrated these reciprocal exchanges, and individuals with same candidate regions came from the same family, such as NA11881 and NA11882, NA12155 and NA12156. Moreover, non-reciprocal genetic exchanges could be observed in both H1 and H2 lineages (group 2 in Figure 2). In all, 26 regions of this kind were identified to have diverse length. These regions may be introduced from other population by gene conversion.
Table 1

Predicted and observed haplotypes in three reciprocal conversion regions.

RegionSampleHapMap haplotypeObserved haplotype
41,163,838 - 41,182,076
NA11881:C1T-A-A-T…G-G-A-G-CT-A-G-C…A-A-G-G-C
NA11881:C2C-C-G-C…A-A-G-A-CC-C-A-T…G-G-A-A-C
NA11882:C1C-C-G-C…A-G-A-G-TC-C-A-T…A-A-G-G-C
NA11882:C2T-A-A-T…A-A-G-A-CT-A-G-C…G-G-A-A-C

41,458,711 - 41,460,355
NA11881:C1T-G-C-C-C…C-C-A-A-C-TT-G-A-T-T…T-T-C-G-C-T
NA11881:C2C-C-A-T-T…T-T-C-G-T-GC-C-C-C-C…C-C-A-A-T-G
NA11882:C2T-G-C-C-C…T-T-C-G-T-GT-G-A-T-T…C-C-A-?-?-G
NA11882:C1C-C-A-T-T…T-C-A-A-C-TC-C-C-C-C…T-T-C-G-C-T

41,578,112 - 41,644,878NA12156:C1C-T-A-C-G-G-A-C-GC-T-A-T-A-A-A-C-G
NA12156:C2T-C-G-T-A-A-C-T-TT-C-G-C-G-G-C-T-T

Note: C1, one sister chromosome; C2, the other sister chromosome.

Figure 2

Distribution of predicted candidate conversion regions. In the upper panel, the left column shows the individuals. Black lines represent H1 and red lines represent H2; red bars in H1 and black bars in H2 represent predicted candidate conversion regions. Two dash lines demonstrated the inversion region. The bottom panel shows the recombination hotspots within this inversion region (retrieved from the HapMap official website at http://www.hapmap.org/).

For the 15 reciprocal genetic exchanges, candidate regions were observed in samples from the same family, implying that this kind of conversion may be the result of phasing error. To test this possibility, we obtained the real haplotypes of these candidate regions by clone sequencing. As shown in Table 1, comparison between the HapMap-predicted haplotypes and the observed ones demonstrated the existence of phasing errors. Factors contributed to phasing errors include conversion events occurred in parent generation and inaccuracy in algorithms adopted by software for phasing. We further sequenced candidate regions in NA10859 and in her parents (NA11881 and NA11882), and found that haplotypes were identical in both generations. Therefore, these errors were most likely due to algorithms in haplotype deduction.

Verification of conversion events

To validate the conversion event of the 26 non-reciprocal genetic exchanges, we selected four regions for sequencing verification (Table 2). Most of the candidate conversion regions were confirmed by clone sequencing, and small conversion regions seemed to be easier for verification. For instance, a 15-bp predicted conversion region, chr17: 41,111,640-41,111,654, was successfully validated in both NA11831 and NA12716. Large candidate regions may contain more SNP phasing errors, and haplotype deduction may be inaccurate. The candidate region chr17: 41581663–41634438 was longer than all known conversion tracts. It was predicted in both NA12236 and NA11839 samples, but was only confirmed in the latter one. The longest candidate region was predicted in NA12156, spanning 66,767 bp. In our attempt for its haplotype construction, we only obtained one haplotype at each breakpoint using clone sequencing. In addition, we found that both upstream and downstream haplotypes had phasing error SNPs, precluding us to identify this candidate as a conversion region.
Table 2

Verification of four “interspersed” conversion regions.

Region
41,012,678-41,015,758
41,111,640-41,111,654
41,195,723-41,195,778
41,581,663-41,634,438
SampleHaplotypeSampleHaplotypeSampleHaplotypeSampleHaplotype
NA07000:C1C-G-T-C-TNA07000:C1G-G-A-CNA07000:C1A-T-G-CNA07000:C1G-G-A-T…T-A-G-A
NA07000:C2C-G-T-C-TNA07000:C2G-G-A-CNA07000:C2A-T-G-CNA07000:C2G-G-A-T…T-A-G-A
NA11839:C1C-G-T-T-TNA11831:C1G-G-A-CNA07357:C1T-G-G-CNA07055:C1G-G-A-T…T-A-G-A
NA11839:C2C-G-T-C-TNA11831:C2A-G-C-ANA07357:C2A-T-G-CNA07055:C2G-G-A-T…T-A-G-A
NA12236:C1C-A-T-C-CNA12716:C1A-G-C-ANA12236:C1T-G-A-ANA12236:C1A-A-T-C…C-G-A-G
NA12236:C2C-A-T-C-CNA12716:C2G-G-A-CNA12236:C2T-G-A-ANA12236:C2G-A-T-C…C-G-G-G
NA12236:C1G-G-C-A
NA12236:C2G-G-C-A

Note: Bold, H1 haplotype; Unbolad, H2 haplotype; Underlined, conversion region; C1, one sister chromosome; C2, the other sister chromosome.

Discussion

In 17q21.31 inversion region, haplotype subgroups corresponded to the two alleles with opposite segmental orientations, H1 and H2, respectively (. Considering the high genetic divergence between these two arrangements (F=0.81), the crossover between two oriented chromosome segments has been significantly inhibited. In this study, we used phased haplotype data from CEU samples to detect gene conversion within 17q21.31 inversion region. After excluding the phasing error from HapMap haplotype construction, we identified 26 confident regions subjected to historic gene conversions. As conversion event can weak the strength of linkage disequilibrium (LD) (, it should be concerned at searching related disease genes within this region. The goal of International HapMap Project was to help researchers to mapping genes related to diseases using LD among markers. Although Bayesian statistical method was more accurate than other algorithms, such as expectation-maximization algorithm (, the genotyping error and structure variation could still affect the data qualification of HapMap. It reminds that researchers should sequence the target regions and determine the correct associated markers. For most applications, although it is easy to estimate the switch error rate of phasing based on computer simulation, the identification of phasing errors in particular loci is inefficient. In this study, we did verify several haplotype phasing error sites compared to real autosomal haplotypes from clone sequencing. However, the predicted method based on detection of “reciprocal genetic exchange” cannot apply to autosomal region without chromosome rearrangement.

Conclusion

Here we described a refined map of genetic exchanges between pairs of gene arrangements within the 17q21.31 region. Using HapMap phase II data of 1,546 SNPs, we successfully deduced 96 H1 and 24 H2 haplotypes in European samples by neighbor-joining (NJ) tree reconstruction. In particular, the large genetic differentiation between these two haplotype clades might be caused by the suppression of gene flux between these two arrangements. Furthermore, we applied methods of Betran et al. ( to scan for genetic flux between H1 and H2 in the 120 CEU haplotypes, and identified 15 and 26 candidate tracts with reciprocal and non-reciprocal genetic exchange, respectively. In all 15 regions harboring reciprocal exchange, haplotypes reconstructed by clone sequencing did not support these exchange event, suggesting that such signals of exchange between two sister chromosomes in certain heterozygous individual were caused by phasing error regions. On the other hand, the finished clone sequencing across 4 of 26 tracts with non-reciprocal genetic flux confirmed that this kind of genetic exchange was caused by gene conversion. In summary, as crossover between pairs of gene arrangements had been considerably suppressed, gene conversion might be the most important mechanism for genetic exchange at 17q21.31.

Material and Methods

Haplotype analysis and conversion prediction

Phasing haplotype data of 60 unrelated CEU samples (Utah residents with ancestry from northern and western Europe) were downloaded from the HapMap website (HapMap Phase II/rel#21a data files; http://www.hapmap.org/) (. In total, 1,546 autosomal markers across chr17q21.31 (41.02–41.98 Mb), which were segregating in CEU HapMap samples, were applied for analysis. To infer the evolutionary relationships of haplotypes, NJ tree algorithm was carried out for haplotype clustering (. Haplotype genetic distances were estimated by the allele sharing distance method (. The haplotype tree was then constructed by means of the NJ algorithm implemented in the MEGA3 (. Next, we used Betran’s method ( to detect gene conversion tracts from haplotype data between arrangements (or haplotype subgroups). The algorithm was provided by the DnaSP software (version 4.10.9) (. Based on the parameter φ (Equation A4 in Betran et al.) (, which measures the probability of detecting a converted site, we estimated the number and the length distribution of conversion tracts across 17q21.31 region.

Clone sequencing and conversion verification

DNA samples of the International HapMap Project were obtained from Coriell Repositories (Camden, NJ, USA). To verify the candidate conversions, seven regions within inversion region were selected for clone sequencing (Table 1, Table 2). All the PCR reactions were performed with TaKaRa LA Taq kit. PCR products were electrophoresed on gel of 1% agrose and were retrieved using AxyPrep DNA Gel Extraction kit (AxyGen Company) according to the recommendations of the manufacturer. In order to get the single haplotype, we cloned the products with pGEM-T vector (pGEM-T vector system, Promega) under the instructions of manufacturer. For each sample, we selected five clones. All the clones were cultured at 37°C for 12 h, and plasmids were extracted using the AxyPrep Plasmid kit according to the manufacturer instruction. DNA sequencing was performed on ABI 3730 DNA sequencer system using Big Dye chemistry. SNPs were found by aligning with reference sequence according to assembly 37 using Lasergene software.

Authors’ contributions

LD, XT and XH performed major data analyses and drafted the manuscript. XH, WC, JL and YY collected the dataset and performed clone sequencing. LD, DZ and CZ designed the study, supervised the project and co-wrote the manuscript. All authors read and approved the final manuscript.

Competing interests

The authors have declared that no competing interests exist.
  13 in total

Review 1.  Contribution of gene conversion in the evolution of the human beta-like globin gene family.

Authors:  M N Papadakis; G P Patrinos
Journal:  Hum Genet       Date:  1999-02       Impact factor: 4.132

2.  DnaSP, DNA polymorphism analyses by the coalescent and other methods.

Authors:  Julio Rozas; Juan C Sánchez-DelBarrio; Xavier Messeguer; Ricardo Rozas
Journal:  Bioinformatics       Date:  2003-12-12       Impact factor: 6.937

3.  A common inversion under selection in Europeans.

Authors:  Hreinn Stefansson; Agnar Helgason; Gudmar Thorleifsson; Valgerdur Steinthorsdottir; Gisli Masson; John Barnard; Adam Baker; Aslaug Jonasdottir; Andres Ingason; Vala G Gudnadottir; Natasa Desnica; Andrew Hicks; Arnaldur Gylfason; Daniel F Gudbjartsson; Gudrun M Jonsdottir; Jesus Sainz; Kari Agnarsson; Birgitta Birgisdottir; Shyamali Ghosh; Adalheidur Olafsdottir; Jean-Baptiste Cazier; Kristleifur Kristjansson; Michael L Frigge; Thorgeir E Thorgeirsson; Jeffrey R Gulcher; Augustine Kong; Kari Stefansson
Journal:  Nat Genet       Date:  2005-01-16       Impact factor: 38.330

Review 4.  Insights into recombination from population genetic variation.

Authors:  Garrett Hellenthal; Matthew Stephens
Journal:  Curr Opin Genet Dev       Date:  2006-10-16       Impact factor: 5.578

5.  Using allele sharing distance for detecting human population stratification.

Authors:  Xiaoyi Gao; Eden R Martin
Journal:  Hum Hered       Date:  2009-06-11       Impact factor: 0.444

6.  Molecular population genetics of the rp49 gene region in different chromosomal inversions of Drosophila subobscura.

Authors:  J Rozas; C Segarra; G Ribó; M Aguadé
Journal:  Genetics       Date:  1999-01       Impact factor: 4.562

7.  The estimation of the number and the length distribution of gene conversion tracts from population DNA sequence data.

Authors:  E Betrán; J Rozas; A Navarro; A Barbadilla
Journal:  Genetics       Date:  1997-05       Impact factor: 4.562

8.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

9.  Intense and highly localized gene conversion activity in human meiotic crossover hot spots.

Authors:  Alec J Jeffreys; Celia A May
Journal:  Nat Genet       Date:  2004-01-04       Impact factor: 38.330

10.  New HLA-DPB1 alleles generated by interallelic gene conversion detected by analysis of sperm.

Authors:  G Zangenberg; M M Huang; N Arnheim; H Erlich
Journal:  Nat Genet       Date:  1995-08       Impact factor: 38.330

View more
  2 in total

1.  Functional Impact and Evolution of a Novel Human Polymorphic Inversion That Disrupts a Gene and Creates a Fusion Transcript.

Authors:  Marta Puig; David Castellano; Lorena Pantano; Carla Giner-Delgado; David Izquierdo; Magdalena Gayà-Vidal; José Ignacio Lucas-Lledó; Tõnu Esko; Chikashi Terao; Fumihiko Matsuda; Mario Cáceres
Journal:  PLoS Genet       Date:  2015-10-01       Impact factor: 5.917

2.  Validation and genotyping of multiple human polymorphic inversions mediated by inverted repeats reveals a high degree of recurrence.

Authors:  Cristina Aguado; Magdalena Gayà-Vidal; Sergi Villatoro; Meritxell Oliva; David Izquierdo; Carla Giner-Delgado; Víctor Montalvo; Judit García-González; Alexander Martínez-Fundichely; Laia Capilla; Aurora Ruiz-Herrera; Xavier Estivill; Marta Puig; Mario Cáceres
Journal:  PLoS Genet       Date:  2014-03-20       Impact factor: 5.917

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.