Literature DB >> 16781103

Diversity of 26-locus Y-STR haplotypes in a Nepalese population sample: isolation and drift in the Himalayas.

Emma J Parkin1, Thirsa Kraayenbrink, Jean Robert M L Opgenort, George L van Driem, Nirmal Man Tuladhar, Peter de Knijff, Mark A Jobling.   

Abstract

Twenty-six Y-chromosomal short tandem repeat (STR) loci were amplified in a sample of 769 unrelated males from Nepal, using two multiplex polymerase chain reaction (PCR) assays. The 26 loci gave a discriminating power of 0.997, with 59% unique haplotypes, and the highest frequency haplotype occurring 12 times. We identified novel alleles at four loci, microvariants at a further two, and nine examples of amelogenin-Y deletions (1.2%). Comparison with a similarly sized Bhutanese sample typed with the same markers suggested histories of isolation and drift, with drift having a greater effect in Bhutan. Extended (11-locus) haplotypes for the Nepalese samples have been submitted to the Y-STR Haplotype Reference Database (YHRD).

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16781103      PMCID: PMC2627361          DOI: 10.1016/j.forsciint.2006.05.007

Source DB:  PubMed          Journal:  Forensic Sci Int        ISSN: 0379-0738            Impact factor:   2.395


Introduction

The analysis of multiple Y-chromosomal short tandem repeats (STRs) provides informative male-specific DNA profiles in forensic analysis. As well as possessing high discriminating power in distinguishing individuals, haplotypes defined by STRs can provide information about likely geographical origin, since they are often concentrated in particular populations or regions. Population databases of Y haplotypes [1] are increasing in size and coverage, greatly contributing to the utility of Y-chromosomal analysis in forensic casework. In this study we describe alleles at 26 Y-STRs, and properties of the haplotypes they define, in a large sample of a previously unrepresented population, that of Nepal in the Himalayas. Eleven-locus haplotypes have been submitted to the Y-STR Haplotype Reference Database (YHRD), and full data are available from the authors on request. Our report follows guidelines for the publication of population data [2]. Sampling and Y-chromosomal analysis of 769 Nepalese males was undertaken as part of a larger collaborative project [3] investigating genetic diversity in Himalayan populations within the framework of their cultural and linguistic diversity [4]. Here we describe our initial findings with Y-STRs, treating the Nepalese sample as a single population; future publications will explore genetic relationships between subpopulations of the Himalayas. The sample represents 15 distinct ethnolinguistic groups widely distributed throughout Nepal, with ∼75% of sampled individuals speaking languages belonging to the Tibeto-Burman family, and the remainder speaking Indo-European languages. In this study, we employ the same set of Y-STRs as that used recently to analyse 856 Bhutanese males [5]. This allows us to carry out a preliminary comparison of diversity and haplotype sharing between these two Himalayan samples.

Materials and methods

DNA samples

Seven hundred and sixty-nine Bhutanese males provided blood samples with informed consent, and DNA was extracted as described [3]. DNA samples from collections of the authors, including Y Chromosome Consortium (YCC) cell lines [6], were used as haplotype reference materials.

Y-STR multiplexes

Two PCR multiplexes (a 20plex [7] and a partially overlapping 14plex [5]) were used to type 26 Y-STRs, as follows: DYS19, DYS385a/b, DYS388, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS425, DYS426, DYS434, DYS435, DYS436, DYS437, DYS438, DYS439, DYS447, DYS448, DYS460, DYS461, DYS462, YCAIIa/b, and Y-GATA-H4.1. The eleven Y-STR markers in the European ‘extended haplotype’ (http://www.yhrd.org/) are indicated in bold. The 14plex includes the amelogenin sex test. Full details of the protocol are given by Parkin et al. [5].

Y-STR nomenclature

Allele nomenclature (explained fully in Parkin et al. [5]) was according to Butler et al. [7] and Bosch et al. [8], with the exception of DYS439, DYS448 and Y-GATA-H4.1, where nomenclature was changed for compatibility with ISFG recommendations [9]. Compared to Butler et al. [7], seven repeats were subtracted from DYS439, three subtracted from DYS448, and eight added to Y-GATA-H4.1.

Calculations

Gene diversity and haplotype diversity were calculated using Arlequin [10]. A median-joining network was constructed using Network 4.0 ([11] http://www.fluxus-engineering.com/sharenet.htm), and the weighting scheme described by Qamar et al. [12].

Results and discussion

Diversity of alleles

Tables 1 and 2 show the allele frequency distributions for all the Y-STRs studied. Diversities of individual STRs are comparable with those observed in a recently studied Bhutanese sample: DYS385 (when considered as a genotype, Table 2) is the most diverse marker within the Y-STR set, with a gene diversity (h) of 0.915, and the most polymorphic single-locus marker is DYS439 (h = 0.726).
Table 2

Frequencies of genotypes at DYS385 and YCAII

GenotypeDYS385YCAII
10–140.005
11–110.017
11–120.007
11–130.004
11–140.070
11–150.001
11–160.001
11–180.003
11–190.001
11–200.001
12–120.004
12–130.003
12–140.009
12–150.001
12–160.027
12–170.029
12–180.014
12–190.007
12–200.012
13–130.022
13–140.022
13–150.003
13–160.027
13–170.056
13–180.182
13–190.177
13–200.060
13–210.014
13–220.001
13–230.003
14–140.004
14–150.007
14–160.010
14–170.013
14–180.044
14–190.031
14–200.027
14–220.001
15–150.004
15–160.0090.009
15–170.007
15–180.013
15–190.0030.014
15–200.007
15–210.001
16–160.0010.008
16–170.0040.003
16–180.004
16–190.0030.010
16–200.003
16–220.001
17–170.0030.036
17–180.029
17–190.606
17–200.113
17–210.091
17–220.001
17–230.001
18–180.012
18–190.0010.025
18–200.001
19–190.0010.036
20–200.001
20–210.001
13–17.20.001
13–18.20.016



h0.9150.607
h(Bh)a0.9210.524

Comparative Bhutanese values from [5].

Previously unreported alleles (defined with reference to Butler [13], Parkin et al. [5] and STRBase, http://www.cstl.nist.gov/biotech/strbase/index.htm) were found at four loci, as follows: DYS426 (allele 13), DYS437 (allele 11), DYS439 (allele 15), DYS447 (alleles 17, 18 and 19). ‘Null’ alleles or multiple peaks were reproducibly obtained at a number of loci. For DYS448, three individuals carried null alleles, while one carried both alleles 20 and 21. For DYS461, one individual carried both alleles 13 and 14. As observed previously [5], DYS425 exhibits a relatively high frequency of various nulls and duplications. Microvariants (partial alleles) were observed at two loci (Tables 1 and 2) and confirmed in uniplex assays after initial detection in multiplexes. Those at DYS385 were not investigated further, but those at DYS447 were analysed by sequencing, and shown to result from a deletion of 1 bp within the pentanucleotide repeat array [5].

AMELY deletion chromosomes

Nine chromosomes showed absence of the amelogenin Y (AMELY) peak in electropherograms. Analysis of sequence-tagged sites revealed that these chromosomes carry interstitial deletions of Yp including the AMELY locus (data not shown); none showed null Y-STR alleles, however, which is consistent with the size and location of known AMELY deletions with respect to the position of Y-STR loci [14]. A previous study has found AMELY deletions at a frequency of ∼2% in India [15], so our finding of deletions at 1.2% frequency in Nepal is not unexpected; in contrast, however, none were found in our previous study of Bhutan [5]. These AMELY deletion chromosomes form part of a large set that is currently being characterised, and will be described fully elsewhere.

Diversity of haplotypes

Haplotype diversity (equivalent to power of discrimination, PD) was calculated, omitting chromosomes carrying null alleles and duplications. This provided a sample size of 741. For the full set of 26 Y-STRs, there are 437 unique haplotypes (59.0%), and PD is 0.9970. The corresponding values for the 20plex [7], extended (11-locus) haplotype and minimal (9-locus) haplotype are shown in Fig. 1.
Fig. 1

Haplotype diversity for (a) all 26 STRs and the 20plex, and (b) the extended and minimal haplotypes. Histograms show the frequency distributions of haplotypes present more than once in the dataset.

Fig. 1 also shows the distribution of haplotypes present more than once in the dataset. Despite the large number of loci used here, in the 741 males one 26-locus haplotype is shared by 12 individuals (Fig. 1a), and a further 13 haplotypes are shared by between 5 and 9 individuals; notably, all these common haplotypes are restricted to particular subpopulations, illustrating the influence of drift. Reduction to 11-locus extended haplotypes allows a global search within the YHRD (release 18): this fails to find matches for three of the six most common Nepalese extended haplotypes (frequency ≥10), consistent with isolation and drift.

Comparison of Y-STR datasets on Nepal and Bhutan

The availability of large Y-STR haplotype datasets on Nepalese and Bhutanese samples allows us to make comparisons between the frequencies and distributions of alleles and haplotypes in these two Himalayan populations. Allele distributions at individual loci are similar between the Nepalese and Bhutanese samples, but this gives little information about population relationships. Particular rare and distinctive alleles may carry more information, because they probably reflect identity-by-descent: a good example of this is the sharing of microvariants at DYS447 [5], but apart from this there is little evidence for specific inter-population sharing. Comparison of haplotype distributions reveals a striking difference between the two populations. The proportion of unique haplotypes in the Nepalese sample is significantly greater than that in the Bhutanese, for all four haplotype resolutions considered (Fig. 2). For example, for the extended haplotype there are 41.8% (±1.8%) unique haplotypes in Nepal, but only 23.3% (±1.5%) in Bhutan. This is explained by the presence of several common haplotypes at high frequency in Bhutan: in the Nepalese dataset, the most common extended haplotypes are each present in 13 individuals, while in the Bhutanese there are haplotypes present in 15, 16, 24 (two instances) and 27 individuals [5].
Fig. 2

Percentage of unique haplotypes in Nepal compared to Bhutan. Haplotypes containing null or duplicated alleles are omitted, giving total sample sizes of 741 and 802, respectively. The error bars represent plus or minus one binomial standard error.

There are no 26-locus haplotypes shared between Nepal and Bhutan, indicating an absence of very recent gene flow. However, forty extended haplotypes are shared between the two samples, and their relationships (omitting the bilocal marker DYS385) are illustrated in a median-joining network in Fig. 3. Most of them fall into one large cluster, with haplotypes linked by single mutational steps, probably representing a common Y-SNP haplogroup. Other shared haplotypes are more widely spread, and may represent several different haplogroups.
Fig. 3

Median-joining network of haplotypes shared between Nepal and Bhutan. Note that the 40 shared extended haplotypes described in the text are reduced to 28 when the bilocal Y-STR DYS385 is removed for network construction. Circles represent Y-STR haplotypes (based on DYS19, DYS389I, DYS389II-I, DYS390, DYS391, DYS392, DYS393, DYS438, DYS439), with area proportional to number of instances. Lines represent Y-STR mutational steps.

To ask if these shared extended haplotypes are more generally common and widespread, we sought matches for the six most predominant examples (combined frequency >10) within the YHRD. Three of the six haplotypes find a total of six exact matches, all within populations originating from China or the Indian subcontinent. We also find a total of 30 one-step mutational neighbours for five of the six haplotypes, all of Asian origin. One haplotype finds neither exact matches nor one-step neighbours. Thus, the common haplotypes shared between Nepal and Bhutan are Asian-specific, but not generally frequent.

Concluding remarks

Our study emphasises the discriminating power of high-resolution Y-STR typing, and provides the first substantial dataset on a Nepalese sample. The comparison of Nepalese and Bhutanese datasets reveals an interesting overall picture of isolation and drift within these Himalayan populations, with drift having a greater effect in Bhutan than Nepal. Haplotype sharing provides evidence of some gene flow between Nepal and Bhutan, or possibly of gene flow into both from some other population. Further light will be thrown on these relationships when Y-SNP data become available.
Table 1

Frequencies of alleles at 22 of the 26 Y-STRs

Allele19388390391392393425426434435436437438439447448460461462389I389II-IH4.1
70.0220.0010.004
80.0030.001
90.0530.0850.5110.018
100.5630.7760.0720.0030.0870.0010.1680.1510.2700.0080.0010.003
110.0040.1640.2610.0090.0010.8260.8700.9820.0010.0030.7190.2480.1760.1400.2170.014
120.0010.2890.0050.0590.6750.9480.1700.0160.0170.9780.0250.3900.0380.6450.6840.524
130.0440.0680.0010.0230.2110.0180.0010.0270.0210.0010.1920.1600.0960.2900.001
140.6440.0200.4970.1000.3860.0160.0270.0010.165
150.2410.0460.0490.0050.5440.0030.0040.168
160.0660.0080.0160.0650.0030.550
170.0040.0010.0010.0010.0030.0460.189
180.0030.0030.0250.0870.022
190.0050.2420.0050.280
200.6140.586
210.0050.0640.104
220.0510.0570.0030.008
230.4900.501
240.3210.144
250.1240.086
260.0080.100
270.0010.072
280.010
290.004
21.40.001
22.40.013
23.40.001
9–120.001
10–110.004
11–120.005
12–130.005
13–140.001
20–210.001
Null0.0170.004



ha0.5210.5930.6390.3680.6730.4900.0400.2890.2350.0360.0430.5510.4470.7260.7020.5530.6330.5360.4760.6140.6260.567
h(Bh)b0.6040.5180.5690.4210.5460.4420.1870.2440.2440.0460.0920.5530.4520.7130.6630.5900.6790.4340.3630.5920.5040.598

Calculation of gene diversity, h, excludes null alleles and duplications.

Comparative Bhutanese values from [5].

  12 in total

1.  Publication of population data of human polymorphisms.

Authors:  P Lincoln; A Carracedo
Journal:  Forensic Sci Int       Date:  2000-05-08       Impact factor: 2.395

2.  A nomenclature system for the tree of human Y-chromosomal binary haplogroups.

Authors: 
Journal:  Genome Res       Date:  2002-02       Impact factor: 9.043

3.  Y-chromosomal DNA variation in Pakistan.

Authors:  Raheel Qamar; Qasim Ayub; Aisha Mohyuddin; Agnar Helgason; Kehkashan Mazhar; Atika Mansoor; Tatiana Zerjal; Chris Tyler-Smith; S Qasim Mehdi
Journal:  Am J Hum Genet       Date:  2002-03-15       Impact factor: 11.025

4.  Median-joining networks for inferring intraspecific phylogenies.

Authors:  H J Bandelt; P Forster; A Röhl
Journal:  Mol Biol Evol       Date:  1999-01       Impact factor: 16.240

5.  Is the amelogenin gene reliable for gender identification in forensic casework and prenatal diagnosis?

Authors:  K Thangaraj; A G Reddy; L Singh
Journal:  Int J Legal Med       Date:  2002-04       Impact factor: 2.686

Review 6.  Recent Developments in Y-Short Tandem Repeat and Y-Single Nucleotide Polymorphism Analysis.

Authors:  J M Butler
Journal:  Forensic Sci Rev       Date:  2003-07

7.  A large interstitial deletion encompassing the amelogenin gene on the short arm of the Y chromosome.

Authors:  Wanda Lattanzi; Marilena C Di Giacomo; Gennaro M Lenato; Guglielmina Chimienti; Gianfranco Voglino; Nicoletta Resta; Gabriella Pepe; Ginevra Guanti
Journal:  Hum Genet       Date:  2005-02-22       Impact factor: 4.132

8.  26-Locus Y-STR typing in a Bhutanese population sample.

Authors:  Emma J Parkin; Thirsa Kraayenbrink; George L van Driem; Karma Tshering Of Gaselô; Peter de Knijff; Mark A Jobling
Journal:  Forensic Sci Int       Date:  2005-11-14       Impact factor: 2.395

9.  High resolution Y chromosome typing: 19 STRs amplified in three multiplex reactions.

Authors:  Elena Bosch; Andrew C Lee; Francesc Calafell; Eduardo Arroyo; Peter Henneman; Peter de Knijff; Mark A Jobling
Journal:  Forensic Sci Int       Date:  2002-01-24       Impact factor: 2.395

10.  A novel multiplex for simultaneous amplification of 20 Y chromosome STR markers.

Authors:  John M Butler; Richard Schoske; Peter M Vallone; Margaret C Kline; Alan J Redd; Michael F Hammer
Journal:  Forensic Sci Int       Date:  2002-09-10       Impact factor: 2.395

View more
  21 in total

1.  Haplotype analysis of the polymorphic 17 YSTR markers in Kerala nontribal populations.

Authors:  Seema Nair Parvathy; Aswathy Geetha; Chippy Jagannath
Journal:  Mol Biol Rep       Date:  2012-02-05       Impact factor: 2.316

2.  Y chromosome interstitial deletion induced Y-STR allele dropout in AMELY-negative individuals.

Authors:  Yan Ma; Jin-Zhi Kuang; Ji Zhang; Gui-Min Wang; Yu-Jian Wang; Wei-Min Jin; Yi-Ping Hou
Journal:  Int J Legal Med       Date:  2012-06-06       Impact factor: 2.686

3.  Y-STR diversity in the Himalayas.

Authors:  Tenzin Gayden; Shilpa Chennakrishnaiah; Joel La Salvia; Sacha Jimenez; Maria Regueiro; Trisha Maloney; Patrice J Persad; Areej Bukhari; Annabel Perez; Oliver Stojkovic; Rene J Herrera
Journal:  Int J Legal Med       Date:  2010-07-21       Impact factor: 2.686

4.  The Himalayas as a directional barrier to gene flow.

Authors:  Tenzin Gayden; Alicia M Cadenas; Maria Regueiro; Nanda B Singh; Lev A Zhivotovsky; Peter A Underhill; Luigi L Cavalli-Sforza; Rene J Herrera
Journal:  Am J Hum Genet       Date:  2007-04-04       Impact factor: 11.025

5.  Molecular characterization of a polymorphic 3-Mb deletion at chromosome Yp11.2 containing the AMELY locus in Singapore and Malaysia populations.

Authors:  Rita Y Y Yong; Linda S H Gan; Yuet Meng Chang; Eric P H Yap
Journal:  Hum Genet       Date:  2007-06-23       Impact factor: 4.132

6.  Analysis of 23 Y-STR loci in Chinese Jieyang Han population.

Authors:  Tianshan Guan; Xuheng Song; Cheng Xiao; Huilin Sun; Xiaoying Yang; Chao Liu; Ling Chen
Journal:  Int J Legal Med       Date:  2019-02-18       Impact factor: 2.686

7.  Phylogenetic analysis and forensic evaluation among Rakhine, Marma, Hajong, and Manipuri tribes from four culturally defined regions of Bangladesh using 17 Y-chromosomal STRs.

Authors:  Mahamud Hasan; Abu Sufian; Pilu Momtaz; Ashish Kumar Mazumder; Jabedul Alam Khondaker; Saikat Bhattacharjee; Kanchan Chakma; Sharif Akhteruzzaman
Journal:  Int J Legal Med       Date:  2018-08-24       Impact factor: 2.686

8.  Phylogenetic and forensic studies of the Bangladeshi population using next-generation PowerPlex® Y23 STR marker system.

Authors:  Md Mahamud Hasan; Pilu Momtaz; Tania Hossain; Ashish Kumar Mazumder; Md Jabedul Alam Khondaker; Abu Sufian; Md Niaz Makhdum; Sharif Akhteruzzaman
Journal:  Int J Legal Med       Date:  2016-04-07       Impact factor: 2.686

9.  Null allele sequence structure at the DYS448 locus and implications for profile interpretation.

Authors:  Bruce Budowle; Xavier G Aranda; Robert E Lagace; Lori K Hennessy; John V Planz; Manuel Rodriguez; Arthur J Eisenberg
Journal:  Int J Legal Med       Date:  2008-06-26       Impact factor: 2.686

10.  Dynamic nature of the proximal AZFc region of the human Y chromosome: multiple independent deletion and duplication events revealed by microsatellite analysis.

Authors:  Patricia Balaresque; Georgina R Bowden; Emma J Parkin; Ghada A Omran; Evelyne Heyer; Lluis Quintana-Murci; Lutz Roewer; Mark Stoneking; Ivan Nasidze; Denise R Carvalho-Silva; Chris Tyler-Smith; Peter de Knijff; Mark A Jobling
Journal:  Hum Mutat       Date:  2008-10       Impact factor: 4.878

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.