Literature DB >> 27833167

A 30-InDel Assay for Genetic Variation and Population Structure Analysis of Chinese Tujia Group.

Chunmei Shen1,2, Bofeng Zhu3,4,5, Tianhua Yao6, Zhidan Li7, Yudang Zhang8, Jiangwei Yan9, Bo Wang10, Xiaohua Bie11, Fadao Tai2.   

Abstract

In the present study, thirty autosomal insertion and deletion polymorphic loci were simultaneously amplified and genotyped in a multiplex system, and their allelic frequencies as well as several forensic parameters were obtained in a sample of 236 unrelated healthy Tujia individuals. All the loci were in Hardy-Weinberg equilibrium after applying a Bonferroni correction and all pair-wise loci showed no significant linkage disequilibrium. These loci were observed to be relatively informative and discriminating, quite efficient for forensic applications. Allelic frequencies of 30 loci were compared between the Tujia group and other reference populations, and the results of analysis of molecular variance indicated the Tujia group showed the least significant differences with the Shanghai Han at one locus, and the most with Central Spanish population at 22 loci. We analyzed the population genetic structure by the principal component analysis, the clustering of STRUCTURE program and a Neighbor-Joining tree, and then evaluated the genetic relationships among Tujia and other 15 populations.

Entities:  

Mesh:

Year:  2016        PMID: 27833167      PMCID: PMC5104975          DOI: 10.1038/srep36842

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Short tandem repeats (STRs) have become popular DNA markers in forensic DNA labs for more than 20 years and have been proved to possess several benefits, which make them especially suitable to identify victims, perpetrators, missing persons, and for kinship testing and population genetic analysis12345. However, there were some potential limitations of STRs in forensic applications because of its relatively high mutation rate, long amplicon size, and the deficiency in the analysis of highly degraded DNA samples and complex kinship cases. In recent years, a novel genetic marker: insertion and deletion polymorphisms (InDels) dispersing through the human genome showed some advantages, such as short amplicon size, low mutation rate, and practicability of being genotyped in the present forensic DNA lab platforms678, which were useful for forensic DNA applications (Supplementary method for STR applications), population genetics9101112, and biogeographic ancestry analysis131415. Population genetic and forensic validation studies have been performed using the Qiagen Investigator DIPplex® reagent including 30 autosomal InDel loci plus amelogenin locus, and population data of Chinese Han, Tibetan, Uigur, Kazak, She, Xibe and Yi populations have been reported in previous studies910161718. In the present study, we firstly reported the population genetic data of 30 InDels in Chinese Tujia ethnic group, evaluated their usefulness in the field of forensic sciences, analyzed the interpopulation differentiations, and retraced the genetic background of the Tujia group by the population structure construction, principal component analysis, phylogenetic tree and some other analyses.

Materials and Methods

Ethical statement and population samples

Bloodstain samples were randomly collected from 236 unrelated healthy Tujia individuals in Enshi Tujia and Miao Autonomous Prefecture of Hubei province, China. The study was conducted in accordance with the human and ethical research principles of Xi’an Jiaotong University Health Science Center and approved by the ethics committee of Xi’an Jiaotong University Health Science Center. We have obtained written informed consent from all volunteers for the purpose of research. The investigation was conducted in order to ensure that any two individuals didn’t share a common ancestry within at least three previous generations; all individuals were born and lived in the same prefecture; and their ancestors married no any other ethnic people.

DNA extraction, co-amplification and genotyping

Genomic DNA was extracted from bloodstain cards by using the Chelex® method (Solarbio, Beijing, China) according to the manufacturer’s instructions19. About 0.5–1.0 ng genomic DNA was used for amplification with a 25 ul reaction volume. PCR amplification for 30 InDel loci and Amelogenin locus was performed in a single multiplex reaction using the DIPplex Investigator reagent (Qiagen, Hilden, Germany), which was prepared on a GeneAmp® PCR System 9700 thermal cycler (Applied Biosystems, Foster City, CA, USA) under the recommended reaction condition. PCR products of all loci were separated and detected by capillary electrophoresis on the ABI 3500 Genetic Analyzer (Applied Biosystems). Genotyping of InDel loci was analyzed using the BTO 550 (Qiagen) as internal lane standard and by GeneMapper® ID software v3.2 (Applied Biosystems). Experiments were carried out according to the kit control and the ISO 17025 standard in this study.

Statistical analyses

Hardy-Weinberg equilibrium (HWE), allelic frequencies and forensic statistical parameters of 30 InDels were calculated by the modified powerstat (version1.2) spreadsheet (Promega, Madison, WI, USA). Linkage disequilibrium (LD) analysis for all pair-wise InDel loci was performed using the SNPAnalyzer v2.0 (Istech, South Korea)20. Fst and p values for pairwise interpopulation comparisons were calculated based on allele frequencies of 30 InDels by analysis of molecular variance (AMOVA) performed with ARLEQUIN version 3.1 software (http://cmpg.unibe.ch/software/arlequin3). Principal component analysis (PCA) in two forms and phylogenetic reconstruction were employed in MATLAB 2007a (MathWorks Inc., USA), R statistical software v3.0.221 and genetic distance and phylogenetic analysis (DISPAN) program (http://pritch. bsd.uchicago.edu), respectively. The detailed population genetic structure was performed with the STRUCTURE program v2.2 (http://pritch.bsd.uchicago.edu.) to analyze the structure of Tujia and the other populations previously published based on the same 30 InDels.

Results and Discussion

Allele diversities within group

Probability values for Hardy-Weinberg equilibrium tests for 29 InDel loci ranged from 0.0669 (HLD40) to 0.9987 (HLD97), and p < 0.05 was only observed at the HLD88 locus (p = 0.0382). P values were adjusted after applying a Bonferroni correction for all 30 InDel loci analyzed and P > 0.00167 was considered statistically insignificant. Then, the genotype frequency data for all loci showed no deviations from HWE expectations in the sample of Tujia group. Allelic frequencies and forensic statistical parameters of 30 InDels based on the raw genotype (shown in Supplementary Table 1) were shown in Table 1. Allelic frequencies of deletion allele at the 30 InDel loci ranged from 0.0445 to 0.9089 in the group, with a mean value of 0.4939. The observed (HO) and expected heterozygosities (HE) ranged from 0.0890 (HLD118) to 0.5381(HLD92); and 0.0850 (HLD118) to 0.4985(HLD136), with a mean value of 0.4028 and 0.4073, respectively. Twenty-four InDel loci had power of discrimination (PD) values greater than 0.5, except the six loci: HLD39, HLD64, HLD81, HLD99, HLD111, and HLD118 loci. The values of the power of exclusion (PE), the matching probability (MP), the typical paternity index (TPI), and the polymorphic information content (PIC) ranged from 0.0067 to 0.2231, 0.3524 to 0.8379, 0.5488 to 1.0826 and 0.0814 to 0.3742, respectively. The lowest HO, HE, PIC, TPI, PD and PE were observed at HLD 118 locus, and this locus was also found with the lowest polymorphism in other previously studied groups10. The combined power of exclusion (CPE) and discrimination (CPD) at the 30 InDel loci in the Tujia group were 0.9860 and 0.9999999999761, respectively; combined matching probability (CMP) value of 30 InDels in the group was 2.3894 × 10−11, higher than that in our previous study which reached 1.10974 × 10−19 of 21 autosomal STRs in Tujia group11. According to our calculation, the value of CMP combining 30 InDels with 21 autosomal STRs reached 2.652 × 10−30. These data suggested that the panel of 30 InDel loci could be a valid supplement to the routine detection of autosomal STRs in forensic cases.
Table 1

The allelic frequencies and forensic efficiency parameters for 30 InDels in Chinese Tujia ethnic Group (n = 236).

HLDDIP(−)DIP(+)PDMPPICPETPIHOHEp
HLD6 (rs1610905)0.53180.46820.61640.38360.37400.19891.02610.51270.49800.6744
HLD39 (rs17878444)0.88350.11650.35190.64810.18470.03430.63780.21610.20590.7107
HLD40 (rs2307956)0.32630.67370.60280.39720.34300.10300.80820.38140.43960.0669
HLD45 (rs2307959)0.34750.65250.59370.40630.35060.15950.93650.46610.45350.7186
HLD48 (rs28369942)0.55720.44280.60720.39280.37170.20671.04420.52120.49350.4123
HLD56 (rs2308292)0.44920.55080.60610.39390.37240.21071.05360.52540.49480.3640
HLD58 (rs1610937)0.59750.40250.61010.38990.36530.18020.98330.49150.48100.7702
HLD64 (rs1610935)0.13980.86020.39970.60030.21160.04930.67820.26270.24060.4369
HLD67 (rs1305056)0.24580.75420.53730.46270.30200.08090.75640.33900.37070.3010
HLD70 (rs2307652)0.43640.56360.60800.39200.37090.20281.03510.51690.49190.4611
HLD77 (rs1611048)0.53600.46400.63230.36770.37370.16960.95930.47880.49740.5461
HLD81 (rs17879936)0.16950.83050.44510.55490.24190.05880.70240.28810.28150.8375
HLD83 (rs2308072)0.61860.38140.60860.39140.36050.16620.95160.47460.47180.9576
HLD84 (rs3081400)0.23940.76060.53070.46930.29790.08290.76130.34320.36420.4879
HLD88 (rs8190570)0.44700.55300.64760.35240.37220.13190.87410.42800.49440.0382
HLD92 (rs17174476)0.54450.45550.59980.40020.37300.22311.08260.53810.49600.2073
HLD93 (rs2307570)0.44070.55930.63320.36680.37150.15950.93650.46610.49300.3912
HLD97 (rs17238892)0.66100.33900.59470.40530.34770.14670.90770.44920.44810.9987
HLD99 (rs2308163)0.13560.86440.37230.62770.20690.02840.62110.19490.23440.1472
HLD101 (rs2307433)0.53390.46610.64420.35580.37380.14670.90770.44920.49770.1275
HLD111 (rs1305047)0.90890.09110.28090.71910.15190.01730.58710.14830.16560.4661
HLD114 (rs2307581)0.76480.23520.52620.47380.29500.08290.76130.34320.35970.5805
HLD118 (rs16438)0.04450.95550.16210.83790.08140.00670.54880.08900.08500.8353
HLD122 (rs8178524)0.76690.23310.52400.47600.29360.07300.73750.32200.35750.2461
HLD124 (rs6481)0.41740.58260.62130.37870.36810.16960.95930.47880.48630.7924
HLD125 (rs16388)0.64830.35170.59280.40720.35200.16620.95160.47460.45600.5874
HLD128 (rs2307924)0.65040.34960.59330.40670.35130.16280.94400.47030.45470.6518
HLD131 (rs1611001)0.66530.33470.58560.41440.34620.15950.93650.46610.44540.5411
HLD133 (rs2067235)0.63770.36230.59700.40300.35530.16960.95930.47880.46210.6273
HLD136 (rs16363)0.47250.52750.63700.36300.37420.16280.94400.47030.49850.3696

DIP(−), frequency of deletion allele; DIP(+), frequency of insertion allele; PD, power of discrimination; MP, matching probability; PIC, polymorphic information content; PE, power of exclusion; TPI, typical paternity index; HO, observed heterozygosity; HE, expected heterozygosity; p, probability values for Hardy-Weinberg equilibrium tests.

Linkage disequilibrium tests

Linkage disequilibrium tests of these pairwise InDels were analyzed using the SNPAnalyzer version 2.0 and obtained several indexes: LOD, r and |D’|. As shown in Supplementary Fig. 1, no strong linkage disequilibrium between two different InDels was observed in a total of 435 interclass correlation tests (data not shown) with the values of r less than 0.8, and no crimson box was coated by a thick black curve. The present LD tests suggested that 30 InDels were independent for the following statistical analyses, and also suited for forensic cases in the Tujia group.

Genetic divergences

Genetic distance is a measure method of the genetic divergence between different populations, used for understanding the origin of biodiversity and reconstructing the history of different ethnic groups22. We measured the Nei’s D distance by examining the differences between allelic frequencies at the same set of 30 InDel loci of different populations. D distances between the 16 groups with each other based on allelic frequencies of the 30 InDel loci were shown in Table 2. Short genetic distances were found between the Tujia group and Shanghai Han17, Guangdong Han18, South Korean23, Beijing Han9, Xibe10, She17, Tibetan9, and Yi16 groups; and further distances were observed between the Tujia group and Chinese Kazak9, Uigur9 groups; whereas the larger distances were estimated with Uruguayan24, Dane25, Central Spanish26, Basque26, and Hungarian populations27. Pairwise populations had small genetic distances, which indicated that they had close genetic relationships or shared a recent common ancestor.
Table 2

D distances between Tujia group and other populations based on allelic frequencies of the same set of 30 InDel loci.

PopulationsShanghai HanBeijing HanGuangdong HanTujiaTibetanSheSouth KoreanUigurKazakDaneBasqueCentral SpanishUruguayanHungarianYiXibeReferences
Shanghai Han*               Wang et al.17
Beijing Han0.0011*              Wei et al.9
Guangdong Han0.00060.0019*             Hong et al.18
Tujia0.00040.00130.0007*            Present study
Tibetan0.00380.00290.00550.0038*           Wei et al.9
She0.00190.00230.00150.00190.0065*          Wang et al.17
South Korean0.00080.00240.00170.00090.00380.0028*         Seong et al.23
Uigur0.01140.01000.01180.01250.00930.01330.0135*        Wei et al.9
Kazak0.00960.00830.01000.01040.00740.01120.01150.0013*       Wei et al.9
Dane0.02640.02510.02650.02720.02260.02750.02880.00830.0093*      Friis et al.25
Basque0.02700.02700.02680.02810.02580.02880.02870.00960.01110.0048*     Martin et al.26
Central Spanish0.02680.02620.02690.02770.02310.02850.02880.00690.00850.00300.0033*    Martin et al.26
Uruguayan0.02400.02300.02440.02500.01990.02550.02580.00570.00670.00390.00430.0023*   Saiz et al.24
Hungarian0.02710.02550.02750.02810.02220.02890.02950.00680.00840.00260.00450.00220.0021*  Kis et al.27
Yi0.00400.00540.00380.00400.00660.00510.00420.01630.01330.03150.03280.03230.02860.0038* Zhang et al.16
Xibe0.00150.00220.00230.00150.00370.00320.00160.00920.00680.02270.02360.02260.02030.00230.0052*Meng et al.10

InDel diversities among populations

Population differentiations for 30 InDels were compared between the Tujia group and other populations previously published based on AMOVA method (p < 0.05). As shown in Table 3, the AMOVA comparison results showed significant differences between the Tujia group and Shanghai Han, Beijing Han, Guangdong Han, She, Xibe, South Korean, Tibetan, Yi, Uigur, Kazak, Uruguayan, Hungarian, Basque, Dane, Central Spanish populations at 1, 3, 3, 4, 5, 7, 8, 9, 14, 14, 20, 20, 20, 21 and 22 loci, respectively. The present results demonstrated that the HLD125, HLD99, HLD67, HLD118 loci had relatively high level of genetic variation, with the significant differentiation between Tujia group and other 9, 10, 10 and 11 populations, respectively; while the least differentiation was obtained at the HLD92, HLD101, HLD124 loci with only one pair-wise population. Therefore, allele frequency data obtained at 30 InDels are very important and necessary for forensic application research of different populations.
Table 3

Pairwise Fst and p values between the Tujia group and other 15 populations based on AMOVA method.

LociShanghai HanBeijing HanGuangdong HanTibetanSheYiSouth KoreanUigurKazakXibeDaneBasqueCentral SpanishUruguayanHungarian
FSTpFSTpFSTpFSTpFSTpFSTpFSTpFSTpFSTpFSTpFSTpFSTpFSTpFSTpFSTp
D77−0.00091.00000.00670.11630.00010.4467−0.00231.0000−0.00251.00000.03880.0000−0.00181.0000−0.00361.0000−0.00391.00000.00310.15250.03540.00200.02460.02250.00310.30300.00940.0547−0.00251.0000
D450.00170.2160−0.00361.0000−0.00111.0000−0.00120.7107−0.00291.0000−0.00321.00000.00700.0372−0.00341.00000.00650.1496−0.00131.00000.04020.00000.04320.00390.02440.01560.02980.00200.05230.0000
D131−0.00111.00000.00040.4555−0.00191.00000.05260.0000−0.00291.00000.00210.2708−0.00010.48680.04940.00000.02480.0108−0.00121.00000.06950.00000.00000.50440.05210.00100.07080.00000.11220.0000
D70−0.00151.00000.00230.2972−0.00181.00000.00140.34410.00620.10950.00120.36460.00840.0127−0.00150.7928−0.00271.00000.00360.16230.00420.19260.01030.10950.00050.44770.01630.0205−0.00100.7703
D6−0.00151.0000−0.00211.0000−0.00111.0000−0.00321.00000.00780.07230.01370.02740.00090.28640.00910.06160.02190.01470.00890.03420.00250.30210.00570.1760−0.00130.69600.02360.00000.00240.2473
D111−0.00121.0000−0.00381.0000−0.00080.8475−0.00381.00000.00340.2033−0.00221.00000.00070.35780.15520.00000.14230.00000.00320.14960.44170.00000.48240.00000.40510.00000.38090.00000.44120.0000
D58−0.00151.0000−0.00291.00000.00650.04890.00310.2405−0.00321.0000−0.00171.00000.00800.0284−0.00261.0000−0.00361.00000.02040.00200.04640.00200.09570.00000.03450.00290.00270.25020.03300.0000
D560.00050.3617−0.00150.81130.00480.08500.00850.0919−0.00291.00000.02980.0000−0.00070.80250.00160.2854−0.00120.64320.00250.21990.01610.03520.00390.23850.07710.0000−0.00241.00000.03480.0000
D1180.00170.19940.00180.34210.00850.0293−0.00271.00000.05810.00000.03150.0000−0.00141.00000.41230.00000.32460.00000.01170.00780.56700.00000.62640.00000.62000.00000.58300.00000.45850.0000
D92−0.00111.0000−0.00331.00000.00220.2033−0.00170.9267−0.00291.0000−0.00010.47900.00100.27270.00520.16130.01600.0156−0.00060.6647−0.00351.0000−0.00200.7439−0.00241.00000.00090.33140.00100.3109
D93−0.00121.0000−0.00321.0000−0.00211.00000.00580.15350.00970.04990.03340.00000.00300.14080.00900.0821−0.00150.73220.00570.09190.00020.44770.00600.21210.00090.3803−0.00231.00000.00050.3744
D99−0.00151.00000.02650.0078−0.00020.48970.04180.0049−0.00150.8328−0.00110.68230.01120.00880.16570.00000.17400.0000−0.00070.68430.22250.00000.15500.00000.19140.00000.15290.00000.19980.0000
D880.00140.19750.00360.24440.00240.16720.00750.1036−0.00271.0000−0.00261.00000.00090.35780.00710.1124−0.00160.7302−0.00110.84160.00950.10070.03990.00590.02850.01560.00060.4096−0.00151.0000
D101−0.00151.0000−0.00281.0000−0.00181.0000−0.00331.00000.01070.0635−0.00160.88560.00040.40270.02920.00490.00810.1075−0.00131.00000.01440.0557−0.00321.0000−0.00020.52200.00160.3118−0.00191.0000
D670.01040.01080.00370.18960.00080.32650.05910.0010−0.00060.54940.00300.25420.01060.01660.03160.00100.07290.00000.00170.27080.05640.00100.16820.00000.03510.00590.07110.00000.06300.0000
D83−0.00060.73310.02440.0098−0.00101.00000.00080.43300.01860.01270.02940.0020−0.00111.0000−0.00180.9247−0.00211.0000−0.00090.80350.05630.00100.05700.00000.03480.00680.00590.09870.00990.0303
D1140.00200.19750.00400.1818−0.00171.00000.00080.4458−0.00211.0000−0.00311.00000.00470.08210.05280.00000.08280.00000.01590.00680.04020.00100.01560.07920.13440.00000.06640.00000.02730.0000
D480.00530.0577−0.00080.59730.00430.0792−0.00351.00000.00530.14270.00820.07430.00700.0362−0.00050.5279−0.00351.00000.00040.36170.10260.00000.01120.10170.02130.01960.01240.0352−0.00020.5435
D1240.00190.1877−0.00271.00000.00820.02840.00670.13290.00670.1075−0.00281.0000−0.00030.55520.00530.1711−0.00381.0000−0.00211.0000−0.00261.00000.01570.05870.00430.20720.00430.17890.00420.1241
D122−0.00151.0000−0.00301.00000.00170.22870.05440.00000.02130.0098−0.00311.00000.00080.30990.04060.00200.03790.00100.00200.23070.04960.00000.02040.04300.06390.00000.08550.00000.13530.0000
D125−0.00141.00000.01610.02350.00000.50640.01550.03130.00190.31090.00890.0547−0.00161.00000.05110.00000.02410.00590.00460.11050.06980.00000.02700.01370.02230.01560.02060.00490.06480.0000
D640.00100.2669−0.00261.0000−0.00030.56400.00320.2219−0.00271.00000.00340.1730−0.00151.00000.09000.00000.08490.00000.02760.00000.20120.00000.28410.00000.18420.00000.09280.00000.22860.0000
D81−0.00131.0000−0.00231.00000.00220.2111−0.00321.00000.00150.35970.00860.05960.00600.03320.03850.00000.04840.0000−0.00171.00000.27730.00000.22840.00000.33330.00000.29850.00000.27660.0000
D136−0.00151.0000−0.00060.55030.00430.0899−0.00251.00000.00420.19360.05660.00000.00420.0811−0.00361.0000−0.00090.6012−0.00181.0000−0.00210.92280.05290.0000−0.00311.0000−0.00070.61390.00170.2825
D133−0.00151.0000−0.00331.0000−0.00151.00000.01320.03620.00280.25810.00010.4379−0.00131.00000.00340.25020.01250.05670.00180.24150.10980.00000.11810.00000.12720.00000.06100.00000.08440.0000
D97−0.00161.0000−0.00040.5200−0.00070.7459−0.00301.0000−0.00130.7527−0.00191.0000−0.00161.00000.00410.2160−0.00241.00000.00040.36170.07030.00000.13330.00000.06310.00000.05960.00000.04820.0000
D1280.00100.3089−0.00221.0000−0.00171.00000.00140.33720.00000.48090.09310.0000−0.00181.00000.01040.05960.00330.2796−0.00201.00000.01010.10070.04690.00290.01750.02440.01660.00780.03480.0000
D39−0.00111.0000−0.00120.68720.00060.37240.06670.0000−0.00110.7126−0.00050.6158−0.00111.00000.11720.00000.04230.00200.00340.13490.11090.00000.08860.00000.22820.00000.20370.00000.18370.0000
D400.00270.12510.00020.45750.00030.44090.03530.00100.00030.42820.01980.00980.00320.14960.03250.00290.03770.00200.00330.16520.12180.00000.10890.00000.13100.00000.07740.00000.10870.0000
D84−0.00101.00000.00140.37240.00210.19260.01030.07330.00660.1202−0.00070.5816−0.00171.00000.05290.00000.00460.15250.00280.17110.12430.00000.06990.00000.05340.00100.09820.00000.09160.0000

Principal component analyses

On the basis of the allelic frequencies at the same 30 InDels, PCA figures were constructed by MATLAB 2007a (MathWorks Inc., USA) and R statistical software v3.0.221 among the Tujia group and other 15 reference populations. As shown in Fig. 1a, the variance ratio contribution of the first principal component (PC) was about 77.87% of the total variation and the second accounted for 5.74%. In the PCA diagram, the 16 populations were divided into three relatively independent areas inconsistency with their languages family. Ethnic groups with similar language family basically spread closer. The results indicated that there were close relationships between the Tujia group and Chinese Han populations from different regions, as well as She and South Korean groups. Ya et al. studied the haplotypes of 17 Y-STR loci and preformed the multidimensional scaling plot which also showed the close relationship between Tujia and Han population28; and the similar result was observed in the PCA plot based on the allelic frequencies of HLA-DRB1 locus29. The relatively far genetic relationships between Tujia group and Kazak or Uigur group were observed in the PCA map constructed by mtDNA haplogroup frequencies30 and in the abovementioned HLA-DRB1 PCA plot29, respectively.
Figure 1

A PCA plot showing the genetic relationships.

(a) Tujia group and other 15 reference populations. (b) Tujia, central Asian, western Eurasian and other eastern Eurasian populations were analyzed at individual level.

The genetic relationships among Tujia, central Asian (Uigur and Kazak populations), western Eurasians (Hungarian, Dane, Basque and Central Spanish populations) and other eastern Eurasians (Shanghai Han, Beijing Han, Guangdong Han, She, Xibe, South Korean, Tibetan, and Yi populations) were also discerned with the aid of abovementioned InDel datasets at the individual level. Results of individual PCA were presented by the plots of the first two PCs (shown in Fig. 1b), which together accounted for 38.82% of the total variation in these populations. The first PC revealed an east-west geographic division within Eurasians. In concrete terms, all eastern Eurasians tended to cluster on the left of PCA plots, whereas western Eurasians formed a separate cluster on the right. The Tujia people were expectedly clustered within eastern Asian group.

Neighbor-joining phylogenetic reconstruction

We constructed a neighbor-joining (N-J) phylogenetic tree (shown in Fig. 2). The branch in the upper-left corner contained the nine East Asian populations including Tujia group; whereas in the other branch, Dane, Basque, Central Spanish, Uruguayan, and Hungarian populations were found in the lower-left corner. The Kazak and Uigur groups were in the middle of the above two branches. In previous study, the close relationship between Tujia group and Han population was observed in the N-J dendrogram based on the allelic frequencies of HLA-A locus31. The language of Tujia belongs to Tibeto-Burman language system, without written script. Tujias lived with other nationalities like Miao and Han, and many of them can speak Mandarin Chinese and write the Chinese characters. The tight genetic relationship between the Tujia and Han population in Hubei provience was observed based on fifteen STRs, and the present and previous studies indicated that broad genetic exchanges had occurred among them in history32.
Figure 2

A neighbor-joining phylogenetic tree constructed to analyze phylogenetic relationships from Tujia group and 15 reference populations.

Population STRUCTURE analyses

The STRUCTURE program was used to evaluate the genetic structure of Tujia and other 15 populations. As shown in Fig. 3, at K = 2, three clusters were highly visible and easily distinguishable basically by red, green and mixture of the two. When K = 2–7 (in Supplementary Fig. 2), the STRUCTURE analyses revealed three major clusters: the first subpopulation of Dane, Basque, Central Spanish, Uruguayan, and Hungarian populations, the second of Kazak and Uigur; the last one of nine East Asian populations including Tujia group. The results presented here were similar to that of the PCA plot and N-J tree. With the increase of K values, no further population structures were obtained. We should, just as a precaution, study more ancestry informative InDels in the future in order to subdivide the genetic structure of different ethnic groups in China, and to infer the population origin and ancestral components of an unknown individual.
Figure 3

Population STRUCTURE analysis of 16 populations at K = 2, which revealed three major clusters.

Conclusion

In summary, the population data here indicated the 30 InDels had high diversities within the studied group and genetic differentiations among different populations; and could be a useful supplement to the routine detection of autosomal STRs in forensic cases. The PCA plot, N-J tree and STRUCTURE analyses suggested the close relationships between Tujia and Han population in different regions. More ancestry informative InDels and SNPs should be selected and validated to clarify the Tujia ancestral origin.

Additional Information

How to cite this article: Shen, C. et al. A 30-InDel Assay for Genetic Variation and Population Structure Analysis of Chinese Tujia Group. Sci. Rep. 6, 36842; doi: 10.1038/srep36842 (2016). Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
  29 in total

1.  Estimation of average heterozygosity and genetic distance from a small number of individuals.

Authors:  M Nei
Journal:  Genetics       Date:  1978-07       Impact factor: 4.562

2.  Selection of 29 highly informative InDel markers for human identification and paternity analysis in Chinese Han population by the SNPlex genotyping system.

Authors:  Chengtao Li; Suhua Zhang; Li Li; Jingzhong Chen; Yan Liu; Shumin Zhao
Journal:  Mol Biol Rep       Date:  2011-06-18       Impact factor: 2.316

3.  Developmental validation of the AGCU 21+1 STR kit: a novel multiplex assay for forensic application.

Authors:  Bo-Feng Zhu; Yu-Dang Zhang; Chun-Mei Shen; Wei-An Du; Wen-Juan Liu; Hao-Tian Meng; Hong-Dan Wang; Guang Yang; Rui Jin; Chun-Hua Yang; Jiang-Wei Yan; Xiao-Hua Bie
Journal:  Electrophoresis       Date:  2014-12-17       Impact factor: 3.535

4.  Forensic evaluation and population genetic study of 30 insertion/deletion polymorphisms in a Chinese Yi group.

Authors:  Yu-Dang Zhang; Chun-Mei Shen; Rui Jin; Ya-Ni Li; Bo Wang; Li-Xia Ma; Hao-Tian Meng; Jiang-Wei Yan; Hong- Dan Wang; Ze-Long Yang; Bo-Feng Zhu
Journal:  Electrophoresis       Date:  2015-04-20       Impact factor: 3.535

5.  Genetic analysis of 17 Y-chromosomal STR loci of Chinese Tujia ethnic group residing in Youyang Region of Southern China.

Authors:  Ya-Ran Yang; Yu-Ting Jing; Guo-Dong Zhang; Xiang-Dong Fang; Jiang-Wei Yan
Journal:  Leg Med (Tokyo)       Date:  2014-02-05       Impact factor: 1.376

6.  Human diallelic insertion/deletion polymorphisms.

Authors:  James L Weber; Donna David; Jeremy Heil; Ying Fan; Chengfeng Zhao; Gabor Marth
Journal:  Am J Hum Genet       Date:  2002-09-04       Impact factor: 11.025

7.  Characterization of 114 insertion/deletion (INDEL) polymorphisms, and selection for a global INDEL panel for human identification.

Authors:  Bobby L LaRue; Robert Lagacé; Chien-Wei Chang; Allison Holt; Lori Hennessy; Jianye Ge; Jonathan L King; Ranajit Chakraborty; Bruce Budowle
Journal:  Leg Med (Tokyo)       Date:  2013-11-01       Impact factor: 1.376

8.  The origin, evolution, and functional impact of short insertion-deletion variants identified in 179 human genomes.

Authors:  Stephen B Montgomery; David L Goode; Erika Kvikstad; Cornelis A Albers; Zhengdong D Zhang; Xinmeng Jasmine Mu; Guruprasad Ananda; Bryan Howie; Konrad J Karczewski; Kevin S Smith; Vanessa Anaya; Rhea Richardson; Joe Davis; Daniel G MacArthur; Arend Sidow; Laurent Duret; Mark Gerstein; Kateryna D Makova; Jonathan Marchini; Gil McVean; Gerton Lunter
Journal:  Genome Res       Date:  2013-03-11       Impact factor: 9.043

9.  SNPAnalyzer 2.0: a web-based integrated workbench for linkage disequilibrium analysis and association analysis.

Authors:  Jinho Yoo; Youngbok Lee; Yujung Kim; Sun Young Rha; Yangseok Kim
Journal:  BMC Bioinformatics       Date:  2008-06-23       Impact factor: 3.169

10.  Genetic polymorphism analyses of 30 InDels in Chinese Xibe ethnic group and its population genetic differentiations with other groups.

Authors:  Hao-Tian Meng; Yu-Dang Zhang; Chun-Mei Shen; Guo-Lian Yuan; Chun-Hua Yang; Rui Jin; Jiang-Wei Yan; Hong-Dan Wang; Wen-Juan Liu; Hang Jing; Bo-Feng Zhu
Journal:  Sci Rep       Date:  2015-02-05       Impact factor: 4.379

View more
  11 in total

1.  Population genetics, diversity and forensic characteristics of Tai-Kadai-speaking Bouyei revealed by insertion/deletions markers.

Authors:  Guanglin He; Zheng Ren; Jianxin Guo; Fan Zhang; Xing Zou; Hongling Zhang; Qiyan Wang; Jingyan Ji; Meiqing Yang; Ziqian Zhang; Jing Zhang; Yilizhati Nabijiang; Jiang Huang; Chuan-Chao Wang
Journal:  Mol Genet Genomics       Date:  2019-06-13       Impact factor: 3.291

2.  Genetic diversity, structure and forensic characteristics of Hmong-Mien-speaking Miao revealed by autosomal insertion/deletion markers.

Authors:  Han Zhang; Guanglin He; Jianxin Guo; Zheng Ren; Hongling Zhang; Qiyan Wang; Jingyan Ji; Meiqing Yang; Jiang Huang; Chuan-Chao Wang
Journal:  Mol Genet Genomics       Date:  2019-07-16       Impact factor: 3.291

3.  Genetic distribution analyses and population background explorations of Gansu Yugur and Guizhou Miao groups via InDel markers.

Authors:  Chun-Hua Yang; Xiao-Ye Jin; Yu-Xin Guo; Wei Cui; Chong Chen; Hao-Tian Meng; Bo-Feng Zhu
Journal:  J Hum Genet       Date:  2019-04-03       Impact factor: 3.172

4.  Population genetic analysis of 30 insertion-deletion (INDEL) loci in a Qinghai Tibetan group using the Investigator DIPplex Kit.

Authors:  Hui Jian; Li Wang; Hui Wang; Xiaogang Bai; Meili Lv; Weibo Liang
Journal:  Int J Legal Med       Date:  2018-10-24       Impact factor: 2.686

5.  Genetic diversity analysis of forty-three insertion/deletion loci for forensic individual identification in Han Chinese from Beijing based on a novel panel.

Authors:  Congying Zhao; Jinlong Yang; Hui Xu; Shuyan Mei; Yating Fang; Qiong Lan; Yajun Deng; Bofeng Zhu
Journal:  J Zhejiang Univ Sci B       Date:  2022-03-15       Impact factor: 3.066

6.  Autosomal InDel polymorphisms for population genetic structure and differentiation analysis of Chinese Kazak ethnic group.

Authors:  Tingting Kong; Yahao Chen; Yuxin Guo; Yuanyuan Wei; Xiaoye Jin; Tong Xie; Yuling Mu; Qian Dong; Shaoqing Wen; Boyan Zhou; Li Zhang; Chunmei Shen; Bofeng Zhu
Journal:  Oncotarget       Date:  2017-05-12

7.  Population Genetic Diversity and Clustering Analysis for Chinese Dongxiang Group With 30 Autosomal InDel Loci Simultaneously Analyzed.

Authors:  Bofeng Zhu; Qiong Lan; Yuxin Guo; Tong Xie; Yating Fang; Xiaoye Jin; Wei Cui; Chong Chen; Yongsong Zhou; Xiaogang Li
Journal:  Front Genet       Date:  2018-08-02       Impact factor: 4.599

8.  The three-hybrid genetic composition of an Ecuadorian population using AIMs-InDels compared with autosomes, mitochondrial DNA and Y chromosome data.

Authors:  Ana Karina Zambrano; Aníbal Gaviria; Santiago Cobos-Navarrete; Carmen Gruezo; Cristina Rodríguez-Pollit; Isaac Armendáriz-Castillo; Jennyfer M García-Cárdenas; Santiago Guerrero; Andrés López-Cortés; Paola E Leone; Andy Pérez-Villa; Patricia Guevara-Ramírez; Verónica Yumiceba; Gisella Fiallos; Margarita Vela; César Paz-Y-Miño
Journal:  Sci Rep       Date:  2019-06-25       Impact factor: 4.379

9.  Autosomal DIPs for population genetic structure and differentiation analyses of Chinese Xinjiang Kyrgyz ethnic group.

Authors:  Yuxin Guo; Chong Chen; Xiaoye Jin; Wei Cui; Yuanyuan Wei; Hongdan Wang; Tingting Kong; Yuling Mu; Bofeng Zhu
Journal:  Sci Rep       Date:  2018-07-23       Impact factor: 4.379

10.  Genetic diversity and phylogenetic structure of four Tibeto-Burman-speaking populations in Tibetan-Yi corridor revealed by insertion/deletion polymorphisms.

Authors:  Xing Zou; Guanglin He; Mengge Wang; Liwen Huo; Xu Chen; Jing Liu; Shouyu Wang; Ziwei Ye; Fei Wang; Zheng Wang; Yiping Hou
Journal:  Mol Genet Genomic Med       Date:  2020-02-03       Impact factor: 2.183

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.