| Literature DB >> 34108995 |
Chunfang Luo1,2, Lizhong Duan3, Yanning Li1,4, Qiqian Xie1, Lingxiang Wang5, Kai Ru5, Shahid Nazir6, Muhammad Jawad6, Yifeng Zhao7, Fenfen Wang8, Zhengming Du8, Dehua Peng2, Shao-Qing Wen5, Pingming Qiu1, Haoliang Fan1,5,9.
Abstract
Guangdong province is situated in the south of China with a population size of 113.46 million. Hakka is officially recognized as a branch of Han Chinese, and She is the official minority group in mainland China. There are approximately 25 million Hakka people who mainly live in the East and North regions of China, while there are only 0.7 million She people. The genetic characterization and forensic parameters of these two groups are poorly defined (She) or still need to be explored (Hakka). In this study, we have genotyped 475 unrelated Guangdong males (260 Hakka and 215 She) with Promega PowerPlex® Y23 System. A total of 176 and 155 different alleles were observed across all 23 Y-STRs for Guangdong Hakka (with a range of allele frequencies from 0.0038 to 0.7423) and Guangdong She (0.0047-0.8605), respectively. The gene diversity ranged from 0.4877 to 0.9671 (Guangdong Hakka) and 0.3277-0.9526 (Guangdong She), while the haplotype diversities were 0.9994 and 0.9939 for Guangdong Hakka and Guangdong She, with discrimination capacity values of 0.8885 and 0.5674, respectively. With reference to geographical and linguistic scales, the phylogenetic analyses showed us that Guangdong Hakka has a close relationship with Southern Han, and the genetic pool of Guangdong Hakka was influenced by surrounding Han populations. The predominant haplogroups of the Guangdong She group were O2-M122 and O2a2a1a2-M7, while Guangdong She clustered with other Tibeto-Burman language-speaking populations (Guizhou Tujia and Hunan Tujia), which shows us that the Guangdong She group is one of the branches of Tibeto-Burman populations and the Huonie dialect of She languages may be a branch of Tibeto-Burman language families.Entities:
Keywords: Guangdong Hakka; Guangdong She; Y-STR; forensic characteristics; phylogenetic analyses
Year: 2021 PMID: 34108995 PMCID: PMC8181459 DOI: 10.3389/fgene.2021.676917
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Locations of population distributions and sampling information. (A,B) Geographic positions of Guangdong Hakka, Guangdong She, and 159 worldwide populations with nine language families (48,637 individuals in total). (C) Distributions of geographical dialects in Guangdong province and detailed sampling information of Guangdong Hakka and Guangdong She groups.
Forensic characteristics of 23 Y-STRs in Guangdong populations (Hakka, She, and Han).
| Number of observed haplotypes | She | Hakka | Hana |
| 1 | 65 | 208 | 320 |
| 2 | 43 | 18 | 10 |
| 3 | 6 | 4 | |
| 4 | 5 | 1 | |
| 5 | 1 | ||
| 6 | 1 | ||
| 15 | 1 | ||
| 122 | 231 | 330 | |
| FUH | 0.5328 | 0.9004 | 0.9697 |
| HD | 0.9939 | 0.9994 | 0.9997 |
| DC | 0.5674 | 0.8885 | 0.9706 |
| MP | 1.42E-02 | 4.91E-03 | 3.31E-03 |
FIGURE 2Principal component analysis (PCA) based on the frequencies of 23 Y-STRs among Guangdong Hakka, Guangdong She, and 159 worldwide human populations. (A) PCA from geographical scale. (B) PCA from linguistic scale.
FIGURE 3Principal component analysis (PCA) based on the 23 Y-STRs frequency profiles among Guangdong Hakka, Guangdong She, and 83 Chinese populations. (A) PCA from geographical scale. (B) PCA from linguistic scale.
FIGURE 4Multidimensional scaling (MDS) plots based on the frequencies of 23 Y-STRs among Guangdong Hakka, Guangdong She, and 159 worldwide human populations (including 83 Chinese populations). (A) Euclidean-based MDS plot in worldwide populations from geographical scale. (B) Euclidean-based MDS plot in worldwide populations from linguistic scale. (C) Manhattan-based MDS plot in worldwide populations from geographical scale. (D) Manhattan-based MDS plot in worldwide populations from linguistic scale. (E) Euclidean-based MDS plot in Chinese populations from geographical scale. (F) Euclidean-based MDS plot in Chinese populations from linguistic scale. (G) Manhattan-based MDS plot in Chinese populations from geographical scale. (H) Manhattan-based MDS plot in Chinese populations from the linguistic scale.
FIGURE 5Neighbor-joining phylogenetic tree between Guangdong Hakka and 13 Han Chinese populations (including three Northern Han and 10 Southern Han) based on the matrix of pairwise R values.
FIGURE 6Neighbor-joining phylogenetic tree between Guangdong She and 42 other minorities with five different language families based on pairwise R values.
FIGURE 7Principal component analysis based on Y-haplogroup frequencies between three She groups and 74 populations (4,195 individuals in total, which included 22 Hmong-Mien, 20 Tai-Kadai, 16 Southern Han, 11 Northern Han, and five Tibeto-Burman language-speaking populations mainly from Southeast and East Asia).