| Literature DB >> 24339953 |
Shuhui Liu1, Bo Huang, Haihua Huang, Xiaoyun Li, Guangcan Chen, Guohong Zhang, Wengting Lin, Dan Guo, Jie Wang, Zefeng Yu, Xi Liu, Min Su.
Abstract
The Taihang Mountain range of north-central China, the Southern region area of Fujian province, and the Chaoshan plain of Guangdong province are 3 major regions in China well known for their high incidence of esophageal cancer (EC). These areas also exhibit high incidences of gastric cardia cancer (GCC). The ancestors of the Chaoshanese, now the major inhabitants in the Chaoshan plain, were from north-central China. We hypothesized that EC and GCC patients in Chaoshan areas share a common ancestry with Taihang Mountain patients. We analyzed 16 East Asian-specific Y-chromosome biallelic markers (single nucleotide polymorphisms; Y-SNPs) and 6 Y-chromosome short tandem repeat (Y-STR) loci in 72 EC and 48 GCC patients from Chaoshan and 49 EC and 63 GCC patients from the Taihang Mountain range. We also compared data for 32 Chaoshan Hakka people and 24 members of the aboriginal She minority who live near the Chaoshan area. Analysis was by frequency distribution and principal component, correlation and hierarchical cluster analysis of Y-SNP. Chaoshan patients were closely related to Taihang Mountain patients, even though they are geographically distant. Y-STR analysis revealed that the 4 patient groups were more closely related with each other than with other groups. Network analysis of the haplogroup O3a3c1-M117 showed a high degree of patient-specific substructure. We suggest that EC and GCC patients from these 2 areas share a similar patrilineal genetic background, which may play an important role in the genetic factor of EC and GCC in these populations.Entities:
Mesh:
Year: 2013 PMID: 24339953 PMCID: PMC3858265 DOI: 10.1371/journal.pone.0081670
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Geographic distribution of the three studied EC and GCC high-risk populations and two low-risk population Hakka and She in Chaoshan area.
Arrows show the north-to-south migrations of Han inhabitants from north-central China according to historical records. 218BC, AD311 and AD669 are the three major time periods of north-to-south migrations.
Y-chromosome single nucleotide polymorphism (Y-SNP) haplogroup frequencies of the 6 studied populations (%).
| Halplogroup | Chaoshan | Chaoshan | Taihang | Taihang | Chaoshan | Chaoshan |
| EC | CC | Mountain EC | Mountain CC | Hakkas (%) | She (%) | |
| Patients (%) | patients (%) | patients (%) | patients (%) | n = 32 | n = 24 | |
| n = 72 | n = 48 | n = 49 | n = 63 | |||
|
| 0 | 0 | 16.33 | 9.52 | 6.25 | 0 |
|
| 0 | 0 | 0 | 1.59 | 0 | 0 |
|
| 1.39 | 0 | 2.04 | 0 | 0 | 0 |
|
| 4.17 | 0 | 0 | 0 | 0 | 0 |
|
| 1.39 | 12.5 | 0 | 1.59 | 0 | 0 |
|
| 8.33 | 10.42 | 2.04 | 6.35 | 0 | 4.17 |
|
| 15.28 | 18.75 | 26.53 | 23.81 | 31.25 | 29.17 |
|
| 2.78 | 0 | 2.04 | 0 | 0 | 0 |
|
| 5.56 | 4.17 | 16.33 | 23.81 | 6.25 | 0 |
|
| 22.22 | 37.5 | 24.49 | 15.87 | 3.13 | 8.33 |
|
| 0 | 2.08 | 0 | 0 | 0 | 20.83 |
|
| 16.67 | 14.58 | 2.04 | 3.17 | 43.75 | 20.83 |
|
| 16.67 | 0 | 0 | 6.35 | 6.25 | 4.17 |
|
| 5.56 | 0 | 0 | 0 | 3.13 | 0 |
|
| 0 | 0 | 4.08 | 4.76 | 0 | 0 |
|
| 0 | 0 | 4.08 | 3.17 | 0 | 12.5 |
Figure 23-D principal component maps of frequencies of Y-chromosome single nucleotide polymorphism (Y-SNP) in Chinese populations.
The smaller the distance between populations, the closer the relationship. We divided 26 populations into 3 clusters: 1) Cluster1 (Red circle): 4 patient groups and 3 populations at high risk of esophageal cancer (EC): CSEC: Chaoshan EC patients, CSCC: Chaoshan gastric cardia cancer (GCC) patients, CSHR: Chaoshan high-risk population; FJHR: Fujian high-risk population; THEC: Taihang Mountain EC patients; THCC: Taihang Mountain GCC patients; and THHR: Taihang Mountain high-risk population; 2) Cluster2 (Green circle) Chaoshan Hakka (CSKJ) and She population (CSSZ); 3) Cluster3 (Orange circle) Northern and southern Han populations. Northern Han populations: HeB: Hebei Han; LN: Liaoning Han; XJ: Xinjiang Han; NMG: Neimeng Han; HB: Hubei Han; HN: Henan Han; GS: Gansu Han; SX: Shanxi Han; SD: Shangdong Han. Southern Han populations: GD: Guangzhou Han; SH: Shanghai Han; ZJ: Zhejiang Han; AH: Anhui Han; JS: Jiangsu Han; HuN: Hunan Han; JX: Jiangxi Han; SC: Sichuan Han.
Correlation analysis of Y-chromosome SNP haplogroup frequencies in the studied populations and 3 high-risk populations and 17 Chinese Han populations.
| Esophageal cancer patients | Gastric cardia cancer patients | |||
| Chaoshan | Taihang Mountain | Chaoshan | Taihang Mountain | |
|
| 0.453 | |||
|
| 0.745 | 0.636 | ||
|
| 0.471 | 0.897 | 0.497 | |
|
| 0.771 | 0.827 | 0.828 | 0.770 |
|
| 0.618 | 0.730 | 0.720 | 0.614 |
|
| 0.434 | 0.691 | 0.334 | 0.830 |
|
| 0.550 | 0.313 | 0.362 | 0.345 |
|
| 0.411 | 0.332 | 0.404 | 0.287 |
|
| 0.188 | 0.605 | 0.273 | 0.773 |
|
| 0.104 | 0.540 | 0.229 | 0.709 |
|
| 0.122 | 0.443 | 0.308 | 0.522 |
|
| 0.175 | 0.505 | 0.29 | 0.519 |
|
| 0.35 | 0.478 | 0.226 | 0.625 |
|
| 0.195 | 0.418 | 0.264 | 0.6 |
|
| 0.283 | 0.377 | 0.343 | 0.484 |
|
| 0.201 | 0.515 | 0.236 | 0.727 |
|
| −0.7 | 0.165 | 0.057 | 0.426 |
|
| 0.156 | 0.303 | 0.188 | 0.532 |
|
| 0.385 | 0.472 | 0.263 | 0.723 |
|
| −0.071 | 0.385 | 0.08 | 0.520 |
|
| 0.073 | 0.334 | 0.133 | 0.471 |
|
| −0.087 | 0.571 | 0.031 | 0.695 |
|
| −0.005 | 0.399 | 0.139 | 0.638 |
|
| 0.055 | 0.518 | 0.186 | 0.719 |
|
| 0.235 | 0.449 | 0.121 | 0.722 |
P<0.01 level (2-tailed).
P<0.05 level (2-tailed).
Figure 3Dendrogram of Y-SNP data.
Shows the affinity between the studied populations, the high-risk population, Chinese Han and Chinese minority nationalities. Taihang: Taihang Mountain high-risk population; Chaoshan: Chaoshan high-risk population; Fujian: Fujian high-risk population. The other abbreviations are defined in the Methods and Figure 2.
Figure 4Neighbor-joining tree of genetic distance between patients, high-risk EC population and Chinese Han populations based on Y-chromosome short tandem repeat (Y-STR) data.
The 4 patient groups are close to each other and are clustered with the high-risk populations.
Figure 5Y-STR network of haplogroup O3a3c1-M117 for patients and high-risk populations belonging to cluster 1 in figure 2.
Most patient groups were generated from one node and clustered mainly in one area (circle). Circles represent lineages, area is proportional to frequency, and color indicates population of origin.
PCR primers and extension primers for 15-SNPs used in Sequenom genotyping.
| SNP_ID | WELL | 1st-PCRP | 2nd-PCRP | UEP_SEQ |
|
| W1 |
|
|
|
|
| W1 |
|
|
|
|
| W1 |
|
|
|
|
| W1 |
|
|
|
|
| W1 |
|
| ccGATTTTCCCCTGAGAGC |
|
| W1 |
|
|
|
|
| W1 |
|
|
|
|
| W1 |
|
|
|
|
| W1 |
|
|
|
|
| W1 |
|
|
|
|
| W1 |
|
|
|
|
| W1 |
|
| tTCCGTTTTTTGATGTGGAAATA |
|
| W2 |
|
|
|
|
| W2 |
|
|
|
|
| W2 |
|
|
|
Figure 6Phylogenetic diagram of 17 haplogroups in Chinese populations based on 16-chromosome biallelic markers drawn according to the non-recombining portion of the Y-chromosome haplogroup tree of East Asia.
The most recent markers defining the haplogroups are beside the branches.