| Literature DB >> 30086702 |
P Daca-Roszak1, M Swierniak2,3,4, R Jaksik5, T Tyszkiewicz2, M Oczko-Wojciechowska2, J Zebracka-Gala2, B Jarzab2, M Witt1, E Zietkiewicz6.
Abstract
BACKGROUND: Numerous studies have demonstrated significant differences in the expression level across continental human populations. Most of published results were performed on B-cell lines materials examined under specific laboratory conditions, without further validation in a primary biological material. The goal of our study was to identify mRNA markers characterized by a significant and stable difference in the gene expression profile in Caucasian and Chinese populations, both in the commercially available B-lymphocyte cell lines and in the primary samples of the peripheral blood.Entities:
Keywords: Classifier testing; Decision-tree; Gene expression study; Human population identification; Illumina platform; Population-specific mRNA markers; TLDA cards
Mesh:
Substances:
Year: 2018 PMID: 30086702 PMCID: PMC6081795 DOI: 10.1186/s12863-018-0663-2
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Fig. 1Study design
A set of transcripts differentiating CEU and CHB cell lines in Illumina expression microarray
| Genes with the higher expression in CHB | ||||
|---|---|---|---|---|
| Probe_ID | Symbol | FDR | Fold-change | |
| 5,270,541 |
| 2.09E-06 | 0.000526 | 2.48 |
| 6,290,228 |
| 6.36E-16 | 2.37E-12 | 2.47 |
| 4,830,202 |
| 9.78E-06 | 0.001106 | 2.37 |
| 7,400,193 |
| 4.72E-06 | 0.000734 | 1.77 |
| 6,420,168 |
| 0.0020744 | 0.044493 | 1.66 |
| 770,564 |
| 7.05E-06 | 0.000856 | 1.61 |
| 5,490,768 |
| 2.12E-06 | 0.000526 | 1.60 |
| 6,650,242 |
| 3.50E-05 | 0.002848 | 1.58 |
| 1,990,672 |
| 2.80E-06 | 0.000581 | 1.56 |
| 3,370,730 |
| 0.000695 | 0.021436 | 1.55 |
| 2,060,181 |
| 0.0002602 | 0.011694 | 1.53 |
| Genes with the lower expression in CHB | ||||
| Probe_ID | Symbol | FDR | Fold-change | |
| 5,420,450 |
| 1.02E-05 | 0.001123 | 0.41 |
| 3,850,168 |
| 0.0025063 | 0.049752 | 0.50 |
| 2,120,053 |
| 0.0001846 | 0.009186 | 0.57 |
| 3,310,520 |
| 7.85E-05 | 0.005053 | 0.61 |
| 6,020,692 |
| 2.42E-09 | 2.26E-06 | 0.61 |
| 6,290,189 |
| 1.29E-06 | 0.000402 | 0.64 |
| 4,830,632 |
| 8.44E-05 | 0.005248 | 0.64 |
| 3,370,075 |
| 1.63E-07 | 7.60E-05 | 0.65 |
| 7,650,669 |
| 1.14E-06 | 0.000387 | 0.65 |
a TLDA probe unavailable or unspecific
Validation of the population-differentiating transcripts on B-cell lines using TLDA cards
| Gene name | Fold change | |
|---|---|---|
| U Mann Whitney/t-test* | ||
| Genes with higher expression in Chinese | ||
| UTS2** | 25.77 |
|
| CHI3L2 | 1.10 | 0.75656 |
| C1ORF115 | 1.68 | 0.15560 |
| IFITM3 | 1.62 | 0.58920 |
| PLA2G4C | 1.39 | 0.16152 |
| CDC42EP5 | 1.13 | 0.75656 |
| Genes with higher expression in European | ||
| UGT2B7 | did not amplify | |
| CYP1B1 | 1.64 | 0.23404 |
| MOXD1 | 1.64 | 0.17702 |
| UGT2B17** | 3.23 |
|
| SLC7A7 | 2.17 |
|
| S1PR4 | 1.47 | 0.0960 |
| TBC1D4 | 1.19 | 0.08012 |
*p-values for genes: UTS2, UGT2B17, CHI3L2 and C1orf115 which did not fulfill the requirement of normal distribution were tested using U-Mann Whitney statistics; other genes, were tested with using the t-test
**Validated on blood samples (see Validation 2 section). Significant population differences (p < 0.05) are indicated in bold
Fig. 2Average ct values obtained in qRT-PCR reactions for two tested genes: UTS2 (a) and UGT2B17 (b). Each bar represents B-cell line from Caucasian (left panel) and Chinese population (right panel)
Fig. 3The normalized relative expression levels of UGT2B17 (a) and UTS2 (b) in the peripheral blood samples from Chinese (n = 29) and Caucasian (n = 37) males. Dots represent relative gene expression in the individual samples. The upper and lower edges of the boxes correspond to the first (Q1) and third (Q3) quartiles, respectively. The lines inside the boxes indicate the median expression values. The whiskers extend to the smallest and the largest observations within the 1.5-times interquartile range (IQR) from the box
Fig. 4Average ct values obtained in qRT-PCR reactions for two tested genes: UGT2B17 (a) and UTS2 (b). Each bar represents blood sample from Caucasian (left panel) and Chinese population (right panel)
Fig. 5A ROC curve and AUC parameter calculated for 3 different classifiers: decision tree (D.Tree; red line), support vector machines (SVM; blue line), and linear discriminate analysis LDA (green line). Results were obtained based on blood samples collected from Chinese (n = 29) and Caucasian (n = 37) populations