| Literature DB >> 25609584 |
Yuan Yuan1, Lei Tian1, Dongsheng Lu1, Shuhua Xu1.
Abstract
In human, Lymphoblastoid cell lines (LCLs) from the CEPH/CEU (Centre d'Etude du Polymorphisme Humain - Utah) family resource have been extensively used for examining the genetics of gene expression levels. However, we noted that CEU/CEPH cell lines were collected and transformed approximately thirty years ago, much earlier than the other cell lines from the pertaining individuals, which we suspected could potentially affect gene expression, data analysis and results interpretation. In this study, by analyzing RNA sequencing data of CEU and the other three European populations as well as an African population, we systematically examined and evaluated the potential confounding effect of LCL age on gene expression levels and patterns. Our results indicated that gene expression profiles of CEU samples have been biased by the older age of CEU cell lines. Interestingly, most of CEU-specific expressions are associated with functions related to cell proliferation, which are more likely due to older age of cell lines than intrinsic characters of the population. We suggested the results be carefully explained when CEU LCLs are used for transcriptomic data analysis in future studies.Entities:
Mesh:
Year: 2015 PMID: 25609584 PMCID: PMC4302305 DOI: 10.1038/srep07960
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Information of population samples with both genotyping data and RNA-Seq data available
| Population | Sample size | Sex ratio (F:M) |
|---|---|---|
| CEU | 91 | 46:45 |
| GBR | 94 | 49:45 |
| FIN | 95 | 58:37 |
| TSI | 93 | 44:49 |
| YRI | 89 | 49:40 |
Figure 1(A) Distribution of genetic differentiation (FST) and expression differentiation (VST) between each pair of the five populations (CEU/TSI/FIN/GBR/YRI). The asterisks represent population pairs CEU involved and the triangles represent non-CEU involved pairs. The four red markers on the upper panel show the population pairs between YRI and European populations, the blue markers on the bottom right panel show the population pairs between CEU and non-CEU European populations, and the gray markers on the bottom left panel show the population pairs between non-CEU European populations. The gray dashed line represents the regression line between FST and VST for the 10 population pairs. The correlation between the mean VST values and mean FST values is shown above the plot. (B) Distribution of genetic differentiation (FST) and expression differentiation (VST) between each pair of the four non-CEU populations (TSI/FIN/GBR/YRI). The three red markers on the upper panel show the population pairs between YRI and European populations and the three gray markers on the bottom panel show the population pairs between non-CEU European populations. The gray dashed line represents the regression line between FST and VST for the 6 population pairs. The correlation between the mean VST values and mean FST values is shown above the plot. (C) Distribution of VST between each pair of European populations. The red solid, dashed and dotted lines represent population pairs between CEU and three other non-CEU Europeans (TSI/FIN/GBR). The blue solid, dashed and dotted lines represent population pairs between three non-CEU Europeans (TSI/FIN/GBR). (D) Venn diagram of DE genes. The yellow circle represents the number of DE genes between CEU and other non-CEU European populations (TSI/FIN/GBR) and the purple circle represents the number of DE genes between any two non-CEU European populations (TSI/FIN/GBR).
Functional annotation and enrichment analysis of 2,420 genes with differential expression between populations
| Gene Ontology category | #DE Genes | #Other genes | |
|---|---|---|---|
| endoplasmic reticulum | 263 | 1025 | 1.68 × 10−9 |
| endoplasmic reticulum part | 199 | 741 | 2.82 × 10−8 |
| endomembrane system | 350 | 1471 | 5.12 × 10−8 |
| endoplasmic reticulum membrane | 174 | 637 | 1.38 × 10−7 |
| Nuclear outer membrane-endoplasmic reticulum membrane network | 174 | 649 | 6.75 × 10−7 |
| Golgi membrane | 131 | 460 | 1.63 × 10−5 |
| organelle membrane | 434 | 2016 | 2.38 × 10−5 |
| membrane-bounded organelle | 1487 | 8001 | 1.43 × 10−4 |
| cellular_component | 2145 | 12036 | 1.91 × 10−4 |
| intracellular membrane-bounded organelle | 1482 | 7979 | 1.92 × 10−4 |
| cytoplasm | 1396 | 7495 | 3.15 × 10−4 |
| Golgi vesicle transport | 59 | 169 | 4.13 × 10−4 |
| Golgi apparatus part | 140 | 527 | 4.88 × 10−4 |
| organelle | 1590 | 8652 | 7.98 × 10−4 |
| Golgi apparatus | 223 | 936 | 1.09 × 10−3 |
| cell | 1962 | 10891 | 1.10 × 10−3 |
| cell part | 1962 | 10891 | 1.10 × 10−3 |
| intracellular organelle part | 1008 | 5289 | 1.11 × 10−3 |
| intracellular organelle | 1584 | 8631 | 1.41 × 10−3 |
| cytoplasmic part | 1045 | 5531 | 1.65 × 10−3 |
| intracellular | 1819 | 10034 | 1.73 × 10−3 |
| organelle part | 1015 | 5355 | 3.15 × 10−3 |
| intracellular part | 1787 | 9864 | 3.77 × 10−3 |
| cellular response to topologically incorrect protein | 36 | 91 | 6.13 × 10−3 |
| ER-nucleus signaling pathway | 38 | 98 | 7.31 × 10−3 |
aDE Genes: Number of genes with differential expression between populations in each GO category.
bNumber of all other background genes within the relevant GO category.
cBonferroni corrected p-value.
Figure 2Venn diagram of DE genes.
The big and red circles represent the number of DE genes between CEU and YRI. The green circles represent the 2420 genes described in Figure 1D. The blue circles represent the number of DE genes between YRI and the other non-CEU European populations: TSI, FIN and GBR respectively in (A)–(C).