| Literature DB >> 25419237 |
Barbara Hrdlickova1, Vinod Kumar1, Kartiek Kanduri2, Daria V Zhernakova1, Subhash Tripathi2, Juha Karjalainen1, Riikka J Lund2, Yang Li1, Ubaid Ullah2, Rutger Modderman1, Wayel Abdulahad3, Harri Lähdesmäki4, Lude Franke1, Riitta Lahesmaa2, Cisca Wijmenga1, Sebo Withoff1.
Abstract
BACKGROUND: Although genome-wide association studies (GWAS) have identified hundreds of variants associated with a risk for autoimmune and immune-related disorders (AID), our understanding of the disease mechanisms is still limited. In particular, more than 90% of the risk variants lie in non-coding regions, and almost 10% of these map to long non-coding RNA transcripts (lncRNAs). lncRNAs are known to show more cell-type specificity than protein-coding genes.Entities:
Year: 2014 PMID: 25419237 PMCID: PMC4240855 DOI: 10.1186/s13073-014-0088-0
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Overview of the nine autoimmune diseases (AIDs) included in this study
|
|
|
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| ||
|
| 29 | 29 | 338 | 123,953 | - | - | - | - | 50 | 14 | [ |
|
| 32 | 30 | 246 | 125,799 | - | - | 2 | 1,000,000 | 127 | 37 | [ |
|
| 97 | 95 | 984 | 139,174 | 2 | 172,626 | 2 | 1,000,000 | 254 | 107 | [ |
|
| 16 | 15 | 112 | 152,964 | - | - | 1 | 1,000,000 | 73 | 20 | [ |
|
| 18 | 16 | 143 | 176,717 | 1 | 859,903 | 2 | 1,000,000 | 90 | 38 | [ |
|
| 33a | 32 | 287 | 111,617 | - | - | 1 | 1,000,000 | 131 | 48 | [ |
|
| 12 | 12 | 81 | 227,273 | - | - | - | - | 50 | 17 | [ |
|
| 26 | 23 | 132 | 96,343 | - | - | 3 | 1,000,000 | 127 | 36 | [ |
|
| 22 | 22 | 198 | 127,674 | - | - | - | - | 48 | 24 | [ |
|
|
|
|
|
|
|
|
|
|
|
| |
CD, Crohn’s disease-specific; CeD, celiac disease; IBD (shared), inflammatory bowel disease shared by Crohn’s disease and ulcerative colitis; JIA, juvenile idiopathic arthritis; PBC, primary biliary cirrhosis; PS, psoriasis; PsCh, primary sclerosing cholangitis; RA, rheumatoid arthritis; UC, ulcerative colitis-specific. Each disease is characterized by the number of genome-wide significant SNPs associated with the disease phenotype, the number of disease loci, and the genes located inside them. We subdivided inflammatory bowel disease loci into Crohn’s disease-specific loci (CD), ulcerative colitis-specific loci (UC), and CD-UC shared loci (IBD shared) to reveal phenotype-specific features. aPsoriasis (PS) was associated with 33 genome-wide significant SNPs. After defining the PS disease loci, we discovered that one locus (chr.19: 10,745,764-10,894,728) is located within a bigger 1 Mb locus (chr.19: 9,963,118-10,963,118), resulting in 32 PS loci and making the total number of AID loci 284. In summary, the total number of AID-associated SNPs was 285 and these led to 284 AID loci. After all the disease loci annotation with genes from the reference dataset (Gencode version 14), we discovered 950 protein-coding genes in total but 186 of them were shared between at least two disease phenotypes (AID-shared coding genes) and 626 were unique across all diseases (AID-coding genes). For lncRNAs, the total number of genes was 341, with 61 shared (AID-shared lncRNAs) and 240 unique genes (AID-lncRNAs).
Figure 1Circular diagrams showing the genes shared between nine autoimmune diseases. The nine diseases are shown on the outer circle in colored bands, with their abbreviated names. The ribbons depicting the shared genes are colored according to the number of disease phenotypes they are shared by (grey, two AIDs; blue, three AIDs; red, four AIDs; black, five AIDs). (A) lncRNA genes and (B) protein-coding genes shown in this figure include genes specific to Crohn’s disease (CD), celiac disease (CeD), inflammatory bowel disease, shared by Crohn’s disease and ulcerative colitis (IBD shared), juvenile idiopathic arthritis (JIA), primary biliary cirrhosis (PBC), psoriasis (PS), primary sclerosing cholangitis (PsCh), rheumatoid arthritis (RA), and ulcerative colitis specific (UC) genes. These two plots are presented at higher resolution in Additional files 8 and 9).
Figure 2Proportion of genes expressed in different immune cells. (A) The number of lncRNA genes expressed (>2 RPKM) as a percentage of all lncRNA genes genome-wide (n = 12,933) or as a percentage of all lncRNAs located in autoimmune disease loci (n = 240 genes). (B) The data for the protein-coding genes genome-wide (n = 20,074) and the ones in AID loci (n = 626). Statistically significant enrichments (P-values) after Bonferroni correction for multiple testing are denoted by asterisks to show the different levels of significance (*α < 0.05; **α < 0.01; ***α < 0.005).
Figure 3Mean expression distribution of lncRNAs and protein-coding genes. (A) Comparison of lncRNA expression genome-wide (LncRNA Gencode, n = 12,933) with expression of lncRNAs located in AID loci (LncRNA AID, n = 240) and the expression of protein-coding genes genome-wide (Protein Gencode, n = 20,074) with the expression of protein-coding genes located in AID loci (Protein AID, n = 626) in seven populations of peripheral blood leukocytes. (B) Similar data for the T-helper cell populations derived from cord blood. Differences in the means of expression levels between the two groups (disease loci (AIDs) versus genome-wide (Gencode)) were tested for significance using the Wilcoxon rank-sum test.
Figure 4RNA sequencing analysis of gene expression in seven peripheral blood leukocyte populations and four T-helper cell populations from cord blood. The heat maps show the expression of all genes located in AID loci: (A) lncRNAs; (B) protein-coding genes and AID genes shared by at least two diseases: (C) lncRNAs; (D) protein-coding genes. Unsupervised hierarchical clustering analysis of gene expression profiles of all 11 cell types (granulocytes, monocytes, NK cells, B cells, memory T cells (both CD4+ and CD8+), naive CD4+and naive CD8+ T cells (cytotoxic T cells), precursor T-helper cells (ThP), primary T-helper cells (Th0), and polarized T cells (Th1, Th2)). Heat maps represent log10 intensity values. In the color scheme, saturated red indicates three-fold up-regulation, saturated green indicates three-fold down-regulation, and black indicates unchanged expression.
Figure 5An example of analyzing an autoimmune disease locus by pathway analysis approaches. (A) Expression levels of protein-coding transcripts (FAM213B, MMEL1) and lncRNA genes (RP3-395 M20.8, RP3-395 M20.7, RP3-395 M20.9, RP13-436 F16.1) located in the MMEL1 locus associated with four AIDs. The arrows pinpoint the data for RP3-395 M20.9. (B) Genes co-expressed with RP3-395 M20.9 are grouped in five differently colored segments corresponding to the pathways predicted by GeneNetwork. (C) The top 10 Gene Ontology (GO) biological processes predicted to be associated with the genes co-expressed with RP3-395 M20.9 are shown. (D) This schema shows a hypothetical mechanism of action of RP3-395 M20.9. The disease-associated SNP is located between protein-coding gene A (tumor necrosis factor receptor superfamily, member 14 (TNFRSF14, HVEM)) and lncRNA 1 (RP3-395 M20.9). This SNP only affects RP3-395 M20.9 directly. Two protein-coding genes (tumor necrosis factor beta/lymphotoxin alpha (TNFb/LTA) on chromosome 6, and UBASH3A on chromosome 21) and one lncRNA (LINC00158 on chromosome 21) are co-expressed with RP3-395 M20.9, which could be due to trans-regulation of these genes by RP3-395 M20.9. A hypothetical cis-effect of lncRNA 1 (RP3-395 M20.9) on protein-coding gene TNFRSF14/HVEM in the same locus on chromosome 1 is also mentioned.