Literature DB >> 30643196

POGLUT1, the putative effector gene driven by rs2293370 in primary biliary cholangitis susceptibility locus chromosome 3q13.33.

Yuki Hitomi1, Kazuko Ueno2,3, Yosuke Kawai1, Nao Nishida4, Kaname Kojima2,3, Minae Kawashima5, Yoshihiro Aiba6, Hitomi Nakamura6, Hiroshi Kouno7, Hirotaka Kouno7, Hajime Ohta7, Kazuhiro Sugi7, Toshiki Nikami7, Tsutomu Yamashita7, Shinji Katsushima7, Toshiki Komeda7, Keisuke Ario7, Atsushi Naganuma7, Masaaki Shimada7, Noboru Hirashima7, Kaname Yoshizawa7, Fujio Makita7, Kiyoshi Furuta7, Masahiro Kikuchi7, Noriaki Naeshiro7, Hironao Takahashi7, Yutaka Mano7, Haruhiro Yamashita7, Kouki Matsushita7, Seiji Tsunematsu7, Iwao Yabuuchi7, Hideo Nishimura7, Yusuke Shimada7, Kazuhiko Yamauchi7, Tatsuji Komatsu7, Rie Sugimoto7, Hironori Sakai7, Eiji Mita7, Masaharu Koda7, Yoko Nakamura7, Hiroshi Kamitsukasa7, Takeaki Sato7, Makoto Nakamuta7, Naohiko Masaki7, Hajime Takikawa8, Atsushi Tanaka8, Hiromasa Ohira9, Mikio Zeniya10, Masanori Abe11, Shuichi Kaneko12, Masao Honda12, Kuniaki Arai12, Teruko Arinaga-Hino13, Etsuko Hashimoto14, Makiko Taniai14, Takeji Umemura15, Satoru Joshita15, Kazuhiko Nakao16, Tatsuki Ichikawa16, Hidetaka Shibata16, Akinobu Takaki17, Satoshi Yamagiwa18, Masataka Seike19, Shotaro Sakisaka20, Yasuaki Takeyama20, Masaru Harada21, Michio Senju21, Osamu Yokosuka22, Tatsuo Kanda22, Yoshiyuki Ueno23, Hirotoshi Ebinuma24, Takashi Himoto25, Kazumoto Murata4, Shinji Shimoda26, Shinya Nagaoka6, Seigo Abiru6, Atsumasa Komori6,27, Kiyoshi Migita6,27, Masahiro Ito6,27, Hiroshi Yatsuhashi6,27, Yoshihiko Maehara28, Shinji Uemoto29, Norihiro Kokudo30, Masao Nagasaki2,3,31, Katsushi Tokunaga1, Minoru Nakamura32,33,34,35.   

Abstract

Primary biliary cholangitis (PBC) is a chronic and cholestatic autoimmune liver disease caused by the destruction of intrahepatic small bile ducts. Our previous genome-wide association study (GWAS) identified six susceptibility loci for PBC. Here, in order to further elucidate the genetic architecture of PBC, a GWAS was performed on an additional independent sample set, then a genome-wide meta-analysis with our previous GWAS was performed based on a whole-genome single nucleotide polymorphism (SNP) imputation analysis of a total of 4,045 Japanese individuals (2,060 cases and 1,985 healthy controls). A susceptibility locus on chromosome 3q13.33 (including ARHGAP31, TMEM39A, POGLUT1, TIMMDC1, and CD80) was previously identified both in the European and Chinese populations and was replicated in the Japanese population (OR = 0.7241, P = 3.5 × 10-9). Subsequent in silico and in vitro functional analyses identified rs2293370, previously reported as the top-hit SNP in this locus in the European population, as the primary functional SNP. Moreover, e-QTL analysis indicated that the effector gene of rs2293370 was Protein O-Glucosyltransferase 1 (POGLUT1) (P = 3.4 × 10-8). This is the first study to demonstrate that POGLUT1 and not CD80 is the effector gene regulated by the primary functional SNP rs2293370, and that increased expression of POGLUT1 might be involved in the pathogenesis of PBC.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30643196      PMCID: PMC6331557          DOI: 10.1038/s41598-018-36490-1

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Primary biliary cholangitis (PBC) is a chronic and progressive cholestatic liver disease characterized by chronic non-suppurative destructive cholangitis (CNSDC), ductopenia, interface hepatitis, fibrosis, and biliary cirrhosis[1,2]. The destruction of small bile ducts is considered to be mediated by autoimmune responses against biliary epithelial cells (BEC), including CD4+ T cells, CD8+ T cells, B cells, and natural killer (NK) cells[2-4]. The higher monozygotic/dizygotic (MZ/DZ) ratio and the higher estimated relative sibling risk (λs) in PBC patients as compared to unaffected individuals indicates the involvement of strong genetic factors in the development of PBC[5,6]. Previous genome-wide association studies (GWASs), ImmunoChip analyses, and subsequent meta-analyses in populations of European descent identified human leukocyte antigen (HLA) and 30 non-HLA susceptibility regions (nearest candidate genes from the top-hit SNPs in each locus: IL12RB2/DENND1B/C11orf53, YPEL5/LBH, IL1RL1/IL1RL2, STAT4/NAB1, SLC19A3/CCL20, PLCL2, TIMMDC1/TMEM39A, IL-12A/IL12A-AS1/IQCJ/SCHIP1, DGKQ, NFKB1/MANBA, IL7R/CAPSL, NUDT12/C5orf30, IL12B, OLIG3/TNFAIP3, ELMO1, IRF5/TNPO3, RPS6KA4, DDX6/CXCR5, TNFRSF1A, ATXN2/BRAP, DLEU1/BCMS, RAD51B, EXOC3L4, RMI2/CLEC16A, IRF8/FOXF1, ZPBP2/GSDMB/IKZF3, MAPT, TYK2, SPIB, and SYNGR1/PDGFB/RPL3) in PBC[7-14]. Additionally, Asian-specific susceptibility regions for PBC, including CD58, CD28/CTLA4, IL21-AS1, TNFSF15/TNFSF8, IL16, IL21R, CSNK2N2/CCDC113, and AATID3A, were reported in the Japanese and Chinese populations by means of GWAS and subsequent genome-wide meta-analysis with genome-wide SNP imputation (already identified PBC susceptibility loci including this study are shown in Table 1)[15-17]. Thus, the evidence reported to date indicates presence of shared and non-shared genetic susceptibility profiles behind the pathogenesis of PBC in European and Asian populations.
Table 1

Non-HLA PBC susceptibility loci in European, Chinese, and Japanese populations.

ChromosomeLocationMapped gene(s)Top-hit SNPMAFaORb P PopulationRef.
11p13.1 CD58 rs23007470.391.292.E-12Chinese [17]
1p31.3IL12RB2/DENND1B/C1orf53rs726785310.161.612.E-38European [12]
22p23.1YPEL5/LBHrs49521080.191.285.E-08European [14]
2q12.1IL1RL1/IL1RL2rs127121330.441.145.E-09European [14]
2q32.2STAT4/NAB1rs3024921, etc.0.050.729.E-25European [12]
rs101682660.321.314.E-14Chinese [17]
2q33.2CD28/CTLA4rs46753690.421.311.E-13Chinese [17]
2q36.3SLC19A3/CCL20rs49733410.331.222.E-10European [14]
33p24.3 PLCL2 rs13720720.371.202.E-08European [10]
3q13.33TIMMDC1/TMEM39Ars22933700.161.397.E-16European [12]
rs37324210.341.353.E-13Chinese [17]
rs57271503 0.340.723.E-09 Japanese This study
3q25.33IL-12A/IL12A-AS1/IQCJ/SCHIP1rs2366643, etc.0.380.623.E-35European [12]
rs5825370.291.332.E-11Chinese [17]
44p16.3 DGKQ rs117248040.441.229.E-12European [14]
4q24NFKB1/MANBArs76650900.491.268.E-14European [12]
rs15988560.491.262.E-10Chinese [17]
rs17033015 0.471.359.E-10 Japanese This study
4q27 IL21-AS1 rs9255500.351.314.E-13Chinese [17]
55p13.2IL7R/CAPSLrs68717480.281.302.E-13European [12]
rs12697352 0.190.682.E-09 Japanese This study
5q21.1NUDT12/C5orf30rs5262310.321.151.E-08European [14]
5q33.3 IL12B rs25468900.501.151E-10European [14]
66q23.3OLIG3/TNFAIP3rs69334040.171.181E-10European [14]
77p14.1 ELMO1 rs69744910.161.254.E-08European [10]
7q32.1IRF5/TNPO3rs104886310.101.595.E-23European [14]
99q32TNFSF15/TNFSF8rs49794670.371.531.E-29Chinese [17]
rs4979462 0.320.602.E-26 Japanese This study
1111q13.1 RPS6KA4 rs5381470.391.232.E-10European [10]
11q23.3DDX6/CXCR5rs800651070.191.397.E-16European [12]
rs778716180.161.403.E-13Chinese [17]
1212p13.31 TNFRSF1A rs18006930.430.747.E-19European [12]
rs41495760.141.374.E-09Chinese [17]
12q24.12ATXN2/BRAPrs110659870.371.193.E-08European [14]
1313q14.2DLEU1/BCMSrs95913250.051.631.E-10European [14]
1414q24.1 RAD51B rs9112630.331.292.E-11European [10]
14q32.32 EXOC3L4 rs22970670.221.396.E-19European [14]
1515q25.1 IL16 rs115562180.191.299.E-09Chinese [17]
1616p12.1 IL21R rs21895210.301.414.E-16Chinese [17]
16p13.13RMI2/CLEC16Ars1646019, etc.0.301.382.E-23European [12]
16q21CSNK2N2/CCDC113rs25503740.481.232.E-08Chinese [17]
16q24.1IRF8/FOXF1rs111174320.231.315.E-11European [10]
1717q21.1ZPBP2/GSDMB/IKZF3rs80673780.471.266.E-14European [12]
rs96357260.411.372.E-16Chinese [17]
rs4795395 0.351.424.E-12 Japanese This study
17q21.31 MAPT rs175648290.231.252.E-09European [12]
1919p13.2 TYK2 rs345364430.031.951.E-12European [12]
19p13.3 ARID3A rs104159760.471.304.E-12Chinese [17]
19q13.33 SPIB rs37455160.211.391.E-20European [14]
2222q13.1SYNGR1/PDGFB/RPL3rs22674070.241.291.E-13European [12]
rs1376030.131.373.E-08Chinese [17]

aMAF (minor allele frequency) in 1000 genome project in each population.

bOR (odds ratio).

Non-HLA PBC susceptibility loci in European, Chinese, and Japanese populations. aMAF (minor allele frequency) in 1000 genome project in each population. bOR (odds ratio). Thousands of genetic variations associated with susceptibility to human complex diseases have been identified by GWASs[18]. Among genes located near the “top-hit SNP” in each susceptibility locus, candidate genes with well-known functions are often selected as the “disease susceptibility genes”. The majority of SNPs that regulate gene expression are actually found in the vicinity of genes within 100 kb of the transcription start site (TSS)[19]. However, trans-acting expression quantitative trait loci (e-QTL) variations, whose target transcripts are separated by arbitrary distances, are believed to explain a substantial proportion of the heritable variation in gene expression[20]. For example, although FTO was reported as a susceptibility gene for obesity, the effector genes whose expression levels were influenced by the significantly associated SNPs were not FTO but Iroquois Homeobox 3 (IRX3) and IRX5[21]. In PBC, a locus on chromosome 17q12-21 (ORMDL3-GSDMB-ZPBP2-IKZF3) has been reported as a shared susceptibility locus in different populations. Although the top-hit SNP was located in the IKAROS family zinc finger 3 (IKZF3), in which the function of the protein product is related to the proliferation and differentiation of B cells, the effector gene in this locus was identified as ORMDL sphingolipid biosynthesis regulator 3 (ORMDL3), whose protein product regulates endoplasmic reticulum (ER)-mediated Ca2+ homeostasis and facilitates the unfolded-protein response (UPR)[22,23]. Therefore, understanding the contribution of susceptibility loci to the onset of diseases requires identification of the effector genes that are regulated by the primary functional variation located in the disease susceptibility loci. The present study aimed to further elucidate the genetic architecture of PBC in the Japanese population. To this end, we performed a GWAS and subsequent genome-wide meta-analysis based on a whole-genome SNP imputation analysis with previous GWAS[16]. The PBC susceptibility locus chromosome 3q13.33 (including ARHGAP31, TMEM39A, POGLUT1, TIMMDC1, and CD80) has been identified by GWAS as a PBC susceptibility locus in European and Chinese populations; consequently, this genome-wide meta-analysis involved replicating chromosome 3q13.33 in the Japanese population. Here, we show in silico and in vitro functional analyses and identify the effector gene and the primary functional SNP in the PBC susceptibility locus chromosome 3q13.33.

Results

GWAS and genome-wide meta-analysis

We genotyped an independent set of 1,148 samples (668 PBC cases and 480 healthy controls) using the Affymetrix Japonica V1 Array[24]. Thirty-four samples were excluded by Dish QC (<0.82) or overall call rate for a total of 20,000 SNPs (<0.97) and 13 samples were excluded because of cryptic relatives. A further 13 samples were located far from the JPT cluster drawn using the first and second components after PCA and were removed from further analysis (Supplementary Fig. 1A). We re-genotype called about 2,897 samples (1,392 PBC cases and 1,505 healthy controls) collected in the previous study[16]. Eighteen samples were excluded by Dish QC (<0.82) or overall call rate for a total of 20,000 SNPs (<0.97) and 15 samples were excluded because of cryptic relatives. Seventeen samples were located far from the HapMap JPT cluster drawn using the first and second components after PCA and were removed from further analysis (Supplementary Fig. 1B). A quantile-quantile plot of the distribution of test statistics for the comparison of allele frequencies in the PBC cases and healthy controls provided an inflation factor lambda value of 1.097 for all tested SNPs for the 1,148 entries in the current dataset and a value 1.061 for the 2,897 entries in the previous dataset (Supplementary Fig. 2). Genotype imputation and the association study were separately performed for the two datasets. The process of data cleaning and meta-analysis is summarized in Supplementary Fig. 3. Figure 1 shows a genome-wide view of the single-point association data based on allele frequencies after meta-analysis. The loci HLA, TNFSF15, IL7R, NFKB1/MANBA, and chromosome 17q12-21 showed significant associations with PBC, as reported in the previous GWAS performed on a Japanese population[16]. In addition to these regions, meta-analysis to combine the two datasets showed a significant association in chromosome 3q13.33 (Top hit SNP: rs57271503, OR = 0.7241, P = 3.5 × 10−9, Fig. 2), although this locus appears as evidence of no association with PBC from studies using each platform (Japonica and ASI, Supplementary Fig. 4).
Figure 1

Genome-wide Manhattan plot of the GWAS meta-analysis between two data sets from the ASI and Japonica platforms. Negative log 10 P-values of qualified SNPs are plotted against their genomic positions. The genome-wide significance line (red) is shown at 7.30 (−log10 P [5e-8]). The genome-wide suggestive line (blue) is shown at 5 (−log10 P [1e-5]).

Figure 2

Regional plot of association results and recombination rates for the region surrounding POGLUT1 (chromosome 3: 119,160,000-119,300,300). Each dot shows the P-value of each SNP after meta-analysis. The purple diamond represents the SNP with the minimum P-value in the region. Genetic recombination rates are shown with a blue line.

Genome-wide Manhattan plot of the GWAS meta-analysis between two data sets from the ASI and Japonica platforms. Negative log 10 P-values of qualified SNPs are plotted against their genomic positions. The genome-wide significance line (red) is shown at 7.30 (−log10 P [5e-8]). The genome-wide suggestive line (blue) is shown at 5 (−log10 P [1e-5]). Regional plot of association results and recombination rates for the region surrounding POGLUT1 (chromosome 3: 119,160,000-119,300,300). Each dot shows the P-value of each SNP after meta-analysis. The purple diamond represents the SNP with the minimum P-value in the region. Genetic recombination rates are shown with a blue line.

Identification of rs2293370 as the primary functional SNP in chromosome 3q13.33

Among the 29 SNPs whose P values were less than 1.0 × 10−6 upon genome-wide meta-analysis, SNPs located in the 3′-untranslated region (UTR) and synonymous substitutions were selected as potential candidates for primary functional variation in the chromosome 3q13.33 region (Table 2 and Fig. 2). Five of the 29 SNPs [rs57271503 and rs3830649 in the 3′UTR of CD80, rs2305249 in exon 11 of ARHGAP31 (P567P), rs1131265 in exon 3 of TIMMDC1 (V146V), and rs3732421 in the 3′UTR of TMEM39A] are located in the 3′UTR or synonymous substitutions but are unrelated to their own gene expression as determined by e-QTL analysis[25] (Supplementary Fig. 5) and thus were excluded from further analysis.
Table 2

29 SNPs associated with susceptibility to PBC in the Japanese population in chromosome 3q13.33 by high-density association mapping.

rs numberSNP location (Chr.3)GWAS/imputationAllele 1Allele 2P valueORRegulome DB scoreUCSC (Regulatory Motifs)Location
rs57271503119244593imputationGA3.48E-090.724No data×CD80 3′UTR
rs9855065119130141imputationGA3.57E-090.725No dataΔARHGAP31 intron 11
rs3830649119246385imputationGdel4.66E-090.727No data×CD80 3′UTR
rs2305249119128398imputationGA5.07E-090.7276×ARHGAP31 Exon 11 (P567P)
rs13092998119245044GWAS (Japonica)GT5.45E-090.728No data×CD80 intron 6
rs62264485119237798imputationCA6.00E-090.7286ΔTIMMDC1 intron 6
rs35264490119238753imputationAdel6.00E-090.728No dataΔTIMMDC1 intron 6
rs2293370 119219934 GWAS (ASI, Japonica) G A 6.08E-09 0.728 3a Δ TIMMDC1 intron 2
rs1463138119128634imputationTC6.34E-090.7516×ARHGAP31 intron 11
rs1131265119222456imputationGC6.57E-090.729No data×TIMMDC1 Exon 3 (V146V)
rs1463139119128628imputationAG6.70E-090.749No data×ARHGAP31 intron 11
rs3732421119150089imputationAG9.93E-090.7325×TMEM39A 3′UTR
rs7650774119205050imputationTC1.01E-080.731No data×POGLUT1 intron 6
rs12636784119174383imputationAG1.27E-080.7336×TMEM39A intron 3
rs4687853119130360imputationAG1.47E-080.7546×ARHGAP31 intron 11
rs9843355119228508imputationGA1.79E-080.7356×TIMMDC1 intron 4
rs1530687119114515imputationGA6.26E-080.7675ΔARHGAP31 intron 8
rs9831023119111762GWAS (ASI, Japonica)TC6.98E-080.7685×ARHGAP31 intron 7
rs9884090119116150imputationGA9.34E-080.7535×ARHGAP31 intron 8
rs1000198119113820imputationAC1.34E-070.7716×ARHGAP31 intron 8
rs11922594119125822imputationTC1.50E-070.7715×ARHGAP31 intron 10
rs6773050119123814GWAS(ASI)CT1.62E-070.7746×ARHGAP31 intron 10
rs12494314119122820imputationTC1.90E-070.758No data×ARHGAP31 intron 10
rs4279094119114693GWAS(ASI)AG2.12E-070.7744×ARHGAP31 intron 8
rs9846036119116064imputationAC2.12E-070.7745×ARHGAP31 intron 8
rs56008261 119114927 GWAS(Japonica) T C 2.61E-07 0.776 1b O ARHGAP31 intron 8
rs6776377119115556imputationTC2.81E-070.7765×ARHGAP31 intron 8
rs6787836119115567imputationAG5.85E-070.7825×ARHGAP31 intron 8
rs11715698119118497imputationAG6.61E-070.7885×ARHGAP31 intron 9
29 SNPs associated with susceptibility to PBC in the Japanese population in chromosome 3q13.33 by high-density association mapping. Two of the remaining 24 SNPs had RegulomeDB scores higher than 3 and these scores were supported by their location in DNase hyper-sensitivity clusters and the binding of transcription factor. Consequently, these two SNPs were selected as potential candidates[26] (Table 2; rs2293370 in intron 2 of TIMMDC1 and rs56008261 in intron 8 of ARHGAP31). Both SNPs were located in DNase I hyper-sensitivity clusters and in H3K27Ac markers in at least one cell type identified by the UCSC genome browser[27]. We performed electrophoretic mobility shift assays (EMSAs) to evaluate the effect of candidate SNPs that potentially regulate the binding affinity of transcription factors. A difference in mobility shift between the major allele and the minor allele was detected for rs2293370 in HepG2 (Fig. 3A) and Jurkat (Supplementary Fig. 6A) cells. The shifted band was abrogated by incubation with a 200× concentration of a non-labeled probe (competitor probe) (Fig. 3B and Supplementary Fig. 6B). In contrast, there was no difference in mobility shift for rs56008261 between the major allele and the minor allele (Fig. 3A and Supplementary Fig. 6A).
Figure 3

In vitro functional analysis of each candidate variation in chromosome 3q13.33. (A) EMSA of each candidate primary variation using biotin-labeled probes corresponding to the major and the minor allele, and nuclear extracts of HepG2 cells. rs2293370 was the only SNP to show a difference in mobility shift between the two alleles. (B) Competitor assay, using HepG2 nuclear extracts and a 200× concentration of unlabeled probe corresponding to either the C (i.e., PBC susceptibility) or T alleles of rs2293370. (C) Outline of reporter plasmid constructs. PCR fragments of intron 2 of TIMMDC1, containing rs2293370, were sub-cloned into the pGL4.23 vector. (D) Transcription was measured by cellular luciferase activity, 24 h after transfection of HepG2 cells. The luciferase activities of cells transfected with the PBC susceptibility allele (C allele) of rs2293370 were higher than those transfected with the T allele. Three independent experiments with triplicate measurements were performed for each assay, and data represent mean ± SD; *P < 0.05 (Student’s t-test). (E) Identification of transcription factors targeting the C allele of rs2293370. A super-shift was observed following incubation of HepG2 cell nuclear extracts with an anti-RUNX1 antibody. Three independent experiments were performed in each assay.

In vitro functional analysis of each candidate variation in chromosome 3q13.33. (A) EMSA of each candidate primary variation using biotin-labeled probes corresponding to the major and the minor allele, and nuclear extracts of HepG2 cells. rs2293370 was the only SNP to show a difference in mobility shift between the two alleles. (B) Competitor assay, using HepG2 nuclear extracts and a 200× concentration of unlabeled probe corresponding to either the C (i.e., PBC susceptibility) or T alleles of rs2293370. (C) Outline of reporter plasmid constructs. PCR fragments of intron 2 of TIMMDC1, containing rs2293370, were sub-cloned into the pGL4.23 vector. (D) Transcription was measured by cellular luciferase activity, 24 h after transfection of HepG2 cells. The luciferase activities of cells transfected with the PBC susceptibility allele (C allele) of rs2293370 were higher than those transfected with the T allele. Three independent experiments with triplicate measurements were performed for each assay, and data represent mean ± SD; *P < 0.05 (Student’s t-test). (E) Identification of transcription factors targeting the C allele of rs2293370. A super-shift was observed following incubation of HepG2 cell nuclear extracts with an anti-RUNX1 antibody. Three independent experiments were performed in each assay. Additionally, in order to deny the possibility of the existence of other variations with independent genetic contributions in chromosome 3q13.33, conditional logistic regression analysis was performed. When the rs2293370 was conditioned on, significant associations of other SNPs in chromosome 3q13.33 totally disappeared (Supplementary Fig. 7). These results indicated that rs2293370 is the primary functional SNP in chromosome 3q13.33.

Molecular features of disease susceptibility influenced by rs2293370

We performed luciferase reporter assays in HepG2 and Jurkat cells to determine the differences in transcription efficiency between the C (major allele, PBC-susceptibility) and T (minor allele, PBC-protective) alleles of rs2293370. Concordant with the result of EMSA, the luciferase activity of cells 24 h after transfection with a reporter construct containing the C allele of rs2293370 was significantly higher than that of cells containing the T allele for both cell lines (Fig. 3C,D and Supplementary Fig. 6C). Next, we performed in silico prediction of transcription factor binding using the TRANSFAC database[28] to identify the transcription factor responsible for the mobility shift associated with the C allele of rs2293370. The C allele of rs2293370, but not the T allele, was predicted to be involved in a binding motif of Runt-related transcription factors (Supplementary Fig. 8). Although the DMRT and Myb families also showed similar patterns, they are not expressed in HepG2 or Jurkat cells[29]. Of the Runt-related transcription factors, Runt-related transcription factor (RUNX1) -1, but not RUNX-2 and RUNX-3, was confirmed to be expressed in both HepG2 and Jurkat cells[29] (Supplementary Fig. 9). Consistent with the in silico prediction of transcription factor binding, the mobility shift associated with the C allele of rs2293370 was reduced by pre-incubation with an anti-RUNX1 antibody prior to electrophoresis (Fig. 3E). These results indicated that the PBC susceptibility allele of rs2293370 enhances transcription via binding with RUNX1.

The mRNA expression level of POGLUT1 is influenced by rs2293370

We used the GTEx portal database[25] to assess the influence of rs2293370 on endogenous gene expression by comparing the expression levels of all genes in the human genome for the different genotypes of rs2293370 in every organ whose expression level of POGLUT1 was above the threshold for detection. Individuals carrying the C allele (i.e., the PBC-susceptible allele) of rs2293370 showed a significantly higher level of expression of POGLUT1 in several organs (Fig. 4; statistical significance level: P < 0.05/47 organs = 0.00106). Other genes located in chromosome 3q13.33 (ARHGAP31, TMEM39A, POGLUT1, TIMMDC1, and CD80) showed no significant association between rs2293370 genotypes and gene expression (Supplementary Fig. 10).
Figure 4

rs2293370 genotypes are associated with differences in endogenous POGLUT1 expression levels. (A) The relationship between rs2293370 genotype and the endogenous expression of POGLUT1 was compared in all tissues registered in the GTEX database. The effect sizes of the rs2293370 minor allele (T allele; disease protective) in every organ are shown. Statistical significance levels before Bonferroni multiple comparison correction were P = 0.00116. *Pc < 0.05, **Pc < 0.005, and ***Pc < 0.0005, after Bonferroni multiple comparison correction. (B and C) Box plots showing the association between endogenous POGLUT1 expression and rs2293370 genotypes in (B). Transformed Fibroblast and (C) Whole Blood.

rs2293370 genotypes are associated with differences in endogenous POGLUT1 expression levels. (A) The relationship between rs2293370 genotype and the endogenous expression of POGLUT1 was compared in all tissues registered in the GTEX database. The effect sizes of the rs2293370 minor allele (T allele; disease protective) in every organ are shown. Statistical significance levels before Bonferroni multiple comparison correction were P = 0.00116. *Pc < 0.05, **Pc < 0.005, and ***Pc < 0.0005, after Bonferroni multiple comparison correction. (B and C) Box plots showing the association between endogenous POGLUT1 expression and rs2293370 genotypes in (B). Transformed Fibroblast and (C) Whole Blood.

Discussion

In the present study we identified chromosome 3q13.33, which includes the genes ARHGAP31, TMEM39A, POGLUT1, TIMMDC1, and CD80, as a PBC susceptibility locus in the Japanese population by genome-wide meta-analysis based on whole-genome SNP imputation analysis of two distinct data sets of Japanese PBC-GWAS. The role of chromosome 3q13.33 had previously been identified in European and Chinese populations. Consequently, rs2293370, which is located in intron 2 of TIMMDC1, was identified as the primary functional SNP for disease susceptibility to PBC in chromosome 3q13.33 by in silico and in vitro functional analyses. In addition, the disease protective allele of rs2293370 was shown to disrupt a RUNX1 binding site and was associated with significantly decreased POGLUT1 mRNA expression levels in tissues compared with individuals without this allele. The contribution of POGLUT1 to the pathogenesis of PBC has not been reported to date. Endoplasmic reticulum (ER)-localized protein O-glucosyltransferase 1, which is encoded by POGLUT1, adds glucose moieties to serine residues of the epidermal growth factor (EGF)-like domains of Notch family proteins[30,31]. Notch signaling is an evolutionally conserved cascade that includes four receptors (Notch 1–4) and five ligands [Jagged 1, Jagged 2, Delta-like ligand 1 (DLL1), DLL3 and DLL4]. Therefore, it might be possible that genetic polymorphisms affecting the expression levels of POGLUT1 influence the Notch signaling pathway by altering Notch glycosylation. The generation and development of diverse blood cell lineages and peripheral immune responses are regulated by this Notch signaling cascade, especially in hematopoiesis during T cell lineage commitment and maturation in the thymus, and during marginal zone B (MZB) cell development in the spleen[32]. Recently, dendritic cell (DC) homeostasis and the development of several lymphocyte subsets belonging to the innate immune system have been reported to be regulated by Notch[32]. Therefore, inappropriate immune responses against self-antigens could occur due to impaired regulation of Notch signaling. In experimental autoimmune encephalomyelitis (EAE) and non-obese diabetic (NOD) mice, which are mouse models for multiple sclerosis and type 1 diabetes (T1D), respectively, disease progression was impeded by the administration of blocking antibodies against Notch receptors or DLL4[32]. Therefore, higher endogenous levels of POGLUT1 caused by the PBC-susceptible allele of rs2293370 may induce excessive Notch signaling and inappropriate immune responses against self-antigens. Very importantly, Notch signaling is also involved in the development or formation of intrahepatic bile ducts. Mutations in JAG1 or Notch2 are known causes of Alagille syndrome, an autosomal dominant disease characterized by congenital cholangiopathy with jaundice and bile duct paucity[33-35]. POGLUT1 was shown to regulate the number of bile ducts around portal veins in a JAG1-dependent manner using JAG1+/− and POGLUT1+/− mice[36]. These results indicate that POGLUT1 might be involved in the mechanism of bile duct loss in PBC. However, a limitation of this study was that endogenous POGLUT1 expression levels in PBC patients were not examined in this study. Additional studies are warranted to improve our understanding of the relationship between PBC pathogenesis and POGLUT1. A more than 100-kb stretch of the genome is located in chromosome 3q13.33 that includes ARHGAP31, TMEM39A, POGLUT1, TIMMDC1, and CD80. In addition to the present study on the Japanese population, this locus, as represented by top-hit SNP rs2293370, has been reported as a susceptibility region to PBC in European and Chinese populations[10,14,17]. The present study identified rs2293370 as the primary functional SNP by in silico and in vitro functional analysis. Therefore, POGLUT1 is likely the effector gene for susceptibility to PBC in not only the Japanese population but also other populations. This locus has also been reported as a susceptibility region for celiac disease, multiple sclerosis, systemic lupus erythematosus, and vitiligo, represented by rs11712165 in ARHGAP31, rs2293370 in TIMMDC1, and rs6804441 and rs148136154 in CD80, respectively[37-40]. The LDs were not strong between these represented tag-SNPs and rs2293370, which was identified as the primary functional SNP in this study (r2 < 0.4 in 1000 genomes Asian, Supplementary Fig. 11). Regardless, rs2293370, whose effector gene is POGLUT1, may operate as a functional SNP in both PBC and multiple sclerosis. The present study, compared with the GWAS in the European descent, identified four out of 30 non-HLA loci (IL7R, NFKB1/MANBA, chromosome 17q12-21, and chromosome 3q13.33) as susceptibility loci in the Japanese population. Additionally, even though the p-values were below the genome-wide significance level (P < 5.0 × 10−8), the direction of OR was the same between a European descent population and the Japanese population in MMEL1, STAT4/STAT1, IL12A, CXCR5/DDX6, and SPIB (Table 1). These loci are likely candidates as shared susceptibility loci for the pathogenesis of PBC between European and Japanese populations. We are currently working to clarify the shared gene profiles between different populations by conducting SNP imputation and subsequent meta-analysis using GWAS data from an international collaboration involving the UK, Italy, the USA, Canada, China and Japan. In conclusion, genome-wide meta-analysis together with in silico/in vitro functional analyses identified the primary functional SNP rs2293370 and the effector gene POGLUT1. Chromosome 3q13.33 contains CD80, which encodes a well-known co-stimulatory signaling molecule necessary for antigen presentation from HLA class II to T cell receptor (TCR). CD80 had been assumed as a candidate effector gene in previous GWASs, whereas our approach identified POGLUT1 as the target transcript for disease susceptibility in this locus comprehensively without stereotypes. Of the PBC susceptibility genes identified in the Japanese population, we previously identified the primary functional SNPs of TNFSF15, PRKCB, and chromosome 17q12-21, and their effector genes[16,22,41], as well as chromosome 3q13.33 in the present study. Similar post-GWAS approaches for susceptibility genes are needed to further clarify the molecular mechanisms of disease development.

Materials and Methods

Subjects

DNA samples for GWAS using the Japonica array platform were collected from 1,148 individuals (668 PBC cases and 480 healthy controls). The demographics of the PBC cases and controls are shown in Supplementary Table 1. Written informed consent was obtained from all participants. The protocol of this study was approved by the committee on research ethics and genetically modified organisms of the Graduate School of Medicine, The University of Tokyo, by the ethics committees of Nagasaki Medical Center, and by the ethics committees of Tohoku Medical Megabank Organization, Tohoku University. All experiments were performed in accordance with relevant guidelines and regulations.

Genotyping, quality control, and genotype imputation

Genotyping was performed using the Japonica V1 array (1,148 Japanese individuals in the present study; Affymetrix Japan). Genotype calling was performed with the apt-probeset-genotype program in Affymetrix Power Tools ver. 1.18.2 (Thermo Fisher Scientific Inc., Waltham, MA). Sample quality control was conducted by following the manufacturer’s recommendation (dish QC > 0.82 and sample call rate > 97%). Clustering of each SNP was evaluated by the Ps classification function in the SNPolisher package (version 1.5.2, Thermo Fisher Scientific Inc.). SNPs that were assigned “recommended” by the Ps classification function were used for downstream analyses. SNPs that satisfied the following criteria were used for genotype imputation: a call rate > 99.0%, a Hardy-Weinberg Equilibrium (HWE) p-value > 0.0001, and a minor allele frequency (MAF) > 0.5%. Pre-phasing was conducted with EAGLE v2.3.2[42] with default settings. Genotype imputation was conducted with IMPUTE4 v1.0[43] using a phased reference panel of 2,049 Japanese individuals from a prospective, general population cohort study performed by the Tohoku Medical Megabank Organization (ToMMo)[44,45]. These procedures were conducted using default settings. Cryptic relatives were excluded using PRIMUS[46] with default settings. In addition, principal component analysis (PCA) was performed using East Asian samples from the International 1000 Genome Project (104 JPT, 103 CHB, 93 CHS, 91 CDX, and 99 KHV samples) in addition to the case and control samples. PCA identified outliers to be excluded using the Smirnov-Grubbs test with a Bonferroni corrected p-value < 0.05. We had previously analyzed the data (2,897 Japanese individuals) using the Axiom Genome-Wide ASI 1 Array[16] and re-analyzed the data using the above-mentioned procedures.

Association analysis and meta-analysis

Association analysis was performed with PLINK (version 1.9) in each dataset (i.e., 2,897 ASI array data and 1,148 Japonica array data). The following options were used for PLINK: a call rate > 97.0%, a HWE p-value > 0.000001, a minor allele frequency (MAF) > 0.1%, and a logistic regression model. Meta-analysis was performed using PLINK with the meta-analysis option after excluding duplicates between the two datasets. The fixed-effects meta-analysis p-value was used.

Databases

The probability that a candidate functional variation might influence transcription regulation was evaluated using the RegulomeDB database (http://www.regulomedb.org/index)[26] and the UCSC genome browser (http://genome.ucsc.edu/index.html)[27]. Transcription factor binding was predicted using TRANSFAC Professional (QIAGEN, Valencia CA; http://www.gene-regulation.com/pub/databases.html)[28]. Gene expression levels in each cell line and the correlation between the genotypes of candidate SNPs and gene expression were examined using GeneCards (http://www.genecards.org/) and the GTEx portal database (http://gtexportal.org/home/), respectively[25,29]. P values < 0.05, after adjustment for multiple testing (Bonferroni correction), were regarded as statistically significant.

Electrophoretic mobility shift assay (EMSA)

EMSA was performed using a LightShift Chemiluminescent EMSA Kit (Thermo-Fisher Scientific) and biotin-labeled double-stranded oligonucleotide probes corresponding to each major and minor allele (Supplementary Table 2), according to the manufacturer’s instructions. These oligonucleotide probes (10 fmol/μL) and a nuclear extract (2.5 μg/mL) of HepG2 or Jurkat cells (Nuclear Extract Kit; Active Motif, Carlsbad, CA) were incubated together for 30 min at 25 °C. The super-shift assay was performed by incubating the biotin-labeled probe with the nuclear extracts for 30 min at 25 °C, before subsequently incubating these complexes with Anti-RUNX1/AML1 antibody - ChIP Grade (ab23980) (Abcam, Cambridge, UK) for 30 min at 25 °C. Each assay was independently performed three times.

Luciferase reporter assay

Amplicons of part of the 2nd intron of TIMMDC1, which contain each allele of rs2293370, were obtained from human genomic DNA using specific PCR primers (Supplementary Table 3) and were then subcloned into the luciferase reporter pGL4.23 (luc2/minP) vector (Promega, Madison, WI). pGL4.23 constructs (500 ng) of each allele and 50 ng of the pGL4.74 (hRluc/TK) vector as an internal control were transfected into Jurkat and HepG2 cells using Lipofectamine 3000 (Thermo-Fisher Scientific). The Dual-Luciferase Reporter Assay system (Promega) was used to measure luciferase activity. Differences in relative luciferase activity were compared between the major and minor alleles of each SNP using Student’s t-test. P values < 0.05 were regarded as statistically significant. Each figure shows representative data from experiments performed independently three times. The data in the figures represent the mean ± standard deviation of triplicate assays in a single experiment. Supplementary data
  45 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

2.  The asthma-associated ORMDL3 gene product regulates endoplasmic reticulum-mediated calcium signaling and cellular stress.

Authors:  Gerard Cantero-Recasens; César Fandos; Fanny Rubio-Moscardo; Miguel A Valverde; Rubén Vicente
Journal:  Hum Mol Genet       Date:  2010-01-01       Impact factor: 6.150

3.  PRIMUS: rapid reconstruction of pedigrees from genome-wide estimates of identity by descent.

Authors:  Jeffrey Staples; Dandi Qiao; Michael H Cho; Edwin K Silverman; Deborah A Nickerson; Jennifer E Below
Journal:  Am J Hum Genet       Date:  2014-10-30       Impact factor: 11.025

Review 4.  Regulation of innate and adaptive immunity by Notch.

Authors:  Freddy Radtke; H Robson MacDonald; Fabienne Tacchini-Cottier
Journal:  Nat Rev Immunol       Date:  2013-05-13       Impact factor: 53.106

5.  Human primary biliary cirrhosis-susceptible allele of rs4979462 enhances TNFSF15 expression by binding NF-1.

Authors:  Yuki Hitomi; Minae Kawashima; Yoshihiro Aiba; Nao Nishida; Mika Matsuhashi; Hitoshi Okazaki; Minoru Nakamura; Katsushi Tokunaga
Journal:  Hum Genet       Date:  2015-04-22       Impact factor: 4.132

Review 6.  From expression QTLs to personalized transcriptomics.

Authors:  Stephen B Montgomery; Emmanouil T Dermitzakis
Journal:  Nat Rev Genet       Date:  2011-03-09       Impact factor: 53.242

7.  Variants at IRF5-TNPO3, 17q12-21 and MMEL1 are associated with primary biliary cirrhosis.

Authors:  Gideon M Hirschfield; Xiangdong Liu; Younghun Han; Ivan P Gorlov; Yan Lu; Chun Xu; Yue Lu; Wei Chen; Brian D Juran; Catalina Coltescu; Andrew L Mason; Piotr Milkiewicz; Robert P Myers; Joseph A Odin; Velimir A Luketic; Danute Speiciene; Catherine Vincent; Cynthia Levy; Peter K Gregersen; Jinyi Zhang; E Jenny Heathcote; Konstantinos N Lazaridis; Christopher I Amos; Katherine A Siminovitch
Journal:  Nat Genet       Date:  2010-07-18       Impact factor: 38.330

8.  Primary biliary cirrhosis in monozygotic and dizygotic twins: genetics, epigenetics, and environment.

Authors:  Carlo Selmi; Marlyn J Mayo; Nancy Bach; Hiromi Ishibashi; Pietro Invernizzi; Robert G Gish; Stuart C Gordon; Harlan I Wright; Bruce Zweiban; Mauro Podda; M Eric Gershwin
Journal:  Gastroenterology       Date:  2004-08       Impact factor: 22.682

9.  Japonica array: improved genotype imputation by designing a population-specific SNP array with 1070 Japanese individuals.

Authors:  Yosuke Kawai; Takahiro Mimori; Kaname Kojima; Naoki Nariai; Inaho Danjoh; Rumiko Saito; Jun Yasuda; Masayuki Yamamoto; Masao Nagasaki
Journal:  J Hum Genet       Date:  2015-06-25       Impact factor: 3.172

10.  Dense fine-mapping study identifies new susceptibility loci for primary biliary cirrhosis.

Authors:  Jimmy Z Liu; Mohamed A Almarri; Daniel J Gaffney; George F Mells; Luke Jostins; Heather J Cordell; Samantha J Ducker; Darren B Day; Michael A Heneghan; James M Neuberger; Peter T Donaldson; Andrew J Bathgate; Andrew Burroughs; Mervyn H Davies; David E Jones; Graeme J Alexander; Jeffrey C Barrett; Richard N Sandford; Carl A Anderson
Journal:  Nat Genet       Date:  2012-09-09       Impact factor: 38.330

View more
  9 in total

1.  Other Types of Glycosylation.

Authors:  Yohei Tsukamoto; Hideyuki Takeuchi
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 3.650

2.  rs2013278 in the multiple immunological-trait susceptibility locus CD28 regulates the production of non-functional splicing isoforms.

Authors:  Yuki Hitomi; Yoshihiro Aiba; Kazuko Ueno; Nao Nishida; Yosuke Kawai; Minae Kawashima; Makoto Tsuiji; Chisato Iwabuchi; Sanami Takada; Noriko Miyake; Masao Nagasaki; Katsushi Tokunaga; Minoru Nakamura
Journal:  Hum Genomics       Date:  2022-10-21       Impact factor: 6.481

3.  Integrated GWAS and mRNA Microarray Analysis Identified IFNG and CD40L as the Central Upstream Regulators in Primary Biliary Cholangitis.

Authors:  Kazuko Ueno; Yoshihiro Aiba; Yuki Hitomi; Shinji Shimoda; Hitomi Nakamura; Olivier Gervais; Yosuke Kawai; Minae Kawashima; Nao Nishida; Seik-Soon Kohn; Kaname Kojima; Shinji Katsushima; Atsushi Naganuma; Kazuhiro Sugi; Tatsuji Komatsu; Tomohiko Mannami; Kouki Matsushita; Kaname Yoshizawa; Fujio Makita; Toshiki Nikami; Hideo Nishimura; Hiroshi Kouno; Hirotaka Kouno; Hajime Ohta; Takuya Komura; Satoru Tsuruta; Kazuhiko Yamauchi; Tatsuro Kobata; Amane Kitasato; Tamotsu Kuroki; Seigo Abiru; Shinya Nagaoka; Atsumasa Komori; Hiroshi Yatsuhashi; Kiyoshi Migita; Hiromasa Ohira; Atsushi Tanaka; Hajime Takikawa; Masao Nagasaki; Katsushi Tokunaga; Minoru Nakamura
Journal:  Hepatol Commun       Date:  2020-03-15

4.  Xylosyl Extension of O-Glucose Glycans on the Extracellular Domain of NOTCH1 and NOTCH2 Regulates Notch Cell Surface Trafficking.

Authors:  Yusuke Urata; Wataru Saiki; Yohei Tsukamoto; Hiroaki Sago; Hideharu Hibi; Tetsuya Okajima; Hideyuki Takeuchi
Journal:  Cells       Date:  2020-05-14       Impact factor: 6.600

Review 5.  Structure, function, and pathology of protein O-glucosyltransferases.

Authors:  Muhammad Zubair Mehboob; Minglin Lang
Journal:  Cell Death Dis       Date:  2021-01-12       Impact factor: 8.469

6.  rs1944919 on chromosome 11q23.1 and its effector genes COLCA1/COLCA2 confer susceptibility to primary biliary cholangitis.

Authors:  Yuki Hitomi; Yoshihiro Aiba; Yosuke Kawai; Kaname Kojima; Kazuko Ueno; Nao Nishida; Minae Kawashima; Olivier Gervais; Seik-Soon Khor; Masao Nagasaki; Katsushi Tokunaga; Minoru Nakamura; Makoto Tsuiji
Journal:  Sci Rep       Date:  2021-02-25       Impact factor: 4.379

Review 7.  Genomics of Human Fibrotic Diseases: Disordered Wound Healing Response.

Authors:  Rivka C Stone; Vivien Chen; Jamie Burgess; Sukhmani Pannu; Marjana Tomic-Canic
Journal:  Int J Mol Sci       Date:  2020-11-14       Impact factor: 5.923

8.  Regional heritability mapping identifies several novel loci (STAT4, ULK4, and KCNH5) for primary biliary cholangitis in the Japanese population.

Authors:  Olivier Gervais; Kazuko Ueno; Yosuke Kawai; Yuki Hitomi; Yoshihiro Aiba; Mayumi Ueta; Minoru Nakamura; Katsushi Tokunaga; Masao Nagasaki
Journal:  Eur J Hum Genet       Date:  2021-04-09       Impact factor: 4.246

9.  APASL clinical practice guidance: the diagnosis and management of patients with primary biliary cholangitis.

Authors:  Hong You; Xiong Ma; Cumali Efe; Guiqiang Wang; Sook-Hyang Jeong; Kazumichi Abe; Weijia Duan; Sha Chen; Yuanyuan Kong; Dong Zhang; Lai Wei; Fu-Sheng Wang; Han-Chieh Lin; Jin Mo Yang; Tawesak Tanwandee; Rino A Gani; Diana A Payawal; Barjesh C Sharma; Jinlin Hou; Osamu Yokosuka; A Kadir Dokmeci; Darrell Crawford; Jia-Horng Kao; Teerha Piratvisuth; Dong Jin Suh; Laurentius A Lesmana; Jose Sollano; George Lau; Shiv K Sarin; Masao Omata; Atsushi Tanaka; Jidong Jia
Journal:  Hepatol Int       Date:  2022-02-04       Impact factor: 6.047

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.