| Literature DB >> 31616042 |
Anja Barešić1, Alexander Jolyon Nash1,2, Tarik Dahoun1,2,3, Oliver Howes1,2, Boris Lenhard4,5,6.
Abstract
Recent genome-wide association studies have identified numerous loci associated with neuropsychiatric disorders. The majority of these are in non-coding regions, and are commonly assigned to the nearest gene along the genome. However, this approach neglects the three-dimensional organisation of the genome, and the fact that the genome contains arrays of extremely conserved non-coding elements termed genomic regulatory blocks (GRBs), which can be utilized to detect genes under long-range developmental regulation. Here we review a GRB-based approach to assign loci in non-coding regions to potential target genes, and apply it to reanalyse the results of one of the largest schizophrenia GWAS (SWG PGC, 2014). We further apply this approach to GWAS data from two related neuropsychiatric disorders-autism spectrum disorder and bipolar disorder-to show that it is applicable to developmental disorders in general. We find that disease-associated SNPs are overrepresented in GRBs and that the GRB model is a powerful tool for linking these SNPs to their correct target genes under long-range regulation. Our analysis identifies novel genes not previously implicated in schizophrenia and corroborates a number of predicted targets from the original study. The results are available as an online resource in which the genomic context and the strength of enhancer-promoter associations can be browsed for each schizophrenia-associated SNP.Entities:
Mesh:
Year: 2019 PMID: 31616042 PMCID: PMC6906185 DOI: 10.1038/s41380-019-0518-x
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Fig. 1Long-range gene regulation. a An enhancer can target a gene over large genomic distances. b A genomic regulatory block (GRB) spanning a 1.9 Mb region of the human genome. This region displays high levels of non-coding conservation between the human and four vertebrate genomes, reflecting evolution over ~435 million years. The non-coding conservation peaks around IRX3, the predicted target gene in this GRB. The black vertical line and blue rectangle mark the genomic position of the obesity-associated SNPs (rs1421085 and rs9939609) from the Ragvin et al. study, and the linkage disequilibrium block around it, respectively. Each SNP falls within an enhancer that is capable of activating the expression of the IRX3 gene, shown in red. c The GRB model: conserved non-coding elements (shown in green) within a GRB contact the target gene’s promoter (shown in red) through looping and regulate its transcriptional activity in multiple contexts. SNPs in the linkage disequilibrium with regulatory elements involved in long-range gene regulation are often erroneously assigned to the nearest bystander gene, associating the wrong gene to the disease phenotype
A table of loci from the PGC study [42] overlapping a genomic regulatory block, with target genes assigned to each locus by the GWAS study and by the GRB method
| Locus coordinates | GWAS SNPs | GWAS genes | GRB target genes | GRB by stander genes |
|---|---|---|---|---|
| chr5:60499143–60843543 | rs4391122 | ZSWIM6 | SMIM15 | NDUFAF2,ZSWIM6,RPL3P6,C5orf64,RNU6-913P,RN7SKP157 |
| chr2:200715237–200848037 | chr2_200825237_I | AC073043.2 C2orf47 C2orf69 TYW5 | SATB2 | PLCL1,RNU7-147P,SATB2-AS1,SEPHS1P6,FTCDNL1 |
| chr7:110843815–111205915 | rs13240464 | IMMP2L | LRRN3 | IMMP2L,DOCK4,DOCK4-AS1 |
| chr11:130714610–130749330 | rs10791097 | SNX19 | ADAMTS15,NTM,OPCML,SPATA19,IGSF9B | ADAMTS8,BAK1P2,C11orf44,PPP1R10P1,SNX19,RN7SL167P,RNU6ATAC12P,NTM-IT,RNU6-1182P,OPCML-IT2,OPCML-IT1,MIR4697,JAM3 |
| chrX:21193266–21570266 | rs1378559 | CNKSR2 | KLHL34 | CNKSR2,SMPX |
| chr18:52747686–53200117 | chr18_52749216_D, rs78322266, rs9636107 | TCF4 | CCDC68 | RAB27B,MAP1LC3P,RNA5SP459,TCF4,MIR4529,RPL21P126 |
| chr3:180588843–181205585 | chr3_180594593_I, rs9841616 | CCDC39 DNAJC19 FXR1 | SOX2 | FXR1,DNAJC19,SOX2-OT,RNU6-4P,FAUP2,RPL7AP25,RN7SL703P,RNA5SP150,RN7SKP265,RPL7L1P8 |
| chr2:57943593–58502192 | rs11682175, rs75575209 | FANCL VRK2 | BCL11A | VRK2,FANCL,EIF3FP3,LINC01122,RNU6-508P,RNA5SP94,RNU1-32P,MIR4432,RN7SL361P,RNU6-612P,ATP1B3P1,PAPOLG |
| chr18:53453389–53804154 | rs72934570, rs715170 | TCF4* | CCDC68 | RAB27B,MAP1LC3P,RNA5SP459,TCF4,MIR4529,RPL21P126 |
| chr3:2532786–2561686 | rs17194490 | CNTN4 | CNTN6 | RN7SL120P,RPL23AP39,RPL21P17,RN7SKP144,CNTN4,CNTN4-AS2,DNAJC19P4,CNTN4-AS1,IL5RA |
| chr3:52541105–52903405 | rs2535627 | GLT8D1 GNL3 ITIH1 ITIH3 | SEMA3G,TNNC1,NT5DC2 | PHF7,NISCH,STAB1,SMIM4,PBRM1,RNU6-856P,RNU6ATAC16P |
| chrX:68377126–68379036 | rs5937157 | PJA1 | EFNB1,FAM155B | STARD8,ACTR3P2,SERBP1P1,PJA1,HMGN1P35,LINC00269,CYCSP43,EDA |
| chr17:2095899–2220799 | rs4523957 | SGSM2 SMG6 SRR TSR1 | SCARF1,RILP,TLCD2,SERPINF2,RTN4RL1,HIC1,MNT | SLC43A2,RN7SL105P,PRPF8,MIR22HG,WDR81,SERPINF1,SMYD4,RPA1,DPH1,OVCA2,MIR132,MIR212,SMG6,RN7SL624P,SRR,HNRNPA1P16,TSR1,SNORD91B,SNORD91A,SGSM2,METTL16 |
| chr15:61831663–61909663 | rs12903146 | VPS14C* | RORA | NARG2,CYCSP38,RNA5SP397 |
| chr22:42315744–42689414 | rs1023500, rs6002655 | CENPM CYP2D6 FAM109B NAGA NDUFA6 SEPT3 SHISA8 SMDT1 SREBF2 TCF20 TNFRSF13C WBP2NL | NFAM1 | TCF20 |
| chr2:146416922–146441832 | chr2_146436222_I | ARHGAP15,ZEB2 | KYNU,MTND6P11,MTND5P24,MTND4P22,MTND3P9,GTDC1,ZEB2-AS1,TEX41,RPL6P5,RNU7-2P,RPL17P12,PABPC1P2 | |
| chr1:243503719–244002945 | rs10803138, rs77149735, rs14403, chr1_243881945_I | AKT3 SDCCAG8 | ZBTB18 | SDCCAG8,MIR4677,AKT3,FABP7P1,AKT3-IT1,RN7SL148P |
| chr3:17221366–17888266 | rs4330281 | TBC1D5 | SATB1 | TBC1D5,PDCL3P3,RAD23BP1,RNU6-138P |
| chr8:60475469–60954469 | rs6984242 | CA8* | TOX | RNU4-50P,RNA5SP267,SLC2A13P1,CA8 |
| chr14:30189985–30190316 | rs2068012 | PRKD1 | FOXG1 | C14orf23,RNU6-864P,RNU11-5P,PRKD1,RNU6-1234P |
| chr4:23366403–23443403 | rs215411 | MIR548AJ2* | PPARGC1A,DHX15,SOD3 | GBA3,CDC42P6,RFPL4AP3,MIR573,RN7SL16P,ATP5LP3,HNRNPA1P65,CCDC149,LGI2 |
| chr7:110034393–110106693 | rs211829 | IMMP2L* | LRRN3 | IMMP2L,DOCK4,DOCK4-AS1 |
| chr7:131539263–131567263 | rs7801375 | PODXL* | PLXNA4,LRGUK | CHCHD3,EXOC4,COX5BP3,SLC35B4 |
| chr1:177247821–177300821 | rs6670165 | FAM5B | ASTN1,BRINP2,SEC16B | PAPPA2,PTP4A1P7,MIR488,RASAL2-AS1,RASAL2 |
| chr1:207912183–208024083 | rs7523273 | C1orf132 CD46 CR1L | PLXNA2 | C1orf132,CD34,RPS26P13,ATP5G2P1,MIR205HG |
| chr12:92243186–92258286 | rs4240748 | C12orf79* | BTG1 | C12orf79,RPL21P106 |
| chr12:103559855–103616655 | rs10860964 | C12orf42 | PAH,ASCL1 | IGF1,LINC00485,RNU7-184P,C12orf42 |
Only a subset with completely novel target genes by the GRB method proposed is presented here; for the full set of loci from the PGC dataset, see Supplementary Table S1
*Denotes nearest non-overlapping gene to the locus, as defined in the Supplementary Table S3 in [42]
Fig. 2Long-range regulation in neurodevelopmental GWAS loci. a LD blocks that do not overlap GRBs are shown in grey. Loci in which the predicted GRB target gene was identified as schizophrenia associated in the original GWAS are shown in light red, and the loci in which the GRB model provides novel target gene predictions are shown in dark red. b The distribution of overlaps with the genomic regulatory blocks of the same number of random regions. P-value calculated based on N = 10,000 randomisations
Fig. 3A schematic view of four genomic regulatory blocks. Arrows represent genes, with their orientation indicated by the direction of the arrow. The lower plot in each example is a matrix of log expression values (in TPM) for each enhancer–promoter pair in that GRB. The distribution of expression in tissues where the enhancer is inactive or active is shown in grey and pink, respectively, and the number of samples where each enhancer is (in)active is given under the enhancer label. The black line represents the median expression for each distribution. Significant differences between the medians of the two categories are marked with *p < 0.05, **p < 0.01 and ***p < 0.001 (permutation test, see Supplementary methods). Note, the schematics are not to scale and merely represent gene order (for to-scale plots, visit http://scz.genereg.net/)