| Literature DB >> 33208108 |
Ju Hun Choi1, Taegun Kim1, Junghyun Jung2, Jong Wha J Joo3.
Abstract
BACKGROUND: Regulatory hotspots are genetic variations that may regulate the expression levels of many genes. It has been of great interest to find those hotspots utilizing expression quantitative trait locus (eQTL) analysis. However, it has been reported that many of the findings are spurious hotspots induced by various unknown confounding factors. Recently, methods utilizing complicated statistical models have been developed that successfully identify genuine hotspots. Next-generation Intersample Correlation Emended (NICE) is one of the methods that show high sensitivity and low false-discovery rate in finding regulatory hotspots. Even though the methods successfully find genuine hotspots, they have not been widely used due to their non-user-friendly interfaces and complex running processes. Furthermore, most of the methods are impractical due to their prohibitively high computational complexity.Entities:
Keywords: Google cloud compute engine (GCE); PLINK; Parallel processing; Regulatory hotspot; VCF (variant call format); Web-based tool; eQTL
Mesh:
Year: 2020 PMID: 33208108 PMCID: PMC7677835 DOI: 10.1186/s12864-020-07012-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1NICER website composition. Sequence of instructions to use NICER. (1) Select input data format among PLINK, VCF, and NICE format. (2) Select number of threads to use in the analysis. (3) Enter an email address. (4) Upload required input files. (5) Click submit button to start the analysis. Result file will be provided via email. Users can choose to run the analysis on either the NICER server (A) or GCE (B)
Fig. 2Time performance comparison between NICE and NICER. Using different numbers of threads, the x-axis corresponds to the number of SNPs analyzed and the y-axis corresponds the running time in hours. The yeast dataset is used for the test. The blue, orange, and gray bars show the performance of NICE, NICER using five processes, and NICER using 10 processes. The performance of the rightmost three bars are extrapolated from the results of 1000 SNPs
Fig. 3Hotspot identification of the yeast dataset. a The eQTL map showing the significant eQTLs across the whole genome. The x-axis corresponds to the SNP positions, and the y-axis corresponds to the gene positions. b The number of eQTLs (linkages) plotted in genome location. The numbers show the identified eQTL hotspots (Table 1). The dashed red line represents the threshold, and the asterisks indicate the previously identified hotspots [14–20]
Results of hotspots identified by NICER using the yeast dataset. Putative regulators identified by previous studies are denoted in boldface [14–20]
| eQTL hotspot | eQTL hotspot location | Putative regulator |
|---|---|---|
| 1* | chrII:117298_A/G | |
| 2 | chrII:466295_A/G | TKL2, ALG1, YBR109W-A, YBR116C |
| 3 | chrII:498623_C/T | CKS1, VMA2, MEO1, YBR126W-B, CCZ1 |
| 4 | chrII:586961_C/T | UMP1, YPC1, EHT1, RPS6B, SMP1, FZO1 |
| 5 | chrIV:266824_T/C | TMA17, QRI1, MSS2, QRI7, NSE4, TRM3 |
| 6 | chrIV:476833_C/T | GCV1, DAS2 |
| 7* | chrIV:637663_A/T | YDR098C-B, |
| 8* | chrVI:96802_G/A | |
| 9* | chrVII:362516_A/T | RPL7A, MPC1, HNM1, |
| 10* | chrVIII:114337_G/A | |
| 11* | chrVIII:173637_G/T | DAP2, YHR033W, SLT2, |
| 12* | chrVIII:463786_G/A | |
| 13 | chrIX:216335_C/T | SPO22, THS1, SER33, YIL077C, SEC28 |
| 14 | chrX:489283_A/G | RAD26, CPR7, RAV1, RBH2 |
| 15 | chrX:568363_G/A | MOG1, OPI3, HAM1, HOC1, NPA3, CDC11 |
| 16* | chrXI:232567_T/C | AAT1, HAP4, |
| 17* | chrXII:514845_A/C | YLR179C, TFS1, VTA1, SAM1, TOS4, |
| 18* | chrXII:650260_C/T | |
| 19* | chrXIII:46211_C/T | BUL2, |
| 20 | chrXIII:110807_T/C | CPR3, HMG1, YML079W, WAR1, YML082W, TDA9, DUS1 |
| 21 | chrXIII:333449_G/A | RCH1, IMP2, MIH1, FAR8 |
| 22 | chrXIII:409642_T/A | AVO2 |
| 23* | chrXIV:467219_A/G | |
| 24* | chrXIV:571166_T/C | HHT2, SIW14, |
| 25* | chrXV:159827_A/G | |
| 26* | chrXV:177015_A/G | |
| 27* | chrXV:520870_G/A | RAS1, PIN2, |
| 28* | chrXV:552069_C/T | |
| 29 | chrXV:610420_C/A | SPP2, RPB2, PNO1, ELG1, MDM32 |