| Literature DB >> 25887316 |
Juan P Steibel1, Heng Wang2,3, Ping-Shou Zhong4.
Abstract
BACKGROUND: Allelic specific expression (ASE) increases our understanding of the genetic control of gene expression and its links to phenotypic variation. ASE testing is implemented through binomial or beta-binomial tests of sequence read counts of alternative alleles at a cSNP of interest in heterozygous individuals. This requires prior ascertainment of the cSNP genotypes for all individuals. To meet the needs, we propose hidden Markov methods to call SNPs from next generation RNA sequence data when ASE possibly exists.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25887316 PMCID: PMC4351697 DOI: 10.1186/s12859-015-0479-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
EFDR (homozygotes called heterozygotes) in blocks of 10 SNP using HMM-ASE and HMM-NASE
|
|
| |||
|---|---|---|---|---|
|
|
|
| ||
| (8,0.5) | ASE | 0.0167 (0.017) | 0.0117 (0.010) | 0.0308 (0.020) |
| NASE | 0.0033 (0.007) | 0.0050 (0.006) | 0.0071 (0.004) | |
| (8,0.8) | ASE | 0.0000 (0.000) | 0.0150 (0.007) | 0.0133 (0.005) |
| NASE | 0.0017 (0.005) | 0.0050 (0.006) | 0.0029 (0.004) | |
| (16,0.5) | ASE | 0.0100 (0.009) | 0.0000 (0.000) | 0.0058 (0.004) |
| NASE | 0.0017 (0.005) | 0.0008 (0.003) | 0.0000 (0.000) | |
| (16,0.8) | ASE | 0.0033 (0.008) | 0.0017 (0.004) | 0.0033 (0.004) |
| NASE | 0.0000 (0.000) | 0.0008 (0.003) | 0.0004 (0.001) | |
| (24,0.5) | ASE | 0.0000 (0.000) | 0.0033 (0.005) | 0.0008 (0.002) |
| NASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) | |
| (24,0.8) | ASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0033 (0.004) |
| NASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) | |
| (32,0.5) | ASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0058 (0.006) |
| NASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) | |
| (32,0.8) | ASE | 0.0000 (0.000) | 0.0017 (0.004) | 0.0000 (0.000) |
| NASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) | |
| (56,0.5) | ASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0008 (0.002) |
| NASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) | |
| (56,0.8) | ASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) |
| NASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) | |
Standard deviations are included in parenthesis.
EFNR (heterozygotes called homozygotes) in blocks of 10 SNP using HMM-ASE and HMM-NASE
|
|
| |||
|---|---|---|---|---|
|
|
|
| ||
| (8,0.5) | ASE | 0.0367 (0.018) | 0.0217 (0.013) | 0.0242 (0.028) |
| NASE | 0.0833 (0.038) | 0.0733 (0.019) | 0.0675 (0.015) | |
| (8,0.8) | ASE | 0.0100 (0.009) | 0.0033 (0.005) | 0.0075 (0.005) |
| NASE | 0.0450 (0.046) | 0.0533 (0.026) | 0.0700 (0.020) | |
| (16,0.5) | ASE | 0.0133 (0.014) | 0.0067 (0.004) | 0.0075 (0.004) |
| NASE | 0.0567 (0.036) | 0.0433 (0.025) | 0.0479 (0.016) | |
| (16,0.8) | ASE | 0.0067 (0.015) | 0.0067 (0.011) | 0.0033 (0.004) |
| NASE | 0.0467 (0.046) | 0.0367 (0.023) | 0.0396 (0.011) | |
| (24,0.5) | ASE | 0.0033 (0.008) | 0.0000 (0.000) | 0.0017 (0.002) |
| NASE | 0.0233 (0.025) | 0.0308 (0.020) | 0.0379 (0.014) | |
| (24,0.8) | ASE | 0.0033 (0.008) | 0.0000 (0.000) | 0.0042 (0.007) |
| NASE | 0.0383 (0.022) | 0.0392 (0.026) | 0.0296 (0.016) | |
| (32,0.5) | ASE | 0.0000 (0.000) | 0.0033 (0.005) | 0.0000 (0.000) |
| NASE | 0.0300 (0.019) | 0.0375 (0.021) | 0.0396 (0.019) | |
| (32,0.8) | ASE | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) |
| NASE | 0.0417 (0.043) | 0.0208 (0.013) | 0.0346 (0.018) | |
| (56,0.5) | ASE | 0.0033 (0.008) | 0.0000 (0.000) | 0.0000 (0.000) |
| NASE | 0.0233 (0.014) | 0.0275 (0.013) | 0.0288 (0.009) | |
| (56,0.8) | ASE | 0.0033 (0.008) | 0.0000 (0.000) | 0.0000 (0.000) |
| NASE | 0.0200 (0.019) | 0.0200 (0.015) | 0.0196 (0.009) | |
Standard deviations are included in parentheses.
Figure 1A comparison of the EFDR and EFNR for the HMM-ASE vs HMM-NASE for with LD= and 0.8.
The EFDR (homozygous called heterozygous) and EFNR (heterozygous called homozygous) with 100 SNPs for HMM-ASE
|
|
| ||
|---|---|---|---|
|
|
|
| |
|
| |||
| (8,0.5) | 0.0163 (0.009) | 0.0163 (0.003) | 0.0230 (0.004) |
| (8,0.8) | 0.0067 (0.003) | 0.0050 (0.002) | 0.0068 (0.003) |
| (16,0.5) | 0.0050 (0.002) | 0.0053 (0.001) | 0.0068 (0.001) |
| (16,0.8) | 0.0033 (0.002) | 0.0025 (0.001) | 0.0030 (0.001) |
| (24,0.5) | 0.0027 (0.002) | 0.0017 (0.001) | 0.0030 (0.002) |
| (24,0.8) | 0.0017 (0.002) | 0.0002 (0.000) | 0.0009 (0.001) |
| (32,0.5) | 0.0017 (0.002) | 0.0008 (0.001) | 0.0008 (0.001) |
| (32,0.8) | 0.0007 (0.001) | 0.0008 (0.001) | 0.0008 (0.000) |
| (56,0.5) | 0.0007 (0.001) | 0.0000 (0.000) | 0.0008 (0.000) |
| (56,0.8) | 0.0003 (0.001) | 0.0002 (0.000) | 0.0002 (0.000) |
|
| |||
| (8,0.5) | 0.0230 (0.003) | 0.0218 (0.007) | 0.0138 (0.003) |
| (8,0.8) | 0.0113 (0.005) | 0.0088 (0.004) | 0.0071 (0.003) |
| (16,0.5) | 0.0080 (0.004) | 0.0063 (0.003) | 0.0047 (0.002) |
| (16,0.8) | 0.0033 (0.001) | 0.0032 (0.002) | 0.0028 (0.001) |
| (24,0.5) | 0.0017 (0.002) | 0.0028 (0.002) | 0.0022 (0.001) |
| (24,0.8) | 0.0020 (0.001) | 0.0017 (0.001) | 0.0010 (0.001) |
| (32,0.5) | 0.0013 (0.001) | 0.0005 (0.001) | 0.0011 (0.001) |
| (32,0.8) | 0.0003 (0.001) | 0.0005 (0.001) | 0.0004 (0.000) |
| (56,0.5) | 0.0003 (0.001) | 0.0005 (0.001) | 0.0001 (0.000) |
| (56,0.8) | 0.0003 (0.001) | 0.0002 (0.000) | 0.0000 (0.000) |
Their standard deviations are shown in the parentheses.
The EFDR (homozygous called heterozygous) and EFNR (heterozygous called homozygous) with 100 SNPs for HMM-NASE
|
|
| ||
|---|---|---|---|
|
|
|
| |
|
| |||
| (8,0.5) | 0.0053 (0.005) | 0.0035 (0.001) | 0.0045 (0.001) |
| (8,0.8) | 0.0025 (0.002) | 0.0015 (0.001) | 0.0020 (0.001) |
| (16,0.5) | 0.0005 (0.001) | 0.0003 (0.001) | 0.0005 (0.000) |
| (16,0.8) | 0.0002 (0.001) | 0.0002 (0.000) | 0.0001 (0.000) |
| (24,0.5) | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) |
| (24,0.8) | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) |
| (32,0.5) | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) |
| (32,0.8) | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) |
| (56,0.5) | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) |
| (56,0.8) | 0.0000 (0.000) | 0.0000 (0.000) | 0.0000 (0.000) |
|
| |||
| (8,0.5) | 0.0728 (0.015) | 0.0740 (0.009) | 0.0701 (0.005) |
| (8,0.8) | 0.0533 (0.011) | 0.0498 (0.014) | 0.0520 (0.005) |
| (16,0.5) | 0.0557 (0.012) | 0.0466 (0.009) | 0.0467 (0.005) |
| (16,0.8) | 0.0358 (0.010) | 0.0364 (0.008) | 0.0401 (0.009) |
| (24,0.5) | 0.0412 (0.011) | 0.0398 (0.007) | 0.0407 (0.003) |
| (24,0.8) | 0.0347 (0.006) | 0.0298 (0.008) | 0.0305 (0.004) |
| (32,0.5) | 0.0280 (0.007) | 0.0338 (0.004) | 0.0313 (0.005) |
| (32,0.8) | 0.0292 (0.010) | 0.0304 (0.007) | 0.0285 (0.003) |
| (56,0.5) | 0.0257 (0.007) | 0.0268 (0.004) | 0.0265 (0.003) |
| (56,0.8) | 0.0203 (0.008) | 0.0218 (0.003) | 0.0213 (0.002) |
Their standard deviations shown in the parenthesis.
The EFDR (homozygous called heterozygous) and EFNR (heterozygous called homozygous) with 100 SNPs using the haplotype structures from real data
|
|
| |||
|---|---|---|---|---|
|
|
|
| ||
| EFDR | ||||
| 8 | ASE | 0.0784 (0.0234) | 0.0729 (0.0093) | 0.0752 (0.0089) |
| NASE | 0.0235 (0.0150) | 0.0276 (0.0119) | 0.0267 (0.0111) | |
| 16 | ASE | 0.0328 (0.0099) | 0.0338 (0.0088) | 0.0331 (0.0086) |
| NASE | 0.0016 (0.0027) | 0.0028 (0.0036) | 0.0019 (0.0017) | |
| 24 | ASE | 0.0153 (0.0070) | 0.0137 (0.0064) | 0.0164 (0.0054) |
| NASE | 0.0012 (0.0026) | 0.0008 (0.0013) | 0.0004 (0.0006) | |
| 32 | ASE | 0.0118 (0.0087) | 0.0110 (0.0031) | 0.0076 (0.0033) |
| NASE | 0.0000 (0.0000) | 0.0000 (0.0000) | 0.0001 (0.0003) | |
| 56 | ASE | 0.0022 (0.0033) | 0.0030 (0.0028) | 0.0019 (0.0011) |
| NASE | 0.0000 (0.0000) | 0.0000 (0.0000) | 0.0000 (0.0000) | |
|
| ||||
| 8 | ASE | 0.0418 (0.0169) | 0.0479 (0.0129) | 0.0502 (0.0098) |
| NASE | 0.2161 (0.0533) | 0.2084 (0.0434) | 0.2075 (0.0410) | |
| 16 | ASE | 0.0239 (0.0095) | 0.0214 (0.0109) | 0.0211 (0.0056) |
| NASE | 0.1888 (0.0517) | 0.1986 (0.0455) | 0.1762 (0.0360) | |
| 24 | ASE | 0.0161 (0.0077) | 0.0164 (0.0093) | 0.0143 (0.0032) |
| NASE | 0.1523 (0.0584) | 0.1788 (0.0503) | 0.1577 (0.0246) | |
| 32 | ASE | 0.0060 (0.0071) | 0.0079 (0.0063) | 0.0088 (0.0036) |
| NASE | 0.1434 (0.0839) | 0.1367 (0.0507) | 0.1368 (0.0227) | |
| 56 | ASE | 0.0040 (0.0057) | 0.0031 (0.0028) | 0.0025 (0.0012) |
| NASE | 0.1339 (0.1005) | 0.1178 (0.0348) | 0.1332 (0.0287) | |
Their standard deviations shown in the parenthesis.
The EFDR (homozygous called heterozygous) and EFNR (heterozygous called homozygous) with 100 SNPs using the haplotype structures from real data
|
|
| |||
|---|---|---|---|---|
|
|
|
| ||
|
| ||||
| 8 | ASE | 0.0222 (0.0100) | 0.0264 (0.0091) | 0.0243 (0.0038) |
| NASE | 0.0250 (0.0064) | 0.0313 (0.0103) | 0.0297 (0.0060) | |
| 16 | ASE | 0.0026 (0.0032) | 0.0023 (0.0020) | 0.0043 (0.0033) |
| NASE | 0.0023 (0.0025) | 0.0023 (0.0027) | 0.0040 (0.0033) | |
| 24 | ASE | 0.0006 (0.0013) | 0.0003 (0.0008) | 0.0006 (0.0005) |
| NASE | 0.0006 (0.0013) | 0.0000 (0.0000) | 0.0003 (0.0005) | |
| 32 | ASE | 0.0000 (0.0000) | 0.0002 (0.0006) | 0.0004 (0.0005) |
| NASE | 0.0000 (0.0000) | 0.0000 (0.0000) | 0.0000 (0.0000) | |
| 56 | ASE | 0.0000 (0.0000) | 0.0000 (0.0000) | 0.0000 (0.0000) |
| NASE | 0.0000 (0.0000) | 0.0000 (0.0000) | 0.0000 (0.0000) | |
|
| ||||
| 8 | ASE | 0.0375 (0.0224) | 0.0295 (0.0090) | 0.0259 (0.0072) |
| NASE | 0.0322 (0.0193) | 0.0251 (0.0080) | 0.0219 (0.0048) | |
| 16 | ASE | 0.0028 (0.0032) | 0.0030 (0.0024) | 0.0033 (0.0020) |
| NASE | 0.0037 (0.0043) | 0.0027 (0.0021) | 0.0028 (0.0019) | |
| 24 | ASE | 0.0002 (0.0008) | 0.0000 (0.0000) | 0.0006 (0.0006) |
| NASE | 0.0000 (0.0000) | 0.0000 (0.0000) | 0.0006 (0.0007) | |
| 32 | ASE | 0.0000 (0.0000) | 0.0000 (0.0000) | 0.0000 (0.0000) |
| NASE | 0.0003 (0.0011) | 0.0002 (0.0005) | 0.0000 (0.0000) | |
| 56 | ASE | 0.0000 (0.0000) | 0.0000 (0.0000) | 0.0000 (0.0000) |
| NASE | 0.0000 (0.0000) | 0.0000 (0.0000) | 0.0000 (0.0000) | |
The data were generated without ASE, namely, all the heterozygous genes have ASE ratio 0.5. Their standard deviations shown in the parenthesis.
Contingency tables of genotype calling with five methods (columns) versus actual genotypes (rows)
|
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|
| genotype | He | Ho | NC | He | Ho | NC | He | Ho | NC |
| (Reads =0) | (Reads =0) | ||||||||
| He | 449 | 78 | 64 |
| 0 | 20 |
| 1 | 20 |
| Ho | 0 | 886 | 83 | 9 |
| 46 | 2 |
| 46 |
|
|
|
| |||||||
| genotype | He | Ho | NC | He | Ho | NC | |||
| He |
| 9 | 6 |
| 8 | 0 | |||
| Ho | 4 |
| 8 | 6 |
| 0 | |||
Values in bold represent counts of correct calls. The other values are incorrect calls or Non-called (NC).
Figure 2Sensitivity (proportion of Heterozygotes genotypes called by each program) as a function of total read count.
Figure 3Estimated minor allele frequency using genotypes from DNA chip or from called cSNP.
Figure 4log-ASE ratio of fitting beta-binomial model on counts from Heterozygous individuals. Heterozygous status was either taken from chip data (x-axis) or from genotypes called from sequence data (y-axis).
Figure 5(p-value) of fitting beta-binomial model on counts from Heterozygous individuals. Heterozygous status was either taken from chip data (x-axis) or from genotypes called from sequence data (y-axis).