| Literature DB >> 23968248 |
Ryan M Smith1, Amy Webb, Audrey C Papp, Leslie C Newman, Samuel K Handelman, Adam Suhy, Roshan Mascarenhas, John Oberdick, Wolfgang Sadee.
Abstract
BACKGROUND: Measuring allelic RNA expression ratios is a powerful approach for detecting cis-acting regulatory variants, RNA editing, loss of heterozygosity in cancer, copy number variation, and allele-specific epigenetic gene silencing. Whole transcriptome RNA sequencing (RNA-Seq) has emerged as a genome-wide tool for identifying allelic expression imbalance (AEI), but numerous factors bias allelic RNA ratio measurements. Here, we compare RNA-Seq allelic ratios measured in nine different human brain regions with a highly sensitive and accurate SNaPshot measure of allelic RNA ratios, identifying factors affecting reliable allelic ratio measurement. Accounting for these factors, we subsequently surveyed the variability of RNA editing across brain regions and across individuals.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23968248 PMCID: PMC3765493 DOI: 10.1186/1471-2164-14-571
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Tissue characteristics and mapping statistics for 9 brain regions
| BA10 (frontopolar cortex) | 5.9 | 36,449,577 | 30,223,143 (83%) | 30,222,857 (83%) | 30,314,666 (83%) |
| BA22 (superior temporal cortex) | 7.1 | 34,975,593 | 29,506,035 (84%) | 29,505,770 (84%) | 29,594,225 (84%) |
| BA24 (anterior cingulate cortex) | 6.5 | 43,992,783 | 36,562,760 (83%) | 36,562,530 (83%) | 36,698,983 (83%) |
| Insular cortex | 5.0 | 38,872,288 | 32,777,310 (84%) | 32,777,059 (84%) | 32,879,072 (84%) |
| Amygdala | 6.9 | 41,172,320 | 34,304,449 (83%) | 34,304,246 (83%) | 34,398,572 (83%) |
| Hippocampus | 6.7 | 48,277,234 | 40,944,589 (85%) | 40,944,279 (85%) | 41,012,249 (85%) |
| Putamen | 6.8 | 30,881,648 | 25,861,872 (84%) | 25,861,690 (84%) | 25,958,094 (84%) |
| Cerebellum | 6.9 | 39,047,957 | 32,918,344 (84%) | 32,918,093 (84%) | 32,938,662 (84%) |
| Raphe nuclei | 6.9 | 41,231,369 | 33,963,780 (82%) | 33,963,585 (82%) | 34,091,575 (83%) |
aRIN RNA Integrity Number, measured with Agilent Bioanalyzer 2100.
bAllele-switching only conducted on a limited number of SNPs listed in Additional file 1: Table S1.
Figure 1Comparison of allelic RNA expression ratios for standard versus alternative mapping methods. On average, the IUPAC and Allele-Switching mapping methods resulted in similar reduction of allelic RNA expression ratios (solid and dashed lines, respectively) as compared to standard mapping.
Figure 2Gene-wise allelic RNA expression ratio comparisons measured by RNA-Seq (Standard, IUPAC, or Allele-Switch) versus SNaPshot (NuGen or GSP). The two alternative mapping methods (IUPAC and Allele-Switch) were similar and more highly correlated with SNaPshot allelic ratio measures for both NuGen cDNA (A) and GSP cDNA (B), whereas Standard mapping was much less correlated (dotted line).
Figure 3Standard versus IUPAC-aligned allelic RNA expression ratios in all 9 brain regions. IUPAC alignment systematically reduces allelic ratios by approximately 0.3-fold per fold change in Standard alignment allelic ratios (i.e. a 2-fold Standard allelic ratio corresponds to a 1.4-fold IUPAC allelic ratio). Inset: Following IUPAC alignment, fewer SNPs are above any given allelic ratio threshold compared to Standard alignment, suggesting uniform reduction of allelic ratios across the spectrum of allelic values.
Tissue characteristics and mapping statistics for 14 dorsolateral prefrontal cortex samples
| DLPFC1 | 32 | AA | M | + | + | 18 | 7.3 | SOLiD 5500 | 94,129,888 | 71,736,257 (76%) |
| DLPFC2 | 49 | C/H | M | + | + | 15 | 6.6 | SOLiD 5500 | 98,880,442 | 73,446,303 (74%) |
| DLPFC3d | 20 | AA | M | - | - | 12 | 6.7 | SOLiD 5500 | 92,458,538 | 69,487,318 (75%) |
| DLPFC4 | 34 | C | M | - | + | 11.5 | 6.8 | SOLiD 5500 | 243,400,656 | 104,099,874 (43%) |
| DLPFC5 | 39 | C | M | + | - | 12 | 8.2 | SOLiD 4 | 49,940,133 | 37,200,670 (74%) |
| DLPFC6 | 25 | AA | M | + | - | 20.5 | 8.2 | SOLiD 4 | 48,399,474 | 37,741,267 (78%) |
| DLPFC7 | 42 | C | F | + | - | 14 | 7.1 | SOLiD 4 | 46,956,017 | 35,661,219 (76%) |
| DLPFC8 | 28 | C | M | - | - | 11 | 9.1 | SOLiD 4 | 53,947,324 | 39,909,693 (74%) |
| DLPFC9 | 35 | C | M | - | - | 12 | 8.3 | SOLiD 4 | 60,022,719 | 44,510,882 (74%) |
| DLPFC10 | 28 | C | M | + | - | 8 | 8.1 | SOLiD 4 | 258,721,328 | 200,550,177 (78%) |
| DLPFC11 | 35 | C | F | + | + | 11 | 6.4 | SOLiD 4 | 48,215,511 | 37,349,598 (77%) |
| DLPFC12 | 39 | C | M | - | + | 11 | 7.7 | SOLiD 4 | 290,364,386 | 227,982,821 (79%) |
| DLPFC13 | 34 | C | M | - | - | 22 | 7.3 | SOLiD 4 | 94,036,538 | 56,435,396 (60%) |
| DLPFC14 | 32 | B/H | F | - | - | 16 | 9.4 | SOLiD 4 | 48,174,311 | 37,197,681 (77%) |
aRace/Ethnicity: AA African American, C Caucasian, H Hispanic.
bPMI Post-mortem interval, in hours.
cRIN RNA Integrity Number, measured by Agilent Bioanalyzer 2100.
dDLPFC originates from same sample as the 9 brain regions in Table 1.
Genes with AEI ≥1.5-fold following stringent correction
| 6 | 2 | 1.69 | 2.46 | protein-coding mRNA | rs9645690 (5.94 × 10-3) | |
| 5 | 2 | 1.62 | 1.70 | pseudogene associated | rs12779247 (8.53 × 10-8)* | |
| 8 | 2 | 1.72 | 2.20 | protein-coding mRNA | rs2668193 (2.77 × 10-3) | |
| 3 | 2 | 1.50 | 2.00 | protein-coding mRNA | rs4932145 (4.61 × 10-9)* | |
| 4 | 2 | 1.57 | 1.97 | protein-coding mRNA | rs8136336 (1.03 × 10-3) | |
| 8 | 2 | 1.96 | 2.16 | protein-coding mRNA | rs3753036 (2.31 × 10-3) | |
| 4 | 2 | 1.66 | 2.06 | protein-coding mRNA | rs684856 (3.79 × 10-4)* | |
| 6 | 2 | 1.54 | 2.27 | protein-coding mRNA | rs10849038 (4.98 × 10-3) | |
| 6 | 2 | 1.53 | 1.61 | protein-coding mRNA | rs2374735 (2.83 × 10-18)* | |
| 9 | 2 | 1.59 | 1.60 | pseudogene associated | rs17477326 (5.35 × 10-4)* | |
| 6 | 2 | 1.67 | 1.67 | pseudogene associated | rs12470730 (1.67 × 10-2) | |
| 8 | 2 | 1.93 | 2.01 | protein-coding mRNA | rs11071185 (8.14 × 10-4)* | |
| 5 | 2 | 1.61 | 1.77 | pseudogene associated | rs606490 (5.66 × 10-4)* | |
| 7 | 2 | 1.67 | 1.70 | protein-coding mRNA | rs7542414 (1.29 × 10-3) | |
| 7 | 2 | 1.85 | 1.85 | protein-coding mRNA | rs11866002 (4.24 × 10-7)* | |
| 6 | 2 | 1.57 | 1.72 | homology with DOCK3 | rs29465 (4.47 × 10-3) | |
| 3 | 2 | 1.70 | 2.00 | protein-coding mRNA | rs352602 (1.04 × 10-4)* | |
| 5 | 2 | 1.62 | 1.68 | homology with other GABRB family members | N/A | |
| 8 | 2 | 2.22 | 2.29 | protein-coding mRNA | rs13029532 (4.23 × 10-2) | |
| 10 | 3 | 1.51 | 2.70 | ambiguity in genome assembly | rs11761839 (1.61 × 10-2) | |
| 10 | 3 | 1.67 | 2.67 | protein-coding mRNA | rs9873910 (1.43 × 10-4)* | |
| 2 | 2 | 1.70 | 2.39 | protein-coding mRNA | rs12940454 (1.89 × 10-3) | |
| 4 | 2 | 1.52 | 1.64 | protein-coding mRNA | N/A | |
| 4 | 2 | 1.83 | 2.16 | protein-coding mRNA | rs2249950 (9.68 × 10-3) | |
| 4 | 2 | 1.62 | 2.50 | long intergenic ncRNA | N/A | |
| 10 | 2 | 1.77 | 2.10 | protein-coding mRNA | rs1323565 (1.91 × 10-4)* | |
| 3 | 2 | 2.01 | 2.18 | pseudogene associated | N/A | |
| 7 | 2 | 1.54 | 1.79 | protein-coding mRNA | rs1927405 (1.01 × 10-2) | |
| 2 | 2 | 1.59 | 3.09 | pseudogene associated | rs1543116 (4.88 × 10-3) | |
| 3 | 2 | 1.56 | 3.21 | pseudogene associated | rs17377643 (9.41 × 10-3) | |
| 9 | 2 | 1.53 | 2.00 | protein-coding mRNA | rs10483610 (3.61 × 10-3) | |
| 2 | 2 | 1.72 | 1.75 | pseudogene associated | rs1127231 (2.02x10-3) | |
| 10 | 2 | 1.65 | 2.22 | protein-coding mRNA | rs10840166 (1.17 × 10-3) | |
| 5 | 2 | 1.71 | 1.98 | protein-coding mRNA | N/A | |
| 5 | 2 | 1.58 | 2.62 | protein-coding mRNA | N/A | |
| 10 | 2 | 1.61 | 1.66 | pseudogene associated | rs2002273 (2.18 × 10-1) | |
| 2 | 2 | 2.87 | 3.23 | imprinted and high homology with multiple genes and known RNA editing | rs2732020 (4.86 × 10-3) | |
| 9 | 3 | 2.54 | 5.98 | pseudogene associated | N/A | |
| 4 | 2 | 1.70 | 2.00 | protein-coding mRNA | rs11139519 (4.86 × 10-2) | |
| 4 | 2 | 1.70 | 2.28 | protein-coding mRNA | rs2545680 (8.60 × 10-14)* | |
| 4 | 2 | 1.52 | 1.59 | pseudogene associated | rs9951407 (1.18 × 10-2) | |
| 2 | 2 | 1.67 | 2.12 | protein-coding mRNA | rs2484639 (8.14 × 10-6)* | |
| 14 | 5 | 4.42 | 10.24 | pseudogene associated | N/A | |
| 3 | 2 | 1.61 | 1.62 | protein-coding mRNA | rs1979736 (4.36 × 10-4)* | |
| 3 | 3 | 1.79 | 2.40 | protein-coding mRNA | rs4280427 (2.08 × 10-2) | |
| 7 | 2 | 1.61 | 1.68 | protein-coding mRNA | rs7894516 (6.53 × 10-2) | |
| 3 | 2 | 1.50 | 1.52 | protein-coding mRNA and ncRNA | rs700085 (1.50 × 10-2) | |
| 4 | 2 | 1.81 | 1.87 | protein-coding mRNA | rs1954713 (5.07 × 10-2) | |
| 5 | 2 | 1.61 | 1.93 | pseudogene associated | rs6765337 (5.48 × 10-3) | |
| 7 | 2 | 1.51 | 2.25 | protein-coding mRNA | rs12912505 (1.03 × 10-1) | |
| 9 | 2 | 1.51 | 1.54 | pseudogene associated | rs11869614 (1.03 × 10-1) | |
| 3 | 2 | 1.53 | 1.92 | pseudogene associated | rs730050 (1.53 × 10-2) |
aCorrection was performed by subtracting 2 standard deviations of the within-gene allelic ratio from the uncorrected allelic ratio.
bCorrected allelic ratio values reported.
cPseudogene associations noted from NCBI Gene database. Homology noted where >50% coverage and >50% similarity with other annotated transcripts by Basic Local Alignment Search Tool (BLAST).
dMost significantly-associated SNP with any probe for the corresponding gene reported in BrainCloud [1]. N/A = no data available for gene. * = significant cis-eQTL association.
RNA editing across brain regions and across individuals
| chr2:37327702 | intergenic | 8 | 11.4 | 59.8 (8.0) | 8 | 22.2 | 32.0 (8.7) | |
| chr8:11702542 | 3′UTR | 10 | 16.8 | 99.4 (0.6) | 8 | 11.0 | 87.3 (4.0) | |
| chrX:73499965 | exonic (ncRNA) | 9 | 62 | 31.8 (2.8) | 10 | 116.3 | 27.1 (2.2) | |
| chr4:158257875 | exonic (non-synonymous) | 10 | 32.3 | 88.9 (4.6) | 13 | 56.0 | 97.5 (0.6) | |
| chr4:158257879 | exonic (synonymous) | 8 | 35.9 | 21.0 (2.7) | 8 | 75.5 | 16.4 (0.8) | |
| chr6:102337689 | exonic (non-synonymous) | 10 | 19.2 | 44.5 (3.9) | 13 | 18.9 | 37.2 (3.8) | |
| chr6:102337702 | exonic (non-synonymous) | 9 | 18.2 | 62.0 (3.7) | 13 | 17.1 | 63.9 (5.7) | |
| chr17:22021971 | intergenic | 10 | 1595.7 | 98.0 (0.1) | 10 | 1796.7 | 96.9 (0.5) | |
| chr15:25227816 | exonic (ncRNA) | 9 | 11.6 | 67.8 (8.2) | 13 | 29.6 | 79.4 (2.0) | |
| chr15:25227838 | exonic (ncRNA) | 7 | 12.1 | 43.5 (5.3) | 11 | 26.1 | 35.4 (2.6) | |
| chr15:25227854 | exonic (ncRNA) | 5 | 14.4 | 25.4 (5.6) | 8 | 26.6 | 22.1 (2.8) | |
| chr9:131071533 | 3′UTR | 5 | 11.2 | 54.6 (9.9) | 11 | 15.1 | 59.8 (4.6) | |
aAccording to GRCh37/hg19.
b10 regions from a single individual.
cOnly the DLPFC from 13 different individuals.