| Literature DB >> 26597164 |
Amy Webb1,2, Audrey C Papp3, Amanda Curtis4, Leslie C Newman5, Maciej Pietrzak6,7, Michal Seweryn8, Samuel K Handelman9, Grzegorz A Rempala10, Daqing Wang11, Erica Graziosa12, Rachel F Tyndale13, Caryn Lerman14, John R Kelsoe15,16, Deborah C Mash17, Wolfgang Sadee18,19,20,21.
Abstract
BACKGROUND: We used RNA sequencing to analyze transcript profiles of ten autopsy brain regions from ten subjects. RNA sequencing techniques were designed to detect both coding and non-coding RNA, splice isoform composition, and allelic expression. Brain regions were selected from five subjects with a documented history of smoking and five non-smokers. Paired-end RNA sequencing was performed on SOLiD instruments to a depth of >40 million reads, using linearly amplified, ribosomally depleted RNA. Sequencing libraries were prepared with both poly-dT and random hexamer primers to detect all RNA classes, including long non-coding (lncRNA), intronic and intergenic transcripts, and transcripts lacking poly-A tails, providing additional data not previously available. The study was designed to generate a database of the complete transcriptomes in brain region for gene network analyses and discovery of regulatory variants.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26597164 PMCID: PMC4657279 DOI: 10.1186/s12864-015-2207-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Read alignment across genomic regions. Presents the percentage of aligned reads falling within genomic regions of different types--exonic, intronic, and intergenic as annotated by GENCODE and lncipedia; any reads aligning to the mitochondrial chromosome; and ribosomal reads filtered by during alignment (18S, 28S, and 5.8S only)
Fig. 2Brain region specificity of RNA classes. Presented is the percentage of different RNA types, as annotated in GENCODE/lncipedia, detectable across brain regions. Detectability is defined as FPKM > 2 in 2+ samples. This includes 10,680 protein coding genes, 242 pseudogenes, and 945 noncoding. A higher percentage of protein coding RNAs are detectable across all 10 regions compared to non-coding RNAs and pseudogenes
Fig. 3Number of detectable RNAs at different FPKM cutoffs. The average number of detectable protein coding and non-coding RNAs is shown at different expression cutoff levels. This illustrates the working pool of RNAs available depending of expression cutoff
Differentially expression between tissues
| BA22 | BA24 | BA46 | Insula | Amygdala | Hippocampus | Post. putamen | Cerebellum | Raphae | |
|---|---|---|---|---|---|---|---|---|---|
| 4 | 11 | 872 | 114 | 1281 | 1612 | 1801 | 4315 | 2876 | BA10 |
| 43 | 62 | 119 | 1761 | 1895 | 1891 | 3577 | 2791 | BA22 | |
| 106 | 0 | 1123 | 1655 | 2252 | 4778 | 3548 | BA24 | ||
| 342 | 1114 | 1794 | 1193 | 2828 | 2176 | BA46 | |||
| 521 | 1322 | 1820 | 4415 | 3155 | Insula | ||||
| 251 | 797 | 3606 | 2256 | Amygdala | |||||
| 827 | 3206 | 1714 | Hippocampus | ||||||
| 0 | 402 | Post. putamen | |||||||
| 51 | Cerebellum |
The number of genes are listed found to be differentially expressed (FDR ≤ 0.05) between pairwise tissue comparisons. FDR correction was made for genes included in analysis based on detectability
Enriched GO terms for differentially expressed smoking related genes in BA46
| Gene | Gene information | GO term |
|---|---|---|
| APOLD1 | Apolipoprotein L domain containing 1 | Response to abiotic stimulus |
| ATP13A4 | ATPase type 13A4 | Transmembrane transport |
| ATP1A2 | ATPase, Na+/K+ transporting, alpha 2 polypeptide | Behavior / response to abiotic stimulus / transmembrane transport |
| CIART | Circadian associated repressor of transcription | Behavior |
| GJA1 | Gap junction protein, alpha 1, 43 kDa | Behavior / response to abiotic stimulus / transmembrane transport |
| GPR37 | G protein-coupled receptor 37 (endothelin receptor type B-like) | Behavior |
| HIF3A | Hypoxia inducible factor 3, alpha subunit | Response to abiotic stimulus |
| IGFBP7 | Insulin-like growth factor binding protein 7 | Response to abiotic stimulus |
| MLC1 | Megalencephalic leukoencephalopathy with subcortical cysts 1 | Response to abiotic stimulus / transmembrane transport |
| PLOD2 | Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 | Response to abiotic stimulus |
| PREX2 | Phosphatidylinositol-3,4,5-trisphosphate-dependent Rac exchange factor 2 | Behavior |
| PTN | Pleiotrophin | Behavior |
| S1PR1 | Sphingosine-1-phosphate receptor 1 | Behavior |
| SDC2 | Syndecan 2 | Response to abiotic stimulus |
| SDC4 | Syndecan 4 | Response to abiotic stimulus |
| SLC14A1 | Urea transporter,Kidd blood group | Transmembrane transport |
| SLC1A2 | Glial high affinity glutamate transporter | Behavior / response to abiotic stimulus / transmembrane transport |
| SLC1A3 | Glial high affinity glutamate transporter | Behavior / response to abiotic stimulus / transmembrane transport |
| SLC4A4 | Sodium bicarbonate cotransporter | Transmembrane transport |
| SLC5A11 | Sodium/inositol cotransporter | Transmembrane transport |
| SLC7A11 | Anionic amino acid transporter light chain, xc- system | Transmembrane transport |
| SLCO1C1 | Solute carrier organic anion transporter | Transmembrane transport |
| TP53BP2 | Tumor protein p53 binding protein 2 | Response to abiotic stimulus |
| VEGFA | Vascular endothelial growth factor A | Behavior / response to abiotic stimulus |
Two GO terms were enriched from genes with significant (FDR ≤ 0.1) differential expression between smokers and nonsmokers in BA46
Genes with the most isoform variation between brain regions
| Ranking | Gene | Isoform ranking score | Gencode ID | RNA type |
|---|---|---|---|---|
| 1 | GATS | 9.0 | ENSG00000160844.6 | Protein |
| 2 | NRXN3 | 9.0 | ENSG00000021645.13 | Protein |
| 3 | R3HDM1 | 9.0 | ENSG00000048991.12 | Protein |
| 4 | RTN4 | 9.0 | ENSG00000115310.13 | Protein |
| 5 | BRWD1 | 8.7 | ENSG00000185658.9 | Protein |
| 6 | MLIP | 8.7 | ENSG00000146147.10 | Protein |
| 7 | PART1 | 8.7 | ENSG00000152931.7 | Noncoding |
| 8 | PIK3R1 | 8.5 | ENSG00000145675.10 | Protein |
| 9 | POLR1D | 8.5 | ENSG00000186184.11 | Protein |
| 10 | PVRL3 | 8.3 | ENSG00000177707.6 | Protein |
| 11 | SEPT8 | 8.3 | ENSG00000164402.9 | Protein |
| 12 | CEP85L | 8.0 | ENSG00000111860.9 | Protein |
| 13 | DLG2 | 8.0 | ENSG00000150672.12 | Protein |
| 14 | ELMO1 | 8.0 | ENSG00000155849.11 | Protein |
| 15 | INPP5F | 8.0 | ENSG00000198825.7 | Protein |
| 16 | MAGI1 | 8.0 | ENSG00000151276.18 | Protein |
| 17 | SYNPO | 8.0 | ENSG00000171992.8 | Protein |
| 18 | WDR47 | 8.0 | ENSG00000085433.11 | Protein |
| 19 | KALRN | 7.8 | ENSG00000160145.10 | Protein |
| 20 | GAS7 | 7.7 | ENSG00000007237.13 | Protein |
| 21 | GPM6B | 7.7 | ENSG00000046653.10 | Protein |
| 22 | ANTXR1 | 7.5 | ENSG00000169604.15 | Protein |
| 23 | ATP5S | 7.5 | ENSG00000125375.10 | Protein |
| 24 | DGKG | 7.5 | ENSG00000058866.10 | Protein |
| 25 | DYNC1I2 | 7.5 | ENSG00000077380.11 | Protein |
| 26 | NTRK2 | 7.5 | ENSG00000148053.11 | Protein |
| 27 | PTER | 7.5 | ENSG00000165983.10 | Protein |
| 28 | SLC29A2 | 7.5 | ENSG00000174669.7 | Protein |
| 29 | WNK1 | 7.5 | ENSG00000060237.12 | Protein |
The top scoring genes are listed with variable isoform usage among brain regions. Order determined by comparing the average expression of each tissue to the average expression across all tissues and individuals. Genes with more tissue variation rank higher
Fig. 4NRXN3 isoform representation across brain region. NRXN3 was found to be a gene with extreme differences in isoform representation. The top panel shows 5 isoforms annotated by RefSeq and the middle panel focuses on the difference between the major isoforms (Dup0 and Dup5). The bottom panel shows the representation of all isoforms as a fraction of the whole gene expression, combining Dup1 and Dup2 into “minor isoforms”. Dup0 is the major isoform in all Broadmann’s areas, insula, amygdala, and hippocampus. Dup5 is the major isoform in cerebellum and raphae nuclei. Samples from the posterior putamen are mixed
Top 20 AEI per tissue
| Amygdala | BA10 | BA22 | BA24 | BA46 | Cerebellum | Hippocampus | Insula | Post. putamen | Raphae nucleus | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | SNHG14 | BCAP29 | C7orf41 | ATP6V1G2 | SLC8A1 | ERV3-1 | RP11-785H5.1 | CD24P4 | CCT5 | GSTA4 |
| 2 | STON2 | LMO7 | ZNF91 | ATP6V1G2-DDX39B | RCAN2 | RP11-862 L9.3 | RP11-785H5.2 | TTTY14 | lnc-SNURF-3 | NDRG3 |
| 3 | lnc-SNURF-3 | NHP2L1 | NHP2L1 | SYNJ1 | SPTBN1 | CTD-2353 F22.1 | DNAJA4 | PILRB | SNHG14 | RALGAPB |
| 4 | AFTPH | NUDT5 | ZNF391 | TXN2 | LPHN3 | NMNAT2 | PDE1A | AC005592.2 | PDXP | RBM26 |
| 5 | UBE3A | RAB21 | NGEF | PRPF8 | FAM212B | GPRIN3 | LANCL2 | FGF1 | SH3BP1 | GPRIN3 |
| 6 | PEG3 | UBE3A | SENP2 | FAM120A | PSD3 | CHN1 | DCP2 | MAP1LC3B | SPP1 | SOGA1 |
| 7 | ZIM2 | IL6ST | LRRC6 | NHP2L1 | AHSA1 | SEPT3 | C9orf72 | POLR3F | ZEB1 | NALCN |
| 8 | RP11-746 M1.1 | UFM1 | LPAR1 | YWHAB | SERINC1 | UBR3 | USP47 | RPL21P3 | PYGB | FUT9 |
| 9 | CELSR2 | PDGFRA | RPS20 | TAOK1 | ANKS1B | GAS7 | KIAA1549 | CCDC103 | YWHAG | RIMS2 |
| 10 | AP1S1 | ZEB1 | ENO4 | POMP | LINGO1 | PLK2 | NEFL | FAM187A | PBX3 | NHP2L1 |
| 11 | NHP2L1 | NEO1 | KIAA1598 | RP11-269G24.3 | ANK2 | EPB41L1 | PRRC2C | GFAP | MANBAL | NMNAT2 |
| 12 | OPA1 | TAF2 | NECAB2 | TANC2 | AL391357.1 | NEDD4L | STARD13 | ARHGAP32 | SV2B | SEPT3 |
| 13 | WDFY3 | AL391152.1 | FAM107A | IGFBP5 | DDOST | MAP2 | CROCCP3 | PCM1 | SETD6 | PPP3CA |
| 14 | MEF2A | CNGB3 | CYFIP1 | PPFIBP1 | PINK1 | YWHAG | EPHA7 | PDE8B | KIF5C | USP24 |
| 15 | LHFPL3 | CPNE3 | ARHGAP32 | SPHKAP | PINK1-AS | lnc-GALNT2-1 | AC010127.3 | NRXN3 | ECHS1 | RP11-981G7.1 |
| 16 | RPS6KA2 | ATF6 | SCP2 | MYO5A | CTSB | AL691479.1 | SCN1A | EIF2AK4 | QKI | GAS7 |
| 17 | CDC14B | NAV2-AS1 | ANKS1B | GAS7 | FMN2 | CASC7 | C1orf226 | PRDM2 | FAM13C | MAPK9 |
| 18 | PRICKLE2 | TNS3 | PPP1R12B | PRKCB | DST | MBP | RP11-565P22.6 | WDR41 | ACIN1 | RP11-463C8.4 |
| 19 | RP11-129B22.1 | ADNP2 | SLC6A1 | RP11-862 L9.3 | -- | NCL | AJAP1 | ARCN1 | FDFT1 | ZNF91 |
| 20 | ARHGAP32 | EXOC5 | APBA1 | PCDH9 | -- | MAP1A | SRPK2 | TPM1 | MOCS1 | APOL2 |
After restricting the list of genes with AEI to those with >30 reads per SNP averaged across gene, the list for each region is sorted by average AEI fold-change
Genes with AEI in 2 samples in a region
| Region | Gene | AEI ratio Avg ± S.D. | Avg SNPs per gene per sample | GTex eQTL | PheGenI eQTL | PheGenI association |
|---|---|---|---|---|---|---|
| BA10 | AKAP12 | 3.7 ± 0.6 | 6 | |||
| BA22 | LPAR1 | 3.3 ± 0.7 | 3 | Whole blood | Brain cerebellum | Alcoholism |
| BA46 | PSD3 | 3.4 ± 0.7 | 2 | Brain pons | Memory | |
| Cerebellum | GNAS | 2.8 ± 0.4 | 2 | Liver, lymphoblastoid | Brain waves | |
| Hippocampus | SRPK2 | 3.6 ± 0.7 | 2 | Whole blood, esophagus | Lymphoblastoid | |
| Raphaenucleus | PDE4DIP | 3.4 ± 0.8 | 3 |
Genes with allelic expression imbalance in 2 subjects in the same brain region after stringent filtering of 30 reads of coverage. Listed is the average and standard deviation of the AEI fold change for each gene and the average number of SNPs contributing to the AEI signal for each gene per sample. Genes were checked against public databases for eQTLs and SNP associations