| Literature DB >> 30610612 |
Silvia Zucchelli1,2, Stefania Fedele1, Paolo Vatta1,3, Raffaella Calligaris1,4, Peter Heutink5,6,7,8, Patrizia Rizzu5,9, Masayoshi Itoh7,8,10, Francesca Persichetti2, Claudio Santoro2, Hideya Kawaji7,8,10,11, Timo Lassmann7,8,12,13, Yoshihide Hayashizaki8,10, Piero Carninci7,8,14, Alistair R R Forrest7,8,15, Stefano Gustincich16,17.
Abstract
Natural antisense transcripts are common features of mammalian genes providing additional regulatory layers of gene expression. A comprehensive description of antisense transcription in loci associated to familial neurodegenerative diseases may identify key players in gene regulation and provide tools for manipulating gene expression. We take advantage of the FANTOM5 sequencing datasets that represent the largest collection to date of genome-wide promoter usage in almost 2000 human samples. Transcription start sites (TSSs) are mapped at high resolution by the use of a modified protocol of cap analysis of gene expression (CAGE) for high-throughput single molecule next-generation sequencing with Helicos (hCAGE). Here we present the analysis of antisense transcription at 17 loci associated to hereditary Alzheimer's disease, Frontotemporal Dementia, Parkinson's disease, Amyotrophic Lateral Sclerosis, and Huntington's disease. We focused our analysis on libraries derived from brain tissues and primary cells. We also screened libraries from total blood and blood cell populations in the quest for peripheral biomarkers of neurodegenerative diseases. We identified 63 robust promoters in antisense orientation to genes associated to familial neurodegeneration. When applying a less stringent cutoff, this number increases to over 400. A subset of these promoters represents alternative TSSs for 24 FANTOM5 annotated long noncoding RNA (lncRNA) genes, in antisense orientation to 13 of the loci analyzed here, while the remaining contribute to the expression of additional transcript variants. Intersection with GWAS studies, sample ontology, and dynamic expression reveals association to specific genetic traits as well as cell and tissue types, not limited to neurodegenerative diseases. Antisense transcription was validated for a subset of genes, including those encoding for Microtubule-Associated Protein Tau, α-synuclein, Parkinsonism-associated deglycase DJ-1, and Leucin-Rich Repeat Kinase 2. This work provides evidence for the existence of additional regulatory mechanisms of the expression of neurodegenerative disease-causing genes by previously not-annotated and/or not-validated antisense long noncoding RNAs.Entities:
Keywords: Antisense transcription; Long noncoding RNA; Neurodegenerative diseases
Year: 2019 PMID: 30610612 PMCID: PMC6614138 DOI: 10.1007/s12035-018-1465-2
Source DB: PubMed Journal: Mol Neurobiol ISSN: 0893-7648 Impact factor: 5.590
List of neurodegenerative disease-associated genes recruited for this study. Gene name, gene symbol, disease symbol, and inheritance are shown
| Gene name | Gene symbol | Disease | Inheritance |
|---|---|---|---|
| Amyloid beta precursor protein |
| AD | Autosomal dominant |
| Presenilin 1 |
| AD | Autosomal dominant |
| Presenilin 2 |
| AD | Autosomal dominant |
| Chromosome 9 open reading frame 72 |
| FTD | Autosomal dominant |
| Granulin precursor |
| FTD | Autosomal dominant |
| Microtubule-associated protein tau |
| FTD | Autosomal dominant |
| Synuclein alpha |
| PD | Autosomal dominant |
| Parkin RBR E3 ubiquitin protein ligase |
| PD | Autosomal recessive |
| PTEN-induced putative kinase 1 |
| PD | Autosomal recessive |
| Parkinsonism associated deglycase (DJ-1) |
| PD | Autosomal recessive |
| Leucine-rich repeat kinase 2 |
| PD | Autosomal dominant |
| VPS35, retromer complex component |
| PD | Autosomal dominant |
| FUS RNA-binding protein |
| ALS | Autosomal dominant |
| Superoxide dismutase 1 |
| ALS | Autosomal dominant |
| TAR DNA-binding protein |
| ALS | Autosomal dominant |
| Ubiquilin 2 |
| ALS | Autosomal dominant |
| Huntingtin |
| HD | Autosomal dominant |
AD, Alzheimer’s disease; FTD, frontotemporal dementia; PD, Parkinson’s disease; ALS, amyotrophic lateral sclerosis; HD, Huntington’s disease
Catalog of FANTOM5 promoters in loci associated to hereditary neurodegenerative diseases. For each gene (gene symbol), the total number of robust TSSs (TSS DPI robust) and CAT clusters (CAT CAGE cluster robust) is indicated. The total number of promoters identified in this study is also shown (at the bottom)
| Disease | Gene symbol | TSS DPI robust | CAT CAGE cluster robust |
|---|---|---|---|
| AD |
| 12 | 5 |
| AD |
| 5 | 2 |
| AD |
| 1 | 0 |
| FTD |
| 4 | 1 |
| FTD |
| 0 | 0 |
| FTD |
| 8 | 6 |
| PD |
| 10 | 8 |
| PD |
| 12 | 6 |
| PD |
| 2 | 6 |
| PD |
| 2 | 0 |
| PD |
| 4 | 4 |
| PD |
| 0 | 1 |
| ALS |
| 2 | 6 |
| ALS |
| 0 | 6 |
| ALS |
| 0 | 2 |
| ALS |
| 1 | 2 |
| HD |
| 0 | 1 |
| Total | 63 | 56 |
List of FANTOM CAT annotated antisense lncRNA genes identified in loci associated to hereditary neurodegenerative diseases. FANTOM CAT nomenclature (AS lncRNA gene) and annotation (AS lncRNA category) are indicated. TSS nomenclature used in this study is also included (AS TSS nomenclature)
| Disease | Gene name | FANTOM CAT | This work | |
|---|---|---|---|---|
| AS lncRNA gene | AS lncRNA category | AS TSS nomenclature | ||
| AD |
|
| p-lncRNA divergent | APP-5′AS |
| AD |
|
| Others-short ncRNA | APP-int2AS |
| AD |
|
| Others- lncRNA divergent | APP-int3AS |
| AD |
|
| Others-short ncRNA | APP-3′AS |
| AD |
|
| p-lncRNA divergent | PSEN1-5′AS |
| AD |
|
| Others-short ncRNA | PSEN1-intAS |
| FTD |
|
| p-lncRNA divergent | C9orf-5′AS (AS3)* |
| FTD |
|
| Others-uncertain coding | C9orf-3′AS |
| FTD |
|
| p-lncRNA divergent | MAPT-5′AS |
| FTD |
|
| Others-lncRNA antisense | MAPT-int1AS |
| FTD |
|
| Others-lncRNA antisense | MAPT-int2AS |
| FTD |
|
| e-lncRNA antisense | MAPT-int3AS |
| PD |
|
| Others-lncRNA divergent | SNCA-int1AS |
| PD |
|
| Others-short ncRNA | SNCA-int2AS |
| PD |
|
| e-lncRNA antisense | SNCA-3′AS2 |
| PD |
|
| e-lncRNA antisense | PRKN-int1AS |
| PD |
|
| Others-lncRNA antisense | PRKN-int2AS |
| PD |
|
| Others-lncRNA antisense | PRKN-int3AS |
| PD |
|
| e-lncRNA antisense | PRKN-int4AS |
| PD |
|
| Others-uncertain coding | PRKN-int5AS |
| PD |
|
| Others-lncRNA antisense | PINK1-AS* |
| PD |
|
| p-lncRNA divergent | LRRK2-5′AS |
| PD |
|
| Others-short ncRNA | VPS35-AS |
| ALS |
|
| p-lncRNA divergent | SOD1-5′AS |
| ALS |
|
| e-lncRNA antisense | SOD1-int2AS |
| ALS |
|
| p-lncRNA divergent | FUS-5′AS |
| ALS |
|
| e-lncRNA divergent | FUS-int1AS |
| ALS |
|
| e-lncRNA divergent | FUS-int2AS |
| ALS |
|
| p-lncRNA divergent | TARDBP-AS |
| ALS |
|
| p-lncRNA divergent | UBQLN2-int1AS |
| ALS |
|
| Others-short ncRNA | UBQLN2-int2AS |
| HD |
|
| p-lncRNA divergent | HTT-AS* |
*AS lncRNAs in the literature
Fig. 1FANTOM CAT annotation of antisense lncRNAs at loci of genes associated to familial forms of neurodegenerative diseases. a Numbers and categories of antisense lncRNAs based on FANTOM CAT annotation: in blue, promoter-associated lncRNAs (p-lncRNAs); in green, enhancer lncRNAs (e-lncRNAs); in yellow, unidentified classification of lncRNAs (others-lncRNAs). Gene symbols are shown on the y-axis. Numbers of FANTOM CAT antisense lncRNAs are indicated on the x-axis. b Distribution of FANTOM CAT categories of lncRNAs (color coding as in a across loci associated to neurodegenerative diseases. c Enrichment of GWAS and PICS SNPs at lncRNA genes that are located antisense to selected neurodegenerative diseases-associated genes using FANTOM CAT. Disease types are grouped in the indicated categories (brain, immune, metabolic, cardiovascular, and others). Number of disease traits is plot for each antisense lncRNA gene (x-axis at the bottom). Relative protein-coding gene (Gene) and associated causative neurodegenerative disease (Disease) are shown (y-axis). d Analysis of sample ontology enrichment using FANTOM CAT. Number of enriched sample ontologies is shown for each antisense lncRNA gene. Causative gene and neurodegenerative disease type are also indicated. Ontology types are grouped in the indicated categories (Nervous System, Immune System, Pancreas/Liver/Adipose Tissues, Cardiovascular, Others)
Fig. 2Antisense transcription in AD-associated APP locus. a Zenbu genome browser view of gene locus for human APP. Genes and transcripts are color-coded according to their orientation in the genome (+ strand, green; − strand, purple). Green arrows highlight tags of antisense transcription. Annotated UCSC transcripts are shown, with exon (thick lines) and intron (thin lines) boundaries. Antisense lncRNAs annotated in public databases are present within UCSC transcripts. FANTOM CAT functional annotation of coding and noncoding genes is shown. FANTOM5 promoters (robust DPI and robust CAT clusters) are indicated as small arrowheads, color-coded according to the direction of transcription. Arrows highlight APP antisense TSSs with their nomenclature. hCAGE data in infected monocyte-derived macrophages after infection (blood) and cerebrospinal fluid (brain) libraries are shown, where antisense expression is more evident. b Zoomed-in image of antisense transcription at the 5′ end of the APP locus. Monocyte-derived macrophages, medial frontal gyrus, and cerebellum libraries have been selected to better show alternative TSS usage. AP000230 antisense p-lnRNA gene is indicated. c Zoomed-in Zenbu genome browser view of APP 3′ region. FANTOM CAT annotation of antisense genes is shown. Antisense TSSs are indicated as in a and b
Fig. 3Antisense transcription in FTD-associated MAPT. a Zenbu genome browser view of gene locus for human MAPT. FANTOM CAT assembly indicates functional categories of antisense lncRNA genes. Annotated (UCSC transcripts) and newly identified (FANTOM CAT transcripts) transcripts in sense (green) and antisense (purple) orientation are shown. FANTOM5 promoters (F5 promoters) are indicated by green and purple arrowheads, according to their orientation on the genome. A series of FANTOM5 libraries was selected to highlight the complexity of antisense transcription at this locus and the selectivity of alternative TSS usage in different cell types. iPS cells at d0 and d18 of differentiation to neurons indicate dynamic regulation of MAPT-int3AS expression. b Zoomed-in image of MAPT locus at its 5′ end, with genomic position of primers used for validation. c PCR validation of MAPT-5′AS expression in human neuroblastoma cell line (SH-SY5Y) and in human blood (Blood)
Fig. 4Antisense transcription at SNCA gene locus. a Zenbu genome browser view of PD-associated SNCA locus. α-Synuclein is expressed from the minus strand of the genome (purple arrow). Annotated (UCSC) and FANTOM5-derived (FANTOM CAT) assembly of genes and transcripts is shown. A series of green arrows indicates antisense transcription in pooled FANTOM5 libraries (F5 ALL). b Zoomed-in image of the genomic region at the 5′ end of SNCA. Transcripts for α-synuclein (purple) and antisense lncRNAs (green) are indicated. hCAGE peaks indicate expression of SNCA-5′AS, ENST00000501215, and SNCA-int1AS (green arrows). Sets of primers (set 1 to set 5) used for PCR validation are shown. PCR amplicons encompassing the region of interest are shown at the bottom (black bars). c PCR validation of α-synuclein antisense transcripts in the blood. Validated annotated (ENST00000501215) and newly discovered antisense transcripts (SNCA-5′AS and SNCA-int1AS) are indicated at the bottom. d Zoomed view of FANTOM5 data at the 3′ of SNCA locus. FANTOM CAT antisense genes are highlighted in bold. Annotated α-synuclein transcripts (purple) and antisense lncRNAs (green) are shown. Tracks with hCAGE data are selected to show usage of alternative TSSs in different samples. e Overlay of single nucleotide polymorphisms (SNPs) reported for the SNCA with FANTOM CAT annotation of antisense lncRNAs in this region. Disease association is shown (PD, Parkinson’s disease; DLB, dementia with Lewy bodies; ND, not detected). Reference to the original study is indicated
Fig. 5Antisense transcription in LRRK2 and PARK7 loci associated to hereditary PD. a Zenbu genome browser view of hCAGE data in brain and blood libraries in PARK7 locus. No annotated transcripts and no FANTOM CAT antisense gene assembly are present at this gene. FANTOM5 promoters (F5 promoters) are indicated by green and purple arrowheads, according to their orientation on the genome. Purple arrows show tags for DJ1-5′AS and DJ1-intAS transcripts. Zoomed image show the magnification of AS TSSs and the genomic position of primers used for PCR validation (expected PCR fragment is indicated, black bars). b Validation of DJ1-5′AS expression in human neuroblastoma cells (SH-SY5Y) and in human whole blood (blood) by RT-PCR. Arrow indicates DJ1-5′AS-specific band (verified upon cloning and sequencing). c Antisense transcription in LLRK2 locus. LRRK2 protein-coding and antisense p-lncRNA (AC079630.4) genes are highlighted in bold. Annotated (UCSC) and FANTOM5 (FANTOM CAT) transcripts are shown. Zenbu track shows sense/antisense transcription in pooled FANTOM5 libraries (F5 ALL). Indent shows zoom-in of the 5′ region. Purple arrows highlight TSSs for LRRK-bAS, LRRK2-5′AS, and LRRK2-int1AS in selected brain and blood libraries. Sets of PCR products are shown in black. d Validation of LRRK2-5′AS expression in whole blood by RT-PCR. Primers used for validation and amplicons are indicated. White arrows indicate PCR bands corresponding to LRRK2-5′AS, as verified by cloning and sequencing
Fig. 6Quantitative expression of S/AS pairs for LRRK2, SNCA, MAPT, and PARK7/DJ-1 in human tissues. qRT-PCR was performed in duplicate on a panel of 20 human tissues derived from three independent subjects. Tissue types are indicated on top, genes on the right. Heat map graphical representation of rescaled normalized fold expression using the ΔΔCt method. For each gene, expression in the brain is used as a reference and set to 1. Value bar shows color coding (red–green) of normalized expression.