| Literature DB >> 36131690 |
Jack N G Marshall1, Alexander Fröhlich1, Li Li1,2, Abigail L Pfaff3,4, Ben Middlehurst1, Thomas P Spargo5,6,7, Alfredo Iacoangeli5,6,7, Bing Lang2, Ammar Al-Chalabi6,8, Sulev Koks3,4, Vivien J Bubb1, John P Quinn1.
Abstract
We describe the characterisation of a variable number tandem repeat (VNTR) domain within intron 1 of the amyotrophic lateral sclerosis (ALS) risk gene CFAP410 (Cilia and flagella associated protein 410) (previously known as C21orf2), providing insight into how this domain could support differential gene expression and thus be a modulator of ALS progression or risk. We demonstrated the VNTR was functional in a reporter gene assay in the HEK293 cell line, exhibiting both the properties of an activator domain and a transcriptional start site, and that the differential expression was directed by distinct repeat number in the VNTR. These properties embedded in the VNTR demonstrated the potential for this VNTR to modulate CFAP410 expression. We extrapolated these findings in silico by utilisation of tagging SNPs for the two most common VNTR alleles to establish a correlation with endogenous gene expression. Consistent with in vitro data, CFAP410 isoform expression was found to be variable in the brain. Furthermore, although the number of matched controls was low, there was evidence for one specific isoform being correlated with lower expression in those with ALS. To address if the genotype of the VNTR was associated with ALS risk, we characterised the variation of the CFAP410 VNTR in ALS cases and matched controls by PCR analysis of the VNTR length, defining eight alleles of the VNTR. No significant difference was observed between cases and controls, we noted, however, the cohort was unlikely to contain sufficient power to enable any firm conclusion to be drawn from this analysis. This data demonstrated that the VNTR domain has the potential to modulate CFAP410 expression as a regulatory element that could play a role in its tissue-specific and stimulus-inducible regulation that could impact the mechanism by which CFAP410 is involved in ALS.Entities:
Keywords: ALS; CFAP410; VNTR; gene expression; transcriptional regulation
Year: 2022 PMID: 36131690 PMCID: PMC9484465 DOI: 10.3389/fnmol.2022.954928
Source DB: PubMed Journal: Front Mol Neurosci ISSN: 1662-5099 Impact factor: 6.261
Figure 1Location of multiple transcripts at the CFAP410 gene locus relative to the intron 1 VNTR. (A) The four main isoforms of CFAP410 overlaid with the Simple Tandem Repeats track and histone marks from the ENCODE database. The ALS GWAS SNP (rs75087725) and identified tagging SNP (rs56212056) are also shown. The VNTR identified in this study is boxed in red. (B) CFAP410 VNTR location overlaid with RefSeq genes from NCBI and predicted transcripts from Ensembl database. (C) PCR amplification and gel electrophoresis of the VNTR within the CFAP410 locus. This region was confirmed to be polymorphic and eight alleles were identified (numbered 1–8 to reflect increasing length). Variant 7 was found only in a single ALS case. All samples were run at 100 V on 2% agarose for 3.5 h. All samples shown are from the cohort of ALS cases and controls that are genotyped in Table 1.
Allele and genotype frequencies of CFAP410 VNTR in MNDA cohort.
|
| |||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ||
|
|
|
|
| ||||
| 1 | 2 | 0.20 | 5 | 0.51 | 7 | −0.30 | 0.67 |
| 2 | 3 | 0.31 | 2 | 0.20 | 5 | 0.10 | 0.50 |
| 3 | 10 | 1.02 | 11 | 1.12 | 21 | −0.10 | 1.00 |
| 4 | 224 | 22.86 | 220 | 22.31 | 444 | 0.54 | 0.93 |
| 5 | 730 | 74.49 | 737 | 74.75 | 1,467 | −0.26 | 0.75 |
| 6 | 5 | 0.51 | 6 | 0.61 | 11 | −0.10 | 1.00 |
| 7 | 1 | 0.10 | 0 | 0 | 1 | 0.10 | 0.50 |
| 8 | 5 | 0.51 | 5 | 0.51 | 10 | 0.00 | 0.50 |
| Total | 980 | 100.00 | 986 | 100.00 | 1,966 | 0.00 | N/A |
|
| |||||||
|
|
|
|
|
|
| ||
|
|
|
|
| ||||
| 1.4 | 1 | 0.20 | 1 | 0.20 | 2 | 0.00 | 1.00 |
| 1.5 | 0 | 0.00 | 4 | 0.81 | 4 | −0.81 | 0.22 |
| 2.2 | 1 | 0.20 | 0 | 0.00 | 1 | 0.20 | 1.00 |
| 2.4 | 1 | 0.20 | 0 | 0.00 | 1 | 0.20 | 0.50 |
| 2.5 | 0 | 0.00 | 2 | 0.41 | 2 | −0.41 | 0.50 |
| 3.4 | 1 | 0.20 | 1 | 0.20 | 2 | 0.00 | 1.00 |
| 3.5 | 9 | 1.84 | 10 | 2.03 | 19 | −0.19 | 0.71 |
| 4.4 | 28 | 5.71 | 27 | 5.48 | 55 | 0.24 | 0.57 |
| 4.5 | 161 | 32.86 | 159 | 32.25 | 320 | 0.61 | 0.59 |
| 4.6 | 3 | 0.61 | 3 | 0.61 | 6 | 0.00 | 0.50 |
| 4.8 | 1 | 0.20 | 2 | 0.41 | 3 | −0.20 | 1.00 |
| 5.5 | 277 | 56.53 | 278 | 56.39 | 555 | 0.14 | 0.76 |
| 5.6 | 2 | 0.41 | 3 | 0.61 | 5 | −0.20 | 0.61 |
| 5.7 | 1 | 0.20 | 0 | 0.00 | 1 | 0.20 | 0.50 |
| 1.8 | 1 | 0.20 | 0 | 0.00 | 1 | 0.20 | 1.00 |
| 5.8 | 3 | 0.61 | 3 | 0.61 | 6 | 0.00 | 1.00 |
| Total | 490 | 100.00 | 493 | 100.00 | 983 | 0.00 | N/A |
.
Figure 2Differential expression of CFAP410 in Case and Controls in the CNS using the NYG transcriptomic data. Expression analysis of CFAP410 isoforms in ALS and control subjects. RNAseq data from the NYGC ALS cohort were used to compare expression of CFAP410 isoforms ENST00000339818 (A), ENST00000325223 (B), ENST00000397956 (C), ENST00000462742 (D) and ENST00000496321 (E). Wilcoxon test was applied to demonstrate statistical significance indicated as asterisks. **P ≤ 0.01.
Figure 3The CFAP410 VNTR shows functional properties in the pGL3-P vector in the HEK293 cell line. (A) Schematic for VNTR containing constructs and pGL3 vectors (Promega) used in the luciferase assay. (B) The fold activity of alleles 4 and 5 of the CFAP410 VNTR in the endogenous (forward) orientation within the pGL3P vector, normalised to the internal control Renilla luciferase (Biological replicate n = 3, technical replicate per assay n = 4). The promoter-less vector (pGL3-B) was included as a negative control. T-test was used to compare VNTR-containing constructs to SV40 unmodified vector (pGL3-P) and to compare all VNTR containing constructs against each other. ***P = <0.001.
Figure 4The CFAP410 VNTR shows promoter activity in the pGL3-B vector in HEK293. (A) Schematic for VNTR containing constructs and pGL3 vectors (Promega) used in luciferase assay. (B) The fold activity of the CFAP410 VNTR in the endogenous and reverse orientation within the pGL3-B vector normalised to the internal control Renilla luciferase (Biological replicate n = 3, technical replicate per assay n = 4). The pGL3-P vector was included as a positive control as this contains an SV40 promoter. T-test was used to compare VNTR containing constructs to each other (allele 4 vs. allele 5) and promoterless vector alone (pGL3-B). T-test was also used to compare VNTR containing constructs to the SV40 promoter vector (pGL3-P). ***P = <0.001.
Figure 5The expression of CFAP410 isoforms is associated with tagging the SNP rs56212056 genotype. (A) CFAP410 isoform expression using RNAseq data from the NYGC Target ALS cohort. (B–F) Association of three different SNP genotypes (AA, GA, and GG) with an expression of isoforms ENST00000339818, ENST00000325223, ENST00000397956Q24, ENST00000462742, and ENST00000496321 was analysed based on the correlation with allele 4 (A) or allele 5 (G) using transcriptomic data from the NYGC ALS cohort. Wilcoxon test was applied to demonstrate statistical significance indicated as asterisks. *P = < 0.05, **P = < 0.01, ***P = < 0.001, ****P = < 0.0001, ns > 0.05.