Literature DB >> 17522092

Merging mouse transcriptome analyses with Parkinson's disease linkage studies.

Daniel Gherbassi1, Lavinia Bhatt, Sandrine Thuret, Horst H Simon.   

Abstract

The hallmark of Parkinson's disease (PD OMIM #168600) is the degeneration of the nigral dopaminergic system affecting approximately 1% of the human population older than 65. In pursuit of genetic factors contributing to PD, linkage and association studies identified several susceptibility genes. The majority of these genes are expressed by the dopamine-producing neurons in the substantia nigra. We, therefore, propose expression by these neurons as a selection criterion, to narrow down, in a rational manner, the number of candidate genes in orphan PD loci, where no mutation has been associated thus far. We determined the corresponding human chromosome locations of 1435 murine cDNA fragments obtained from murine expression analyses of nigral dopaminergic neurons and combined these data with human linkage studies. These fragments represent 19 genes within orphan OMIM PD loci. We used the same approach for independent association studies and determined the genes in neighborhood to the peaks with the highest LOD score value. Our approach did not make any assumptions about disease mechanisms, but it, nevertheless, revealed alpha-synuclein, NR4A2 (Nurr1), and the tau genes, which had previously been associated to PD. Furthermore, our transcriptome analysis identified several classes of candidate genes for PD mutations and may also provide insight into the molecular pathways active in nigral dopaminergic neurons.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17522092      PMCID: PMC2779897          DOI: 10.1093/dnares/dsm007

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


Introduction

The neuropathological hallmark of Parkinson's disease (PD) is the progressive degeneration of dopaminergic (DA) neurons in the substantia nigra pars compacta (SNpc), affecting about 1–2% of the human population older than 65 years.[1] It is characterized by the clinical symptoms of resting tremor, muscular rigidity, postural instability, a positive response to the administration of l-DOPA, and the presence of cytoplasmic inclusions in postmortem brains, Lewy Bodies.[2] Despite its mostly sporadic onset and a high discordance rate in monozygotic twins,[3] several human linkage studies had been initiated to determine susceptibility genes for this disease.[4] In the Online Mendelian Inheritance in Man (OMIM) database, 13 PD loci have been recorded: PARK1,[5] PARK2,[6-9] PARK3,[10] PARK4,[11,12] PARK5,[13] PARK6,[14,15] PARK7,[16,17] PARK8,[18] PARK9,[19,20] PARK10,[21] PARK11,[22,23] PaRK12,[23,24] and PARK13.[25] Furthermore, genome-wide analyses of multiplex PD families provided evidence for linkage to regions on different chromosomes.[21,22,24,26-29] The PARK loci are sometimes larger than 10 Mb and can contain hundreds of genes. In case of the genome-wide linkage studies for a complex, multifactorial disease such as PD, the regions with high LOD scores are rarely smaller than 20 cM.[29] The differences among independent studies and the size of the suggested susceptibility regions make the searches for the underlying mutations irremediably a time-consuming process. For several PARK loci, the searches have been successful. Mutations in α-synuclein (PARK1 and PARK4), DJ-1 (PARK7), parkin (PARK2), PINK1 (PTEN-induced putative kinase) (PARK6), LRRK2 (leucine-rich repeat kinase 2) (PARK8), UCHL1 (ubiquitin carboxy-terminal-hydrolase-L1) (PARK5), and ATP13A2 (ATPase type 13A2) (PARK9) have been identified.[5,30-37] Other studies have revealed the cytoskeletal protein tau (MAPT)[36,38] and the ligand-independent nuclear receptor NR4A2[30,39,40] (Nurr1) as susceptibility genes. Although the definite role in PD of many of these genes is still discussed and controversial (especially for NR4A2 and UCHL1) and the known mutations account for less than 10% of all PD cases, the investigation into the functions of the underlying genes has generated an insight into the fundamental disease pathogenesis. For example, α-synuclein and parkin turned out to be major protein components of Lewy bodies in sporadic PD.[41] Mutations in parkin, UCHL1, and DJ-1 suggest that abnormal protein folding and protein degradation through the ubiquitin-proteasome system is an important factor in the etiology of the disease.[42,43] PINK1 may be involved in the phosphorylation of mitochondrial proteins in response to cellular stress, thus protecting against mitochondrial dysfunction.[35] Interestingly, mitochondria are also the site, where the known neurotoxins for DA neurons operate, suggesting that their malfunctioning could be a major contributor to PD pathogenesis.[44] Current or future searches for the underlying mutations in the remaining orphan Parkinson loci could be accelerated and widened to promoter regions and to haplotype variations, if the number of candidate genes is narrowed down by other criteria. At least seven out of the nine PD-associated genes are expressed by nigral DA neurons,[45-50] with different expression levels and specificity. These are α-synuclein, NR4A2, parkin,[46] PINK1, tau, UCHL1, and LRRK1 (http://www.brain-map.org). For this reason, we propose expression (specific or non-specific) by mesDA neurons as a selection criterion to identify candidate genes in those PD loci where the underlying gene is still unknown (orphan). Such an approach does not make any presumption with respect to disease mechanisms. Conceptually, the same method was applied on five large PD loci using serial analysis of gene expression for a comparative expression analysis of SNpc and adjacent mesencephalon in postmortem brains.[51] As cell-specific expression in mouse and human is very similar, we took three murine expression studies which employed fluorescent-activated cell sorting (FACS) and two unrelated subtractive methods for the identification of genes expressed by mesDA neurons.[52-54] We collected the cDNA sequences of these expression analyses from public databases, determined the underlying genes and the corresponding gene ontology annotations [Gene Ontology (GO)] to obtain insight into their function. Then, we established their genetic locations and their syntenic positions on the human genome. Finally, we combined these data with existing human PD linkage studies.[5-11,13-24,26-29,55,56]

Material and methods

Transcriptome analysis

All nucleotide sequences used in this study are publicly available at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Nucleotide and derived from three expression analyses in mouse: (i) Barrett et al.[52] published 779 sequences (Accession Nos.: BE824469–BE824504, BE824506–BE824519, BE824521–BE824561, BE824563–BE824823, BE824825–BE825045, BE825047–BE825132, CK338036–CK338155). (ii) Stewart et al.[53,57,58] published 496 cDNA sequences (Accession Nos.: AA008736, W33210–W33212, W33214–W33289, W35421–W35480, W36130–W36269, W39787–W40005, W40007–W40008, W40010–W40023, W45732). (iii) We published 160 sequences (Accession Nos.: CO436137–CO436293).[54] Each nucleotide sequence was employed for a nucleotide-nucleotide BLAST (blastn) (basic local alignment search tool) on the nr database (non-redundant) (http://www.ncbi.nlm.nih.gov/BLAST/) and on the mouse genome (http://www.ncbi.nlm.nih.gov/genome/seq/MmBlast.html). We then recorded those alignments with the highest scores, the lowest e-values, and highest number of hits in a single locus. BLAST results were categorized into four groups: (1) no significant alignments on mouse genome (None), (2) significant alignments with mitochondrial DNA (Mitochondrial Genes), (3) multiple high-scoring alignments on mouse genome (Multiple Hits) for ambiguous results, and (4) significant alignments on mouse genome for single hits or otherwise unambiguous results (Table 1). The latter group was further subdivided into: ‘Genes’, ‘ESTs’, and ‘genomic Sequences’. The group ‘Genes’ comprises the results with high-scoring alignments in exons of single genes. In some cases, where the alignment lay in the region after the last exon or, according to the chromosome map view, in an intron of a given gene, we termed it also ‘Gene’, if the hit was in a UniGene cluster which was linked to the gene in the locus. With those alignments that we were unable to associate to a gene, we performed a blastn on the MmEST database. If we could associate the sequence to a previously described EST, we termed it ‘EST’; otherwise, it was termed ‘Genomic Sequence’.
Table 1

BLAST results on mouse genome

No significant alignments on mouse genome262None
Significant alignments with mitochondrial genes104Mitochondrial genes
Multiple high-scoring alignments on mouse genome19Multiple hits
Significant alignments on mouse genome1050Genes (940)
Annotated genes (793)
Hypothetical genes (147)
ESTs (47)
Genomic sequences (63)

cDNA sequences are separated into four different categories based on the types of alignments generated. Alignments on the mouse genome were subdivided into Genes, ESTs, and genomic sequences. For the category ‘Genes’, we differentiated further between ‘annotated’ and ‘hypothetical’ depending on the gene RefSeq status recorded at NCBI.72

BLAST results on mouse genome cDNA sequences are separated into four different categories based on the types of alignments generated. Alignments on the mouse genome were subdivided into Genes, ESTs, and genomic sequences. For the category ‘Genes’, we differentiated further between ‘annotated’ and ‘hypothetical’ depending on the gene RefSeq status recorded at NCBI.72 For all the ‘Annotated Genes’, ‘Hypothetical Genes’, and mitochondrial genes, the following data were collected from the locus link feature (http://www.ncbi.nlm.nih.gov/LocusLink this was replaced by http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene during the course of this study): the gene name, gene symbol, accession number, Gene ID, and the MGI link number, if available. The latter provides a relational link to the GO library and the information related to ‘biological processes’, ‘cellular components’, and ‘molecular functions’. For all cDNA sequences categorized by ‘Significant Alignments on Mouse Genome’, we also registered the exact chromosomal position in kilobases (starting from the top of the short arm).

Mapping the murine cDNA sequences to the human genome

For most of the murine genes, a human homolog has already been determined, normally carrying the same name and symbol. This information is registered on the Entrez Gene page together with the cytogenetic locations. When this information did not exist, we used the mouse protein sequence of the identified gene for a translated BLAST (tblastn), or the nucleotide sequence of the cDNA fragment or the GenBank accession number of the corresponding gene for a blastn on the human genome. We registered the position in kilobases on the chromosome and verified each position on the human genome by comparing the neighboring genes to those in the mouse genome and recorded the human position only if the neighboring genes also matched. When the cytogenetic position on the human genome was determined, we compared this information with the positions of the recorded PARK loci. We aligned the human chromosome map view with the map for ‘morbid/disease’, described in OMIM (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM). When the genes, or the estimated human locations, and the cytogenetic disease locations co-localized, we called the gene a PD candidate gene. For the loci suggested by genome-wide studies, we selected those genes, which were situated ± 3 Mb from the chromosome marker (single nucleotide polymorphism (SNP)) with the highest LOD score (Table. 5). We are aware that this approach reduces the numbers of genes in an arbitrary manner. However, if preferred, the range can be widened with the provided data (see Supplementary Data) in order to more accurately consider asymmetry or size of each specific linkage peak.
Table 5

Association studies not recorded at OMIM

Cytogenetic locationGenetic markerMbCM MarshfieldLOD score
Bertoli-Avella (03)2719p13.13D19S22112.6362.26
19p13.13D19S84013.738
DeStefano (01)289q34.11D9S1825123.31361.3
10q22.1GATA121A0870.2881.07
DeStefano (02)569q32D9S930110.61201.86
20q11.2D20S47837.9541.82
21q21D21S205227.7242.21
Hicks (02)555q23.3D5S666120–1371351.6
Li (02)2110q25.3D10S1237116.11342.62
6p21.1D6S101741.7631.88
5q15D5S146296.4105
5q21.1Peak1001081.65
5q21.3D5S1453105–109115
17p13.1D17S130310.8241.93
Martinez (04)292p12–q22D2S2216881111.24
2p11–q12Peak1021172.04
2q12D2S1601071231.77
5q23D5S471117.51301.05
6p12D6S25756801.37
6q11–q13Peak69–73851.41
6q14D6S460∼82901.14
7p22D7S531351.51
11q14D11S417589.9911.6
19q13.3D19S90253.6731.05
Pankratz (03)24Xq22.3DXS8055113.4713.1
10q11.2D10S19651.570.02.3
Scott (01)265q31.1D5S816135.41392.39
17p11.2D17S92114.5361.92
Two-point and multipoint LOD17q11.2D17S129332.7562.28
17p11.2D17S92114.5362.02
17q11.2D17S129332.7562.62
9q33.1−10 cM117.8130
Multipoint LOD9q33.3D9S301 66 cM126.31402.59
9q34.2+10 cM132.3150
3q13.32D3S2460118.71351.62

For each individual study, the highest LOD scores with the associated genetic markers are listed. In these studies, the peak positions and the flanking genetic markers were given in centiMorgan on the Marshfield genetic map. We determined, when possible, the exact position in Mb on the corresponding chromosome. The average distance between the two adjacent genetic markers in each study varied between 5 and 11 cM.

The entire data set was collected and processed using the database program, Filemaker Pro 7.0. The latest update was in February 2007. This database is available upon request.

Results

We obtained 1435 sequences from three independent studies, which had the original aim to identify genes expressed by mesDA neurons. Barrett et al.[52] had isolated DA neurons from E13 ventral midbrain by FACS. This library contains genes expressed by mesDA neurons with a preference for abundant genes. The other two studies used subtractive methods to enrich for rare RNA transcripts expressed by mesDA neurons. Stewart et al.[53,57,58] had created a single-stranded directional cDNA library from substantia nigra of 8-week-old mice subtracted with a cDNA library from cerebellum. We had used a PCR-based differential display method[54] employing cDNA from engrailed-1/2 double-mutant and wild-type ventral midbrain during the embryonic stages when mesDA neurons disappear in the mutants.[59,60] The amplified sequences were compared to the expression profile of adult olfactory bulb, a source of DA neurons unrelated to those in the ventral midbrain. Only differentially expressed cDNA fragments were isolated and sequenced. As the original sequence analyses of the former two studies had been performed when a smaller nucleotide data set was available and in order to update our own expression analysis, we subjected the sequence data from all three screens to new BLAST searches and determined their association to genes and published ESTs, and their location on the mouse genome. The 1435 cDNA fragments generated 1050 unambiguous murine genomic hits, 19 ambiguous multiple hits, and 104 alignments with mitochondrial DNA. Two hundred and sixty-two cDNA sequences produced no significant alignments (see Table 1 for definitions and the entire analysis, and Table 2 for the individual libraries).
Table 2

Classification of BLAST results from each library

Total analysisBarrett52Stewart53Thuret54
A. Number of unique alignments per individual library
Genesa42315021877
Hyp. genesb80233919
ESTsc3216123
Genomic4415821
Mitochondria11822
Multiple hitsd14644
Nonee185671118
Totalf789285394134
B. Total number of fragmentsg
Genesa79340329397
Hyp. genesb147715521
ESTsc4628155
Genomic62301121
Mitochondria10410022
Multiple hitsd19946
Nonee2621381168
Total1435779496160

aAnnotated mouse genes.

bHypothetical genes determined by EST clustering or predicted by automated computational genome analysis with a large open reading frame.

cExpressed sequencing tags.

dUnderlying gene not identifiable, due to multiple alignments with low e-values.

eNo hit in mouse and human genome.

fNumber of unique alignments. Five hundred and seventy-nine unique tags were on the mouse genome (excluding mitochondria).

gNumber of fragments that represent genes, hypothetical genes, ESTs, genomic sequences, multiple alignments, and mitochondrial genes, listed per individual library.

Classification of BLAST results from each library aAnnotated mouse genes. bHypothetical genes determined by EST clustering or predicted by automated computational genome analysis with a large open reading frame. cExpressed sequencing tags. dUnderlying gene not identifiable, due to multiple alignments with low e-values. eNo hit in mouse and human genome. fNumber of unique alignments. Five hundred and seventy-nine unique tags were on the mouse genome (excluding mitochondria). gNumber of fragments that represent genes, hypothetical genes, ESTs, genomic sequences, multiple alignments, and mitochondrial genes, listed per individual library. Out of 1050 cDNA fragments, which generated unambiguous alignments on the mouse genome, 1020 were in gene loci. Most of them aligned to exons of those genes (72.6%; 741 of 1020). Out these 1020 cDNA fragments, 181 (17.8%) lay 3′ to the last annotated exon, suggesting that substantial amounts of mRNAs isolated from brain tissue are longer at their 3′ end than mRNAs from other tissues (Table 3). Finally, 9.6% (98 of 1020) of the alignments lay in regions designated as introns, suggesting that they are parts of unrecorded splice variants, possibly specific for mesDA neurons.
Table 3

Alignments in relation to gene loci

TotalGenomic sequencesESTsGenes
In gene lociOnly in last exon471471
In last and other exon(s)132132a
Not in last exon138138
After 3′ end181616159
Intron98401048
Subtotal10204626948
Outside gene loci301713
Total10506339948

Genomic alignments were divided into three groups: ‘ESTs’ (3.7%), ‘genomic sequences’ (6.0%), and ‘genes’ (90.3%). Majority of the cDNA fragments that aligned with genes are aligned with the last exon. A significant number of the cDNAs aligned with the region 3′ to the last exon. See Material and Methods for details.

aForty-four hits are in genes with only one exon.

Alignments in relation to gene loci Genomic alignments were divided into three groups: ‘ESTs’ (3.7%), ‘genomic sequences’ (6.0%), and ‘genes’ (90.3%). Majority of the cDNA fragments that aligned with genes are aligned with the last exon. A significant number of the cDNAs aligned with the region 3′ to the last exon. See Material and Methods for details. aForty-four hits are in genes with only one exon. The 1050 cDNA fragments represented 503 genes (423 annotated and 80 hypothetical genes), 32 ESTs, and 44 unique genomic hits with no otherwise described ESTs. Additionally, the 104 sequences that aligned to the mitochondrial DNA represented 11 mitochondrial genes (Table 2). To these cDNA sequences, we associated the corresponding MGI numbers, if available. This provided us with insight into their molecular function, the cellular locations of the proteins, and the associated biological process (see Supplementary Data for the entire transcriptome analysis). Several protein classes were over-represented, like, for example, those, which take part in mitochondria-related processes, in fatty acid chain metabolism, in ubiquitination, in the MAPK signaling pathways, or which are chaperones. Some of these molecular pathways were previously linked to the death of mesDA neurons, to PD, and other human neurodegenerative disorders. The majority of the mutations, which are associated to PD, is in genes that are expressed in mesDA neurons. We, therefore, joined these expression analyses with human PD linkage and association studies,[5-11,13-24,26-29,55,56] where no mutation has been associated thus far. For each unique mouse cDNA sequencing tag, we determined its human homolog and the corresponding cytogenetic and physical positions on the human chromosomes. We verified each locus on the human genome by identifying the neighboring genes on the mouse genome and recorded the human position only if the adjacent genes were the same. We then determined whether these positions were within OMIM (Table 4) and other suggestive (non-OMIM) PD loci (Table 5). In case of the OMIM orphan PD loci, we projected on the human chromosome view the map for ‘morbid diseases’. In case of non-OMIM loci, we identified the genes ± 3 Mb to the SNP marker with the highest LOD score. Totally, we linked the mouse transcriptome analyses to 569 unique locations on the human genome. Nineteen of these are within orphan PARK loci (Table 6) and 51 in non-OMIM PD loci (Table 7).
Table 4

PARK loci

LocusOMIM identifierGeneCytogenetic locationFrom (kb)To (kb)MbNumber of genes
PARK1163890SNCA4q21.1-4q21.3
PARK2602544Parkin6q25.3-6q26
PARK36024042p13.3-2p13.168.07575.3077.2106
PARK46055434p15.33-4p15.113.42437.32423.960
PARK5191342UCHL14p14
PARK6605909PINK11p36.33-1p35.1
PARK7602533DJ11p36.23-1p36.22
PARK860706012q11.2-12q13.1327.90855.63727.7351
PARK9606693ATP13A21p36.33-1p36.11
PARK106068521p33-1p32.247.65155.3807.776
PARK116076882q36.1-2q37.3219.844243.41623.6216
PARK12300557Xq21-q2575.950129.90040.0356
PARK136102972p13.1-2p11.275.45084.1308.739
601828NR4A22q22.1-2q23.3
603779SNCAIP5q23.1-q23.3
260540MAPT17q21.1

Genomic location of PARK loci as recorded in the OMIM databank. For seven of the PARK loci, the mutated genes were identified. The number of genes is the current GenBank estimation of all annotated and predicted genes in the corresponding PARK locus. For the PARK10 locus, we used the narrow definition 1p33-1p32.2 as determined by the two genetic markers D1S2134 and D1S200, and not the entire shorter arm of chromosome 1 (1p) which contains 1232 genes.21

Table 6

Candidate genes in Orphan PARK loci

No. of cDNA fragments aligning with the geneMouse IDHuman IDSymbolHuman gene namePositionLocus
1NM_146169XM_376062KIAA1155KIAA1155 protein2p13.3Park3
1NM_008717NM_014497ZFMLZinc finger, matrin-like2p13.2–p13.1Park3
1NM_183138XM_371501MGC22014cDNA sequence BC0374322p13.1Park3
3NM_080555NM_003713PPAP2BPhosphatidic acid phosphatase type 2B1p32Park10
1AA819910EstimatedFAF1In locus of Fas-associated factor 11p33Park10
6NM_009129NM_003469SCG2Secretogranin II2q35–q36PARK11
3AK052241NM_005544IRS1Insulin receptor substrate 12q36PARK11
1NM_152915NM_139072DNERDelta/notch-like EGF-related receptor2q37.1PARK11
1NM_008440NM_004321KIF1AKinesin family member 1A2q37.3PARK11
2NM_024197NM_004544NDUFA10NADH dehydrogenase (ubiquinone) 1 alpha subcomplex 102q37.3PARK11
1NM_025437NM_001412EIF1AXEukaryotic translation initiation factor 1A, X-linkedXp22.13PARK12
2NM_019768NM_012286MORF4L2Mortality factor 4 like 2Xq22PARK12
3NM_011123NM_000533PLP1Proteolipid protein 1Xq22PARK12
3NM_013898NM_004085TIMM8ATranslocase of inner mitochondrial membrane 8 homolog aXq22.1PARK12
3NM_016783NM_006667PGRMC1Progesterone receptor membrane component 1Xq22–q24PARK12
7NM_030688estimatedIL1RAPL2After 3′ of interleukin 1 receptor accessory protein-like 2Xq22.2–q22.3PARK12
1NM_133196NM_001325CSTF2Cleavage stimulation factor, 3′ pre-RNA, subunit 2Xq22.1PARK12
1NM_025893NM_173798ZCCHC12Zinc finger, CCHC domain containing 12Xq24PARK12
2NM_172782NM_018698NXT2Nuclear transport factor 2-like export factor 2Xq23PARK12
Table 7

Candidate genes for non-OMIM PARK loci

GenBank IDHuman IDSymbolHuman locationIn kbbGene name
Scott (01) D3S246026
5NM_008083NM_002045GAP433q13.1–13.2116700Growth-associated protein 43
3BB626331ESTLsamp3q13.2–q21117200Limbic system-associated membrane protein
2NM_177093XM_057296LRRC583q13.33121300Leucine-rich repeat containing 58
2NM_008047NM_007085FSTl3q13.32–q13.3121460follistatin-like 1
Martinez (04) D5S47129
1XM_283496NM_005509DMXL15q22118600Dmx-like 1
1GenomicEstimatedFEM1C5q22114939fem-1 homolog c
3NM_152809NM_004384CSNK1G35q23123000Casein kinase 1, gamma 3
Li (02) D5S1462 D5S145321
1NM_172827ESTLNPEP5q1596440Leucyl/cystinyl aminopeptidase
Hicks (02). Scott (01) D5S666. D5S81626,55
1NM_173753NM_001008738FNIP15q31.1131060Folliculin interacting protein 1
1NM_144823NM_015256ACSL65q31131400Acyl-CoA synthetase long-chain family member
1NM_033144XM_034872SEPT85q31132180Septin 8
1AK011363NM_003337UBE2B5q23–q31133800Ubiquitin-conjugating enzyme E2B, RAD6 homology
Scott (01) D5S81626
1NM_029518NM_016604JMJD1B5q31137810Jumonji domain containing 1B
3NM_010771NM_018834MATR35q31.3138730Matrin 3
Li (02) D6S101721
1NM_025365NM_013397C6ORF496p21.3141800Chromosome 6 open reading frame 49
1NM_020493NM_003131SRF6p21.143200Serum response factor (c-fos serum response element-binding transcription factor)
5NM_008302NM_007355HSP90AB16p1244300Heat shock protein 90 kDa alpha (cytosolic), class B member 1
Martinez (04) D6S257 D6S46029
1GenomicEstimated6q12–q1372500
8NM_010106NM_001402EEF1A16q14.174224Eukaryotic translation elongation factor 1 alpha 1
Martinez (04) D7S53129
1NM_028469NM_032350MGC112577p22.3850Hypothetical protein MGC11257
1NM_010302NM_007353GNA127p22–p212510Guanine nucleotide binding protein (G protein) alpha 12 
6NM_007393NM_001101ACTB7p15–p125300Actin beta
1NM_026050NM_032706MGC129667p22.26110Hypothetical protein MGC12966
1NM_009007NM_006908RAC17p226170ras-related C3 botulinum toxin substrate 1 (rho family, small GTP binding protein Rac)
DeStefano (01) D9S1825,56 Scott (01) D9S30126
7NM_026434NM_033117RBM189q34.11120400RNA binding motif protein 18
2NM_022310NM_005347HSPA59q33–q34.1123370Heat shock 70 kD protein 5
1NM_025709NM_015635GAPVD19q34.11123450GTPase-activating protein and VPS9 domains 1
1NM_172661XM_497080KIAA05159q34.1129650KIAA0515 gene
DeStefano (01) GATA121A0856
1NM_183295NM_015634KIAA127910q22.170100KIAA1279 gene
Martinez (04) D11S417529
1NM_025844NM_012124CHORDC111q14.389650Cysteine and histidine-rich domain (CHORD)-containing. zinc-binding protein 1
Li (02) D10S123921
1NM_172523NM_003054VMAT210q25118680Solute carrier family 18
Li (02) D17S130321
1NM_018768NM_004853STX817p129350Syntaxin 8
Scott (01) D17S921, D17S129326
1NM_011664NM_018955UBB17p12–p11.216470Ubiquitin B
1NM_011480NM_004176SREBF117p11.217950Sterol regulatory element binding factor 1
1XM_110937NM_145809USP3217p11.218621Ubiquitin-specific protease 32
1NM_026389NM_015584POLDIP217q11.226800Polymerase delta interacting protein 2
1NM_174852NM_020889PHF1217q11.127400PHD finger protein 12
1NM_010897NM_000267NF117q11.229700Neurofibromatosis 1
1NM_010161NM_014210EVI2A17q11.229800Ecotropic viral integration site 2A
1NM_010716NM_002311LIG317q11.2–q1233450Ligase III, DNA, ATP-dependent
Bertoli-Avella (03) D19S22127
2NM_008319NM_003259ICAM519p13.210260Intercellular adhesion molecule 5, telencephalin
16NM_016742NM_007065CDC3719p13.210370Cell division cycle 37 homolog (S. cerevisiae)-like
1NM_145624NM_016264ZNF4419p13.212200Zinc finger protein 44
1NM_010906NM_002501NFIX19p13.313030Nuclear factor I/X
1NM_183097Estimated19p13.1314060Progestin and adipoQ receptor family member
DeStefano (02) D20S47856
1BQ927659Estimated20q11.2–q1235330
1NM_013865NM_022477NDRG320q11.21–q11.2336000n-myc downstream regulated 3
1NM_010658NM_005461MAFB20q11.2–q13.140000v-maf musculoaponeurotic fibrosarcoma oncogene family. protein B
2NM_021464NM_007050PTPRT20q12–q1340500Protein tyrosine phosphatase. receptor type T
DeStefano (02) D21S205256
2NM_11782EstimatedADAMTS521q21.227170A disintegrin-like and metalloprotease (reprolysin-type) with thrombospondin type 1 motif 5 (aggrecanase-2′) 3′
Pankratz (03) DXS805524
1NM_016783NM_006667PGRMC1Xq24116713Progesterone receptor membrane component 1

Listed genes are situated ± 3 Mb to peak with the highest LOD score, except for D10S196 where we used ± 8Mb.

aNumber of cDNA fragments aligning with the gene.

bkb from the top of the short arm of the chromosome.

cHuman chromosome location was estimated by comparing the flanking regions of mouse and man.

PARK loci Genomic location of PARK loci as recorded in the OMIM databank. For seven of the PARK loci, the mutated genes were identified. The number of genes is the current GenBank estimation of all annotated and predicted genes in the corresponding PARK locus. For the PARK10 locus, we used the narrow definition 1p33-1p32.2 as determined by the two genetic markers D1S2134 and D1S200, and not the entire shorter arm of chromosome 1 (1p) which contains 1232 genes.21 Association studies not recorded at OMIM For each individual study, the highest LOD scores with the associated genetic markers are listed. In these studies, the peak positions and the flanking genetic markers were given in centiMorgan on the Marshfield genetic map. We determined, when possible, the exact position in Mb on the corresponding chromosome. The average distance between the two adjacent genetic markers in each study varied between 5 and 11 cM. Candidate genes in Orphan PARK loci Candidate genes for non-OMIM PARK loci Listed genes are situated ± 3 Mb to peak with the highest LOD score, except for D10S196 where we used ± 8Mb. aNumber of cDNA fragments aligning with the gene. bkb from the top of the short arm of the chromosome. cHuman chromosome location was estimated by comparing the flanking regions of mouse and man. The experimental design of the three different transcriptome analyses, we used for our study, were such that they included both highly and rarely expressed transcripts. Our analysis confirmed the complementary nature of the three screens. Only 7.2% (104 out of 1435) of the cDNA sequences of these libraries represent genes, hypothetical genes, or EST clusters, which are found in more than one of them (Table 8). Moreover, the libraries also contained two cDNA fragments for α-synuclein, three for NR4A2, and one for the tau genes. Mutations in all three genes have been previously associated to PD.[5,30,36] Assuming that all 30 000 genes in the human genome[61] were equally likely detected, the probability to identify three of nine PD susceptibility genes by chance out of a pool of 569 was less than 3.4 × 10 −3. If we exclude the controversial NR4A2 and UCHL1, the probability was less than 1.5 × 10−2.
Table 8

cDNA library comparison

Barrett52Stewart53Thuret54
Barrett45 (22)11 (2)
Stewart35 (22)5 (4)
Thuret3 (2)5 (4)

Of 1435, 104 (7.2%) cDNA fragments overlap with sequences also present in one other library. This number includes not only fragments that align with each other, but also those which align with the same annotated gene, hypothetical gene, mitochondrial gene, EST, or genomic position. These overlapping 104 cDNA fragments represent 28 of 781 (3.6%) unique tags (Table 2).

cDNA library comparison Of 1435, 104 (7.2%) cDNA fragments overlap with sequences also present in one other library. This number includes not only fragments that align with each other, but also those which align with the same annotated gene, hypothetical gene, mitochondrial gene, EST, or genomic position. These overlapping 104 cDNA fragments represent 28 of 781 (3.6%) unique tags (Table 2).

Discussion

The entire human and mouse genome sequences have been available for more than 3 years.[61,62] Therefore, the chromosomal locations of most genes have been determined and as a consequence also those genes within a given disease locus. In order to identify potential PD susceptibility genes, we projected the sequence data of three murine transcriptome studies for mesDA neurons onto the human genome and compared them with previously identified PD loci. We determine the human homologs of 1435 murine cDNA fragments which corresponded to 579 unique mouse chromosomal locations; 423 annotated genes, 80 hypothetical genes, 32 ESTs, and 44 genomic locations, which are not linked to any genes or otherwise reported cDNA sequences. Of the 569 unique locations on the human genome, 19 were positioned in OMIM PARK loci and 51 within genomic regions that have a weaker linkage to PD, which are not recorded in the OMIM database and need further confirmation. Multiple studies are on the way to determine the underlying mutations of orphan PARK loci[63]; however, the length of putative regulatory regions of most gene, their unpredictable position, and the common presence of SNPs have thus far restricted such studies to nucleotide variation in the coding region and in 5′ and 3′ UTR. Disparities in the promoter–enhancer–silencer regions were only the aim if the targeted gene had been previously linked to PD.[64,65] A nucleotide variation in the α-synuclein promoter, for example, was associated to the disease.[12,66] Variability on the level of gene expression is far more common than nucleotide variations which alter protein sequences[67] and it is believed that these haplotype variations determine individual traits and predispositions for common diseases such as PD. Narrowing down the number of candidate genes in identified loci in a rational manner may encourage the inclusion of the promoter regions in future studies aiming to identify mutations associated to PD. Among the candidate genes that we found, the most interesting is VMAT2 (vesicular monoamine transporter 2) (10q25). Reduced expression of VMAT2 could be correlated with a higher sensitivity to environmental factors. For example, VMAT2 heterozygote mice (+/ − ) are remarkably more sensitive than wild-type to the neurotoxin 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine.[68,69] Furthermore, we identified two genes in the ubiquitination pathway, Ube2b [ubiquitin-conjugating enzyme E2B, RAD6 homology (S. cerevisiae)] and Ubb (Ubiquitin B, member of the HSP90 family) and Hspa5 (heat shock 70 kDa protein 5, member of the HSP70 family). Finally, 26 mitochondrial genes encoded by nuclear DNA are present in our transcriptome analysis. Of these, an unexpected high proportion of genes, namely four, are located within orphan OMIM PARK loci. There is increasing evidence that impairment of mitochondrial functions and oxidative stress are contributing factors to PD[70] supported by the recent finding of a mutation in PINK1.[35] Furthermore, the functional deficiencies induced by several of the other PD mutations seem to converge onto the mitochondria.[71] Our finding confirms a central role of the mitochondria in PD and suggests the possibility that a misregulation of some of these four mitochondrial genes may be a contributing factor for the disease. We conclude that our transcriptome analysis, along with being applicable for the identification of PD candidate genes, may also be a useful tool for future genome-wide association studies with newer resources, such as HapMap (http://www.hapmap.org/), where tagSNPs can be chosen close to loci of genes expressed by mesDA neurons. Furthermore, new GO annotations are constantly added and with time it may turn out that many of the identified genes are part of shared metabolic pathways. Our data set may give new insight into ligand/receptor interactions and/or intracellular signaling pathways acting in mesDA neurons, allowing novel studies into the molecular etiology of PD.
  72 in total

1.  The ubiquitin pathway in Parkinson's disease.

Authors:  E Leroy; R Boyer; G Auburger; B Leube; G Ulm; E Mezey; G Harta; M J Brownstein; S Jonnalagada; T Chernova; A Dehejia; C Lavedan; T Gasser; P J Steinbach; K D Wilkinson; M H Polymeropoulos
Journal:  Nature       Date:  1998-10-01       Impact factor: 49.962

2.  Mutations in the parkin gene cause autosomal recessive juvenile parkinsonism.

Authors:  T Kitada; S Asakawa; N Hattori; H Matsumine; Y Yamamura; S Minoshima; M Yokochi; Y Mizuno; N Shimizu
Journal:  Nature       Date:  1998-04-09       Impact factor: 49.962

Review 3.  Genetics of Parkinson's disease.

Authors:  R L Nussbaum; M H Polymeropoulos
Journal:  Hum Mol Genet       Date:  1997       Impact factor: 6.150

4.  Refinement of the gene locus for autosomal recessive juvenile parkinsonism (AR-JP) on chromosome 6q25.2-27 and identification of markers exhibiting linkage disequilibrium.

Authors:  M Saito; H Matsumine; H Tanaka; A Ishikawa; S Shimoda-Matsubayashi; A A Schäffer; Y Mizuno; S Tsuji
Journal:  J Hum Genet       Date:  1998       Impact factor: 3.172

Review 5.  Etiology and pathogenesis of Parkinson's disease.

Authors:  C W Olanow; W G Tatton
Journal:  Annu Rev Neurosci       Date:  1999       Impact factor: 12.449

6.  A susceptibility locus for Parkinson's disease maps to chromosome 2p13.

Authors:  T Gasser; B Müller-Myhsok; Z K Wszolek; R Oehlmann; D B Calne; V Bonifati; B Bereznai; E Fabrizio; P Vieregge; R D Horstmann
Journal:  Nat Genet       Date:  1998-03       Impact factor: 38.330

7.  The tau gene haplotype h1 confers a susceptibility to Parkinson's disease.

Authors:  Jun Zhang; Yiqing Song; Honglei Chen; Dongsheng Fan
Journal:  Eur Neurol       Date:  2004-12-27       Impact factor: 1.710

8.  Chromosome 6-linked autosomal recessive early-onset Parkinsonism: linkage in European and Algerian families, extension of the clinical spectrum, and evidence of a small homozygous deletion in one family. The French Parkinson's Disease Genetics Study Group, and the European Consortium on Genetic Susceptibility in Parkinson's Disease.

Authors:  J Tassin; A Dürr; T de Broucker; N Abbas; V Bonifati; G De Michele; A M Bonnet; E Broussolle; P Pollak; M Vidailhet; M De Mari; R Marconi; S Medjbeur; A Filla; G Meco; Y Agid; A Brice
Journal:  Am J Hum Genet       Date:  1998-07       Impact factor: 11.025

9.  Autosomal recessive juvenile parkinsonism maps to 6q25.2-q27 in four ethnic groups: detailed genetic mapping of the linked region.

Authors:  A C Jones; Y Yamamura; L Almasy; S Bohlega; B Elibol; J Hubble; S Kuzuhara; M Uchida; T Yanagi; D E Weeks; T G Nygaard
Journal:  Am J Hum Genet       Date:  1998-07       Impact factor: 11.025

10.  Loss of function mutations in the gene encoding Omi/HtrA2 in Parkinson's disease.

Authors:  Karsten M Strauss; L Miguel Martins; Helene Plun-Favreau; Frank P Marx; Sabine Kautzmann; Daniela Berg; Thomas Gasser; Zbginiew Wszolek; Thomas Müller; Antje Bornemann; Hartwig Wolburg; Julian Downward; Olaf Riess; Jörg B Schulz; Rejko Krüger
Journal:  Hum Mol Genet       Date:  2005-06-16       Impact factor: 6.150

View more
  2 in total

1.  Parkinson's disease candidate gene prioritization based on expression profile of midbrain dopaminergic neurons.

Authors:  Shahrooz Vahedi; Mehrnoosh Rajabian; Arman Misaghian; Daniel Grbec; Horst H Simon; Kambiz N Alavian
Journal:  J Biomed Sci       Date:  2010-08-17       Impact factor: 8.410

2.  Fas-associated factor 1 and Parkinson's disease.

Authors:  Ranjita Betarbet; Leah R Anderson; Marla Gearing; Tiffany R Hodges; Jason J Fritz; James J Lah; Allan I Levey
Journal:  Neurobiol Dis       Date:  2008-05-29       Impact factor: 5.996

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.