| Literature DB >> 17522092 |
Daniel Gherbassi1, Lavinia Bhatt, Sandrine Thuret, Horst H Simon.
Abstract
The hallmark of Parkinson's disease (PD OMIM #168600) is the degeneration of the nigral dopaminergic system affecting approximately 1% of the human population older than 65. In pursuit of genetic factors contributing to PD, linkage and association studies identified several susceptibility genes. The majority of these genes are expressed by the dopamine-producing neurons in the substantia nigra. We, therefore, propose expression by these neurons as a selection criterion, to narrow down, in a rational manner, the number of candidate genes in orphan PD loci, where no mutation has been associated thus far. We determined the corresponding human chromosome locations of 1435 murine cDNA fragments obtained from murine expression analyses of nigral dopaminergic neurons and combined these data with human linkage studies. These fragments represent 19 genes within orphan OMIM PD loci. We used the same approach for independent association studies and determined the genes in neighborhood to the peaks with the highest LOD score value. Our approach did not make any assumptions about disease mechanisms, but it, nevertheless, revealed alpha-synuclein, NR4A2 (Nurr1), and the tau genes, which had previously been associated to PD. Furthermore, our transcriptome analysis identified several classes of candidate genes for PD mutations and may also provide insight into the molecular pathways active in nigral dopaminergic neurons.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17522092 PMCID: PMC2779897 DOI: 10.1093/dnares/dsm007
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
BLAST results on mouse genome
| No significant alignments on mouse genome | 262 | None |
| Significant alignments with mitochondrial genes | 104 | Mitochondrial genes |
| Multiple high-scoring alignments on mouse genome | 19 | Multiple hits |
| Significant alignments on mouse genome | 1050 | Genes (940) |
| Annotated genes (793) | ||
| Hypothetical genes (147) | ||
| ESTs (47) | ||
| Genomic sequences (63) |
cDNA sequences are separated into four different categories based on the types of alignments generated. Alignments on the mouse genome were subdivided into Genes, ESTs, and genomic sequences. For the category ‘Genes’, we differentiated further between ‘annotated’ and ‘hypothetical’ depending on the gene RefSeq status recorded at NCBI.72
Association studies not recorded at OMIM
| Cytogenetic location | Genetic marker | Mb | CM Marshfield | LOD score | |
|---|---|---|---|---|---|
| Bertoli-Avella (03)27 | 19p13.13 | D19S221 | 12.6 | 36 | 2.26 |
| 19p13.13 | D19S840 | 13.7 | 38 | ||
| DeStefano (01)28 | 9q34.11 | D9S1825 | 123.3 | 136 | 1.3 |
| 10q22.1 | GATA121A08 | 70.2 | 88 | 1.07 | |
| DeStefano (02)56 | 9q32 | D9S930 | 110.6 | 120 | 1.86 |
| 20q11.2 | D20S478 | 37.9 | 54 | 1.82 | |
| 21q21 | D21S2052 | 27.7 | 24 | 2.21 | |
| Hicks (02)55 | 5q23.3 | D5S666 | 120–137 | 135 | 1.6 |
| Li (02)21 | 10q25.3 | D10S1237 | 116.1 | 134 | 2.62 |
| 6p21.1 | D6S1017 | 41.7 | 63 | 1.88 | |
| 5q15 | D5S1462 | 96.4 | 105 | ||
| 5q21.1 | Peak | 100 | 108 | 1.65 | |
| 5q21.3 | D5S1453 | 105–109 | 115 | ||
| 17p13.1 | D17S1303 | 10.8 | 24 | 1.93 | |
| Martinez (04)29 | 2p12–q22 | D2S2216 | 88 | 111 | 1.24 |
| 2p11–q12 | Peak | 102 | 117 | 2.04 | |
| 2q12 | D2S160 | 107 | 123 | 1.77 | |
| 5q23 | D5S471 | 117.5 | 130 | 1.05 | |
| 6p12 | D6S257 | 56 | 80 | 1.37 | |
| 6q11–q13 | Peak | 69–73 | 85 | 1.41 | |
| 6q14 | D6S460 | ∼82 | 90 | 1.14 | |
| 7p22 | D7S531 | 3 | 5 | 1.51 | |
| 11q14 | D11S4175 | 89.9 | 91 | 1.6 | |
| 19q13.3 | D19S902 | 53.6 | 73 | 1.05 | |
| Pankratz (03)24 | Xq22.3 | DXS8055 | 113.4 | 71 | 3.1 |
| 10q11.2 | D10S196 | 51.5 | 70.0 | 2.3 | |
| Scott (01)26 | 5q31.1 | D5S816 | 135.4 | 139 | 2.39 |
| 17p11.2 | D17S921 | 14.5 | 36 | 1.92 | |
| Two-point and multipoint LOD | 17q11.2 | D17S1293 | 32.7 | 56 | 2.28 |
| 17p11.2 | D17S921 | 14.5 | 36 | 2.02 | |
| 17q11.2 | D17S1293 | 32.7 | 56 | 2.62 | |
| 9q33.1 | −10 cM | 117.8 | 130 | ||
| Multipoint LOD | 9q33.3 | D9S301 66 cM | 126.3 | 140 | 2.59 |
| 9q34.2 | +10 cM | 132.3 | 150 | ||
| 3q13.32 | D3S2460 | 118.7 | 135 | 1.62 |
For each individual study, the highest LOD scores with the associated genetic markers are listed. In these studies, the peak positions and the flanking genetic markers were given in centiMorgan on the Marshfield genetic map. We determined, when possible, the exact position in Mb on the corresponding chromosome. The average distance between the two adjacent genetic markers in each study varied between 5 and 11 cM.
Classification of BLAST results from each library
| Total analysis | Barrett52 | Stewart53 | Thuret54 | |
|---|---|---|---|---|
| A. Number of unique alignments per individual library | ||||
| Genesa | 423 | 150 | 218 | 77 |
| Hyp. genesb | 80 | 23 | 39 | 19 |
| ESTsc | 32 | 16 | 12 | 3 |
| Genomic | 44 | 15 | 8 | 21 |
| Mitochondria | 11 | 8 | 2 | 2 |
| Multiple hitsd | 14 | 6 | 4 | 4 |
| Nonee | 185 | 67 | 111 | 8 |
| Totalf | 789 | 285 | 394 | 134 |
| B. Total number of fragmentsg | ||||
| Genesa | 793 | 403 | 293 | 97 |
| Hyp. genesb | 147 | 71 | 55 | 21 |
| ESTsc | 46 | 28 | 15 | 5 |
| Genomic | 62 | 30 | 11 | 21 |
| Mitochondria | 104 | 100 | 2 | 2 |
| Multiple hitsd | 19 | 9 | 4 | 6 |
| Nonee | 262 | 138 | 116 | 8 |
| Total | 1435 | 779 | 496 | 160 |
aAnnotated mouse genes.
bHypothetical genes determined by EST clustering or predicted by automated computational genome analysis with a large open reading frame.
cExpressed sequencing tags.
dUnderlying gene not identifiable, due to multiple alignments with low e-values.
eNo hit in mouse and human genome.
fNumber of unique alignments. Five hundred and seventy-nine unique tags were on the mouse genome (excluding mitochondria).
gNumber of fragments that represent genes, hypothetical genes, ESTs, genomic sequences, multiple alignments, and mitochondrial genes, listed per individual library.
Alignments in relation to gene loci
| Total | Genomic sequences | ESTs | Genes | ||
|---|---|---|---|---|---|
| In gene loci | Only in last exon | 471 | 471 | ||
| In last and other exon(s) | 132 | 132a | |||
| Not in last exon | 138 | 138 | |||
| After 3′ end | 181 | 6 | 16 | 159 | |
| Intron | 98 | 40 | 10 | 48 | |
| Subtotal | 1020 | 46 | 26 | 948 | |
| Outside gene loci | 30 | 17 | 13 | ||
| Total | 1050 | 63 | 39 | 948 |
Genomic alignments were divided into three groups: ‘ESTs’ (3.7%), ‘genomic sequences’ (6.0%), and ‘genes’ (90.3%). Majority of the cDNA fragments that aligned with genes are aligned with the last exon. A significant number of the cDNAs aligned with the region 3′ to the last exon. See Material and Methods for details.
aForty-four hits are in genes with only one exon.
PARK loci
| Locus | OMIM identifier | Gene | Cytogenetic location | From (kb) | To (kb) | Mb | Number of genes |
|---|---|---|---|---|---|---|---|
| PARK1 | 163890 | SNCA | 4q21.1-4q21.3 | ||||
| PARK2 | 602544 | Parkin | 6q25.3-6q26 | ||||
| PARK3 | 602404 | 2p13.3-2p13.1 | 68.075 | 75.307 | 7.2 | 106 | |
| PARK4 | 605543 | 4p15.33-4p15.1 | 13.424 | 37.324 | 23.9 | 60 | |
| PARK5 | 191342 | UCHL1 | 4p14 | ||||
| PARK6 | 605909 | PINK1 | 1p36.33-1p35.1 | ||||
| PARK7 | 602533 | DJ1 | 1p36.23-1p36.22 | ||||
| PARK8 | 607060 | 12q11.2-12q13.13 | 27.908 | 55.637 | 27.7 | 351 | |
| PARK9 | 606693 | ATP13A2 | 1p36.33-1p36.11 | ||||
| PARK10 | 606852 | 1p33-1p32.2 | 47.651 | 55.380 | 7.7 | 76 | |
| PARK11 | 607688 | 2q36.1-2q37.3 | 219.844 | 243.416 | 23.6 | 216 | |
| PARK12 | 300557 | Xq21-q25 | 75.950 | 129.900 | 40.0 | 356 | |
| PARK13 | 610297 | 2p13.1-2p11.2 | 75.450 | 84.130 | 8.7 | 39 | |
| 601828 | NR4A2 | 2q22.1-2q23.3 | |||||
| 603779 | SNCAIP | 5q23.1-q23.3 | |||||
| 260540 | MAPT | 17q21.1 |
Genomic location of PARK loci as recorded in the OMIM databank. For seven of the PARK loci, the mutated genes were identified. The number of genes is the current GenBank estimation of all annotated and predicted genes in the corresponding PARK locus. For the PARK10 locus, we used the narrow definition 1p33-1p32.2 as determined by the two genetic markers D1S2134 and D1S200, and not the entire shorter arm of chromosome 1 (1p) which contains 1232 genes.21
Candidate genes in Orphan PARK loci
| No. of cDNA fragments aligning with the gene | Mouse ID | Human ID | Symbol | Human gene name | Position | Locus |
|---|---|---|---|---|---|---|
| 1 | NM_146169 | XM_376062 | KIAA1155 | 2p13.3 | Park3 | |
| 1 | NM_008717 | NM_014497 | ZFML | 2p13.2–p13.1 | Park3 | |
| 1 | NM_183138 | XM_371501 | MGC22014 | 2p13.1 | Park3 | |
| 3 | NM_080555 | NM_003713 | PPAP2B | 1p32 | Park10 | |
| 1 | AA819910 | Estimated | FAF1 | 1p33 | Park10 | |
| 6 | NM_009129 | NM_003469 | SCG2 | 2q35–q36 | PARK11 | |
| 3 | AK052241 | NM_005544 | IRS1 | 2q36 | PARK11 | |
| 1 | NM_152915 | NM_139072 | DNER | 2q37.1 | PARK11 | |
| 1 | NM_008440 | NM_004321 | KIF1A | 2q37.3 | PARK11 | |
| 2 | NM_024197 | NM_004544 | NDUFA10 | 2q37.3 | PARK11 | |
| 1 | NM_025437 | NM_001412 | EIF1AX | Xp22.13 | PARK12 | |
| 2 | NM_019768 | NM_012286 | MORF4L2 | Xq22 | PARK12 | |
| 3 | NM_011123 | NM_000533 | PLP1 | Xq22 | PARK12 | |
| 3 | NM_013898 | NM_004085 | TIMM8A | Xq22.1 | PARK12 | |
| 3 | NM_016783 | NM_006667 | PGRMC1 | Xq22–q24 | PARK12 | |
| 7 | NM_030688 | estimated | IL1RAPL2 | Xq22.2–q22.3 | PARK12 | |
| 1 | NM_133196 | NM_001325 | CSTF2 | Xq22.1 | PARK12 | |
| 1 | NM_025893 | NM_173798 | ZCCHC12 | Xq24 | PARK12 | |
| 2 | NM_172782 | NM_018698 | NXT2 | Xq23 | PARK12 |
Candidate genes for non-OMIM PARK loci
| GenBank ID | Human ID | Symbol | Human location | In kbb | Gene name | |
|---|---|---|---|---|---|---|
| Scott (01) D3S246026 | ||||||
| 5 | NM_008083 | NM_002045 | GAP43 | 3q13.1–13.2 | 116700 | |
| 3 | BB626331 | EST | Lsamp | 3q13.2–q21 | 117200 | |
| 2 | NM_177093 | XM_057296 | LRRC58 | 3q13.33 | 121300 | |
| 2 | NM_008047 | NM_007085 | FSTl | 3q13.32–q13.3 | 121460 | |
| Martinez (04) D5S47129 | ||||||
| 1 | XM_283496 | NM_005509 | DMXL1 | 5q22 | 118600 | |
| 1 | Genomic | Estimated | FEM1C | 5q22 | 114939 | fem-1 homolog c |
| 3 | NM_152809 | NM_004384 | CSNK1G3 | 5q23 | 123000 | |
| Li (02) D5S1462 D5S145321 | ||||||
| 1 | NM_172827 | EST | LNPEP | 5q15 | 96440 | |
| Hicks (02). Scott (01) D5S666. D5S81626,55 | ||||||
| 1 | NM_173753 | NM_001008738 | FNIP1 | 5q31.1 | 131060 | |
| 1 | NM_144823 | NM_015256 | ACSL6 | 5q31 | 131400 | |
| 1 | NM_033144 | XM_034872 | SEPT8 | 5q31 | 132180 | |
| 1 | AK011363 | NM_003337 | UBE2B | 5q23–q31 | 133800 | |
| Scott (01) D5S81626 | ||||||
| 1 | NM_029518 | NM_016604 | JMJD1B | 5q31 | 137810 | |
| 3 | NM_010771 | NM_018834 | MATR3 | 5q31.3 | 138730 | |
| Li (02) D6S101721 | ||||||
| 1 | NM_025365 | NM_013397 | C6ORF49 | 6p21.31 | 41800 | |
| 1 | NM_020493 | NM_003131 | SRF | 6p21.1 | 43200 | |
| 5 | NM_008302 | NM_007355 | HSP90AB1 | 6p12 | 44300 | |
| Martinez (04) D6S257 D6S46029 | ||||||
| 1 | Genomic | Estimated | 6q12–q13 | 72500 | ||
| 8 | NM_010106 | NM_001402 | EEF1A1 | 6q14.1 | 74224 | |
| Martinez (04) D7S53129 | ||||||
| 1 | NM_028469 | NM_032350 | MGC11257 | 7p22.3 | 850 | |
| 1 | NM_010302 | NM_007353 | GNA12 | 7p22–p21 | 2510 | |
| 6 | NM_007393 | NM_001101 | ACTB | 7p15–p12 | 5300 | |
| 1 | NM_026050 | NM_032706 | MGC12966 | 7p22.2 | 6110 | |
| 1 | NM_009007 | NM_006908 | RAC1 | 7p22 | 6170 | |
| DeStefano (01) D9S1825,56 Scott (01) D9S30126 | ||||||
| 7 | NM_026434 | NM_033117 | RBM18 | 9q34.11 | 120400 | |
| 2 | NM_022310 | NM_005347 | HSPA5 | 9q33–q34.1 | 123370 | |
| 1 | NM_025709 | NM_015635 | GAPVD1 | 9q34.11 | 123450 | |
| 1 | NM_172661 | XM_497080 | KIAA0515 | 9q34.1 | 129650 | |
| DeStefano (01) GATA121A0856 | ||||||
| 1 | NM_183295 | NM_015634 | KIAA1279 | 10q22.1 | 70100 | |
| Martinez (04) D11S417529 | ||||||
| 1 | NM_025844 | NM_012124 | CHORDC1 | 11q14.3 | 89650 | Cysteine and histidine-rich domain (CHORD)-containing. zinc-binding protein 1 |
| Li (02) D10S123921 | ||||||
| 1 | NM_172523 | NM_003054 | VMAT2 | 10q25 | 118680 | Solute carrier family 18 |
| Li (02) D17S130321 | ||||||
| 1 | NM_018768 | NM_004853 | STX8 | 17p12 | 9350 | Syntaxin 8 |
| Scott (01) D17S921, D17S129326 | ||||||
| 1 | NM_011664 | NM_018955 | UBB | 17p12–p11.2 | 16470 | |
| 1 | NM_011480 | NM_004176 | SREBF1 | 17p11.2 | 17950 | |
| 1 | XM_110937 | NM_145809 | USP32 | 17p11.2 | 18621 | |
| 1 | NM_026389 | NM_015584 | POLDIP2 | 17q11.2 | 26800 | |
| 1 | NM_174852 | NM_020889 | PHF12 | 17q11.1 | 27400 | |
| 1 | NM_010897 | NM_000267 | NF1 | 17q11.2 | 29700 | |
| 1 | NM_010161 | NM_014210 | EVI2A | 17q11.2 | 29800 | |
| 1 | NM_010716 | NM_002311 | LIG3 | 17q11.2–q12 | 33450 | |
| Bertoli-Avella (03) D19S22127 | ||||||
| 2 | NM_008319 | NM_003259 | ICAM5 | 19p13.2 | 10260 | |
| 16 | NM_016742 | NM_007065 | CDC37 | 19p13.2 | 10370 | |
| 1 | NM_145624 | NM_016264 | ZNF44 | 19p13.2 | 12200 | |
| 1 | NM_010906 | NM_002501 | NFIX | 19p13.3 | 13030 | |
| 1 | NM_183097 | Estimated | 19p13.13 | 14060 | ||
| DeStefano (02) D20S47856 | ||||||
| 1 | BQ927659 | Estimated | 20q11.2–q12 | 35330 | ||
| 1 | NM_013865 | NM_022477 | NDRG3 | 20q11.21–q11.23 | 36000 | |
| 1 | NM_010658 | NM_005461 | MAFB | 20q11.2–q13.1 | 40000 | |
| 2 | NM_021464 | NM_007050 | PTPRT | 20q12–q13 | 40500 | |
| DeStefano (02) D21S205256 | ||||||
| 2 | NM_11782 | Estimated | ADAMTS5 | 21q21.2 | 27170 | |
| Pankratz (03) DXS805524 | ||||||
| 1 | NM_016783 | NM_006667 | PGRMC1 | Xq24 | 116713 | |
Listed genes are situated ± 3 Mb to peak with the highest LOD score, except for D10S196 where we used ± 8Mb.
aNumber of cDNA fragments aligning with the gene.
bkb from the top of the short arm of the chromosome.
cHuman chromosome location was estimated by comparing the flanking regions of mouse and man.
cDNA library comparison
| Barrett52 | Stewart53 | Thuret54 | |
|---|---|---|---|
| Barrett | 45 (22) | 11 (2) | |
| Stewart | 35 (22) | 5 (4) | |
| Thuret | 3 (2) | 5 (4) |
Of 1435, 104 (7.2%) cDNA fragments overlap with sequences also present in one other library. This number includes not only fragments that align with each other, but also those which align with the same annotated gene, hypothetical gene, mitochondrial gene, EST, or genomic position. These overlapping 104 cDNA fragments represent 28 of 781 (3.6%) unique tags (Table 2).