| Literature DB >> 26370122 |
Flavio Pazos Obregón1, Cecilia Papalardo2, Sebastián Castro3, Gustavo Guerberoff4, Rafael Cantera5,6.
Abstract
BACKGROUND: Assembly and function of neuronal synapses require the coordinated expression of a yet undetermined set of genes. Although roughly a thousand genes are expected to be important for this function in Drosophila melanogaster, just a few hundreds of them are known so far.Entities:
Mesh:
Year: 2015 PMID: 26370122 PMCID: PMC4570697 DOI: 10.1186/s12864-015-1888-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Transcription profiles of the Drosophila genes to be classified and of the training set. a–b Temporal expression profiles of the 13,642 Drosophila genes that show transcription level above zero in at least one sample during embryonic, larval or pupal stages. Graphs were constructed with absolute values (FPKM, a) and with the same values after normalization between 0 and 1 (b, see Methods). c Transcription profiles of the 92 “synaptic genes” of the training set after normalization of the original values between 1 and 0. A clear correspondence is observed between what is expected for the mean transcription profile of Drosophila synaptic genes and the actual expression profiles of the genes of our training set. Along the life cycle of Drosophila a first wave of massive synaptogenesis takes place during the second half of embryonic life (samples 7–12), and a second wave occurs in the pupa (samples 19–24), when the synapses of the adult brain are being formed. Between these two waves of synaptogenesis a period of massive synapse disassembly takes place. The yellow line corresponds to the mean expression levels of the 92 synaptic genes and matches these three features. d Transcription profiles of the 397 “non-synaptic genes” of the training set, after normalization of the original values between 1 and 0. The green line corresponds to the mean expression levels of the 397 non-synaptic genes. (Original values published by Graveley et al. [32] and adapted as explained in Methods)
Fig. 2Misclassification error rates of the three adjusted models after 10-fold cross validation. Box plots of misclassification error rates of the three adjusted models: Random Forest (RF), Support Vector Machine (SVM) and k-Nearest Neighbors (k-NN), estimated by 10-fold cross validation as described in the text. In each box plot the black horizontal line represents the median value and the points outside the box correspond to values over or lower to 1.5 times interquartile range than the third or first quartile respectively
Fig. 3Number of genes classified as synaptic by each method as the classification threshold increases. Each column corresponds to the threshold that a gene’s estimated probability of being synaptic must exceed to be labeled as synaptic. Each row corresponds to one of the adjusted models or to one of their combinations. The last row shows the number of genes classified as synaptic by the three models. In the bottom panel the color areas of the Venn diagrams are proportional to the number of genes that they represent. The 13,153 genes to be classified are represented by the black circles and the number of genes classified as synaptic by each model, or combinations thereof, are represented in agreement with the color code shown to the left
Fig. 4Enrichment in selected synapse-related BP GO terms as the classification threshold increases. a–d show the enrichment of the catalogues generated by each classifier, in a representative selection of GO terms of relevance for the biological function “synapse”. For all classifiers, elevation of the threshold resulted in increased enrichment. This increase in functional enrichment is accentuated in the consensus catalogues, showing how the combined methods improved the three individual classifiers. a k-NN catalogues, b Random Forest catalogues, c SVM catalogues, d catalogues of genes classified as synaptic by the three classifiers. All enrichment values have an associated p-value lower than 10−4 and a FDR q-value lower than 10−3
Fig. 5Relation between percentages of genes with tissue-specific differential expression by tissue and classification thresholds. a–d show the relationship between tissue-specific differential expression, classification methods, and classification thresholds. Regardless of the classification method, all catalogues are enriched in genes expressed in the CNS at much higher levels than in tissues with fewer or none synapses. Notice that increasing the threshold did not result in an increment of the proportion of genes differentially expressed in tissues other than the CNS. a k-NN catalogues, b Random Forest catalogues, c SVM catalogues, d consensus catalogues. The threshold values are shown in the horizontal axis, with 0 corresponding to the initial set of genes to be classified. Each colored line corresponds to one of the five analyzed tissues according to the color code shown at the bottom
Putative synaptic genes with no synapse-related annotation in Drosophila and homologues annotated as synaptic in Homo sapiens
| Annotation symbol | Gene symbol | Mean prob. | Human homologue | HGI | GO term (in Homo sapiens) |
|---|---|---|---|---|---|
| CG7392 | Cka | 1.00 | STRN | ENSG00000115808 | CC: postsynaptic membrane |
| CG9634 | NA | 1.00 | MME | ENSG00000196549 | CC: synaptic vesicle |
| CG8529 | Dyb | 0.99 | DTNA | ENSG00000134769 | BP: synaptic transmission |
| CG7023 | Usp12–46 | 0.99 | USP46 | ENSG00000109189 | BP: regulation of synaptic transmission |
| CG16973 | msn | 0.99 | MINK1 | ENSG00000141503 | BP: synaptic transmission |
| CG6593 | Pp1alpha-96A | 0.99 | PPP1CA | ENSG00000172531 | CC: dendritic spine |
| CG15112 | ena | 0.99 | ENAH | ENSG00000154380 | CC: synapse |
| CG10545 | Gbeta13F | 0.99 | GNB2 | ENSG00000172354 | BP: synaptic transmission |
| CG14991 | Fit1 | 0.99 | FERMT1 | ENSG00000101311 | CC: synapse |
| CG1651 | Ank | 0.99 | ANK2 | ENSG00000145362 | CC: postsynaptic membrane |
| CG32717 | sdt | 0.99 | MPP4 | ENSG00000082126 | CC: presynaptic membrane |
| CG5248 | loco | 0.99 | RGS14 | ENSG00000169220 | BP: long-term synaptic potentiation |
| CG7147 | kuz | 0.99 | ADAM10 | ENSG00000137845 | CC: postsynaptic density |
| CG10566 | NA | 0.99 | ICA1 | ENSG00000003147 | CC: synaptic vesicle membrane |
| CG32264 | NA | 0.99 | PHACTR1 | ENSG00000112137 | CC: synapse |
| CG13830 | NA | 0.99 | SPOCK2 | ENSG00000107742 | BP: synapse assembly |
| CG10637 | Nak | 0.98 | AAK1 | ENSG00000115977 | CC: terminal bouton (of the axon) |
| CG3269 | Rab2 | 0.98 | RAB14 | ENSG00000119396 | BP: neurotransmitter secretion |
| CG10011 | NA | 0.98 | ANK2 | ENSG00000145362 | CC: postsynaptic membrane |
| CG8440 | Lis-1 | 0.98 | PAFAH1B1 | ENSG00000007168 | BP: synaptic transmission |
| CG5650 | Pp1-87B | 0.98 | PPP1CA | ENSG00000172531 | CC: dendritic spine |
| CG10579 | Eip63E | 0.98 | CDK16 | ENSG00000102225 | CC: synaptic vesicle |
| CG10538 | CdGAPr | 0.98 | ARHGAP32 | ENSG00000134909 | CC: postsynaptic membrane |
| CG7535 | GluClalpha | 0.98 | CHRNA5 | ENSG00000169684 | BP: synaptic transmission |
| CG8726 | NA | 0.98 | PXK | ENSG00000168297 | BP: regulation of synaptic transmission |
| CG30389 | NA | 0.98 | TMEM57 | ENSG00000204178 | CC: synapse part |
| CG7546 | NA | 0.98 | BAG6 | ENSG00000204463 | BP: synaptonemal complex assembly |
| CG1506 | Ac3 | 0.98 | ADCY3 | ENSG00000138031 | BP: synaptic transmission |
| CG6214 | MRP | 0.98 | ABCC8 | ENSG00000006071 | BP: synaptic transmission |
| CG4574 | Plc21C | 0.98 | PLCB1 | ENSG00000182621 | BP: synaptic transmission |
| CG11734 | HERC2 | 0.98 | HERC1 | ENSG00000103657 | MF: neurotrans:Na symporter activity |
| CG42829 | CadN2 | 0.97 | CDH2 | ENSG00000170558 | CC: synapse |
| CG6383 | crb | 0.97 | DNER | ENSG00000187957 | BP: synapse assembly |
| CG1862 | Ephrin | 0.97 | EFNB1 | ENSG00000090776 | CC: synapse |
| CG7100 | Cadherin-N | 0.97 | CDH1 | ENSG00000039068 | BP: synapse assembly |
| CG8948 | Graf | 0.97 | OPHN1 | ENSG00000079482 | BP: synaptic vesicle endocytosis |
| CG9361 | Task7 | 0.97 | KCNK3 | ENSG00000171303 | BP: synaptic transmission |
| CG6998 | ctp | 0.97 | DYNLL2 | ENSG00000121083 | BP: synaptic target recognition |
| CG18455 | Optix | 0.97 | SIX1 | ENSG00000126778 | BP: reg. of synaptic growth at nj |
| CG5912 | arr | 0.96 | LRP6 | ENSG00000070018 | BP: synaptic transmission CC: synapse |
| CG8261 | Ggamma1 | 0.96 | GNG10 | ENSG00000242616 | BP: synaptic transmission |
| CG2849 | Rala | 0.96 | RIT2 | ENSG00000152214 | BP: synaptic transmission |
| CG32217 | Su(Tpl) | 0.96 | MARVELD2 | ENSG00000152939 | MF: neurotrans:Na symporter activity |
| CG17336 | Lcch3 | 0.96 | GABRB1 | ENSG00000163288 | BP: synaptic transmission |
| CG15274 | GABA-B-R1 | 0.96 | GABBR1 | ENSG00000204681 | BP: synaptic transmission |
| CG4244 | Su(dx) | 0.96 | NEDD4 | ENSG00000069869 | BP: regulation of synapse organization |
| CG16757 | Spn | 0.96 | PPP1R9A | ENSG00000158528 | CC: synapse |
| CG4625 | Dhap-at | 0.96 | GNPAT | ENSG00000116906 | BP: synapse assembly |
| CG32434 | siz | 0.96 | IQSEC3 | ENSG00000120645 | CC: inhibitory synapse |
| CG9491 | Gef26 | 0.95 | RAPGEF2 | ENSG00000109756 | BP: regulation of synaptic plasticity |
| CG42314 | PMCA | 0.95 | ATP2B2 | ENSG00000157087 | BP: regulation of synaptic plasticity |
| CG7223 | htl | 0.95 | FGFR2 | ENSG00000066468 | BP: synaptic vesicle transport |
| CG9375 | Ras 85D | 0.95 | HRAS | ENSG00000174775 | BP: long-term synaptic plasticity |
| CG30388 | Magi | 0.95 | MAGI2 | ENSG00000187391 | CC: synapse |
| CG11958 | Cnx99A | 0.95 | CANX | ENSG00000127022 | BP: synaptic vesicle endocytosis |
| CG9985 | sktl | 0.94 | PIP5K1C | ENSG00000186111 | BP: synaptic vesicle exo and endocitosis |
| CG8726 | NA | 0.94 | KCNK18 | ENSG00000186795 | BP: synaptic transmission |
| CG7641 | Nca | 0.94 | NCALD | ENSG00000104490 | BP. synaptic transmission |
| CG8394 | VGAT | 0.94 | SLC32A1 | ENSG00000101438 | BP: synaptic transmission |
| CG3585 | Rbcn-3A | 0.94 | DMXL2 | ENSG00000104093 | CC: synaptic vesicle |
| CG7558 | Arp3 | 0.94 | ACTR3 | ENSG00000115091 | CC: excitatory synapse |
| CG14145 | Blos2 | 0.94 | BLOC1S2 | ENSG00000196072 | BP: synaptic vesicle transport |
| CG1407 | NA | 0.94 | ZDHHC15 | ENSG00000102383 | BP: synaptic vesicle maturation |
| CG31196 | 14-3-3epsilon | 0.94 | YWHAE | ENSG00000128245 | BP: regulation of synaptic plasticity |
| CG16928 | mre11 | 0.94 | MRE11A | ENSG00000020922 | CC: synapsis |
| CG8705 | pnut | 0.92 | SEPT5 | ENSG00000184702 | BP: synaptic vesicle targeting |
The table shows the list of Drosophila genes belonging to our final catalogue of putative synaptic genes that don’t have any synapse related GO term annotation but have at least 1 homologue in Homo sapiens already annotated with some synapse related GO term. The first 2 columns show the Annotation Symbol as well as the Symbol of these 66 Drosophila genes. The third column shows the mean between the classification probabilities given by the three learning algorithms to each gene. The following columns show the symbol and gene identifier of the corresponding human homologue (or 1 of them when there are more than 1) that is already annotated with at least 1 synapse related GO term in humans, an annotation that is shown in the last column of the table
Genes in Wilhelm et al. [53] list that are neither in our training set nor annotated as synaptic in Drosophila
| Rat protein | Drosophila homologue | Initial catalogue | Final catalogue |
|---|---|---|---|
| VGlut 1/2 | CG10069 | Yes | Yes |
| Calmodulin | CG8472 | Yes | No |
| NSF | CG31495 | Yes | No |
| AP-2 mu2 | CG10637 | Yes | Yes |
| SGIP1 | CG8176 | Yes | Yes |
| endophilin II | CG9834 | Yes | Yes |
| Hsc70 | CG8937 | No | No |
| PIPK Ig | CG9985 | Yes | Yes |
| Vti1a | CG3279 | Yes | No |
| VAMP4 | CG1599 | Yes | No |
| Calbindin | CG6702 | No | No |
The first column shows the names of the rat proteins whose Drosophila homologues are neither in our training set nor already annotated as synaptic genes in the fly. The second column shows the name of the corresponding Drosophila homologue. The third column indicates whether these Drosophila genes were classified as synaptic with a probability higher than 0.5 by the 3 algorithms. The fourth column indicates if the genes were also classified as synaptic by the 3 algorithms with a probability above 0.9