| Literature DB >> 28638238 |
Larissa Catharina1, Carlyle Ribeiro Lima2,3, Alexander Franca4, Ana Carolina Ramos Guimarães4, Marcelo Alves-Ferreira1, Pierre Tuffery3, Philippe Derreumaux2, Nicolas Carels1.
Abstract
We present an approach for detecting enzymes that are specific of Leishmania major compared with Homo sapiens and provide targets that may assist research in drug development. This approach is based on traditional techniques of sequence homology comparison by similarity search and Markov modeling; it integrates the characterization of enzymatic functionality, secondary and tertiary protein structures, protein domain architecture, and metabolic environment. From 67 enzymes represented by 42 enzymatic activities classified by AnEnPi (Analogous Enzymes Pipeline) as specific for L major compared with H sapiens, only 40 (23 Enzyme Commission [EC] numbers) could actually be considered as strictly specific of L major and 27 enzymes (19 EC numbers) were disregarded for having ambiguous homologies or analogies with H sapiens. Among the 40 strictly specific enzymes, we identified sterol 24-C-methyltransferase, pyruvate phosphate dikinase, trypanothione synthetase, and RNA-editing ligase as 4 essential enzymes for L major that may serve as targets for drug development.Entities:
Keywords: AnEnPi; Leishmaniasis; genomics; metabolism; sequence homology; specific enzymes
Year: 2017 PMID: 28638238 PMCID: PMC5470852 DOI: 10.1177/1177932217712471
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Figure 1Flowchart of strictly specific enzyme depuration.
Figure 2Homologous hit search algorithm for the classification of false positives of parasite-specific genes in humans. (A) The 11 possibilities of Ensembl protein associations (S2-E2) that one may obtain with a human genomic region that is homologous (tBLASTn) to a parasite protein query (S1-E1). S and E are for the beginning (start) and end of a tBLASTn homology or human gene coordinates. The Boolean description of each association between a tBLASTn hit in a human genome region and the Ensembl proteins annotated in that region is given on the left and right sides of panel A. “&” is used here in its Boolean sense, ie, a logical AND. Human genes for the Ensembl protein that are eventually compatible with a parasite’s homologous counterpart (tBLASTn) are modeled by thin lines. Human genes for Ensembl proteins in the same genomic region as a parasite’s homologous counterpart, but which are not compatible with it, are represented by dashed lines. (B): The decision tree for TRUE and FALSE associations of the human genes for the Ensembl proteins with a human genomic region that had a homology (tBLASTn) hit with a parasite protein.
Figure 3Search scheme for false positives of parasite-specific genes in human. The (A) homologous hit of Leishmania major and human proteins is compared with (B) the human gene structure obtained by tBLASTn search and with (C) the CCDS annotation from Havana.
Sequence materials of some important human parasites for comparison with the strictly specific enzymes of Leishmania major.
Figure 4Flowchart of the sequence depuration process in Leishmania major compared with Homo sapiens. Nrd stands for nonredundant.
Homo sapiens proteins retrieved from nr using the DNA stretch of the subject, which corresponded to the homologous region between the Leishmania major proteins and the human genome, as a query.
EC number annotations of the human proteins that are homologous to the Leishmania major enzymes classified as functionally specific.
Putative functional and strictly specific enzymes of Leishmania major that share ECs.
Strictly specific enzymes of Leishmania major after hidden Markov model filtering.
Distribution of strictly specific enzymes in human parasites.
Figure 5Phylogenetic tree of REL1 in Metakinetoplastina as obtained by maximum likelihood. Scale bar: 0.05 substitutions per site.