Literature DB >> 25606461

Comparative in-silico genome analysis of Leishmania (Leishmania) donovani: A step towards its species specificity.

Satheesh Kumar S1, Gokulasuriyan R K1, Monidipa Ghosh1.   

Abstract

Comparative genome analysis of recently sequenced Leishmania (L.) donovani was unexplored so far. The present study deals with the complete scanning of L. (L.) donovani genome revealing its interspecies variations. 60 distinctly present genes in L. (L.) donovani were identified when the whole genome was compared with Leishmania (L.) infantum. Similarly 72, 159, and 265 species specific genes were identified in L. (L.) donovani when compared to Leishmania (L.) major, Leishmania (L.) mexicana and Leishmania (Viannia) braziliensis respectively. The cross comparison of L. (L.) donovani in parallel with the other sequenced species of leishmanial led to the identification of 55 genes which are highly specific and expressed exclusively in L. (L.) donovani. We found mainly the discrepancies of surface proteins such as amastins, proteases, and peptidases. Also 415 repeat containing proteins in L. (L.) donovani and their differential distribution in other leishmanial species were identified which might have a potential role during pathogenesis. The genes identified can be evaluated as drug targets for anti-leishmanial treatment, exploring the scope for extensive future investigations.

Entities:  

Keywords:  A2 gene family; Amastin; Apical Membrane Antigen 1; Species specific genes; Visceral leishmaniasis

Year:  2014        PMID: 25606461      PMCID: PMC4287845          DOI: 10.1016/j.mgene.2014.10.003

Source DB:  PubMed          Journal:  Meta Gene        ISSN: 2214-5400


Introduction

Leishmaniasis is a vector-borne parasitic disease caused by obligate intracellular protozoa of the genus Leishmania. Leishmaniasis, an endemic disease of tropical and subtropical regions is the second-largest parasitic killer in the world (after malaria), responsible for an estimated 2 million cases each year and 350 million people at risk worldwide clearly imposing a major health problem globally except Australia and Antarctica (Desjeux, 2001). Approximately 88 countries are named to be infected with this sand fly borne disease. Although leishmaniasis is represented by at least 20 leishmanial species, the disease spectrum is generally categorized into 1) cutaneous leishmaniasis, 2) mucocutaneous leishmaniasis and 3) visceral leishmaniasis, depending upon the tropism of the disease and the species causing the infection. Visceral leishmaniasis which is the most severe and deadly form is caused by the old world species Leishmania (L.) donovani and Leishmania (L.) infantum (Africa, Asia, Europe) and the new world species L. (L.) infantum (South America). India, Bangladesh, Nepal, Sudan, and Brazil have been reported to have more than 90% of the world's cases of visceral leishmaniasis with an estimated incidence of 500,000 new cases and 60,000 deaths each year (News, 2006). The severity of the disease is further augmented by drug resistance and drug failure, particularly in L. (L.) donovani strains of India and Nepal, which has been recently documented (Downing et al., 2011). So the raising future risks by this neglected tropical disease among the neglected populations of the world made WHO to include leishmaniasis among the six major diseases targeted for intense research and control. Genome comparison of Leishmania (L.) major, L. (L.) infantum, Leishmania (Viannia) braziliensis showed great conservation of synteny and identifies only a small number of genes (approx 200) which are differentially distributed (Toledo et al., 2010, Peacock et al., 2007). These species specific genes may be a key factor for difference in pathogenesis between the species. Visceral leishmaniasis poses a fatality rate of greater than 100% within two years if untreated (Peacock et al., 2007). Since the genome information of L. (L.) donovani is very recent, so far, research on visceral leishmaniasis was mostly dependent on L. (L.) infantum genome information (Chappuis et al., 2007). Also most of the drugs commonly used to treat visceral leishmaniasis are toxic and exert unacceptable side effects (Khan et al., 2010). Though many control programs have been organized to control visceral leishmaniasis in Indian sub-continent, the success rate is severely compromised by the developing resistant strains of L. (L.) donovani at least in parts of India, particularly North Bihar and West Bengal. The diverse intra-strain genetic variability and drug resistance developed by the most severe L. (L.) donovani in Indian subcontinent and Nepal imposed the recent concern in the field of visceral leishmaniasis treatments (Pourshafie et al., 2004). So the treatment and control of leishmaniasis caused by L. (L.) donovani indeed require novel drug alternatives. Although the outcome of infections and presentation of disease depend on many factors such as host immune response, host genetic variability, vector, and protozoan species (Guerin et al., 2002), it is obviously the specific genes of the species which determines the spectrum and severity of the disease. In view of the obvious clinical importance of this human pathogen, a genomic approach is highly desirable and may give insight into the complex mechanisms of pathogenesis. Here we report the comparative genome analysis of L. (L.) donovani with the other sequenced leishmanial species. This study therefore frameworks the experimental verification of few significant genes, consistent with independent existence, to set an avenue of genomic aspect of drug targeting to overcome the current problems in an effective way.

Results

Genome information of L. (L.) donovani

Despite many years of evolution, the genome content of L. (L.) donovani was greatly conserved with the other four leishmanial species except few specific genes. The lack of vast diversity among the leishmanial species over estimated 20–100 Ma of evolution may be due to lack of some machinery that causes diversity in eukaryotes and probably lack of transposable elements in the leishmanial species might be the cause (Khan et al., 2010). Though the presence of retrotransposons and RNAi machinery in Trypanosoma brucei and L. braziliensis was clearly reported, the evolutionary loss of these elements in leishmanial species preserves their genome content (Bringaud et al., 2006, Villanueva et al., 1991). Similarly the L. (L.) donovani genome lacks the retrotransposons and RNAi machinery. L. (L.) donovani contains 36 chromosomes and has a haploid genome size of 32.4 Mb. L. (L.) donovani genome encodes 8032 proteins, out of which 42.84% proteins were found to be functional homologs of other leishmanial species, 56.34% proteins lack functional assignment and the remaining ~ 0.8% proteins were exclusive to L. (L.) donovani. Approximately 5.6% genes were identified to code for repeat containing proteins which were conserved and probably might play a huge role in pathogenesis which was discussed elsewhere. The summary of L. (L.) donovani genome information was given in Supplementary Table S1.

Comparative analysis of L. (L.) donovani with the other sequenced leishmanial species

L. (L.) donovani species specific genes

Though L. (L.) donovani disease tropism differs greatly from other leishmanial species, till now no specific genes were reported for difference in disease presentation, except for the A2 gene family which was reported to be involved in the survival of the parasite in visceral organs (Zhang et al., 2003). Keeping this gap area in mind, we compared the proteome of L. (L.) donovani with the other four leishmanial species which identifies 55 gene coding proteins which were specific and expressed exclusively in L. (L.) donovani. The list of species specific genes of L. (L.) donovani which has been ascribed putative function was given in Table 1. A total list of L. (L.) donovani species specific genes were given in Supplementary Table S2. Out of the 55 L. (L.) donovani specific genes, only 36 genes were assigned putative function based on homology searches and Gene Ontology. The remaining genes encode for hypothetical proteins with conserved domain or unknown function which requires experimental documentation. Signal peptides were also detected for five specific L. (L.) donovani proteins which may have antigenic role in leishmanial pathogenesis. Among the 36 genes which encode proteins with putative function, few proteins were membrane related proteins, of which the important proteins being amastin like surface protein, lipophosphoglycan biosynthetic protein and phosphoglycan 1,3 galactosyltransferase which might have prime roles in pathogenesis, though the way or the mechanism it differs from other species was not clearly understood. Also 6 ribosomal genes specific to L. (L.) donovani were reported, though the exact involvement of the product of these genes in pathogenesis was unknown. In addition 2 specific genes (gene IDs: LdBPK_020010, LdBPK_260590) of L. (L.) donovani encode peptidases and heat shock proteins which were well known to have been involved directly or indirectly in pathogenesis. Also 3 other genes (gene IDs: LdBPK_252230, LdBPK_271710, LdBPK_040310) involved in sugar metabolism were identified which might possibly be involved in virulence and pathogenesis (Loughman and Caparon, 2006, Moyrand et al., 2007). Though the relationship between sugar metabolism and virulence remains greatly undefined, the novel proteins encoded by these genes might be involved in sugar metabolism of L. (L.) donovani and may influence the virulence in an unknown manner.
Table 1

Species specific L. (L.) donovani proteins.

FunctionsL. (L.) donovaniaProtein ID of L. (L.) donovaniL. (L.) infantumL. (L.) majorL. (Viannia) braziliensisL. (L.) mexicana
60s ribosomal, putativeLdBPK_150220CBZ32839.1
Histone H3, putativeLdBPK_160600CBZ33039.1
Paraflagellar rod protein 2CLdBPK_161520CBZ33130.1
Elongation factor 1-alphaLdBPK_170200CBZ33167.1
ATG8/AUT7/APG8/PAZ2, putativeLdBPK_190850CBZ33555.1
Glycerol uptake protein, putativeLdBPK_191310CBZ33599.1
Aminoacylase, putativeLdBPK_201730CBZ33807.1
Cornichon homolog (Drosophila), isoformLdBPK_240080CBZ34396.1
CMP-sialic acid transporter, putativeLdBPK_240350CBZ34423.1
Eukaryotic initiation factor 5a, putativeLdBPK_250760CBZ34706.1
Succinyl-CoA synthetase α subunitLdBPK_252230CBZ34853.1
Aminopeptidase P1, putativeLdBPK_020010CBZ34885.1
10 kDa heat shock protein, putativeLdBPK_260590CBZ34948.1
Phosphoenolpyruvate carboxykinaseLdBPK_271710CBZ35328.1
Heat-shock protein hsp70, putativeLdBPK_282960CBZ35699.1
AMA1 protein, putativeLdBPK_301490CBZ36170.1
PolyubiquitinLdBPK_090950CBZ36500.1
Glutaminyl-peptide cyclotransferaseLdBPK_312030CBZ36603.1
Phosphoglycan β 1,3 galactosyltransferaseLdBPK_210010CBZ36730.1
Ribosomal protein L3, putativeLdBPK_323330CBZ37061.1
40S ribosomal protein S3, putativeLdBPK_151010CBZ37261.1
Amastin-like surface protein, putativeLdBPK_342650CBZ37742.1
Lipophosphoglycan biosynthetic proteinLdBPK_044281CBZ37906.1
40S ribosomal protein S6, putativeLdBPK_212150CBZ38085.1
60S ribosomal protein L22, putativeLdBPK_364640CBZ38901.1
Beta-fructofuranosidase, putativeLdBPK_040310CBZ31370.1
Metalloendopeptidase OMA1LdBPK_041090CBZ31447.1
ATPase alpha subunitLdBPK_050500CBZ31513.1
ATP-binding cassette protein subfamily GLdBPK_060100CBZ31594.1
U box domain-containing proteinLdBPK_070110CBZ31730.1
Amastin-like proteinLdBPK_080760CBZ31918.1
Translation initiation factor EIF-2B gammaLdBPK_091140CBZ32073.1
Cathepsin L-like proteaseLdBPK_080950CBZ31936.1
Histone H3LdBPK_101050CBZ32222.1
60S ribosomal protein L6, putativeLdBPK_151060CBZ37264.1
Flagellar radial spoke protein-likeLdBPK_290690CBZ35793.1

The IDs represent the GeneDb IDs of L. (L.) donovani.

Interestingly two novel and specific genes of L. (L.) donovani that encode 1) Apical Membrane Antigen 1 (AMA1) (gene ID: LdBPK_301490) and 2) cathepsin L like protease (gene ID: LdBPK_080950) were identified and its involvement in pathogenesis was discussed as follows.

Apical Membrane Antigen 1 (AMA1), highly suspected protein in parasite interaction and invasion

Apical Membrane Antigen 1 was documented as a protein directly involved in invasion of apicomplexon parasites into the host (Tonkin et al., 2011). In plasmodium and toxoplasma, the AMA1 proteins were secreted from microneme of rhoptries and it was targeted to the apical membrane where it gets integrated with the parasite plasma membrane. Earlier studies with plasmodium clearly showed that the integrated AMA1 forms a complex with RON2 which in turn helps the parasite to attach with the host cell to promote invasion. The involvement of AMA1 in signaling and parasite replication was also documented in toxoplasma (Santos et al., 2011). In conjunction, three AMA1 genes were identified in L. (L.) donovani out of which two were conserved in all leishmanial species and the third one was specific to L. (L.) donovani. The homology search showed very less sequence similarity between AMA1 from plasmodium and leishmanial species. The absence of microneme and RON2 in Leishmania species clearly indicates the absence of this mechanism in leishmanial species. Surprisingly the Gene Ontology (GO) studies of specific AMA1 (gene ID: LdBPK_301490) reported in L. (L.) donovani shows cholesterol binding ability. The importance of host membrane cholesterol in L. (L.) donovani infection was already well documented (Pucadyil and Chattopadhyay, 2006). In L. (L.) donovani, cholesterol is required for binding and internalization of the parasite inside a host cell (Pucadyil et al., 2004, Tewary et al., 2006). Similar phenomenon was also documented in L. (L.) infantum (Rodriguez et al., 2006). Also the L. (L.) donovani specific AMA1 is leucine rich protein and the significance of leucine residues in interacting with host cell membrane was already documented in many organisms including leishmanial species (Kedzierski et al., 2004). Also signal peptide was detected which possibly confers that the protein was secretory. All these clues together made us to hypothesize that in L. (L.) donovani, AMA1 is secreted from an unknown organelle into the apical membrane of L. (L.) donovani where it interacts with the cholesterol of host membrane through leucine residues and helps in parasite internalization. The species specific expression of this protein in L. (L.) donovani indicates the probable role of this protein in visceralization, which needs future experimentations. The ongoing experimental studies in our laboratory regarding AMA1 will be a huge hope in the future for anti-leishmanial therapy.

Cathepsin L like protease

Cathepsin L like protease, a type of lysosomal endopeptidases, has been already identified as class of drug targets which participate in many essential biological processes of the parasite such as embryogenesis, molting, and immune evasion (Lustigman et al., 2004, Dalton et al., 2003). Cathepsin L like protease is expressed in both promastigote and amastigote of leishmanial species (Sakanari et al., 1997), and probably the novel gene (gene ID: LdBPK_080950) from L. (L.) donovani which encodes cathepsin L like protease may assist in evading the host immune system.

Differential gene distribution in leishmanial species

Further the comparative analysis establishes few genes specific between the L. (L.) donovani and other leishmanial species. L. (L.) donovani was found to be closest with L. (L.) infantum and there were only 60 genes which were distinctly observed between L. (L.) donovani and L. (L.) infantum; 72 L. (L.) donovani specific genes were distinguished while comparing with L. (L.) major; 159 specific genes were found distinct while comparing with Leishmania (L.) mexicana; 265 specific genes were found distinct while comparing with L. (Viannia) braziliensis. The comparison was depicted in Fig. 1. The lists of few important genes and its protein product from L. (L.) donovani which share homology with some leishmanial species but not conserved with the other leishmanial species were given in Table 2. The full list of such genes including those which encode hypothetical proteins was given in Supplementary Table S3. The Gene Ontology (GO) for the L. (L.) donovani genes which were observed in few leishmanial species (~ 83 genes which encode functional proteins) showed that these genes mainly encode lipid containing membrane proteins, possibly involved in binding. Also some genes encode proteins which contain the enzymatic activity, like kinase proteins were expected to be involved in signal transduction or sugar metabolism. The Gene Ontology (GO) results were depicted in Fig. 2.
Fig. 1

Comparative analysis of L. (L.) donovani with four other leishmanial species. Individual circles represent the individual leishmanial species. The numbers in the main part of the individual circles represent the total number of protein coding genes of individual leishmanial species and the numbers after the slash represent the genes encoding the number of proteins which are distinct between L. (L.) donovani and any of the other four leishmanial species. The portion which was shared by two circles was used to represent the genes encoding the proteins which were homologous between any two leishmanial species. L. (L.) donovani, L. (L.) infantum, L. (L.) major, L. (L.) mexicana, and L. (Viannia) braziliensis are represented as L. donovani, L. infantum, L. major, L. mexicana and L. braziliensis respectively.

Table 2

Comparative analysis of L. (L.) donovani with the other four leishmanial species.

FunctionsL. (L.) donovaniaProtein ID of L. (L.) donovaniL. (L.) infantumaL. (L.) majoraL. (Viannia) braziliensisaL. (L.) mexicanaa
Protein kinase, putativeLdBPK_220370CBZ34082.1LinJ.22.0370LmjF.22.0130
RNA-binding protein, putative, UPB1LdBPK_250500CBZ34680.1LinJ.25.0500
Kinetoplast-associated protein-like proteinLdBPK_270240CBZ35185.1LinJ.27.0250LmjF.27.0745LbrM.27.0030
3′-Nucleotidase/nuclease, putativeLdBPK_312370CBZ36636.1LinJ.31.2370LmjF.31.2300LbrM.31.3140
Amastin-like surface protein, putativeLdBPK_342660CBZ37743.1LinJ.34.2660LbrM.34.3890
Ribosomal protein S11 homologLdBPK_211790CBZ33991.1
RNA-editing complex protein MP81, putativeLdBPK_020380CBZ31208.1LinJ.02.0380LmjF02.0100
Oxidoreductase-like proteinLdBPK_020700CBZ31240.1LinJ.02.0700LmjF.27.2650
GP63, leishmanolysin, metallo-peptidaseLdBPK_100520CBZ32170.1LinJ.10.0520LmjF.10.0425
40S ribosomal protein S4, putativeLdBPK_131130CBZ32612.1LinJ.13.1130LbrM.13.1160
U1 small nuclear ribonucleoproteinLdBPK_161690CBZ33146.1LinJ.16.1690LbrM.16.1700
ATPase subunit 9, putativeLdBPK_260450CBZ34934.1LinJ.26.0450LmjF.26.0460
BET1-like protein, putativeLdBPK_290650CBZ35789.1LinJ.29.0650
Amastin, putativeLdBPK_093030CBZ35869.1LinJ.29.3030LmjF.34.1080
Amastin, putativeLdBPK_291450CBZ35870.1LinJ.29.3030LmjF.34.1080
Calcium-binding protein, putativeLdBPK_301300CBZ36151.1LinJ.30.1300LmjF.30.1240LbrM.30.1300
AAA family ATPase-like proteinLdBPK_302520CBZ36273.1LinJ.30.2520LmjF.30.2500LbrM.30.2930
Tryparedoxin-like proteinLdBPK_312010CBZ36601.1LinJ.31.2010
Helicase-like protein, DNA repairLdBPK_321670CBZ36898.1LinJ.32.1670LmjF.32.1630LbrM.32.1680
Amastin-like surface protein, putativeLdBPK_341720CBZ37650.1LinJ.34.1720LbrM.34.1870
Endonuclease/exonuclease/phosphatase,LdBPK_361210CBZ38559.1LinJ.36.1210LmjF.36.1395LbrM.35.0700
Mitochondrial carrier protein, putativeLdBPK_367030CBZ39138.1LinJ.36.7030LmjF.36.6785
Cytochrome b5-like proteinLdBPK_091580CBZ32116.1LinJ.09.1580LmjF.09.1500LmxM.09.1490
ATP-binding cassette protein subfamily A,LdBPK_111220CBZ32385.1LinJ.11.1230LmjF.11.1230LmxM.11.1240
Alg9-like mannosyltransferase, putativeLdBPK_120145CBZ32416.1LinJ.12.0140LmjF.12.0160LmxM.12.0160
Nucleotide sugar transporter, putativeLdBPK_150900CBZ32906.1LinJ.15.0900LmjF.15.0840LmxM.15.0840
Product = P-type H-ATPase, putativeLdBPK_181490CBZ33456.1LinJ.18.1510LmjF.18.1510LbrM.18.1690LmxM.18.1520
Product = 4-coumarate:coa ligase-like proteinLdBPK_190950CBZ33564.1LinJ.19.0960LmjF.19.0985LmxM.19.0995
Product = 4-coumarate:coa ligase-like proteinLdBPK_190960CBZ33565.1LinJ.19.0960LmjF.19.0985LmxM.19.0995
Calmodulin, putativeLdBPK_131060CBZ32605.1LinJ.13.1060LmjF.13.1160LmxM.13.1160
EF hand-like proteinLdBPK_131490CBZ32647.1LinJ.13.1490LmjF.13.1450LmxM.13.1450
Fatty acyl CoA synthetase 2, putativeLdBPK_010530CBZ31140.1LinJ.01.0530LmjF.01.0500LmxM.01.0500
Calpain-like cysteine peptidase, putativeLdBPK_201210CBZ33756.1LinJ.20.1210LmjF.20.1180LbrM.20.2800LmxM.20.1180
Pumilio protein 9, putativeLdBPK_201420CBZ33777.1LinJ.20.1420LmjF.20.1475LbrM.20.4700LmxM.20.1370a
Cyclase associated proteinLdBPK_211110CBZ33923.1LinJ.21.1110LmjF.21.0891LbrM.21.0880LmxM.21.0891
Hypothetical protein, conservedLdBPK_211770CBZ33989.1LinJ.21.1770LmjF.21.1530LmxM.21.1530
Ring finger protein 138LdBPK_220070CBZ34052.1LinJ.22.0070LmjF.22.0190LmxM.22.0190
Protein–tyrosine phosphatase-like proteinLdBPK_220120CBZ34057.1LinJ.22.0120LmjF.22.0250LbrM.22.0190LmxM.22.0250
Methylenetetrahydrofolate dehydrogenaseLdBPK_220220CBZ34067.1LinJ.22.0220LmjF.22.0340LbrM.22.0240LmxM.22.0340
A2 proteinLdBPK_220670CBZ34112.1LinJ.22.0670LmxM.22.0692
40S ribosomal protein L14, putativeLdBPK_221410CBZ34182.1LinJ.22.1410LmjF.22.1680LbrM.22.1410
Ubiquitin-activating enzyme, putativeLdBPK_240470CBZ34435.1LinJ.24.0470LmjF.24.0460LbrM.24.0710LmxM.24.0460
Acylphosphatase, putativeLdBPK_252040CBZ34834.1LinJ.25.2040LmxM.25.1960
Tagatose-6-phosphate kinase-like proteinLdBPK_252550CBZ34886.1LinJ.25.2550LmjF.25.2440LmxM.25.2440
ATPase beta subunit, putativeLdBPK_052580CBZ34888.1LinJ.25.1210LmjF.25.1170LmxM.25.1170
Ghistone H1 likeLdBPK_271511CBZ35309.1LinJ.27.1511LmjF.27.1605LmxM.27.1605
Tuzin like protein, putativeLdBPK_093020CBZ35871.1LinJ.29.3020LmjF.34.1590LmxM.33.1830
Histone H2A, putativeLdBPK_291850CBZ35910.1LinJ.29.1850LmxM.21.0915
Histone H2A, putativeLdBPK_291860CBZ35911.1LinJ.29.1870LmxM.21.0915
Histone H2A, putativeLdBPK_291870CBZ35912.1LinJ.29.1870LmxM.21.0915
Poly(A) polymerase, putativeLdBPK_292710CBZ35996.1LinJ.29.2710LmjF.29.2600LbrM.29.2910LmxM.08_29.2600
Phosphoglycan β 1,2 arabinosyltransferaseLdBPK_020190CBZ31189.1LinJ.02.0190LmjF.02.0180LbrM.02.0030LmxM.33.0510
Ribosomal RNA processing protein, putativeLdBPK_020290CBZ31199.1LinJ.02.0290LmjF.02.0320LbrM.02.0060LmxM.02.0320
Amino acid permease 3LdBPK_310900CBZ36492.1LinJ.31.0910LmjF.31.0870LmxM.30.0880
Sodium stibogluconate resistance protein,LdBPK_073400CBZ36733.1LinJ.31.0950LmjF.31.0950
Tuzin-like proteinLdBPK_341160CBZ37596.1LinJ.34.1160LmjF.34.2800LbrM.34.0500LmxM.36.1280
Serine acetyltransferaseLdBPK_342710CBZ37748.1LinJ.34.2710LmjF.34.2850LmxM.33.2850
RNA editing associated helicase 2, putativeLdBPK_343010CBZ37778.1LinJ.34.3010LbrM.34.3540LmxM.34.2680
Unc104-like kinesin, putativeLdBPK_344090CBZ37886.1LinJ.34.4090LmjF.34.3952LbrM.34.4700
Phosphoglycan beta 1,3 galactosyltransferase 6LdBPK_360010CBZ38439.1LinJ.25.2570LmjF.35.0010LmxM.25.2460
Xylulokinase, putativeLdBPK_360280CBZ38466.1LinJ.36.0280LmjF.36.0260LmxM.36.0260
Aminoalcohol phosphotransferase, putativeLdBPK_030970CBZ31323.1LinJ.03.0970LmjF.36.5900LmxM.03.0821
Peptide deformylase 2 metalloprotease-likeLdBPK_040820CBZ31420.1LinJ.04.0820LmjF.04.0820LmxM.04.0820
Cytochrome b5-like proteinLdBPK_070940CBZ31808.1LinJ.07.0940LmjF.07.0810LbrM.07.0965LmxM.07.0810
Serine peptidase, clan SF, family S26ALdBPK_080460CBZ31890.1LinJ.08.0460LmjF.08.0450LmxM.08.0450
Ecotin, putativeLdBPK_150530CBZ32870.1LinJ.15.0530LmjF.15.0510LbrM.15.0540
CFAS, putativeLdBPK_080560CBZ31900.1LinJ08.0560LbrM.08.0590LmxM.08.0545
Kinesin, putativeLdBPK_161550CBZ33133.1LinJ.16.1550LbrM.16.1530
Uridine kinase-like proteinLdBPK_312560CBZ36654.1LinJ.31.2560LmjF.31.2785LbrM.31.3370
G-actin binding protein, putativeLdBPK_365830CBZ39019.1LinJ.36.5830LmjF.36.5590LbrM.35.5860
Cytochrome b5-like proteinLdBPK_091580CBZ32116.1LinJ.09.1580LmjF.09.1500

The IDs represent the GeneDb IDs of the corresponding leishmanial species. Bold highlighted — pseudogenes.

Fig. 2

Gene Ontology predicted for the genes differentially distributed between L. (L.) donovani and four other leishmanial species. The pie chart shows the most represented functions under three categories: A) biological process, B) molecular function and C) cellular component. The biological process predicted for these proteins is mainly involved in lipid biosynthesis, signal transduction and carbohydrate metabolism. The molecular function shows that these proteins are involved greatly in binding and some proteins contain enzymatic activity and hence involved in signaling pathways. Cellular component shows that these proteins are mainly membrane proteins. NA indicates that the GO predictions were not available.

A specific gene (gene ID: LdBPK_290650) found to be conserved between L. (L.) donovani and L. (L.) infantum but was completely absent in other leishmanial species encodes a putative BET1-like protein. Though the exact function of this protein is not known, its close similarity with BET1 shows that the protein might participate in vesicular transport and may function as SNARE in docking of ER-derived vesicles with the cis-golgi membrane. The possible involvement of this protein in pathogenesis needs future experimental investigations. The other gene (gene ID: LdBPK_161550) from L. (L.) donovani which was found homologous to L. (L.) infantum and L. (Viannia) braziliensis but absent in L. (L.) major and L. (L.) mexicana encodes a kinesin protein which may be involved in flagellar movement inside the host cell. Though leishmanial genome encodes many conserved kinesin molecules involved in flagellar locomotion, the importance of this specific kinesin needs further studies. Another gene (gene ID: LdBPK_367030) which was found in L. (L.) donovani and L. (L.) infantum but has become a pseudogene or absent in other leishmanial species encodes a putative mitochondrial carrier protein (Agcp2438-like protein) which is a mitochondrial transmembrane protein involved in transport, probably playing a key role inside the oxidative environment of macrophages (Dolezal et al., 2012). The important surface proteins of leishmanial species amastin, amastin-like surface protein, cysteine protease B (CPB), lipophosphoglycan LPG3 and the leishmanolysin GP63, were clearly reported for their potential role in parasite virulence (Rochette et al., 2005, Mottram et al., 2004, Vinet et al., 2009, Joshi et al., 2002). Though we found no obvious difference in the gene structure of lipophosphoglycan LPG3 (gene IDs: LdBPK_044280 and LdBPK_044281) in L. (L.) donovani in comparison with the other leishmanial species, we do found the discrete gene pattern of amastin, amastin like surface protein, leishmolysin and cysteine protease in L. (L.) donovani, which was discussed below. The important surface protein of leishmanial species leishmolysin GP63, a metalloprotease ubiquitously distributed in trypanosomatids, plays a myriad of functions and was found conserved in all leishmanial species including Herpetomonas samuelpessoai, an insect trypanosomatid (Pereira et al., 2010).We found that a molecule of GP63(gene ID: LdBPK_100520), leishmanolysin metallo-peptidase, clan MA (M), and family M8 (gene ID: LdBPK_100520) specific in L. (L.) donovani and L. (L.) infantum, released from the surface by proteolysis might participate in different stages of the parasite life cycle (Elias et al., 2006). The enzyme has a physiological function in the promastigote stage of these parasites (Schneider and Glaser, 1993) which indicates its role in the initial stage of parasite infection and its apparent role of interaction with macrophages in visceral organs needs further experimentations. Two genes which encode amastin like surface proteins (L. (L.) donovani gene IDs: LdBPK_341720, LdBPK_342660) located at chromosome 34 found to be specific to L. (L.) donovani and L. (L.) infantum probably indicates the involvement of this protein in visceralization. The comparative studies also identified two other significant gene families of pathogenesis 1) amastin gene family and 2) A2 gene family. The L. (L.) donovani specific amastin genes being the important pathogenesis factor was compared with the other leishmanial species and discussed in the following sections. Also, A2 protein encoded by A2 gene family, the only documented protein to be directly involved in visceralization of the L. (L.) donovani was detailed in later sections.

L. (L.) donovani specific amastins and its gene locations with respect to other leishmanial species

Amastin, encoded by a large gene family was a transmembrane glycoprotein initially documented in amastigote stage of trypanosomatid parasites and subsequently documented as surface proteins expressed more in leishmanial species than trypanosoma species (Jackson, 2010). The exact function of amastin and tuzin, the gene family found to be associated with amastin of unknown functions is yet to be classified. The significance of amastin gene family in pathogenesis of leishmanial species has been partially reported in earlier studies (Rochette et al., 2005). Amastin proteins of leishmanial species were mainly found in chromosome 8, 24, 28, 29, 30 and 34 of which chromosome 34 was most represented and chromosomes 28 and 29 were the least represented. Chromosome 34 contains mainly the amastin genes (6 copies) which were specific for visceralization causing protozoans. One amastin gene was found to be specific for L. (L.) donovani, 2 copies of amastin genes were found conserved in L. (L.) major, L. (L.) infantum, L. (L.) mexicana and L. (L.) donovani, one amastin gene was found present in all leishmanial species except L. (L.) major and one amastin gene was found conserved in all leishmanial species. Interestingly chromosome 8 contains 5 amastin gene locations out of which 3 were L. (L.) donovani specific amastin genes, one is found in L. (L.) donovani and L. (L.) infantum and one found in all leishmanial species except L. (Viannia) braziliensis where it was absent. This clearly indicates that the amastin gene concentrated on chromosome 8 might be the visceralization factor of L. (L.) donovani. No L. (L.) donovani specific amastin gene family was detected in chromosomes 24, 28 and 30 during our analysis and these amastin genes were well conserved in all leishmanial species except L. braziliensis where the amastins were absent. Also a gene from chromosome 29 encodes a specific amastin protein which was found absent in L. (L.) major and L. (Viannia) braziliensis. In total L. (L.) donovani contains 4 specific amastin genes (gene IDs: LdBPK_341700, LdBPK_080710, LdBPK_080780, LdBPK_080790) the importance of this specific gene expression in virulence and pathogenesis needs future investigations. Altogether the comparative study shows that amastin gene family was represented more in L. (L.) donovani and less in L. (Viannia) braziliensis. The probable explanation could be the possible involvement of this protein in visceralization and extreme evolutionary diversification of L. (Viannia) braziliensis and L. (L.) donovani. The complete comparison and chromosomal location of amastin gene family from leishmanial species were picturized in Fig. 3.
Fig. 3

Comparison of amastin gene family in five different Leishmanial species. The straight lines indicate the chromosomes and the boxes denote the particular amastin gene. The numbers are given above and below to differentiate the chromosomal locations of the gene in the forward and reverse strands. Different colors are used to differentiate the amastin gene conserved or differentiated among the leishmanial species. The box indicates the amastin genes conserved in all leishmanial species. The box indicates the amastin gene conserved between L. (L.) donovani and L. (L.) infantum but absent in other three leishmanial species. The box indicates the amastin gene conserved between L. (L.) donovani, L. (L.) infantum and L. (L.) mexicana but absent in other two leishmanial species. The box indicates the genes conserved in all leishmanial species except L. braziliensis. The box indicates the amastin gene expressed specifically in L. (L.) donovani and absent in remaining four leishmanial species. The box indicates the amastin gene conserved in all leishmanial species except L. (L.) major. The numbers over the boxes indicate the exact chromosomal location of amastin genes in L. (L.) donovani genome.

Analysis of A2 gene family with respect to copy number variations

L. (L.) donovani disease tropism differs from other leishmanial species depending mainly on visceralization. The gene mainly involved in the phenomena was the A2 gene family with its role in survival of the parasite in visceral organs. Studying the evolutionary pattern of this important gene may give insight into the difference in its mode of expression, leading to difference in mechanism of pathogenesis of L. (L.) donovani from other leishmanial species. The research on L. (L.) donovani till now presents only one specific gene family A2 which was directly involved in disease tropism (Zhang and Matlashewski, 2001). In contrast, L. (L.) major contained only A2 pseudogenes (Zhang et al., 2003) and were completely absent in L. (Viannia) braziliensis. The function of A2 protein in L. (L.) donovani might be to relieve the stress in visceral organs following infection (McCall and Matlashewski, 2010). To study the evolution, A2 protein sequences of L. (L.) donovani were collected from literature (Oliveira et al., 2011). HMMER and Blastp result revealed that the A2 protein had clear homology with Stage Specific S Antigen (SSSA) of other leishmanial species. A2 gene family was specifically expressed in the amastigote stage inside host macrophage, though the exact function of the protein coded by this gene family was unclear (Charest and Matlashewski, 1994). The multiple sequence alignment showed that the A2 protein sequence was conserved in all leishmanial species though the length of the protein was 5 fold less represented in L. (L.) donovani, may be due to deletion events. The multiple sequence alignment was given in (Supplementary Fig. S1). The clear homology and exact length with S-antigen homolog indicates that in course of evolution, part of A2 protein might have got lost in other leishmanial species, leaving the SSSA part to play a role in pathogenesis. So the important role of SSSA in L. (L.) donovani remains to be uncovered in future. Also significant homology found between the A2 protein sequence of L. (L.) donovani species and uncharacterized protein sequences of Streptomyces ambofaciens and Thermobispora bispora indicates that these proteins might have originated from common ancestor and as evolution progresses these proteins diverged depending on the host as well as the environment and formed a significant protein to play a key role in pathogenesis. The evolutionary closeness was further confirmed by phylogenetic analysis (data not shown). Moreover A2 protein from L. (L.) mexicana, yet not declared as non-functional protein coded by a pseudogene, was very much conserved with the uncharacterized protein sequences of Streptomyces ambofaciens and Thermobispora bispora indicating that it was preserved throughout the evolution but the exact role of these protein in L. (L.) mexicana pathogenesis remains unclear. The copy number of A2 gene also plays a significant role in pathogenesis of particular leishmanial species. A2 gene family has become a pseudogene in L. (L.) major genome (Zhang et al., 2003) and was completely absent in L. (Viannia) braziliensis. A2 family genes that are abundantly expressed in amastigote stage of leishmanial species known to be the primary factor for visceralization and virulence. Previous studies demonstrate the crucial role of A2 genes (Zhang and Matlashewski, 2001). A2 gene family present in chromosome 22, consists of a 5′A2 rel, 3′A2 rel, internal A2 rel and A2 genes and organized as 5′A2rel, A2 gene, internal A2 rel and 3′A2 rel (Zhang et al., 2003, Zhang and Matlashewski, 2001). All the available genes in A2 family consisting of 5′A2rel, 3′A2rel, internal A2rel and A2 gene sequences from leishmanial genomes were retrieved from NCBI GenBank and GeneDb. A2 gene sequence was not available for L. (L.) major and L. (Viannia) braziliensis in the databases while 5′A2rel, 3′A2rel and internal A2rel gene sequences were not available for the species other than L. (L.) donovani and L. (L.) infantum. To understand the organization of A2 gene family within the leishmanial species, A2 genes were searched in the genome of L. (L.) donovani using ACT (Carver et al., 2005). The results of respective A2 gene against respective genome shows that, in L. (L.) infantum single copy of A2 gene is present but in L. (L.) mexicana two copies of A2 gene are present which are adjacent to each other with very little sequence differences. In L. (L.) donovani there are multiple copies of partial A2 gene scattered throughout the chromosome 22. The location of A2 genes on the chromosome 22 of four leishmanial species are depicted in (Fig. 4). But the presence of two A2 genes and its role in L. (L.) mexicana remains unclear. L. (L.) donovani genome evolved with 5 copies of A2 gene as a result of duplication remains the most severe form of visceral leishmaniasis. The copy number difference of A2 gene in different leishmanial species was depicted in (Supplementary Fig. S2).
Fig. 4

Location of A2 gene family in chromosome 22 of four leishmanial species.

Chromosomal location of A2 gene in four leishmanial species is shown, except for L. (Viannia) braziliensis where A2 gene is completely absent. A, B, C and D represent the chromosomal locations of L. (L.) infantum, L. (L.) mexicana, L. (L.) major and L. (L.) donovani respectively. Different grayscale representations as given in the figure are shown to differentiate and locate the 5′A2rel, A2 gene, internal A2 rel and 3′A2 rel on chromosome 22 of four leishmanial species.

L. (L.) donovani genes with high-variation

Cysteine peptidases were considered to be the important molecule in leishmanial pathogenesis (Vinet et al., 2009). A gene (gene ID: LdBPK_201210) encoding calpain-like cysteine peptidase, putative in L. (L.) donovani was identified and interestingly the comparison of this protein with the other leishmanial proteins showed that it contains repeat motifs at different locations strengthening the anticipation of this protein to have a vital role in parasite survival. Also 11 other proteins recently documented as proteins under positive selection in L. (L.) donovani (Downing et al., 2011) were compared and listed in (Supplementary Table S4).

Repeat analysis identifies huge differences in gene coding for surface proteins between L. (L.) donovani and other leishmanial species

Amino acid repeat-containing proteins have a broad range of functions and their identification was of relevance to many experimental biologists. The involvement of these proteins in immune evasion had been shown in protozoan parasites such as the kinetoplastid and Plasmodium species, probably by influencing virulence and pathogenicity (Goto et al., 2010). Leishmanial proteomes were enriched with amino acid repeats, approximately 3–4% proteins were repeat containing proteins and the probable role of these proteins might be to mediate host parasite interaction (Kedzierski et al., 2004, Depledge et al., 2007). Identification of repeat-containing proteins provides researchers with a defined subset of proteins which can be analyzed by expression profiling and functional characterization, thereby facilitating study of pathogenicity and virulence factors in the parasitic protozoa. Total repeat containing protein sequences were collected from the nearby species of L. (L.) donovani from Repseq database (Depledge et al., 2007). The comparative analysis of number and type of repeat containing protein sequences in 4 leishmanial species are illustrated here (Fig. 5). Many virulence factors were already reported in leishmanial species in which surface proteins were of special interest. Comparative analysis of genes encoding the repeat containing surface proteins and other repeat proteins known to be involved in virulence was given in (Table 3). Among the surface proteins, surface antigen proteins and Proteophosphoglycan were well represented repeat proteins of leishmanial species. In L. (L.) donovani, two surface proteins (protein IDs: CBZ33953.1 and CBZ31356.1) from chromosome 12 and 4 were duplicated many times in other leishmanial species. Also calpain like cysteine peptidases from L. (L.) donovani (protein ID: CBZ35211.1), were represented more in other leishmanial species. The genes which encode other repeat containing proteins which were functional homologs or hypothetical proteins of other leishmanial species were also compared and tabulated. The most common type of motif found in repeat proteins was Single Repeat Regions (SRR). The complete list of comparative analysis of genes encoding repeat proteins was given in (Supplementary Table S5, Supplementary Table S6). The frequency of occurrence of particular amino acid was important for a repeat containing protein to be involved in pathogenesis directly or indirectly. Alanine the common amino acid was occurring more in repeat containing proteins also, though there was no relation reported between alanine and pathogenesis. Leucine was represented more in these proteins and the significance of this amino acid in virulence or pathogenesis of many organisms was well documented in literature (Kedzierski et al., 2004). Also frequency of serine was found more, though the exact involvement of this amino acid in pathogenesis was unknown. The amino acid composition of the repeat containing proteins was given in (Supplementary Fig. S3).
Fig. 5

Analysis of L. (L.) donovani proteins containing repeats. L. (L.) donovani proteome contains 415 repeat containing proteins out of which are 293 functional proteins and 122 are hypothetical. Similarly L. (L.) major contains 317 repeat containing proteins out of which are functional proteins and are hypothetical. L. (L.) infantum contains 261 repeat containing proteins out of which are functional proteins and are hypothetical. L. (Viannia) braziliensis contains 248 repeat containing proteins out of which are functional proteins and are hypothetical. The blue bars represent the total number of repeat proteins, red bars represent the functional proteins and green bars represent the hypothetical proteins. L. (L.) donovani, L. (L.) infantum, L. (L.) major, and L. (Viannia) braziliensis are represented as L. donovani, L. infantum, L. major and L. braziliensis respectively.

Table 3

Repeat analysis of L. (L.) donovani.a

L. (L.) donovaniaL. (L.) majoraL. (L.) infantumaL. (Viannia) braziliensisaLengthTypeFunction
LdBPK_211410LmjF12.0730LinJ12.0690477SRRSurface antigen protein 2
LdBPK_211410LmjF12.0740LinJ12_v4.0668477SRRSurface antigen protein
LdBPK_211410LmjF12.0755LinJ12_v4.0663477SRRSurface antigen protein 2
LdBPK_211410LmjF12.0760LinJ12_v4.0665477SRRSurface antigen protein 2
LdBPK_311480LmjF12.0765LinJ12.0690LbrM12.0750412SRRSurface antigen protein 2
LdBPK_211410LmjF12.0830LinJ12_v4.0668477SRRSurface antigen protein 2
LdBPK_211410LmjF12.0850LinJ12_v4.0668477SRRSurface antigen protein 2
LdBPK_311480LmjF12.0870LinJ12_v4.0668412SRRSurface antigen protein 2
LdBPK_211410LmjF21.1170LinJ21.1410LbrM21.1370477SRRSurface antigen-like protein
LdBPK_311490LmjF31.1450LinJ31.1490302SRRSurface membrane protein gp46-protein
LdBPK_040200LmjF04.0210LinJ04.0200285SRRSurface antigen-like protein
LdBPK_211410LmjF12.0860477SRRSurface antigen protein
LdBPK_211410LmjF12.0910477SRRPromastigote surface antigen protein
LdBPK_211410LmjF12.0920477SRRPromastigote surface antigen protein
LdBPK_211410LmjF12.0960477SRRSurface antigen protein 2
LdBPK_211410LmjF12.0990477SRRSurface antigen protein
LdBPK_312750LmjF12.1005670SRRSurface antigen protein 2
LdBPK_211410LmjF12.1020477SRRSurface antigen protein
LdBPK_211410LmjF12.1040477SRRSurface antigen protein
LdBPK_211410LmjF12.1060477SRRSurface antigen protein
LdBPK_211410LmjF12.1070477SRRSurface antigen protein 2
LdBPK_211410LmjF12.1090477SRRSurface antigen protein
LdBPK_311490LmjF31.1460302SRRSurface membrane protein gp46 protein
LdBPK_363500LinJ36.35002533SAARHypothetical transmembrane protein
LdBPK_322420LmjF32.2270LinJ32.2420LbrM32.2500341SRRMembrane associated protein-like protein
LdBPK_040170LbrM04.0670349SRRSurface antigen-like protein
LdBPK_040170LbrM04.1260349SRRSurface antigen-like protein
LdBPK_040170LbrM04.1270349SRRSurface antigen-like protein
LdBPK_040170LbrM04.1340349SRRSurface antigen-like protein
LdBPK_050240LmjF05.0240LinJ05.0240433SRRViscerotropic leishmaniasis antigen
LdBPK_040430LmjF20.1180LinJ20.1210855SRRCalpain-like cysteine peptidase
LdBPK_270510LmjF27.0490LinJ.27.0500LbrM.27.06005550SRRCalpain-like cysteine peptidase
LdBPK_270510LmjF27.0500LinJ27.0500LbrM27.06005550SRRCalpain-like cysteine peptidase
LdBPK_270510LbrM27.21405550SRRCalpain-like cysteine peptidase
LdBPK_270510LbrM28.21005550SRRCalpain-like cysteine peptidase
LdBPK_350490LmjF35.0500LinJ35.0490LbrM34.0520453SRRProteophosphoglycan ppg3
LdBPK_350500LinJ35.0500392SRRProteophosphoglycan ppg3
LdBPK_350490LinJ35.0530453SRRProteophosphoglycan 5
LdBPK_311480LinJ35.0540413SRRProteophosphoglycan 5
LdBPK_020200LbrM02.0240993TRPhosphoglycan beta 1
LdBPK_060810LbrM34.05303343SRRProteophosphoglycan ppg4
LdBPK_080630LbrM34.05402883SRRProteophosphoglycan ppg4
LdBPK_350500LbrM34.0550392SRRProteophosphoglycan ppg3
LdBPK_280600LinJ10.0520566TRGP63-3

The IDs represent the GeneDb IDs of the corresponding leishmanial species. Bold highlighted — least homologous/pseudogenes.

Discussion

The aim of this study is to compare and reveal the species specific differences of L. (L.) donovani with some other important leishmanial species. The difference between L. (L.) donovani and other leishmanial species at the genome level is completely studied. Though genome of leishmanial species are highly conserved as reviewed in the introduction, few important differences were identified in the genome of L. (L.) donovani. The comparative genome analysis of L. (L.) donovani with the other leishmanial species identified 55 species specific genes. Since the proteins encoded by these genes are species specific, the importance of these proteins in disease representation, disease progression, pathogenesis and virulence can be evaluated. To read the functions of these genes, Gene Ontology was done using Amigo tool available at GO database. Major portions of the functionally specific proteins fall into two classes 1) Ribosomal protein and 2) Surface/surface-like proteins. Though the specificity of ribosomal proteins was not urged, the probable involvement of surface proteins in L. (L.) donovani disease spectrum is revealed in our study. In addition some genes involved in sugar metabolism was also identified and the probable role of these proteins in pathogenesis was discussed. Importantly a gene which encodes cathepsin like cysteine protease with its established role in pathogenesis was identified as specific gene of L. (L.) donovani which can be verified as important drug target in future. Also one other gene which encodes Apical Membrane Antigen 1 (AMA1), though less research was done in leishmanial species regarding this protein and the involvement of this protein in pathogenesis is completely undefined, the involvement of this protein in virulence and pathogenesis in organisms such as plasmodium and Toxoplasma gondii attracts the future interest to investigate the importance of this protein in leishmanial pathogenesis. We hypothesize the specific AMA1 of L. (L.) donovani might follow a similar mechanism documented in apicomplexon parasites to invade the host cells though the mediators involved in host–parasite interaction are RON2 in the case of apicomplexon parasites and may be cholesterol in the case of L. (L.) donovani. Matching our expectation, many genes which are found conserved with L. (L.) infantum showed less homology with remaining three leishmanial species. Apart from this, many genes were identified that are found in L. (L.) donovani and few leishmanial species but absent or encode non functional proteins in other leishmanial species. The comparative analysis list contains genes which encode many surface proteins noticeably amastins, amastin like surface proteins, peptidases etc., and these proteins are highly linked with pathogenesis, which may be experimentally verified further for complete understanding of distinct mechanism of L. (L.) donovani pathogenesis. As amastins have significant role in pathogenesis of any leishmanial species, the amastin comparison was done among all the five leishmanial species which identified few L. (L.) donovani specific amastin gene locations. Amastin gene family is represented more in L. (L.) donovani though the exact cause is not known. Also few genes which encode proteins that are not surface proteins but linked to pathogenesis directly or indirectly are discussed to their relevance. In addition, we identified a gene encoding a repeat containing protein, calpain like cysteine peptidases was identified and this particular protein is conserved in all leishmanial species except in L. (Viannia) braziliensis where possibility of pseudogene presence was reported. Totally 12 genes which were under evolutionary selection were compared. Unexpectedly none of these genes were found in L. (Viannia) braziliensis. It is inferred that the possibility of deletion events as the evolution of L. (Viannia) braziliensis is fast and diverged unlike other leishmanial species because of the presence of transposable elements. Previously the comparative analysis of three leishmanial species L. (L.) major, L. (L.) infantum and L. (Viannia) braziliensis identified few genes which are specific to each leishmanial species (Peacock et al., 2007). The availability of genome sequence of two new leishmanial species L. (L.) donovani and L. (L.) mexicana and the complete comparative study of five species of leishmanial in the present study eliminated few genes reported as species specific genes for three leishmanial species L. (L.) major, L. (L.) infantum and L. (Viannia) braziliensis by Peacock et al. [data not shown], as these genes are identified in L. (L.) donovani. The only specific gene family till now documented to be responsible for visceralization, the A2 gene family was analyzed for its evolutionary divergence. A2 gene family encodes A2 protein which was highly homologous to Stage Specific S Antigen of other leishmanial species, though the exact relevance of this stage specific expression is completely not understood. Also A2 gene copy number difference between leishmanial species was reported which clearly identified high copy number in L. (L.) donovani and L. (L.) infantum. The presence of A2 protein in L. (L.) mexicana was ambiguous which needs future verifications. Specific amino acid repeats play a direct role in virulence from prokaryotes to eukaryotes. Complete L. (L.) donovani repeat containing proteins were identified and compared with the other leishmanial species in the present study. Altogether the current study shows the complete analysis of the recent draft genome of L. (L.) donovani. The genes identified as L. (L.) donovani species specific can be experimented in the future to explore the complexity of L. (L.) donovani genome information and probably if some of these genes are established to be involved in pathogenesis, it can be a clear target for anti-leishmanial therapy.

Materials and methods

Data collection

The genome sequence of L. (L.) donovani and all other sequenced leishmanial species including L. (L.) infantum, L. (L.) major, L. (Viannia) braziliensis and L. (L.) mexicana was retrieved from NCBI GenBank and GeneDb (Benson et al., 2011, Logan-Klumpler et al., 2012). The total protein sequences of all other four leishmanial species were collected from Trityp database (Aslett et al., 2010).

Comparative analysis

The total protein sequence of L. (L.) donovani was simultaneously searched against the proteomes of other leishmanial species using HMMER package (Eddy, 2009) and Psi-blast (Altschul et al., 1997) and it was further confirmed by Bioedit local blastp package (Hall, 1999). The matches of L. (L.) donovani which was showing greater than 30% homology or e-value lesser than e− 05 were eliminated using in-house Perl script and the remaining proteins coded by L. (L.) donovani genes which are completely non-homologous in all leishmanial species were considered species specific genes of L. (L.) donovani. The stringent e value is set to completely eliminate any homologous genes. The identified species specific genes were supposed to length criteria of greater than 90 codons and the genes which code proteins less than 30 amino acids were manually discarded, as it is very difficult to justify the proteins which are coded by less than 90 codons. Further L. (L.) donovani genes were individually compared with the other four leishmanial species and the genes which were present between L. (L.) donovani and any leishmanial species but absent in other leishmanial species were listed. All the possible combinations were done between L. (L.) donovani and other four leishmanial species and compared. In addition, the proteins encoded by these genes were further annotated by transferring functions of already assembled and annotated L. (L.) major, L. (L.) infantum and L. (Viannia) braziliensis proteins using blastp searches. Also annotations of L. (L.) donovani specific genes were done by searching the GO database using Amigo package (Carbon et al., 2009). Functionally homologous genes which show specificity between L. (L.) donovani and other species were also functionally annotated by Gene Ontology using Amigo tool. In addition, genes which were continuously under change (i.e.) genes under positive selection were detected using Psi-blast and blastp matches. The genes meeting these criteria were retrieved using in house Perl scripts. The amastin gene sequences from different leishmanial species were collected and HMMER searches identified the homologous amastin genes in L. (L.) donovani. The identified amastin genes were compared and localized by searching all the leishmanial genome in parallel using ACT software (Carver et al., 2005).

Multiple sequence alignment (MSA)

A2 protein sequences and gene sequences were collected from NCBI. Blastp using NCBI blast (Altschul et al., 1990) and Fasta (Lipman and Pearson, 1985) format of those gene sequences identified the nearby homologous sequences from different species. Multiple sequence alignment was done using MUSCLE (Edgar, 2004). Also the location of the A2 gene family was identified by searching all the leishmanial species using Artemis software (Carver et al., 2005).

Repeat analysis

The entire L. (L.) donovani proteins were analyzed for the presence of amino acid repeats using Repseq database (Depledge et al., 2007). The entire repeat sequences from L. (L.) infantum, L. (L.) major, and L. (Viannia) braziliensis and Trypanosoma species were retrieved from Repseq database and checking the presence of the homologous sequence in L. (L.) donovani using HMMR, Psi-Blast and Blastp revealed the most probable repeat sequence in L. (L.) donovani. Further the proteins identified as species specific were also checked for the presence of repeats. Altogether this will reveal the complete repeat containing proteins of L. (L.) donovani. The following are the supplementary data related to this article

Supplementary Fig. 1.

Multiple sequence alignment of A2 proteins.

Supplementary Fig. 2.

Copy number variations of A2 protein in leishmanial species.

Supplementary Fig. 3.

Amino acid distribution in repeat containing proteins of L. (L.) donovani. Supplementary Table S1. L. (L.) donovani genome content. Supplementary Table S4. L. (L.) donovani genes with high-variations.

Supplementary Table S2

List of L. (L.) donovani species specific proteins (Including hypothetical proteins).

Supplementary Table S3

List of functional and hypothetical homologs.

Supplementary Table S5

Repeat analysis of L. (L.) donovani (functional homologs).

Supplementary Table S6

Repeat analysis of L. (L.) donovani proteome (hypothetical proteins only).
  47 in total

1.  Host cell invasion by apicomplexan parasites: insights from the co-structure of AMA1 with a RON2 peptide.

Authors:  Michelle L Tonkin; Magali Roques; Mauld H Lamarque; Martine Pugnière; Dominique Douguet; Joanna Crawford; Maryse Lebrun; Martin J Boulanger
Journal:  Science       Date:  2011-07-22       Impact factor: 47.728

2.  Evolution of non-LTR retrotransposons in the trypanosomatid genomes: Leishmania major has lost the active elements.

Authors:  Frédéric Bringaud; Elodie Ghedin; Gaëlle Blandin; Daniella C Bartholomeu; Elisabet Caler; Mariano J Levin; Théo Baltz; Najib M El-Sayed
Journal:  Mol Biochem Parasitol       Date:  2005-10-10       Impact factor: 1.759

3.  Whole genome sequencing of multiple Leishmania donovani clinical isolates provides insights into population structure and mechanisms of drug resistance.

Authors:  Tim Downing; Hideo Imamura; Saskia Decuypere; Taane G Clark; Graham H Coombs; James A Cotton; James D Hilley; Simonne de Doncker; Ilse Maes; Jeremy C Mottram; Mike A Quail; Suman Rijal; Mandy Sanders; Gabriele Schönian; Olivia Stark; Shyam Sundar; Manu Vanaerschot; Christiane Hertz-Fowler; Jean-Claude Dujardin; Matthew Berriman
Journal:  Genome Res       Date:  2011-10-28       Impact factor: 9.043

4.  Cholesterol: a potential therapeutic target in Leishmania infection?

Authors:  Thomas J Pucadyil; Amitabha Chattopadhyay
Journal:  Trends Parasitol       Date:  2006-12-20

5.  Upregulated expression of B-cell antigen family tandem repeat proteins by Leishmania amastigotes.

Authors:  Yasuyuki Goto; Darrick Carter; Jeffrey Guderian; Noboru Inoue; Shin-Ichiro Kawazu; Steven G Reed
Journal:  Infect Immun       Date:  2010-02-16       Impact factor: 3.441

Review 6.  Cysteine peptidases as virulence factors of Leishmania.

Authors:  Jeremy C Mottram; Graham H Coombs; James Alexander
Journal:  Curr Opin Microbiol       Date:  2004-08       Impact factor: 7.934

Review 7.  Fasciola hepatica cathepsin L-like proteases: biology, function, and potential in the development of first generation liver fluke vaccines.

Authors:  John P Dalton; Sandra O Neill; Colin Stack; Peter Collins; Alan Walshe; Mary Sekiya; Sean Doyle; Grace Mulcahy; Deborah Hoyle; Eric Khaznadji; Nathalie Moiré; Gerard Brennan; Angela Mousley; Natalia Kreshchenko; Aaron G Maule; Sheila M Donnelly
Journal:  Int J Parasitol       Date:  2003-09-30       Impact factor: 3.981

8.  Cloning of S-adenosyl-L-methionine:C-24-Delta-sterol-methyltransferase (ERG6) from Leishmania donovani and characterization of mRNAs in wild-type and amphotericin B-Resistant promastigotes.

Authors:  Mohammad Pourshafie; Stanislas Morand; Alain Virion; Michaelle Rakotomanga; Corinne Dupuy; Philippe M Loiseau
Journal:  Antimicrob Agents Chemother       Date:  2004-07       Impact factor: 5.191

9.  Evaluation of rK-39 strip test using urine for diagnosis of visceral leishmaniasis in an endemic area in Bangladesh.

Authors:  Md Gulam Musawwir Khan; Mohammad Shafiul Alam; Milka Patracia Podder; Makoto Itoh; Kazi M Jamil; Rashidul Haque; Yukiko Wagatsuma
Journal:  Parasit Vectors       Date:  2010-11-26       Impact factor: 3.876

10.  RepSeq--a database of amino acid repeats present in lower eukaryotic pathogens.

Authors:  Daniel P Depledge; Ryan P J Lower; Deborah F Smith
Journal:  BMC Bioinformatics       Date:  2007-04-11       Impact factor: 3.169

View more
  8 in total

1.  Comparative genomics of canine-isolated Leishmania (Leishmania) amazonensis from an endemic focus of visceral leishmaniasis in Governador Valadares, southeastern Brazil.

Authors:  Hugo O Valdivia; Laila V Almeida; Bruno M Roatt; João Luís Reis-Cunha; Agnes Antônia Sampaio Pereira; Celia Gontijo; Ricardo Toshio Fujiwara; Alexandre B Reis; Mandy J Sanders; James A Cotton; Daniella C Bartholomeu
Journal:  Sci Rep       Date:  2017-01-16       Impact factor: 4.379

2.  Novel Antigen Detection Assay to Monitor Therapeutic Efficacy of Visceral Leishmaniasis.

Authors:  Claudia Abeijon; Om Prakash Singh; Jaya Chakravarty; Shyam Sundar; Antonio Campos-Neto
Journal:  Am J Trop Med Hyg       Date:  2016-08-01       Impact factor: 2.345

3.  A Novel Niosomal Combination of Selenium Coupled with Glucantime against Leishmania tropica.

Authors:  Mahshid Mostafavi; Payam Khazaeli; Iraj Sharifi; Saeedeh Farajzadeh; Hamid Sharifi; Alireza Keyhani; Maryam Hakimi Parizi; Sina Kakooei
Journal:  Korean J Parasitol       Date:  2019-02-26       Impact factor: 1.341

4.  Antileishmanial activity and immune modulatory effects of benzoxonium chloride and its entrapped forms in niosome on Leishmania tropica.

Authors:  Maryam Hakimi Parizi; Abbas Pardakhty; Iraj Sharifi; Saeedeh Farajzadeh; Mohammad Hossein Daie Parizi; Hamid Sharifi; Ali Reza Keyhani; Mahshid Mostafavi; Mehdi Bamorovat; Daryoush Ghaffari
Journal:  J Parasit Dis       Date:  2019-03-29

5.  An experimental challenge model for Leishmania donovani in beagle dogs, showing a similar pattern of parasite burden in the peripheral blood and liver.

Authors:  Hiroya Konno; Nozomu Yokoyama; Yu Tamura; Keisuke Aoshima; Ryo Nakao; Mitsuyoshi Takiguchi; Ken Katakura
Journal:  Parasitol Res       Date:  2022-10-12       Impact factor: 2.383

6.  Improved orthologous databases to ease protozoan targets inference.

Authors:  Nelson Kotowski; Rodrigo Jardim; Alberto M R Dávila
Journal:  Parasit Vectors       Date:  2015-09-29       Impact factor: 3.876

7.  Genomic insights into virulence mechanisms of Leishmania donovani: evidence from an atypical strain.

Authors:  Sumudu R Samarasinghe; Nilakshi Samaranayake; Udeshika L Kariyawasam; Yamuna D Siriwardana; Hideo Imamura; Nadira D Karunaweera
Journal:  BMC Genomics       Date:  2018-11-28       Impact factor: 3.969

8.  Genomic and Transcriptomic Analysis for Identification of Genes and Interlinked Pathways Mediating Artemisinin Resistance in Leishmania donovani.

Authors:  Sushmita Ghosh; Aditya Verma; Vinay Kumar; Dibyabhaba Pradhan; Angamuthu Selvapandiyan; Poonam Salotra; Ruchi Singh
Journal:  Genes (Basel)       Date:  2020-11-17       Impact factor: 4.096

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.