| Literature DB >> 35453714 |
Pornchai Anuntasomboon1,2, Suradej Siripattanapipong3, Sasimanas Unajak4, Kiattawee Choowongkomon4, Richard Burchmore5, Saovanee Leelayoova6, Mathirut Mungthin6, Teerasak E-Kobon1,2.
Abstract
(1) Background: Autochthonous leishmaniasis, a sandfly-borne disease caused by the protozoan parasites Leishmania orientalis (formerly named Leishmania siamensis) and Leishmania martiniquensis, has been reported for immunocompromised and immunocompetent patients in the southern province of Thailand. Apart from the recent genomes of the northern isolates, limited information is known on the emergence and genetics of these parasites. (2)Entities:
Keywords: Leishmania martiniquensis; Leishmania orientalis; Leishmania siamensis; bioinformatics; comparative genomics; leishmaniasis
Year: 2022 PMID: 35453714 PMCID: PMC9031872 DOI: 10.3390/biology11040515
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Basic characteristics of draft genomes of L. orientalis isolate PCM2 and L. martiniquensis isolate PCM3 from the southern part of Thailand.
| Genome Characteristics | ||
|---|---|---|
| Contig number | 4399 | 42 |
| Total length | 30.01 Mbp | 32.39 Mbp |
| N50 | 14.03 Kbp | 1.05 Mbp |
| L50 | 651 | 11 |
| Number of coding genes | 8989 | 9577 |
| %GC | 59.02 | 59.92 |
Figure 1Alignment of genomic contigs from L. orientalis southern isolate PCM2 and L. martiniquensis southern isolate PCM3 compared to those of the northern isolates LSCM1 and LSCM4 in Thailand. Colored shades represented the conserved synteny, and the bar height indicates the level of sequence identity. Lines show associated conserved syntenic blocks across the compared isolates.
Figure 2Comparison of protein sequences of the southern isolates of L. orientalis PCM2 and L. martiniquensis PCM3 in Thailand with the northern isolates and other 14 Leishmania species. The protein identity scores were hierarchically clustered into two clusters of Leishmania species (depicted by the column dendrogram) and six groups of proteins (represented by the row dendrogram). Color scale represents the level of sequence identity scores. Numbers labeled on the right of the heatmap indicate the group numbers (1 to 6).
Comparison of the number of uniquely assigned GO terms (biological process, cellular component, molecular function, and enzymatic function) to Leishmanial proteins and percentage of protein annotation in the six groups clustered according to the protein sequence similarity between the southern isolates of L. orientalis (PCM2) and L. martiniquensis (PCM3) and the northern ones (LSCM1 and LSCM4), as well as other Leishmania species.
| Protein Groups | Number of Uniquely Assigned GO Terms | Total Number of Proteins in Each Group | |||
|---|---|---|---|---|---|
| Biological Process | Cellular Component | Molecular Function | Enzymatic Function | ||
| 1 | 43 (13.58) | 23 (20.09) | 59 (16.47) | 30 (8.96) | 692 (21.24%) |
| 2 | 762 (53.81) | 378 (72.07) | 629 (54.85) | 474 (26.57) | 5371 (66.15%) |
| 3 | 119 (21.30) | 71 (36.10) | 125 (23.82) | 61 (10.07) | 953 (33.26%) |
| 4 | 77 (37.96) | 64 (57.55) | 65 (36.33) | 24 (11.84) | 245 (44.49%) |
| 5 | 136 (37.69) | 88 (64.04) | 109 (40.38) | 58 (14.81) | 520 (51.35%) |
| 6 | 5 (10.64) | 3 (23.40) | 6 (17.02) | 3 (8.51) | 47 (19.15%) |
| Total | 7828 (56.23%) | ||||
Comparison of the annotated proteins that were found only in L. orientalis and L. martiniquensis. The compared numbers are depicted as the proteins uniquely identified in either of the two isolates or both. The “-” sign indicates the absence of the unique proteins in that isolate.
| Protein Cluster | Number of Unique Proteins Only Found in | |||||
|---|---|---|---|---|---|---|
|
|
| |||||
| Unique to PCM2 | Unique to LSCM4 | Found in Both Isolates | Unique to PCM3 | Unique to LSCM1 | Found in Both Isolates | |
| 1 | 3 | - | 28 | 7 | - | - |
| 2 | 47 | - | - | 67 | - | - |
| 3 | 3 | - | - | 21 | - | - |
| 4 | - | - | - | - | - | - |
| 5 | - | - | - | - | - | - |
| 6 | - | - | - | 2 | - | - |
| Total | 53 | - | 28 | 97 | - | - |
Unique proteins present in the proteomes of L. orientalis isolate PCM2 and L. martiniquensis isolate PCM3. The “+” in the cluster column represents the presence of the protein in L. orientalis isolate PCM2.
| No | Gene ID | Cluster | Protein Name |
|---|---|---|---|
| 1 | k59_7101.g4222 | 1 | Amastin surface glycofamily protein |
| 2 | k59_5884.g7963 | 1 | D-amastin |
| 3 | k59_5466.g3144 | 1 | Putative malic enzyme |
| 4 | k59_1669.g769 | 1+ | Receptor-type adenylate cyclase a |
| 5 | k59_10223.g2223 | 2+ | Aminoacyl-tRNA editing domain containing protein, putative |
| 6 | k59_6373.g6990 | 2 | Aminoacyl-tRNA editing domain containing protein, putative |
| 7 | k59_6105.g6831 | 2 | Ankyrin repeat family protein |
| 8 | k59_2596.g5206 | 2+ | Antiviral helicase |
| 9 | k59_4730.g7538 | 2 | ATP-dependent DNA helicase |
| 10 | k59_1985.g3871 | 2+ | Beta-lactamase, putative |
| 11 | k59_7875.g3627 | 2+ | C3H1-type domain-containing protein |
| 12 | k59_10386.g8210 | 2 | Cactus-binding C-terminus of cactin protein, putative |
| 13 | k59_3265.g2893 | 2 | Centromere/microtubule binding protein cbf5, putative |
| 14 | k59_10658.g8866 | 2+ | Chromatin assembly factor 1 subunit A, putative |
| 15 | k59_10459.g1059 | 2+ | CS domain-containing protein |
| 16 | k59_3314.g4482 | 2 | Cyclin dependent kinase-binding protein, putative |
| 17 | k59_5445.g2737 | 2 | D-3-phosphoglycerate dehydrogenase-like protein |
| 18 | k59_4821.g8945 | 2 | Endoplasmic Reticulum-Golgi Intermediate Compartment (ERGIC) |
| 19 | k59_2500.g2688 | 2 | Enkurin domain-containing protein |
| 20 | k59_10260.g4158 | 2+ | Essential nuclear protein 1 |
| 21 | k59_8350.g1377 | 2+ | Exosome-associated protein 1 |
| 22 | k59_9227.g4203 | 2 | Farnesyltransferase alpha subunit |
| 23 | k59_6707.g85 | 2+ | GB1/RHD3-type G domain-containing protein |
| 24 | k59_6854.g4013 | 2+ | Glucose-6-phosphate isomerase (Fragment) |
| 25 | k59_199.g4824 | 2+ | Glycosyltransferase (GlcNAc), putative |
| 26 | k59_10459.g1058 | 2 | GOLD domain-containing protein |
| 27 | k59_4463.g6091 | 2+ | Guanine nucleotide-binding protein subunit beta-like protein |
| 28 | k59_1181.g5237 | 2+ | H(+)-exporting diphosphatase |
| 29 | k59_9085.g8550 | 2 | Heat shock protein DnaJ |
| 30 | k59_457.g6730 | 2+ | HECT domain-containing protein |
| 31 | k59_1395.g2084 | 2 | Intraflagellar transport protein 22 |
| 32 | k59_8474.g5391 | 2 | Intraflagellar transport protein D4 |
| 33 | k59_9119.g816 | 2 | MIZ/SP-RING zinc finger family protein |
| 34 | k59_7389.g5456 | 2 | mRNA processing protein, putative |
| 35 | k59_5206.g5478 | 2 | MRP-L46 domain-containing protein |
| 36 | k59_8917.g2151 | 2 | MYND zinc finger (ZnF) domain-like protein |
| 37 | k59_10044.g5171 | 2+ | NLE domain-containing protein |
| 38 | k59_7676.g6287 | 2 | NTF2 domain-containing protein |
| 39 | k59_5657.g997 | 2+ | Nuclease-related domain containing protein, putative |
| 40 | k59_5439.g2470 | 2+ | PAB-dependent poly(A)-specific ribonuclease subunit 3 |
| 41 | k59_660.g3683 | 2 | Palmitoyltransferase |
| 42 | k59_2880.g6470 | 2 | PHD domain-containing protein |
| 43 | k59_5927.g1098 | 2 | Phosphorylated CTD interacting factor 1 WW domain containing protein |
| 44 | k59_5927.g1098 | 2 | Phosphorylated CTD interacting factor 1 WW domain containing protein |
| 45 | k59_10222.g2044 | 2+ | Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region |
| 46 | k59_9477.g3999 | 2+ | PRK domain-containing protein |
| 47 | k59_8589.g1275 | 2 | Protein disulfide isomerase, putative |
| 48 | k59_5413.g1718 | 2 | Putative DNA repair protein RAD2 |
| 49 | k59_9374.g604 | 2+ | Putative flagellar radial spoke protein-like |
| 50 | k59_10153.g8013 | 2 | Putative Gamma-soluble NSF attachment protein (SNAP-gamma) |
| 51 | k59_2626.g6421 | 2 | Putative mitochondrial RNA binding complex 1 subunit |
| 52 | k59_7006.g1237 | 2+ | Putative prefoldin subunit 2 |
| 53 | k59_2660.g7703 | 2+ | Putative pre-mRNA branch site protein p14 |
| 54 | k59_10051.g5820 | 2 | Putative tRNA pseudouridine synthase A |
| 55 | k59_2665.g8021 | 2+ | Putative unspecified product |
| 56 | k59_2453.g835 | 2+ | Putative variant surface glycoprotein |
| 57 | k59_1240.g7057 | 2+ | Pyroglutamyl-peptidase I (PGP), putative |
| 58 | k59_10051.g5821 | 2 | Queuosine salvage protein |
| 59 | k59_1202.g6035 | 2 | Rab3 GTPase-activating protein catalytic subunit |
| 60 | k59_7391.g5855 | 2 | Rab-GTPase-TBC domain containing protein, putative |
| 61 | k59_3567.g2504 | 2 | Related to elongation factor-2 kinase efk-1b isoform-like protein |
| 62 | k59_1109.g2256 | 2+ | RNA recognition motif family protein |
| 63 | k59_4042.g1153 | 2+ | Roadblock/LC7 domain containing protein, putative |
| 64 | k59_3916.g6656 | 2+ | Secretory carrier membrane protein 3 |
| 65 | k59_3762.g682 | 2 | SNARE associated Golgi protein, putative |
| 66 | k59_1957.g3007 | 2+ | Sperm-tail PG-rich repeat family protein |
| 67 | k59_5678.g1713 | 2+ | Succinate dehydrogenase assembly factor 2, mitochondrial |
| 68 | k59_5098.g435 | 2+ | TFIIS central domain-containing protein |
| 69 | k59_8533.g7240 | 2+ | Translation initiation factor |
| 70 | k59_3506.g770 | 2+ | U5 snRNP-specific 40 kDa protein, putative |
| 71 | k59_6432.g50 | 2+ | WD domain, G-beta repeat family protein |
| 72 | k59_6901.g5645 | 3 | Alpha-1,3/1,6-mannosyltransferase ALG2 |
| 73 | k59_4782.g8504 | 3 | ATP-dependent DNA helicase |
| 74 | k59_6737.g659 | 3 | ATP-dependent helicase, putative |
| 75 | k59_7325.g3419 | 3 | N-terminal region of Chorein, a TM vesicle-mediated sorter family protein |
| 76 | k59_7204.g7945 | 3+ | Putative trypanothione synthetase |
| 77 | k59_5425.g1966 | 3 | RNA binding protein-like protein |
| 78 | k59_2661.g7832 | 3 | SAT domain-containing protein |