| Literature DB >> 22570533 |
Adelino Soares Lima Neto1, Osvaldo Pompílio de Melo Neto, Carlos Henrique Nery Costa.
Abstract
This study describes the application of the LongSAGE methodology to study the gene expression profile in promastigotes of Leishmania infantum chagasi. A tag library was created using the LongSAGE method and consisted of 14,208 tags of 17 bases. Of these, 8,427 (59.3%) were distinct. BLAST research of the 1,645 most abundant tags showed that 12.8% of them identified the coding sequences of genes, while 82% (1,349/1,645) identified one or more genomic sequences that did not correspond with open reading frames. Only 5.2% (84/1,645) of the tags were not aligned to any position in the L. infantum genome. The UTR size of Leishmania and the lack of CATG sites in some transcripts were decisive for the generation of tags in these regions. Additional analysis will allow a better understanding of the expression profile and discovering the key genes in this life cycle.Entities:
Mesh:
Year: 2012 PMID: 22570533 PMCID: PMC3336188 DOI: 10.1155/2012/673458
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Summary of the serial analysis of gene expression in L. i. chagasi promastigotes.
| Copy number | Number of sequenced tags (%) | Number of unique identified tags (%) |
|---|---|---|
| 1 | 6,782 (47.7) | 6,782 (80.5) |
| 2~4 | 3,242 (22.8) | 1,331 (15.8) |
| 5~19 | 2,288 (16.1) | 272 (3.2) |
| 20~99 | 1,331 (9.4) | 39 (0.46) |
| ≥100 | 565 (4.0) | 3 (0.04) |
|
| ||
| Total | 14,208 (100.0) | 8,427 (100.0) |
Summary of BLAST search of 50 most expressed tags of L. i. chagasi as Long or Short tag.
| Number of distinct genes matching to tag | |||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 6 | |
| Long tag (21 bp) | 49 | 1 | 0 | 0 | 0 |
| Short tag (14 bp) | 30 | 8 | 4 | 6 | 2 |
Results of conventional BLAST alignments of 1,645 tags with two or more copies in the L. i. chagasi library per alignment region and distance from the nearest ORF.
| Tags/genes | 5′ UTR (%) | 3′ UTR (%) | Total (%) |
|---|---|---|---|
| Different tags aligned at | 690 (42.0) | 659 (40.0) | 1,349 (82.0) |
| Genes identified at | |||
| Up to 1Kb | 624 (38.0) | 632 (38.4) | 1,256 (76.4) |
| Between 1 to 2 Kb | 120 (7.2) | 116 (7.0) | 236 (14.3) |
| More than 2 Kb | 41 (2.5) | 44 (2.7) | 85 (5.2) |
| Total | 785 (47.8) | 792 (48.1) | 1,577 (95.9) |
| Mean distance from ORF | 718 | 718 | — |
| Reach (bp) | 5 to 14,647 | 1 to 8,161 | — |
Most abundant and annotated transcript and corresponding tags identified in the library of L. i. chagasi promastigotes.
| Ord. | Long tag | Frequ. | Gene/protein | Gene ID |
|---|---|---|---|---|
| 1 | CATGCGCGCGATGGTGCCCCC | 125 | elongation factor 1 alpha | LinJ.17.0100 |
| 2 | CATGCGACTTAGACCGTGAGG | 91 | histone H1 | LinJ.27.1070 |
| 3 | CATGGGCGCACGGCGGCGCCG | 68 | heat shock protein 83-1 | LinJ.33.0350 |
| 4 | CATGGGCTTCCTCGAGTCCGC | 60 | histone H3 | LinJ.10.1050 |
| 5 | CATGGAGGAGGGCGAGTTCTC | 57 | Leishmania infantum JPCM5 alpha tubulin | LinJ.13.0330 |
| 6 | CATGCTCCATAAAGGAGAGAC | 53 | heat-shock protein hsp70, putative | LinJ.28.3000 |
| 7 | CATGCGGTACTGCGTTCCAGA | 47 | ribosomal protein L23, putative | LinJ.35.3840 |
| 8 | CATGGCGGCCGCAAAAGCAGG | 43 | 40S ribosomal protein S17, putative | LinJ.28.2750 |
| 9 | CATGACTGGACATTCAAAAGA | 41 | histone H1, putative | LinJ.27.1070 |
| 10 | CATGCGAAAATGTGGTTTCGC | 39 | 60S ribosomal protein L7a, putative | LinJ.07.0550 |
| 11 | CATGAGCGGCCACCCGCTTGT | 38 | ubiquitin-fusion protein | LinJ.31.1930 |
| 12 | CATGCGTGGACGATTTCAAAG | 36 | ribosomal protein S29, putative | LinJ.28.2360 |
| 13 | CATGCCGGCAGCACACCAACA | 35 | histone h4 | LinJ.06.0010 |
| 14 | CATGCCCTTCATTTTCCTCCC | 32 | RNA binding protein, putative | LinJ.32.0790 |
| 15 | CATGCGCCCTTTCATTTTAAC | 29 | hypothetical protein, conserved | LinJ.33.0970 |
| 16 | CATGACGCTCTTGCCAACCTG | 28 | ribosomal protein s26, putative | LinJ.30.3240 |
| 17 | CATGCGAAAAACAGCACAGCA | 27 | 60S ribosomal protein L6, putative | LinJ.15.1060 |
| 18 | CATGGCTGCCGCTACCGCAGC | 26 | ribosomal protein s11 homolog | LinJ.20.1620 |
| 19 | CATGATCAGATTTTGTTTTGT | 26 | ribosomal protein L3, putative | LinJ.34.2730 |
| 20 | CATGTGCGTGCCGGTGCCGGT | 25 | hypothetical protein, conserved | LinJ.32.2840 |
| 21 | CATGCGGCTTGTTTATCCTTT | 25 | 60S ribosomal protein L12, putative | LinJ.35.2230 |
| 22 | CATGCGGGCGCGGACTTTGCG | 24 | ribosomal protein L24, putative | LinJ.36.1130 |
| 23 | CATGCGCGAGGGCTGTGAAGC | 24 | hypothetical protein, conserved | LinJ.27.0130 |
| 24 | CATGTGCAAGACTCACGTCCA | 23 | ribosomal protein S25 | LinJ.34.0460 |
| 25 | CATGGTCGTTTTGCGGGCAGC | 22 | hypothetical protein, conserved | LinJ.13.0270 |
| 26 | CATGAAATGAAAAGGAAAGGC | 22 | ribosomal protein L15, putative | LinJ.30.3710 |
| 27 | CATGTGGATCGCGGGGTGCCA | 22 | 60S ribosomal protein L2, putative | LinJ.32.4050 |
| 28 | CATGGTGGACAGTGGCGAGCG | 22 | 60S acidic ribosomal subunit protein, | LinJ.27.1300 |
| 29 | CATGGCAGCTGCCGCTGCGTC | 21 | 60S ribosomal protein L34, putative | LinJ.36.3930 |
| 30 | CATGGGATGGAAGGCATCGCA | 20 | ATP-dependent zinc metallopeptidase, | LinJ.18.0620 |
| 31 | CATGCAGAGAGAAGCAGTGCA | 20 | heat shock protein 83-1 | LinJ.33.0350 |
| 32 | CATGTGCTGTGATCCGTGTGT | 20 | hypothetical protein, conserved | LinJ.24.2290 |
| 33 | CATGTATGCCTAAAGTACCAC | 20 | hypothetical protein, conserved | LinJ.31.0920 |
| 34 | CATGGGAAACACTCTGCGCCC | 19 | 60S ribosomal protein L19, putative | LinJ.06.0430 |
| 35 | CATGACGGTGGGTGCGCTAAT | 18 | hypothetical protein, conserved | LinJ.30.3590 |
This table shows the 35 most abundant and annotated tags aligning to only one gene, their occurrences in the L. i. chagasi library, their corresponding name and gene ID in GeneDB.