| Literature DB >> 17584941 |
Elida P B Ojopi1, Paulo S L Oliveira, Diana N Nunes, Apuã Paquola, Ricardo DeMarco, Sheila P Gregório, Karina A Aires, Carlos F M Menck, Luciana C C Leite, Sergio Verjovski-Almeida, Emmanuel Dias-Neto.
Abstract
BACKGROUND: Five species of the genus Schistosoma, a parasitic trematode flatworm, are causative agents of Schistosomiasis, a disease that is endemic in a large number of developing countries, affecting millions of patients around the world. By using SAGE (Serial Analysis of Gene Expression) we describe here the first large-scale quantitative analysis of the Schistosoma mansoni transcriptome, one of the most epidemiologically relevant species of this genus.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17584941 PMCID: PMC1914358 DOI: 10.1186/1471-2164-8-186
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
The 50 most abundant transcripts, revealed by SAGE analysis, in S. mansoni adult worms.
| 1 | actattcggg | 1454 | gi| 160993| gb| M14309.1| SCMFSPA | 50 | 0 | |
| 2 | cctgtaaact | 835 | gi| 44829167| tpg| DAA04497.1| C200397.1 | 99 | 0 | |
| 3 | gaagaagtgg | 640 | gi| 161027| gb| J04017.1| SCMHSP86 | 96 | 0 | |
| 4 | tcatacaaga | 588 | gi| 790657| gb| U24281.1| SMU24281 | 95 | 0 | |
| 5 | Similar to | ccggtgtctc | 513 | TC13660 | 97 | 0 |
| 6 | No matches | agcgtccaaa | 472 | TC7406 | 85 | 0 |
| 7 | aggcaagtgg | 464 | gi| 161025| gb| L02415.1| SCMHSP70X | 67 | 0 | |
| 8 | cataatgaag | 438 | gi| 160994| gb| M92359.1| SCMGAPDH | 89 | 1 | |
| 9 | cggctcagga | 398 | gi| 2598925| gb| AF026805.1| AF026805 | 72 | 0 | |
| 10 | Mitochondrial small subunit ribosomal RNA | gtagtgcttg | 364 | AF130787_1459_201 | 94 | 0 |
| 11 | No matches | cttttctaaa | 349 | TC17835 | 93 | 0 |
| 12 | Similar to | tgttgttcgt | 340 | TC17184 | 96 | 0 |
| 13 | agcaatggaa | 319 | gi| 454257| emb| Z29960.1| SMTANREP TC7340 | 20 | 3 | |
| 14 | tgtacgtcat | 313 | 94 | 2 | ||
| 15 | tatcgttcta | 302 | gi| 160983| gb| M60895.1| SCMFABP14 | 89 | 0 | |
| 16 | Similar to unknown protein | gcgagtcgaa | 293 | gi| 56758226| gb| AAW27253.1| TC16783 | 86 | 0 |
| 17 | Similar to | taatatgcgc | 265 | gi| 76161984| gb| AAX30141.2| TC11200 | 71 | 0 |
| 18 | Similar to | gtgctcgaag | 255 | gi| 14581393| gb| AAF98445.1| TC17397 | 53 | 0 |
| 19 | tgactgatct | 254 | gi| 161010| gb| M98271.1| SCMGSTM | 88 | 1 | |
| 20 | gcacattgtc | 252 | gi| 2454222| gb| U91941.1| U91941 | 62 | 0 | |
| 21 | acatcaacaa | 225 | gi| 924602| gb| U19945.1| SMU19945 | 84 | 0 | |
| 22 | similar | ccttcggtac | 225 | gi| 29841092| gb| AAP06105.1| TC13745 | 97 | 0 |
| 23 | Similar to SJCHGC09089 | gaggttatgg | 215 | gi| 56755876| gb| AAW26116.1| TC16990 | 89 | 0 |
| 24 | Similar to | ttggaggcaa | 211 | gi| 28317769| gb| AY223294.1| TC13521 | 93 | 0 |
| 25 | gagaacacca | 198 | gi| 161086| gb| M37003.1| SCMSM226 | 71 | 0 | |
| 26 | Similar to SJCHGC01209 | gtaaccaatg | 193 | gi| 56752993| gb| AAW24708.1| TC11060 | 93 | 0 |
| 27 | Similar to | cagcgtcctt | 191 | gi| 29841163| gb| AAP06176.1| TC10489 | 78 | 3 |
| 28 | Similar to SJCHGC06078 | gggattgccg | 190 | gi| 56758252| gb| AAW27266.1| TC16808 | 87 | 0 |
| 29 | gtctgctgat | 186 | gi| 19071248| gb| AF422164.1| | 87 | 0 | |
| 30 | No matches | ccaggttgtg | 184 | TC9014 | 0 | 0 |
| 31 | Similar to | gttatggcca | 174 | gi| 29841170| gb| AAP06183.1| TC10354 | 75 | 0 |
| 32 | Similar to S. | tggattcttg | 168 | gi| 56754980| gb| AY813941.1| TC14578 | 81 | 0 |
| 33 | gtacttagtg | 167 | gi| 161043| gb| L01634.1| SCMMYH | 97 | 0 | |
| 34 | tatgttctct | 165 | gi| 4099443| gb| U87629.1| SMU87629 | 87 | 0 | |
| 35 | NADH dehydrogenase 4 (ND4) gene | attttgtttg | 159 | AF130788_1003_2262 | 87 | 0 |
| 36 | gggtatgaat | 159 | gi| 473158| emb| Z32529.1| SMCATHL | 87 | 2 | |
| 37 | Similar to | ggattcggtt | 155 | gi| 29841185| gb| AAP06198.1| TC7332 | 95 | 0 |
| 38 | ccatcagcct | 154 | gi| 34701209| gb| CD164545.1| CD164545 CD164545 | 92 | 0 | |
| 39 | Similar to | gccccttgga | 154 | TC13587 | 87 | 0 |
| 40 | tcgttctgat | 147 | gi| 1002615| gb| U30175.1| SMU30175 | 84 | 0 | |
| 41 | Non-available | tccccgtaca | 145 | |||
| 42 | cccaccactt | 144 | gi| 33355622| gb| AY334553.1| | 84 | 0 | |
| 43 | Similar to | tacctaggcc | 143 | gi| 29841379| gb| AAP06411.1| TC7615 | 79 | 0 |
| 44 | Similar to | gtcgcaaagt | 139 | gi| 56754665| gb| AAW25518.1| TC7475 | 59 | 1 |
| 45 | ggtttagtag | 138 | gi| 3599492| gb| AF085145.1| AF085145 | 90 | 0 | |
| 46 | Non-available | tgcgcgcgtg | 137 | |||
| 47 | Similar to | cacagacagc | 134 | gi| 60687866| gb| AAX30266.1| TC8025 | 55 | 0 |
| 48 | tggtgagggc | 130 | gi| 2623827| gb| AF030961.1| AF030961 TC13698 | 46 | 0 | |
| 49 | Similar to | ctctcgagga | 128 | gi| 76162217| gb| ABA40776.1| TC7478 | 70 | 0 |
| 50 | Similar to | ggagaagaaa | 123 | gi| 56757579| gb| AAW26951.1| TC7368 | 3 | 3 |
Figure 1Gene Ontology analysis of the most abundant proteins classes in adult worms of . Functional classification S. mansoni protein groups containing more than 500 tags/functional class.
Transcripts with putative alternative poly-adenylation events in S. mansoni, as suggested by SAGE.
| Rac GTPase ( | 2,383 | 211..777 | tgtgtgtgta; 951; 366; 1. | No | No | 3 out of 4 |
| acaagttatg; 1959; 351; 0. | No | No | ||||
| Receptor tyrosine kinase ( | 6,022 | 120..4799 | tcaatcatta; 821; 249; 1. | No | Yes | 4 out of 4 |
| aagaaatgca; 2013; 132; 0. | No | No | ||||
| Myosin light chain ( | 965 | 30..512 | aatcctaatc; 651; 29; 1. | No | No | 2 out of 2 |
| aatatataca; 819; 44; 0. | Yes | No | ||||
| Calponin homolog ( | 1,963 | 12..1097 | tttatcttca; 1467; 44; 1. | Yes (AATAAA – 1,480) | No | 2 out of 4 |
| cccaaccctc; 1677; 513; 0. | No | No | ||||
| Dynein light chain ( | 469 | 6..275 | gcattgtata; 163; 29; 1. | No | No | 0 out of 0 |
| aaacccataa; 207; 689; 0. | No | No | ||||
| Enolase trans-spliced ( | 1,461 | 40..1344 | caacgttggt; 650; 29; 1. | Yes (ATTAAA – 1,024) | Yes | 2 out of 2 |
| tcgttctgat; 1235; 2154; 0. | Yes (AATAAA – 1,353) | No | ||||
| Cyclophilin ( | 571 | 16..501 | tgtcagggtg; 189; 44; 1. | Yes (AATAAA – 250) | Yes | 0 out of 0 |
| ttgttttcgg; 382; 1465; 0. | No | No | ||||
| Actin ( | 1,538 | 45..1175 | gccgacgagg; 44; 29; 2. | No | Yes | 5 out of 5 |
| aagtgtgatg; 893; 1216; 1. | No | Yes | 5 out of 5 | |||
| acatcaacaa; 1307; 3297; 0. | Yes (AATAAA – 1,366) | No | ||||
| Phosphofructokinase ( | 3,087 | 147..2492 | ttcctttcat; 2895; 29; 2. | No | No | 1 out of 3 |
| ttttccgttt; 2984; 59; 1. | Yes (AATAAA – 3,025) | No | 0 out of 3 | |||
| taaaaaaaaa; 3043; 103; 0. | No | No | ||||
| Triose phosphate isomerase ( | 1,080 | 33..794 | atgtcgatgg; 714; 29; 1. | Yes (ATTAAA – 749) | No | 4 out of 4 |
| tcagttactt; 844; 938; 0. | Yes (AATAAA – 1,066) | No | ||||
| Superoxide dismutase ( | 623 | 7..561 | gataccccag; 319; 29; 2. | No | No | 0 out of 0 |
| tgctacaata; 530; 29; 1. | No | No | 0 out of 0 | |||
| aaatgatttt; 557; 249; 0. | Yes (AATAAA – 588) | No | ||||
| Cu/Zn superoxide dismutase ( | 605 | 23..484 | cctattctcc; 513; 498; 1. | No | No | 2 out of 2 |
| cacaaataaa; 580; 161; 0. | Yes (AATAAA – 588) | No | ||||
| PUR-alpha-like ( | 1,282 | 77..880 | tgcttaatag; 1110; 191; 1. | No | No | 1 out of 3 |
| ttagatttct; 1242; 15; 0. | No | No |
Transcripts with alternative poly-adenylation events were considered only when confirmed by at least 2 SAGE tags, without downstream internal binding sites for oligo-dT (fake polyA). # – alternative-polyadenylation events affecting the coding region were considered when the alternative upstream tags were located at least 256 nt before the original stop-codon. CDS – stands for the cDNA coordinates of the coding region of the gene. * refers to the number of putative Adenylate Rich Elements (AREs) present in the 3'UTR, that are located downstream of the most 5' SAGE tag found in a transcript, out of the total 3' UTR AREs of the transcript.