| Literature DB >> 21261994 |
Laura I Ruvalcaba-Trejo1, Nancy R Sturm.
Abstract
BACKGROUND: Chagas disease has a diverse pathology caused by the parasite Trypanosoma cruzi, and is indigenous to Central and South America. A pronounced feature of the trypanosomes is the kinetoplast, which is comprised of catenated maxicircles and minicircles that provide the transcripts involved in uridine insertion/deletion RNA editing. T. cruzi exchange genetic material through a hybridization event. Extant strains are grouped into six discrete typing units by nuclear markers, and three clades, A, B, and C, based on maxicircle gene analysis. Clades A and B are the more closely related. Representative clade B and C maxicircles are known in their entirety, and portions of A, B, and C clades from multiple strains show intra-strain heterogeneity with the potential for maxicircle taxonomic markers that may correlate with clinical presentation.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21261994 PMCID: PMC3040149 DOI: 10.1186/1471-2164-12-58
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Genes in the Sylvio X10, CL Brener and Esmeraldo maxicircle consensus sequences
| position (5'-3') | length | ||||||
|---|---|---|---|---|---|---|---|
| Gene | Editing | Sylvio | CL Brener | Esmeraldo | Sylvio | CL Brener | Esmeraldo |
| CLADE A | CLADE B | CLADE C | |||||
| 12S rRNA | - | 1-1159 | 1-1161 | 1-1160 | 1159 | 1161 | 1160 |
| 9S rRNA | - | 1203-1812 | 1200-1808 | 1197-1804 | 610 | 608 | 607 |
| ND8 | extensive | 1856-2135 | 1853-2131 | 1957-2127 | 280 | 279 | 271 |
| ND9 | extensive | 2198-2547 | 2195-2532 | 2192-2545 | 350 | 338 | 354 |
| MURF5 | none | 2541-2801 | 2526-2789 | 2522-2788 | 261 | 264 | 267 |
| ND7 | extensive | 2877-3623 | 2857-3611 | 2862-3617 | 747 | 755 | 756 |
| COIII | extensive | 3697-4116 | 3678-4100 | 2685-4109 | 420 | 424 | 425 |
| Cyb | 5' end | 4189-5269 | 4175-5254 | 4167-5246 | 1081 | 1080 | 1080 |
| ATPase 6 | extensive | 5310-5639 | 5292-5627 | 5288-5622 | 330 | 336 | 335 |
| MURF1 | none | 5696-7036 | 5675-7015 | 5681-7029 | 1341 | 1340 | 1347 |
| CR3* | extensive | 7023-7144 | 7002-7120 | 7016-7136 | ~122 | ~119 | ~121 |
| NDI | none | 7138-8079 | 7116-8057 | 7132-8073 | 942 | 942 | 942 |
| COII+gRNA | internal | 8093-8737 | 8071-8715 | 8088-8733 | 645 | 645 | 646 |
| MURF2 | 5' end | 8748-9800 | 8725-9780 | 8750-9794 | 1053 | 1056 | 1045 |
| COI | none | 9791-11440 | 9771-11420 | 9785-11434 | 1650 | 1650 | 1650 |
| CR4 | extensive | 11492-11700 | 11471-11677 | 11487-11658 | 209 | 207 | 172 |
| MURF2 gRNA | - | 11764-11804 | 11744-11784 | - | 41 | 41 | - |
| ND4 | none | 11802-13114 | 11782-13095 | 11659-12872 | 1313 | 1314 | 1214 |
| ND3 | extensive | 13108-13300 | 13087-13279 | 12864-13051 | 193 | 193 | 188 |
| RPS12 | extensive | 13377-13564 | 13357-13547 | 13129-13315 | 188 | 191 | 187 |
| ND5 | none | 13586-15185** | 13568-15337 | 13335-15105 | 1600+ | 1770 | 1771 |
Gene positions are given relative to the start of the 12S rRNA
*CR3 5' and 3' end positions are uncertain
**partial sequence
Figure 1Phylogenetic relationship among the three clade representatives. A Neighbor Joining tree shows 100% bootstrap for clade A and B and for clade A and B. The bootstrap and NJ tree were calculated using MEGA 4.0 default parameters and the coding region of the maxicircle from 12S to partial ND5 for each strain. T. brucei and L. tarentolae were used as outgroups. Sylvio X10 is a representative of clade A; CL Brener is a representative of clade B; Esmeraldo is a representative of clade C.
Figure 2Maxicircle-encoded Sylvio X10 gRNA for MURF2 RNA editing. Alignment of the MURF2 gRNA from Sylvio X10 maxicircle aligned with the predicted editing pattern for the 5' end of the MURF2 transcript. Standard Watson-Crick base pairing is shown by '|' and G-U wobble base pairing is represented by ':'. The cognate CL Brener maxicircle-encoded gRNA is shown below. The transitions between the two gRNAs are underlined in CL Brener.
Average percent identities among the three maxicircle clades
| Comparison | Non-edited | Edited | Intergenic | ||
|---|---|---|---|---|---|
| rRNAs | Genes | Proteins | Genes | ||
| Sylvio vs CL Brener | 93% | 92% | 86% | 93% | 81% |
| CL Brener vs Esmeraldo | 92% | 89% | 86% | 90% | 73% |
| Sylvio vs Esmeraldo | 93% | 89% | 86% | 89% | 74% |
Non-edited genes: Cyb, MURF1, ND1, COII, MURF2, COI, ND4, partial ND5
Proteins: MURF1, ND1, COII, MURF2, COI
Edited genes: ND8, ND7, COIII, ATPase6, CR4, CR5, RPS12
Figure 3An insertion in the unedited region of the Sylvio X10 Cyb gene. A) Portion of the Cyb nucleotide alignment from T. cruzi strains Sylvio X10, CL Brener, and Esmeraldo. The Sylvio X10 Cyb gene contains an insertion ('i' under the alignment) at position 817, disrupting the reading frame and resulting in an early stop codon. Positions of single nucleotide polymorphisms (SNPs) are indicated by 's' above the alignment; the distribution of SNPs is compiled in Table 3. B) The predicted Cyb amino acid alignment based on the predicted edited sequence in T. cruzi strains Sylvio X10, CL Brener and Esmeraldo. The T insertion present in position 817 causes an early stop codon (#) at position 284. Positions in the alignment with '*' below are identical; '.' indicates a conservative change; a blank represents a non-conservative change or three different amino acids. C) Deletion of Sylvio X10 817 insertion restores the full-length reading frame; the relevant portion of the alignment is shown.
SNPs and indels of non-edited and 5'-edited/minimally edited genes
| SNPs | Amino acid alterations | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Indels | SNPs | A+B | B+C | A+C | Unique | total | A+B | B+C | A+C | Unique |
| 12S rRNA | 18 | 125 | 44 | 37 | 33 | 11 | - | ||||
| 9S rRNA | 8 | 46 | 20 | 12 | 14 | 0 | - | ||||
| MURF5 | 17 | 40 | 14 | 12 | 14 | 0 | 27 | 8 | 8 | 4 | 7 |
| Cyb | 1 | 129 | 64 | 36 | 28 | 1 | 15 | 7 | 5 | 3 | 0 |
| MURF1 | 10 | 168 | 77 | 47 | 40 | 4 | 29 | 19 | 2 | 7 | 1 |
| ND1 | 0 | 130 | 43 | 32 | 54 | 1 | 31 | 13 | 10 | 6 | 2 |
| COII | 0 | 75 | 33 | 18 | 23 | 1 | 17 | 5 | 4 | 8 | 0 |
| MURF2 | 20 | 144 | 64 | 35 | 39 | 6 | - | ||||
| COI | 0 | 206 | 80 | 62 | 55 | 9 | 8 | 6 | 2 | 0 | 0 |
| ND4* | 2 | 181 | 80 | 44 | 52 | 5 | 38 | 21 | 6 | 8 | 3 |
| ND5** | 7 | 223 | 84 | 79 | 55 | 5 | 55 | 25 | 18 | 7 | 5 |
| Total | 83 | 1467 | 603 | 414 | 407 | 43 | 220 | 104 | 55 | 43 | 18 |
*for region beyond Esmeraldo 5' deletion; translation includes correction of frameshift in Sylvio
**through available Sylvio sequence, ~1600 of ~1770 nts; Esmeraldo translation corrected for frameshifts
Figure 4Insertion in the non-edited ND4 gene in Sylvio X10. A) Partial representation of the ND4 nucleotide alignment from T. cruzi strains Sylvio X10, CL Brener and Esmeraldo. SNPs are indicated by 's' above the alignment; the distribution of SNPs is compiled in Table 3. The Sylvio X10 strain has an insertion ('i' under the alignment) at position 656. B) The ND4 amino acid alignment from the three T. cruzi strains; the Esmeraldo sequence is partial due to a 99-nt deletion in the 5' end. An early stop codon (#) is present at position 226 in the Sylvio X10 strain. The available Esmeraldo translation is included for comparison. C) Deletion of the extra A in the Sylvio X10 ND4 gene restores the truncated portion of the protein from aa 226.
SNPs and indels of extensively edited genes
| Gene | Indels | A+B | B+C | A+C | SNPs | A+B | B+C | A+C | Unique |
|---|---|---|---|---|---|---|---|---|---|
| ND8 | 31 | 19 | 4 | 8 | 6 | 1 | 2 | 2 | 1 |
| ND9 | 38 | 16 | 16 | 6 | 18 | 13 | 1 | 4 | |
| ND7 | 62 | 32 | 11 | 19 | 33 | 14 | 9 | 10 | |
| COIII | 28 | 16 | 7 | 5 | 14 | 5 | 5 | 4 | |
| ATPase6 | 18 | 8 | 7 | 3 | 17 | 8 | 6 | 3 | |
| CR3 | 4 | 3 | 1 | 0 | 4 | 3 | 1 | 0 | |
| CR4* | 28 | 13 | 11 | 4 | 18 | 8 | 3 | 7 | |
| ND3 | 21 | 14 | 4 | 3 | 18 | 8 | 3 | 7 | |
| RPS12 | 11 | 5 | 2 | 4 | 2 | 1 | 1 | 0 | |
| Total | 241 | 126 | 63 | 52 | 130 | 61 | 31 | 37 | 1 |
*beginning after the Esmeraldo deletion
Figure 5Abundant indels in extensively edited genes. The COIII nucleotide sequence alignment using Sylvio X10, CL Brener and Esmeraldo is shown as an example. SNPs are indicated by 's' above the alignment and indels by 'i' below the alignment. In this particular case, indels outnumber SNPs 2:1. The distributions of SNPs and indels for COIII and other extensively edited genes are compiled in Table 4.
Figure 6. Linear representation of the mitochondrial maxicircle for the T. cruzi strains Sylvio X10, CL Brener and Esmeraldo with labeled indels and genes. Strain-specific stop codons resulting from indels are marked with a red asterisk (*).
Primers for PCR amplification
| Primer name | Primer sequence (5'-3') | Binding site | Tm | MgCl2 | Region amplified | Product |
|---|---|---|---|---|---|---|
| (°C) | (mM) | (nt) | ||||
| Tc. Sylvio.cons.Region.For1 | TTGTAAAAACCTTATCAGCAAAGAAA | Palindrome | 46 | 3.5 | Palindrome-partial 12S | ~2,000 |
| Tc. Maxi.12S.rRNA.Rev | TTACTTGGTACATATATAACAACTG | 12S | ||||
| Tc.CLBmaxi12s.For2 | GCACAGTTGTTATATATGTAC | 12S | 50 | 2.5 | partial 12S- partial 9S | 1,528 |
| Tc.Sylvio.Maxi9s.2R | GATTACTGCACGTTATTTTTATT | 9S | ||||
| Tc.9s.For.unedited | GCCCACCAATTTTTATAATAA | 9S | 38 | 2.5 | partial 9S-partial ND7 | 1,576 |
| Tc.Maxi.ND8-Murf5.UTR.unedited.Rev | ATCCTTCGAACATCCCTCCT | ND7 | ||||
| Tc.ND7.For.unedited | CGGGAAGGAAGAACAGTT | ND7 | 37 | 2.5 | partial ND7-partial Cyb | 2,161 |
| Tc.Cyb.Rev.CDS.not edited | CTAATCTAATCTACATACAAC | Cyb | ||||
| Sylvio.Cyb.For.CDS.not edited | GTTGTATGTAGATTAGATTAG | Cyb | 55 | 2.5 | partial Cyb-partial ND1 | 2,773 |
| Tc.CL Brener.NDI.CDS.Rev1 | TTAATCTTATCAGGATTTGTTAGCC | ND1 | ||||
| Tc.CL Brener.NDI.CDS.For1 | GGCTAACAAATCCTGATAAGATTAA | ND1 | 50 | 3.5 | partial ND1-partial COI | 3,419 |
| Tc.Maxi.4.Rev2 | TTGCTTAAGTGTTTCCCACAAA | COI | ||||
| Tc.Maxi.4.For | TTTGTGGGAAACACTTAAGCAA | COI | 40 | 2.5 | partial COI-partial ND5 | 2,283 |
| Tc.Maxi.ND3-RPS12.Rev.2 | TTGATTGTCAAAAACTTATAAAATGCC | ND5 | ||||
| Tc. maxiRPS12-ND5.unedited.For1 | CCAACTTCCCTTCAAACCAA | RPS12 | 50 | 2.5 | partial RPS12-partial ND5 | 1,993 |
| Tc.maxi.ND5.unedited.Rev1 | TTCAAAATAACATAACAACATCCGTA | ND5 | ||||
Internal primers
| Primer name | Primer sequence (5'-3') | Binding site |
|---|---|---|
| 99904 2-cons-12s F | CACAAGTTGTTATGCATGTAA | conserved region |
| 99905 2-cons-12s R | CTATCACAATTTGTGGGAAAA | conserved region |
| 83251 13-12S-9S F | CGAAAATAAAATTTTAGTAGCATA | 9S |
| 6713 B-9S-ND7 4F | GGTTGCCCTCTTGTTGTCAT | ND9 |
| 1-42 CO3 CYB (R) | CCTAAACTGAACCCCACTCC | Cyb |
| 1-43 CO3 CYB (R) | CCTAAACTGAACCCCACTCC | Cyb |
| 1-69 CO3CyB (R ) | CAAAAGCAAAGTCGCTCAC | Cyb |
| 1-68-CO3CyB (F) | TTGGAGTTGGGTGGAGTTC | ND7 |
| 55103 A Murf4-Cr3 R | CGATTATCTCAGAAAGTGCCTTA | ND1 |
| 64986 B-MURF4-CR3 3F | GGCAATGGGAATTGTACCTA | MURF1 |
| 6611 B-MRF4-CR3 2F | CGGGCAACAACGGTTTTGA | MURF4 (ATPase6) |
| 69245 COII-CO1 Maxi Sylvio F1 | ACCAATTTTGTATAACGCAATTATTA | MURF2 |
| 894-17F 4-COII-COI F | CGTATGCTTCTTAATATTATATTT | MURF2 |
| 894-18 4-COII-COI R | GCTTGATATAATGCTGCATGA | COI |
| 753-5 CO2-CO1 2F | CGGTATCAATTTTTTGATATA | MURF2 |
| 54597 CO2 CO1 2R | CGAGCATTATAAATTCTATTAA | COI |
| 55799 CO2-CO1 F | GTAGAGAACCGGGGAGGTGT | COII |
| 57799 CO2-CO1 R | TTGCACCTGTTATGGTTGGAT | COI |
| 1-58-ND4CR4Bfor-M13F_B08 | GGAGACTTTTTTACCAAGGG | ND3 |
| 1-59-ND4CR4Brev-M13R_CO8 | TCCCCCTTCTTCTCCTTCAC | CR4 |
| 259-67 RPS12-ND5 F | TGGCTAACCTTTTCATGTTCA | ND5 |
| 256-66 RPS12-ND5 R | CCTTGCAATAAAATCCACACAA | ND5 |