| Literature DB >> 18036214 |
Abstract
BACKGROUND: Gene order along the genome sequence of the human parasite Trypanosoma brucei provides evidence for a 0.5 Mb duplication, comprising the 3' regions of chromosomes 4 and 8. Here, the principal aim was to examine the contribution made by this duplication event to the T. brucei genome sequence, emphasising the consequences for gene content and the evolutionary change subsequently experienced by paralogous gene copies. The duplicated region may be browsed online at http://www.genedb.org/genedb/tryp/48dup_image.jspEntities:
Mesh:
Year: 2007 PMID: 18036214 PMCID: PMC2212663 DOI: 10.1186/1471-2164-8-432
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1ACT comparison of whole Chromosomes 4 and 8 in T. brucei and Chromosome 31 in L. major. Chromosomes are presented as in GeneDB, with both forward and reverse strands and loci represented as coloured rectangles; grey arrows indicate the direction of transcription along polycistronic regions. Red rectangles denote the boundaries of the duplicated region on each chromosome; significant tBLASTx matches between homologous genes are linked by red coloured lines. Note that, while indicative, not all sequence affinity between chromosomes was detected in this tBLASTx search.
Figure 2Frequency distribution of sequence identity between paralogs retained on both duplicons. Nucleotide sequence identity is recorded for coding regions and each UTR.
Figure 3Sequence divergence between paralogs. Values for coding regions are correlated with 5' UTR (a) and 3'UTR (b) regions. Horizontal dashed lines denote the 25% nucleotide identity expected between two unrelated sequences.
Shared, paralogous loci showing significant asymmetry in the rate of non-synonymous substitutions per site since duplication, as determined by the canonical relative rates test.
| Chr4 | Chr8 | Identity: | Identity | Length (bp) | Identity | Length (bp) | # Sites | Δ | SD | ratio | P | ||||||
| Chr4 | Chr8 | Chr4 | Chr8 | ||||||||||||||
| 5 | Tb927.4.5350 | Tb927.8.6970 | 3-methylcrotonyl-CoA carboxylase | 0.983 | 0.661 | 0.652 | 0.976 | 42 | 0.984 | 327 | 1361.5 | 0.253 | 0.241 | -0.012 | 0.004 | -2.797 | 0.005 |
| 9 | Tb927.4.5310 | Tb927.8.7110 | S/T-protein kinase A | 0.845 | 0.684 | 0.714 | 0.357 | 8 | 0.239 | ~250 | 887.1 | 0.141 | 0.184 | 0.043 | 0.011 | 3.862 | 0.000 |
| 20 | Tb927.4.5050 | Tb927.8.7380 | dihydrolipoamide dehydrogenase | 0.924 | 0.621 | 0.647 | 0.308 | 0 | 0.303 | 18 | 1085.0 | 0.268 | 0.255 | -0.013 | 0.005 | -2.647 | 0.008 |
| 43 | Tb927.4.4550 | Tb927.8.7780 | Tb927.8.7760 | 0.516 | 0.453 | 0.366 | 0.247 | 0 | 0.436 | 0 | 1723.4 | 0.439 | 0.606 | -0.168 | 0.023 | -7.302 | 0.000 |
| 46 | Tb927.4.4520 | Tb927.8.7820 | Tb927.8.7800 | 0.407 | 0.218* | 0.242* | 0.396 | 80 | 0.47 | 500? | 1881.0 | 0.886 | 0.764 | 0.122 | 0.035 | 3.500 | 0.000 |
| 51 | Tb927.4.4350 | Tb927.8.8030 | Tb927.8.7950 | 0.433 | 0.239 | 0.261 | 0.156 | 0 | 0.25 | 0 | 3308.0 | 0.939 | 0.863 | 0.076 | 0.028 | 2.729 | 0.006 |
| 54 | Tb927.4.4380 | Tb927.8.7980 | monoglyceride lipase | 0.783 | 0.616 | 0.553 | 0.098 | 0 | 0.199 | 0 | 610.9 | 0.300 | 0.343 | -0.043 | 0.018 | -2.346 | 0.019 |
| 58 | Tb927.4.4240 | Tb927.8.8070 | glycosyltransferase | 0.519 | 0.438 | 0.416 | 0.192 | 0 | 0.222 | 0 | 753.3 | 0.497 | 0.565 | -0.068 | 0.034 | -1.975 | 0.048 |
| 59 | Tb927.4.4220 | Tb927.8.8140 | Tb927.8.8070/8110 | 0.145 | 0.13* | 0.098* | 0.219 | 0 | 0.258 | 0 | 151.5 | 0.547 | 0.930 | -0.384 | 0.117 | -3.270 | 0.001 |
| 70 | Tb927.4.3970 | Tb927.8.8320 | amino acid transporter | 0.734 | - | - | 0.261 | 0 | 0.241 | 0 | 944.1 | 0.220 | 0.182 | 0.037 | 0.015 | 2.506 | 0.012 |
| 73 | Tb927.4.3910 | Tb927.8.8350 | Tb927.4.3920 (TMH) | 0.727 | 0.609 | 0.559 | 0.166 | 0 | 0.276 | 0 | 564.4 | 0.265 | 0.360 | -0.095 | 0.023 | -4.127 | 0.000 |
| 74 | Tb927.4.3880 | Tb927.8.8360 | mitotic centromer-associated kinesin | 0.686 | 0.43 | 0.432 | 0.473 | 190 | - | - | 1339.9 | 0.497 | 0.536 | -0.039 | 0.018 | -2.141 | 0.032 |
Locus number refers to paralogous loci retained on both duplicons, as described in Additional files 1 and 2.
An asterisk * denotes the use of T. vivax as the outgroup comparison, in place of T. cruzi.
Shared, paralogous loci showing significant asymmetry in total substitution rate since duplication (P < 0.05), as determined by the Bayesian relative rates test.
| Locus | Identifier | Chromosome 4: | Chromosome 8: | |||||||
| Chr4 | Chr8 | Average | SD | Range | Average | SD | Range | |||
| 42 | Tb927.4.4570 | Tb927.8.7760 | 0.4483 | 0.0220 | 0.404 | 0.492 | 0.0265 | |||
| 45 | Tb927.4.4530 | Tb927.8.7800 | 0.0284 | 0.3339 | 0.0240 | 0.286 | 0.382 | |||
| 50 | Tb927.4.4400 | Tb927.8.7950 | 0.0269 | 0.5099 | 0.0232 | 0.464 | 0.556 | |||
| 58 | Tb927.4.4240 | Tb927.8.8070 | 0.4534 | 0.0774 | 0.299 | 0.608 | 0.1162 | |||
Locus number refers to Additional file 1.
Values in bold denote the paralogs with the faster evolutionary rate.
Retained paralogs with codons showing evidence for positive or negative selection.
| Locus | Identifier Chr4 | Chr8 | Selection analysis: With outgroup | Without outgroup | ||||||
| Method | Global ω | + | - | + codons | - codons | + codons | - codons | |||
| 6 | Tb927.4.5340 | Tb927.8.6980 | SLAC | 0.382 | 0.369 | 0.462 | 0 | 0 | ||
| FEL | 0 | 54 | ||||||||
| REL | 2 | 65 | 67 | 0 | ||||||
| 8 | Tb927.4.5320 | Tb927.8.7090 | SLAC | 0.333 | 0.325 | 0.418 | 0 | 0 | ||
| FEL | 0 | 84 | ||||||||
| REL | 5 | 340 | 7 | 6 | ||||||
| 10 | Tb927.4.5300 | Tb927.8.7140 | SLAC | 0.289 | 0.271 | 0.364 | 0 | 1 | ||
| FEL | 1 | 44 | ||||||||
| REL | 15 | 192 | 20 | 7 | ||||||
| 12 | Tb927.4.5220 | Tb927.8.7190 | SLAC | 0.315 | 0.302 | 0.415 | 0 | 0 | ||
| FEL | 0 | 31 | ||||||||
| REL | 6 | 77 | 19 | 7 | ||||||
| 14 | Tb927.4.5180 | Tb927.8.7220 | SLAC | 0.201 | 0.192 | 0.266 | 0 | 0 | ||
| FEL | 0 | 73 | ||||||||
| REL | 4 | 135 | 2 | 49 | ||||||
| 21 | Tb927.4.5030 | Tb927.8.7390 | SLAC | 0.072 | 0.062 | 0.112 | 0 | 0 | ||
| FEL | 0 | 65 | ||||||||
| REL | 5 | 286 | 3 | 24 | ||||||
| 27 | Tb927.4.4960 | Tb927.8.7460 | SLAC | 0.188 | 0.176 | 0.246 | 0 | 0 | ||
| FEL | 0 | 76 | ||||||||
| REL | 7 | 289 | 49 | 0 | ||||||
| 31 | Tb927.4.4920 | Tb927.8.7500 | SLAC | 0.413 | 0.363 | 0.508 | 0 | 0 | ||
| FEL | 0 | 13 | ||||||||
| REL | 6 | 63 | 15 | 0 | ||||||
| 40 | Tb927.4.4730 | Tb927.8.7740 | SLAC | 0.148 | 0.138 | 0.202 | 0 | 1 | ||
| FEL | 0 | 45 | ||||||||
| REL | 4 | 201 | 40 | 0 | ||||||
| 43 | Tb927.4.4550 | Tb927.8.7780 | SLAC | 0.199 | 0.193 | 0.261 | 0 | 1 | ||
| FEL | 0 | 158 | ||||||||
| REL | 11 | 17 | 5 | 7 | ||||||
| 46 | Tb927.4.4520 | Tb927.8.7820 | SLAC | 0.342 | 0.33 | 0.426 | 0 | 0 | ||
| FEL | 1 | 57 | ||||||||
| REL | 1 | 73 | 6 | 0 | ||||||
| 49 | Tb927.4.4470 | Tb927.8.7860 | SLAC | 0.256 | 0.253 | 0.335 | 0 | 2 | ||
| FEL | 0 | 214 | ||||||||
| REL | 2 | 187 | 20 | 0 | ||||||
| 50 | Tb927.4.4400 | Tb927.8.7950 | SLAC | 0.369 | 0.365 | 0.447 | 0 | 10 | ||
| FEL | 4 | 191 | ||||||||
| REL | 0 | 121 | 2 | 0 | ||||||
| 52 | Tb927.4.4370 | Tb927.8.8000 | SLAC | 0.216 | 0.207 | 0.28 | 0 | 0 | ||
| FEL | 0 | 91 | ||||||||
| REL | 49 | 68 | 8 | 0 | ||||||
| 62 | Tb927.4.4160 | Tb927.8.8170 | SLAC | 0.282 | 0.273 | 0.345 | 0 | 1 | ||
| FEL | 1 | 94 | ||||||||
| REL | 3 | 181 | 5 | 0 | ||||||
| 71 | Tb927.4.3950 | Tb927.8.8330 | SLAC | 0.249 | 0.242 | 0.318 | 0 | 0 | ||
| FEL | 1 | 115 | ||||||||
| REL | 14 | 458 | 27 | 41 | ||||||
Shared, paralogous loci showing significant disparity (p < 0.005) between the ratio of invariable to variable mutations at synonymous (S) and non-synonymous sites (R) respectively, as determined by G test.
| Locus | Description | Mutation type: | G | RatioR | RatioS | RatioRS | |||
| RI | RV | SI | SV | ||||||
| 2 | Tb927.4.5380 (alcohol dehydrogenase-like) | 436 | 10 | 310 | 21 | 8.155 | 43.60 | 14.76 | 2.954 |
| 6 | Tb927.4.5340 | 356 | 284 | 147 | 191 | 13.03 | 1.254 | 0.770 | 1.629 |
| 8 | Tb927.4.5320 | 197 | 384 | 105 | 314 | 9.131 | 0.513 | 0.334 | 1.534 |
| 24 | Tb927.8.7420 | 256 | 5 | 210 | 15 | 6.937 | 51.20 | 14.00 | 3.657 |
| 29 | Tb927.8.7480 | 135 | 484 | 53 | 317 | 8.674 | 0.279 | 0.167 | 1.668 |
| 35 | Tb927.8.7580 (TMH/SP) | 201 | 539 | 75 | 400 | 22.05 | 0.373 | 0.188 | 1.989 |
| 36 | amino acid transporter | 132 | 279 | 89 | 379 | 19.92 | 0.473 | 0.235 | 2.015 |
| 37 | amino acid transporter | 54 | 191 | 41 | 287 | 9.081 | 0.283 | 0.143 | 1.979 |
| 41 | amino acid transporter | 139 | 60 | 83 | 77 | 12.09 | 2.317 | 1.078 | 2.149 |
| 49 | Tb927.8.7850 | 210 | 598 | 84 | 429 | 17.25 | 0.351 | 0.196 | 1.793 |
| 50 | adenylate cyclase GRESAG 4 | 73 | 502 | 98 | 414 | 8.458 | 0.145 | 0.237 | 0.614 |
| 51 | Tb927.8.7950 | 430 | 1671 | 178 | 905 | 7.649 | 0.257 | 0.197 | 1.308 |
| 58 | UDP-GlcNAc-dependent glycosyltransferase | 67 | 318 | 24 | 247 | 10.12 | 0.211 | 0.097 | 2.168 |
| 62 | Tb927.4.4180 | 234 | 775 | 83 | 475 | 15.94 | 0.302 | 0.175 | 1.728 |
| 68 | Tb927.8.8270 | 263 | 748 | 97 | 485 | 19.05 | 0.352 | 0.200 | 1.758 |
Sequence divergence of shared, paralogous loci, displaying four different evolutionary dynamics.
| Chr4 | Chr8 | CDS | 3'UTR | ||
| 1 | Tb927.4.5390 | Tb927.8.6930 | serine/threonine-protein kinase NrkA | 0.983 | 0.968 |
| 2 | Tb927.4.5380 | Tb927.8.6940 | quinonprotein alcohol dehydrogenase-like | 0.993a | 0.995 |
| 3 | Tb927.4.5370 | Tb927.8.6950 | dynein light chain 2B | 0.994 | 0.994 |
| 4 | Tb927.4.5360 | Tb927.8.6960 | TMH/SP | 0.987 | 0.984 |
| 5 | Tb927.4.5350 | Tb927.8.6970 | 3-methylcrotonyl-CoA carboxylase | 0.983b | 0.984 |
| 22 | Tb927.4.5020 | Tb927.8.7400 | RNA polymerase IIA largest subunit | 0.999 | 1 |
| 24 | Tb927.4.5000 | Tb927.8.7420 | C2 calcium/lipid-binding region, CaLB | 0.995a | 0.966 |
| 30 | Tb927.4.4930 | Tb927.8.7490 | 0.987 | 0.956 | |
| 64 | Tb927.4.4150 | Tb927.8.8180 | 0.991 | 0.995 | |
| 65 | Tb927.4.4140 | Tb927.8.8190 | 0.986 | 0.981 | |
| 67 | Tb927.4.4120 | Tb927.8.8210 | 0.984 | 0.982 | |
| 7 | Tb927.4.5330 | Tb927.8.7060 | EGF/Laminin domain | 0.235 | 0.165 |
| 10 | Tb927.4.5240 | Tb927.8.7140 | UDP-GlcNAc-dependent glycosyltransferase | 0.455 | 0.258 |
| 11 | Tb927.4.5230 | Tb927.8.7180 | 0.385 | 0.181 | |
| 12 | Tb927.4.5220 | Tb927.8.7190 | SP | 0.402 | 0.09 |
| 18 | Tb927.4.5120 | Tb927.8.7260 | kinetoplast-associated protein | 0.368 | 0.365 |
| 29 | Tb927.4.4940 | Tb927.8.7480 | phosphopantetheine attachment site | 0.491a | 0.427 |
| 33 | Tb927.4.4900 | Tb927.8.7550 | 0.282 | 0.16 | |
| 34 | Tb927.4.4890 | Tb927.8.7560 | TMH | 0.454 | 0.137 |
| 35 | Tb927.4.4880 | Tb927.8.7580 | TMH/SP, Zn-finger protein | 0.48a | 0.264 |
| 38 | Tb927.4.4810 | Tb927.8.7710 | TMH | 0.402 | 0.425 |
| 39 | Tb927.4.4790 | Tb927.8.7720 | TMH/SP | 0.391 | 0.288 |
| 42 | Tb927.4.4580 | Tb927.8.7750 | protein kinase | 0.446 | 0.422 |
| 46 | Tb927.4.4530 | Tb927.8.7800 | SPla/RYanodine receptor SPRY | 0.407b | 0.47 |
| 47 | Tb927.4.4520 | Tb927.8.7820 | cold-shock protein, DNA-binding | 0.434 | 0.222 |
| 48 | Tb927.4.4500 | Tb927.8.7830 | 0.34 | 0.25 | |
| 51 | Tb927.4.4400 | Tb927.8.7950 | leucine rich repeat | 0.433ab | 0.25 |
| 57 | Tb927.4.4310 | Tb927.8.8050 | spectrin repeat | 0.366 | 0.152 |
| 59 | Tb927.4.4240 | Tb927.8.8070 | Zinc finger, C2H2-type | 0.145 | 0.258 |
| 60 | Tb927.4.4220 | Tb927.8.8140 | small GTP-binding rab protein | 0.452 | 0.433 |
| 62 | Tb927.4.4180 | Tb927.8.8160 | 0.48a | 0.186 | |
| 69 | Tb927.4.4040 | Tb927.8.8280 | 0.436 | 0.209 | |
| 71 | Tb927.4.3970 | Tb927.8.8320 | 0.277 | 0.302 | |
| 9 | Tb927.4.5310 | Tb927.8.7110 | serine/threonine-protein kinase A | 0.845b | 0.239 |
| 15 | Tb927.4.5160 | Tb927.8.7240 | TMH/SP | 0.837 | 0.169 |
| 17 | Tb927.8.7250 | 0.958 | 0.4 | ||
| 20 | Tb927.4.5050 | Tb927.8.7380 | dihydrolipoamide dehydrogenase | 0.924b | 0.303 |
| 45 | Tb927.4.4540 | Tb927.8.7790 | LSD1 zinc finger | 0.818 | 0.227 |
| 54 | Tb927.4.4360 | Tb927.8.8020 | monoglyceride lipase | 0.783b | 0.199 |
| 56 | Tb927.4.4330 | Tb927.8.8040 | diadenosine tetraphosphatase | 0.781 | 0.214 |
| 63 | Tb927.4.4160 | Tb927.8.8170 | SP, CheY-like domain | 0.773 | 0.901 |
| 65 | Tb927.4.5150 | Tb927.8.7240 | 0.901 | 0.983 | |
a denotes evidence for 'constant but different' non-synonymous mutations.
b denotes evidence for significant rate asymmetry between retained paralogs.