| Literature DB >> 18366692 |
Jyoti Srivastava1, Sanjay Premi, Sudhir Kumar, Sher Ali.
Abstract
BACKGROUND: Simple sequence repeats (SSRs) of GACA/GATA have been implicated with differentiation of sex-chromosomes and speciation. However, the organization of these repeats within genomes and transcriptomes, even in the best characterized organisms including human, remains unclear. The main objective of this study was to explore the buffalo transcriptome for its association with GACA/GATA repeats, and study the structural organization and differential expression of the GACA/GATA repeat tagged transcripts. Moreover, the distribution of GACA and GATA repeats in the prokaryotic and eukaryotic genomes was studied to highlight their significance in genome evolution.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18366692 PMCID: PMC2346481 DOI: 10.1186/1471-2164-9-132
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Chromosomal distribution of GACA (A) and GATA (B) repeats across the six eukaryotes based on in-silico analysis. The repeat density of the GACA/GATA tetramers across the chromosomes sets in different species is expressed in base-pairs per megabase of each chromosome. Note the differential occurrence of these repeats along different chromosomes. The human and dog genomes were found to be GATA rich. The GATA repeats were predominant on the human and chicken Y chromosomes. Status of these repeats on the Y chromosomes in other species remained unclear due to their unfinished genomes.
Figure 2Microsatellite associated sequence amplification (MASA) performed using oligos based on varying lengths of GACA/GATA repeats and cDNA from different sources (A-D). The amplified transcripts ranged from 0.15 kb to 1.8 kb. MASA using GACA repeat with cDNA from different somatic and gonadal tissues is given in (A) and cDNA from spermatozoa from 4 animals in (B). Similarly, MASA using GATA repeats and cDNA from different somatic tissues (C) and spermatozoa is shown in (D). Note the tissue and spermatozoa-specific transcript profiles generated by GACA and GATA repeats. GATA did not detect any transcripts in lung and heart.
Detailed analysis for the MASA identified somatic and spermatozoal transcripts tagged with the GATA repeat motif from water buffalo Bubalus bubalis#
| pJC29 | Brain/1769 | 1. | 2072671 | - | 109–395 | 90% | ||
| pJC30 | Heart/1768 | 2. | 8212 | 22 | 123–385 | 90% | ||
| pJC31 | Liver/1768 | |||||||
| pJC32 | Lung/1812 | 3. | 188109 | - | 109–386 | 89% | ||
| pJC33 | Ovary/1767 | |||||||
| pJC34 | Spleen/1772 | 4. | 207929 | - | 131–395 | 90% | ||
| pJC43 | Testis/1767 | |||||||
| pJC55 | NS | Kidney/1812 | 5. | 447010 | - | 109–356 | 91% | |
| pJC44 | Kidney/1303 | 1. Pig DNA sequence from clone CH242-277I8 | 206278 | 17 | 104–277 | 86% | ||
| pJC45 | Liver/1303 | 2. Human DNA sequence from clone RP5-1009H6 on chromosome 20 Contains the 3' end of the NFATC2 gene for cytoplasmic calcineurin-dependent (2) nuclear factor of activated T-cells | 89163 | 20 | 158–245 | 90% | ||
| pJC56 | NS | Ovary/1303 | 627–774 | |||||
| pJC57 | NS | Spleen/1303 | ||||||
| pJC58 | NS | Testis/1303 | ||||||
| pJC35 | Heart/1080 | 1. Human DNA sequence from clone RP11-148E14 on chromosome 10 Contains part of the BTRC gene for beta-transducin repeat | 36454 | 10 | 281–884 | 94% | ||
| pJC36 | Liver/1080 | 2. | 206515 | 19 | 282–884 | 90% | ||
| pJC37 | Lung/1080 | |||||||
| pJC38 | Ovary/1080 | |||||||
| pJC39 | Spleen/1080 | |||||||
| pJC41 | Kidney/1080 | |||||||
| pJC59 | Testis/1080 | |||||||
| pJC40 | Testis/1043 | 1. | 104027 | 13q17 | 333–857 | 89% | ||
| 2. | 31412 | - | 333–857 | 88% | ||||
| 3. | 65476 | - | 333–857 | 87% | ||||
| pJC42 | Kidney/1067 | 1. | 4148 | - | 398–597 | 97% | ||
| 2. | 1152 | 10 | 403–446 | 93% | ||||
| pJC46 | Liver/848 | 1. | 412 | - | 131–441 | 97% | ||
| pJC60 | NS | Spleen/850 | ||||||
| pJC61 | NS | Heart/848 | 2. | 4005 | 7 | 131–374 | 97% | |
| pJC62 | NS | Testis/848 | ||||||
| pJC63 | NS | Kidney/848 | 3. | 452 | - | 68–285 | 100% | |
| pJC64 | NS | Ovary/848 | ||||||
| pJC65 | NS | Lung/858 | 4. | 1185 | - | 256–842 | 86% | |
| pJC54 | Testis/725 | 1. | 2072671 | - | 139–261 | 90% | ||
| 2. | 1253 | 6 | 139–261 | 86% | ||||
| 3. | 2828 | - | 174–253 | 91% | ||||
| pJC49 | Ovary/635 | 1. Human DNA sequence from clone RP4-752I6 on chromosome 1 Contains the 5' end of the WASF2 gene for WAS protein family | 71971 | 1 | 445–485 | 91% | ||
| pJC66 | NS | Kidney/635 | 555–635 | |||||
| pJC67 | NS | Heart/635 | ||||||
| pJC68 | NS | Liver/635 | ||||||
| pJC69 | NS | Testis/647 | ||||||
| pJC70 | NS | Spleen/635 | ||||||
| 2. Mouse DNA sequence from clone RP23-125F21 on chromosome 4 | 152069 | 4 | 555–635 | 90% | ||||
| pJC50 | Spleen/612 | 1. | 1470 | 21 | 119–368 | 86% | ||
| pJC71 | NS | Ovary/612 | ||||||
| *pJC48 | Testis/523 | 1. | 1470 | 21 | 156–405 | 86% | ||
| pJC72 | NS | Ovary/523 | ||||||
| pJC47 | Brain/455 | 1. | 419 | - | 43–437 | 100% | ||
| pJC73 | NS | Heart/455 | 2. | 153836 | 12 | 125–251 | 86% | |
| pJC74 | NS | Kidney/455 | ||||||
| pJC75 | NS | Ovary/455 | ||||||
| pJC76 | NS | Spleen/455 | ||||||
| pJC77 | NS | Lung/455 | ||||||
| pJC78 | NS | Testis/455 | ||||||
| pJC79 | NS | Liver/455 | ||||||
| pJC53 | Heart/412 | 1. | 197 | - | 52–224 | 89% | ||
| 2. | 171712 | 6 | 52–233 | 86% | ||||
| 3. | 3406 | 3 | 54–116 | 87% | ||||
| pJC51 | Testis/209 | 1. | 184175 | 1 | 186–209 | 100% | ||
| pJC80 | NS | Liver/209 | ||||||
| pJC81 | NS | Lung/209 | 2. | 191162 | 13 | 188–209 | 100% | |
| pJC82 | NS | Ovary/209 | ||||||
| pJC83 | NS | Spleen/209 | ||||||
| pJC84 | NS | Kidney/209 | ||||||
| pJC85 | NS | Heart/209 | ||||||
| *pJC52 | Testis/217 | 1. | 4601 | 8 | 7–207 | 99% | ||
| 2. | 2660 | 11 | 37–207 | 94% | ||||
| 3. Macaca mulatta ubiquitin associated protein 1 (UBAP1), | 4100 | 15 | 9–207 | 90% | ||||
| 4. | 2752 | 9p13.3 | 9–207 | 90% | ||||
| pJSC1 | 1313 | ▪ Same as pJC44–45 and pJC56–58 | ||||||
| pJSC2 | 857 | ▪ Same as pJC46 and pJC60–61 | ||||||
| pJSC3 | 807 | 1. | 757 | - | 16–792 | 96% | ||
| 2. | 2580 | - | 558–737 | 90% | ||||
| pJSC4 | 789 | 1. Hippopotamus amphibius DNA, SINE-containing sequence | 311 | - | 582–611 | 100% | ||
| 2. | 18838 | 29 | 659–686 | 100% | ||||
| 3. Globicephala macrorhynchus DNA, CHR-2 SINE FL type sequence | 321 | - | 659–742 | 88% | ||||
| pJSC5 | 844 | 1. | 1676 | - | 217–414 | 91% | ||
| 2. Canis familiaris similar to zinc finger, DHHC domain | 1470 | - | 290–397 | 83% | ||||
| pJSC6 | 797 | 1. | 176467 | 4 | 95–401 | 85% | ||
| pJSC7 | 840 | ▪ Same as pJC48 and pJC50 | - | |||||
| pJSC8 | 635 | ▪ Same as pJC49 and pJC66–70 | - | |||||
| pJSC9 | 507 | 1. | 207929 | - | 52–339 | 88% | ||
| 2. | 188109 | - | 52–339 | 87% | ||||
| 3. | 48420 | 1q43 | 52–337 | 87% | ||||
| pJSC10 | 516 | 1. | 805 | - | 272–443 | 97% | ||
| 2. | 2196 | 5p12–p13 | 356–507 | 91% | ||||
| 3. Pan troglodytes similar to disabled 2 p93 | 5113 | 5 | 356–435 | 91% | ||||
| pJSC11 | 523 | ▪ Same as pJC48, pJC50 and pJSC6 | - | |||||
| pJSC12 | 532 | 1. Human DNA sequence from clone RP11-790G19 on chromosome 10 Contains the 5' end of the gene for transmembrane receptor Unc5H2, the 3'end of a novel gene and two CpG islands | 195130 | 10 | 394–431 | 97% | ||
| pJSC13 | 531 | 1. | 187091 | 15 | 46–327 | 83% | ||
| 2. | 118230 | 8 | 133–377 | 84% | ||||
| pJSC27 | 522 | ▪ Same as pJC48, 50, 71 & 72 | - | |||||
| pJSC14 | 455 | ▪ Same as pJC47 and pJC73–79 | - | |||||
| pJSC15 | 392 | 1. | 199601 | 16 | 362–398 | 100% | ||
| pJSC16 | 387 | 1. | 826 | - | 160–335 | 89% | ||
| 2. | 153353 | - | 120–261 | 90% | ||||
| pJSC17 | 354 | 1. | 2561 | 3 | 33–346 | 97% | ||
| pJSC18 | 267 | 1. Zebrafish DNA sequence from clone CH211-222O4 in linkage group 3 | 190220 | - | 2–28 | 96% | ||
| pJSC19 | 277 | 1. | 202934 | 7 | 165–191 | 96% | ||
| pJSC20 | 291 | 1. | 9596 | Xq22–q24 | 97–220 | 91% | ||
| 2. | 1645 | - | 97–218 | 90% | ||||
| 3. | 2960 | - | 97–216 | 90% | ||||
| pJSC21 | 301 | ▪ Same as pJC48, pJC50, pJSC6 and pJSc11 | - | |||||
| pJSC22 | 273 | 1. | 466 | - | 91–203 | 89% | ||
| 2. | 2458 | 17 | 100–203 | 90% | ||||
| 3. | 12039 | 5q23 | 118–205 | 91% | ||||
| pJSC23 | 274 | 1. Human DNA sequence from clone RP11-541N10 on chromosome 10 Contains the 5' end of the SH3MD1 gene for SH3 multiple domains 1, a novel gene and two CpG islands | 190882 | 10 | 103–254 | 89% | ||
| pJC24 | 269 | NA | - | |||||
| pJSC25 | 229 | 1. | 183470 | 7 | 1–26 | 100% | ||
| pJSC26 | 209 | ▪ Same as pJC51, pJC80–85 | - | |||||
# The transcripts uncovered from somatic and gonadal tissues are given in (i) whereas spermatozoal transcripts in (ii). All of the GACA-tagged transcripts were submitted to the GenBank and the accession numbers were obtained for each transcript. The analysis carried out for their homologues, size and chromosomal positions is also given. Blast search showed homology of these transcripts with several genes/gene fragments across the species. Notably, only few of them represented by '*' had homology along the length while others showed partial homology.
Analysis of the MASA uncovered somatic and spermatozoal transcripts tagged with the GATA repeat motifs from water buffalo Bubalus bubalis#
| 1. | pJC86 | Kidney/807 | 1. | pJSC28 | 808 | ||
| 2. | pJC95 | NS | Testis/807 | 2. | pJSC31 | 425 | |
| 3. | pJC94 | NS | Ovary/807 | 3. | pJSC30 | 414 | |
| 4. | pJC93 | NS | Spleen/821 | 4. | pJSC32 | 417 | |
| 5. | pJC96 | NS | Liver/807 | 5. | pJSC33 | 367 | |
| 6. | pJSC29 | Spleen/425 | 6. | pJSC34 | 367 | ||
| 7. | pJC97 | NS | Testis/425 | 7. | pJSC35 | NS | 277 |
| 8. | pJC98 | NS | Ovary/425 | 8. | PJSC36 | NS | 282 |
| 9. | pJC99 | NS | Kidney/425 | 9. | pJSC37 | NS | 150 |
| 10. | pJC100 | NS | Liver/425 | 10. | pJSC38 | NS | 125 |
| 11. | pJC101 | NS | Testis/414 | ||||
| 12. | pJC102 | NS | Testis/417 | ||||
| 13. | pJC89 | Testis/376 | |||||
| 14. | pJC103 | NS | Ovary/367 | ||||
| 15. | pJC104 | NS | Liver/367 | ||||
| 16. | pJC105 | NS | Kidney/367 | ||||
| 17. | pJC106 | NS | Testis/367 | ||||
| 18. | pJC87 | Testis/277 | |||||
| 19. | pJC88 | Testis/282 | |||||
| 20. | pJC107 | NS | Ovary/282 | ||||
| 21. | pJC108 | NS | Spleen/282 | ||||
| 22. | pJC109 | NS | Liver/282 | ||||
| 23. | pJC90 | Testis/150 | |||||
| 24. | pJC91 | Testis/125 | |||||
# The mRNA transcripts detected in somatic tissues are described in (i) whereas spermatozoal transcripts in (ii). Note that these transcripts did not show any homology with genes present in databank.
Figure 3RT-PCR analyses for representative GACA- (A) and GATA- (B) tagged transcripts using internal primers and cDNA from different somatic tissues, gonads and spermatozoa as templates. The transcript IDs are given on the left and names of the tissues on the top. Quality and quantity of the cDNA samples was normalized (C) and genomic contamination in the RNA checked by PCR with β-actin derived primers. Tissue specificities of the transcripts were ascertained on the basis of presence or absence of amplicons using the respective cDNA templates which were further confirmed by real time PCR and Southern blotting.
Relative quantitative expression and Copy number status of the genes/gene fragments tagged with GACA & GATA repeat motifs, originating from different somatic/gonadal tissues and spermatozoa#
| 1. | pJC40 | 194 | 21 | 23 | 17 | 2 | 3 | 274 | 181 | 147 | 239 | ||||
| 2. | pJC42 | 32 | 8 | 2 | 30 | 7 | 51 | 29 | 17 | 21 | 27 | 2–3 | 2–3 | ||
| 3. | pJC52 | 512 | 32 | 34 | 83 | 24 | 60 | 6 | 9 | 5 | 7 | 3 | 3 | ||
| 4. | pJC54 | 208 | 28 | 15 | 69 | 3 | 1 | 107 | 119 | 97 | 157 | 1 | 1 | ||
| 5. | pJC29 | 15 | 10 | 13 | 45 | 20 | 22 | 49 | 52 | 32 | 45 | 1 | 1 | ||
| 6. | pJC35 | 147 | 24 | 51 | 45 | 3 | 3 | 39 | 32 | 51 | 39 | 1–2 | 1–2 | ||
| 7. | pJC44 | 44 | 21 | 17 | 34 | 11 | 25 | 97 | 111 | 97 | 128 | 1 | 1 | ||
| 8. | pJC46 | 7 | 6 | 2 | 18 | 3 | 14 | 22 | 11 | 12 | 2 | 2 | |||
| 9. | pJC47 | 34 | 11 | 14 | 91 | 14 | 2 | 73 | 97 | 87 | 84 | 1 | 1 | ||
| 10. | pJC49 | 3521 | 891 | 330 | 637 | 238 | 630 | 2896 | 4792 | 2702 | 3326 | 1 | 1 | ||
| 11. | pJC51 | 1663 | 157 | 338 | 2521 | 3 | 5 | 362 | 239 | 512 | 676 | 1 | 1 | ||
| 12. | pJC53 | 17 | 13 | 5 | 29 | 4 | 45 | 18 | 14 | 12 | 16 | 2 | 2 | ||
| 13. | pJSC11 | 4390 | 1176 | 664 | 1097 | 2 | 1195 | 6616 | 5120 | 8526 | 7342 | 25–65 | 30–65 | ||
| 14. | pJSC1 | 46 | 35 | 40 | 36 | 12 | 15 | 36 | 21 | 23 | 27 | 1 | 1 | ||
| 16. | pJSC3 | 156 | 45 | 12 | 87 | 2 | 37 | 1176 | 724 | 776 | 1440 | 1 | 1 | ||
| 17. | pJSC4 | 149 | 222 | 376 | 34 | 10 | 6 | 675 | 630 | 608 | 588 | 2 | 2 | ||
| 18. | pJSC5 | 128 | 2 | 9 | 2 | 2 | 62 | 47 | 41 | 38 | 2 | 2 | |||
| 19. | pJSC6 | 31 | 21 | 30 | 14 | 13 | 51 | 52 | 97 | 84 | 55 | 1 | 1 | ||
| 20. | pJSC9 | 53 | 4 | 3 | 4 | 3 | 3 | 15 | 14 | 15 | 14 | 2 | 2 | ||
| 21. | pJSC10 | 3 | 3 | 12 | 4 | 14 | 3 | 6 | 4 | 9 | 3 | 3 | 3 | ||
| 22. | pJSC12 | 91 | 6 | 28 | 52 | 2 | 6 | 138 | 97 | 119 | 97 | 1 | 1 | ||
| 23. | pJSC13 | 228 | 181 | 246 | 34 | 2 | 74 | 1782 | 1910 | 1097 | 1351 | 1 | 1 | ||
| 24. | pJSC15 | 39 | 4 | 26 | 13 | 5 | 2 | 49 | 35 | 45 | 39 | 8–13 | 8–10 | ||
| 25. | pJSC16 | 31 | 14 | 9 | 2 | 2 | 1 | 117 | 112 | 127 | 118 | 1 | 1 | ||
| 26. | pJSC17 | 27 | 22 | 19 | 15 | 9 | 16 | 14 | 29 | 18 | 20 | 1 | 1 | ||
| 27. | pJSC18 | 18 | 7 | 42 | 28 | 9 | 2 | 34 | 23 | 42 | 23 | 1 | 1 | ||
| 28. | pJSC19 | 85 | 24 | 35 | 28 | 2 | 13 | 69 | 68 | 83 | 52 | 2 | 2 | ||
| 29. | pJSC20 | 89 | 74 | 88 | 81 | 65 | 74 | 81 | 88 | 71 | 82 | 1 | 1 | ||
| 30. | pJSC22 | 4 | 4 | 2 | 2 | 8 | 3 | 75 | 69 | 54 | 61 | 2–3 | 2 | ||
| 31. | pJSC23 | 2 | 2 | 12 | 4 | 10 | 2 | 55 | 73 | 41 | 67 | 1 | 1 | ||
| 32. | pJSC24 | 2 | 1 | 12 | 2 | 2 | 1 | 48 | 27 | 42 | 32 | 1 | 1 | ||
| 33. | pJSC25 | 149 | 127 | 104 | 21 | 109 | 64 | 239 | 194 | 195 | 256 | 1 | 1 | ||
| 1. | pJSC28 | 114 | 58 | 16 | 5 | 2 | 3 | 51 | 48 | 34 | 42 | 2–4 | 2–4 | ||
| 2. | pJSC30 | 169 | 20 | 65 | 48 | 1 | 1 | 168 | 128 | 113 | 137 | 1 | 1 | ||
| 3. | pJSC31 | 65 | 30 | 23 | 35 | 10 | 5 | 59 | 48 | 53 | 43 | 2 | 2 | ||
| 4. | pJSC32 | 326 | 33 | 28 | 52 | 8 | 3 | 1351 | 1261 | 1351 | 1261 | 1 | 1 | ||
| 5. | pJSC33 | 239 | 57 | 14 | 68 | 2 | 44 | 42 | 68 | 55 | 73 | 3–5 | 3–5 | ||
| 6. | pJSC34 | 490 | 78 | 19 | 14 | 37 | 3 | 589 | 510 | 465 | 610 | 2 | 2 | ||
| 7. | pJC87 | 386 | 39 | 14 | 9 | 2 | 3 | 314 | 296 | 357 | 260 | 1 | 1 | ||
| 8. | pJC88 | 134 | 87 | 93 | 102 | 4 | 15 | 201 | 174 | 124 | 145 | 1–2 | 1–2 | ||
# The expression for gene fragments tagged with GACA repeat is described in (A) whereas for GATA-tagged ones in (B). Note the highest expression of most of the GACA-tagged and all GATA-tagged genes in testis and/or spermatozoa.
Figure 4Quantitative expression of representative GACA/GATA-tagged transcripts demonstrating variations among somatic/gonadal tissues and spermatozoa. Four types of expressional profiles were uncovered with GACA; some transcripts with highest expression in testis and spermatozoa e.g. Ankyrin repeat domain (a), few in testis only e.g. Ubap1 (b), few in spermatozoa only e.g. novel pJSC3 (c), and others distributed almost uniformly in all the tissues e.g. HBGF-1 (d). Three types of expressional profiles were observed for GATA-tagged transcripts; some showed highest expression both in testis and spermatozoa e.g. novel pJSC34 (e), few in testis only e.g. novel pJSC33 (f), few others in spermatozoa only e.g. novel pJSC32 (g), and others highest in testis and spermatozoa but with minimal variation in comparison to somatic tissues e.g. novel pJSC31 (h). For details, see table 3 and text.
Figure 5Chromosomal mapping for the candidate Ubap1 gene onto the short arm of metacentric chromosome 3 (A) and Ankyrin repeat domain onto the proximal end of the short arm of sub-metacentric chromosome 4 (B). Detailed mapping for these genes with respect to its position on the G-banded ideogram following ISCNDB 2000 is shown in the figure.