| Literature DB >> 26981434 |
Chinta Someswara Rao1, S Viswanadha Raju2.
Abstract
Next generation sequencing (NGS) technologies have been rapidly applied in biomedical and biological research in recent years. To provide the comprehensive NGS resource for the research, in this paper , we have considered 10 loci/codi/repeats TAGA, TCAT, GAAT, AGAT, AGAA, GATA, TATC, CTTT, TCTG and TCTA. Then we developed the NGS Tandem Repeat Database (TandemRepeatDB) for all the chromosomes of Homo sapiens, Callithrix jacchus, Chlorocebus sabaeus, Gorilla gorilla, Macaca fascicularis, Macaca mulatta, Nomascus leucogenys, Pan troglodytes, Papio anubis and Pongo abelii genome data sets for all those locis. We find the successive occurence frequency for all the above 10 SSR (simple sequence repeats) in the above genome data sets on a chromosome-by-chromosome basis with multiple pattern 2° shaft multicore string matching.Entities:
Keywords: Genome; NGS; SSR; String matching; TandemRepeatDB; chromosomes
Year: 2016 PMID: 26981434 PMCID: PMC4778683 DOI: 10.1016/j.gdata.2016.01.015
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Fig. 1Architecture of TandemRepeatDB.
Table Structure.
| Type | Collation |
|---|---|
| sample_id | text |
| sample_name | text |
| sample_chromosome_name | text |
| position | int(10) |
| noofoccurences | int(10) |
| codi | text |
Genome sequences used in the study.
| Genome sequence name | Name & number of chromosomes | Total number of tandem repeats extracted (> 1) |
|---|---|---|
| 1 to 22, MT, X, Y and Un (26) | 11,99,985 | |
| 1 to 22, X, Y and Un (25) | 11,40,529 | |
| 1 to 29, MT, X, Y and Un (33) | 11,13,445 | |
| 1, 2A, 2B, 3 to 22, MT, X and Un (26) | 11,63,843 | |
| 1 to 20, MT, X and Un (23) | 12,31,029 | |
| 1 to 20, MT, X and Un (23) | 12,74,556 | |
| 1a 2 to 6, 7b, 8 to 21, 22a, 23 to 25, X and Un (27) | 11,71,594 | |
| 1, 2A, 2B, 3 to 22, MT, X, Y and Un (27) | 12,76,766 | |
| 1 to 20, MT, X and Un (23) | 13,51,393 | |
| 1, 2A, 2B, 3 to 22, MT, X and Un (26) | 13,81,887 | |
| 10 | 259 |
Tandem repeat successive occurrences for all chromosomes of H.sapiens.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| TAGA | 21 | Once |
| AGAA | 42 | Twice |
| GATA | 22 | Once |
| TCTA | 25 | Once |
| TCAT | 12 | Twice |
| GAAT | 12 | Once |
| AGAT | 21 | Once |
| CTTT | 78 | Once |
| TATC | 25 | Once |
| TCTG | 12 | Four Times |
Fig. 2Max number of successive occurrences of all repeats for all chromosomes of H.sapiens.
Tandem repeat successive occurrences for all chromosomes of C.jacchus.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| 21 | Once | |
| 57 | Once | |
| 20 | Once | |
| 18 | Twice | |
| 14 | Once | |
| 13 | Once | |
| 21 | Once | |
| 51 | Once | |
| 18 | Thrice | |
| 14 | Once |
Fig. 3Max number of successive occurrences of all repeats for all chromosomes of C.jacchus.
Tandem repeat successive occurrences for all chromosomes of C.sabaeus.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| TAGA | 19 | Twice |
| AGAA | 54 | Once |
| GATA | 20 | Once |
| TCTA | 20 | Twice |
| TCAT | 14 | Once |
| GAAT | 14 | Once |
| AGAT | 20 | Once |
| CTTT | 42 | Once |
| TATC | 20 | Once |
| TCTG | 14 | Once |
Fig. 4Max number of successive occurrences of all repeats for all chromosomes of C.sabaeus.
Tandem repeat successive occurrences for all chromosomes of G.gorilla.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| TAGA | 19 | Thrice |
| AGAA | 41 | Once |
| GATA | 20 | Once |
| TCTA | 26 | Once |
| TCAT | 14 | Once |
| GAAT | 12 | Ten times |
| AGAT | 20 | Twice |
| CTTT | 66 | Once |
| TATC | 26 | Once |
| TCTG | 16 | Once |
Fig. 5Max number of successive occurrences of all repeats for all chromosomes of G.gorilla.
Tandem repeat successive occurrences for all chromosomes of M.fascicularis.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| TAGA | 29 | Once |
| AGAA | 218 | Once |
| GATA | 29 | Once |
| TCTA | 33 | Once |
| TCAT | 19 | Once |
| GAAT | 14 | Thrice |
| AGAT | 28 | Once |
| CTTT | 221 | Once |
| TATC | 33 | Once |
| TCTG | 16 | Once |
Fig. 6Max number of successive occurrences of all repeats for all chromosomes of M.fascicularis.
Tandem repeat successive occurrences for all chromosomes of M.mulatta.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| TAGA | 31 | Once |
| AGAA | 84 | Once |
| GATA | 31 | Once |
| TCTA | 21 | Twice |
| TCAT | 19 | Once |
| GAAT | 15 | Once |
| AGAT | 31 | Once |
| CTTT | 79 | Once |
| TATC | 21 | Once |
| TCTG | 12 | Twice |
Fig. 7Max number of successive occurrences of all repeats for all chromosomes of M.mulatta.
Tandem repeat successive occurrences for all chromosomes of N.leucogenys.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| TAGA | 17 | Thrice |
| AGAA | 52 | Once |
| GATA | 17 | Once |
| TCTA | 22 | Once |
| TCAT | 12 | Once |
| GAAT | 11 | Once |
| AGAT | 17 | Once |
| CTTT | 33 | Once |
| TATC | 23 | Once |
| TCTG | 13 | Once |
Fig. 8Max number of successive occurrences of all repeats for all chromosomes of N.leucogenys.
Tandem repeat successive occurrences for all chromosomes of P.troglodytes.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| TAGA | 17 | Four Times |
| AGAA | 43 | Once |
| GATA | 18 | Twice |
| TCTA | 18 | Once |
| TCAT | 10 | Five Times |
| GAAT | 11 | Once |
| AGAT | 18 | Once |
| CTTT | 30 | Once |
| TATC | 19 | Once |
| TCTG | 13 | Once |
Fig. 9Max number of successive occurrences of all repeats for all chromosomes of P.troglodytes.
Tandem repeat successive occurrences for all chromosomes of P.anubis.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| TAGA | 31 | Once |
| AGAA | 54 | Once |
| GATA | 31 | Once |
| TCTA | 22 | Once |
| TCAT | 15 | Once |
| GAAT | 14 | Once |
| AGAT | 32 | Once |
| CTTT | 47 | Twice |
| TATC | 21 | Twice |
| TCTG | 15 | Once |
Fig. 10Max number of successive occurrences of all repeats for all chromosomes of P.anubis.
Tandem repeat successive occurrences for all chromosomes of P.abelli.
| codi/Repeat name | MAX number of codi in the successive occurrences | Number of times the MAX number appeared |
|---|---|---|
| TAGA | 20 | Once |
| AGAA | 37 | Once |
| GATA | 19 | Once |
| TCTA | 18 | Twice |
| TCAT | 12 | Once |
| GAAT | 13 | Once |
| AGAT | 20 | Once |
| CTTT | 63 | Once |
| TATC | 19 | Once |
| TCTG | 11 | Twice |
Fig. 11Max number of successive occurrences of all repeats for all chromosomes of P.abelii.