| Literature DB >> 26246889 |
James Chun-I Lee1, Bill Tseng1, Bing-Ching Ho2, Adrian Linacre3.
Abstract
BACKGROUND: Whole-genome sequencing is performed routinely as a means to identify polymorphic genetic loci such as short tandem repeat loci. We have developed a simple tool, called pSTR Finder, which is freely available as a means of identifying putative polymorphic short tandem repeat (STR) loci from data generated from genome-wide sequences. The program performs cross comparisons on the STR sequences generated using the Tandem Repeats Finder based on multiple-genome samples in a FASTA format. These comparisons generate reports listing identical, polymorphic, and different STR loci when comparing two samples.Entities:
Keywords: Bioinformatics; FASTA; Forensic; MASSIVE parallel sequencing; STR; TRF; Whole-genome sequences
Year: 2015 PMID: 26246889 PMCID: PMC4525727 DOI: 10.1186/s13323-015-0027-x
Source DB: PubMed Journal: Investig Genet ISSN: 2041-2223
Summary results of the number of identical, polymorphic and different STR loci among four samples after searching using pSTR
| Samples | AC_000155.1 | CM000685.1 | NC_018934.2 | CM000274.1 |
|---|---|---|---|---|
| AC_000155.1 | 7034 (4654) | 6716 (4096) | 8592 (2906) | |
| CM000685.1 | 4935 (2197) | 10720 (2790) | 10930 (3017) | |
| NC_018934.2 | 4807 (3073) | 3113 (2109) | 8655 (3546) | |
| CM000274.1 | 5033 (2387) | 2676 (2584) | 4330 (3418) |
Figures in the upper quadrant indicate the number of identical STR loci with the total number of polymorphic STR loci in brackets. The figures in the lower quadrant indicate the number of different STR loci, and the numbers in brackets indicate the number of different STR loci after switching the ‘source sample’ with the ‘target sample’. Using AC_000155.1 as the reference data and comparing the other data, there are 5443 identical STR loci, 4305 polymorphic STR loci and 0 unique STR loci (please see Additional file 1 for further information)
Summary results of the number of identical, polymorphic and different STR loci among five samples after searching using pSTR with 10 bp flanking sequences used as a reliability test
| Samples | AC_000155.1 | AC_000155.1a | AC_000155.1b | AC_000155.1c | AC_000155.1d |
|---|---|---|---|---|---|
| AC_000155.1 | 10856 (3) | 11137 (9) | 10440 (4) | 10793 (12) | |
| AC_000155.1a | 3026 (0) | 8108 (12) | 9395 (4) | 8734 (13) | |
| AC_000155.1b | 2739 (0) | 3026 (2739) | 7692 (13) | 8045 (21) | |
| AC_000155.1c | 3441 (0) | 1460 (1045) | 3441 (2739) | 7348 (16) | |
| AC_000155.1d | 3080 (0) | 2112 (2058) | 3080 (2739) | 3441 (3080) |
Figures in the upper quadrant indicate the number of identical STR loci with the total number of polymorphic STR loci in brackets. The figures in the lower quadrant indicate the number of different STR loci and the numbers in brackets indicate the number of different STR loci after switching the ‘source sample’ with the ‘target sample’
Summary results of the number of identical, polymorphic and different STR loci among five samples after searching using pSTR with 100 bp flanking sequences used as a reliability test
| Samples | AC_000155.1 | AC_000155.1a | AC_000155.1b | AC_000155.1c | AC_000155.1d |
|---|---|---|---|---|---|
| AC_000155.1 | 10927 (0) | 11211 (0) | 10502 (0) | 10861 (0) | |
| AC_000155.1a | 3047 (0) | 8164 (0) | 9454 (0) | 8785 (0) | |
| AC_000155.1b | 2763 (1) | 3048 (2763) | 7739 (0) | 8098 (0) | |
| AC_000155.1c | 3472 (0) | 1473 (1048) | 3473 (2763) | 7389 (0) | |
| AC_000155.1d | 3113 (0) | 2142 (2076) | 3114 (2763) | 3472 (3113) |
Figures in the upper quadrant indicate the number of identical STR loci with the total number of polymorphic STR loci in brackets. The figures in the lower quadrant indicate the number of different STR loci and the numbers in brackets indicate the number of different STR loci after switching the ‘source sample’ with the ‘target sample’