| Literature DB >> 27408929 |
Abstract
Analysis of DNA sequences is necessary for higher hierarchical classification of the organisms. It gives clues about the characteristics of organisms and their taxonomic position. This dataset is chosen to find complexities in the unidentified DNA in the disclosed patents. A total of 17 unidentified DNA sequences were thoroughly analyzed. The quick response codes were generated. AT/GC content of the DNA sequences analysis was carried out. The QR is helpful for quick identification of isolates. AT/GC content is helpful for studying their stability at different temperatures. Additionally, a dataset on cleavage code and enzyme code studied under the restriction digestion study, which helpful for performing studies using short DNA sequences was reported. The dataset disclosed here is the new revelatory data for exploration of unique DNA sequences for evaluation, identification, comparison and analysis.Entities:
Keywords: BioLABs; Blunt ends; Genomics; NEB cutter; Restriction digestion; Short DNA sequences; Sticky ends
Year: 2016 PMID: 27408929 PMCID: PMC4930343 DOI: 10.1016/j.dib.2016.06.010
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1QR codes of unidentified sequences (AR360580-AR360590, AX000218, AX000220, AX000221, AX000222, AX000224 and AX000225).
Fig. 2GC plot of unidentified sequences (AR360580-AR360590, AX000218, AX000220, AX000221, AX000222, AX000224 and AX000225).
Fig. 3NEB restriction enzyme digestion of unidentified sequences of patents (Accession No.: AR360580-AR360590, AX000218, AX000220, AX000221, AX000222, AX000224 and AX000225).
Unidentified sequences: Genomic analysis and restriction digestion using NEB single cutter restriction enzymes.
| S. N. | Accession number | Name of sequence | Maximum GC% | Average GC% | Average AT% | Number of cleavage code | Number of enzyme code | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Blunt end cut | 5′ extension | 3′ extension | *: cleavage affected by CpG methylation | #: cleavage affected by other methylation | ||||||
| 1 | AR360580 | Sequence 1 from patent US 6596510 | 42 | 37 | 63 | 10 | 10 | 9 | 13 | 4 |
| 2 | AR360581 | Sequence 3 from patent US 6596510 | 38.5 | 22 | 67 | 1 | 7 | – | 3 | – |
| 3 | AR360582 | Sequence 4 from patent US 6596510 | 65 | 51 | 49 | 2 | 12 | 2 | 20 | 7 |
| 4 | AR360583 | Sequence 5 from patent US 6596510 | 45 | 36 | 64 | 11 | 11 | 14 | 17 | 2 |
| 5 | AR360584 | Sequence 7 from patent US 6596510 | 73.37 | 47 | 53 | 3 | 5 | 4 | 6 | – |
| 6 | AR360585 | Sequence 8 from patent US 6596510 | 30 | 27 | 73 | – | 6 | – | 2 | – |
| 7 | AR360586 | Sequence 9 from patent US 6596510 | 55.6 | 51 | 49 | 4 | 20 | 1 | 12 | 5 |
| 8 | AR360587 | Sequence 10 from patent US 6596510 | 58.8 | 51 | 49 | 5 | 18 | 4 | 10 | 6 |
| 9 | AR360588 | Sequence 11 from patent US 6596510 | 54.5 | 41 | 59 | 11 | 15 | 10 | 17 | 2 |
| 10 | AR360589 | Sequence 12 from patent US 6596510 | 44.5 | 38 | 62 | 9 | 12 | 10 | 17 | 4 |
| 11 | AR360590 | Sequence 13 from patent US 6596510 | 54.5 | 41 | 59 | 9 | 19 | 11 | 18 | 1 |
| 12 | AX000218 | Sequence 1 from Patent WO 9906567 | 42 | 37 | 63 | 10 | 11 | 9 | 13 | 4 |
| 13 | AX000220 | Sequence 3 from patent WO 9906567 | 38.5 | 33 | 67 | 1 | 8 | – | 3 | 7 |
| 14 | AX000221 | Sequence 4 from patent WO 9906567 | 65 | 51 | 49 | 2 | 12 | 1 | 21 | 7 |
| 15 | AX000222 | Sequence 5 from patent WO 9906567 | 45 | 36 | 64 | 11 | 23 | 12 | 18 | 2 |
| 16 | AX000224 | Sequence 7 from patent WO 9906567 | 73.3 | 47 | 53 | 3 | 5 | 4 | 6 | – |
| 17 | AX000225 | Sequence 8 from patent WO 9906567 | 30 | 27 | 73 | – | 6 | – | 2 | – |
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | School of Life Sciences, S. R. T. M. University, Nanded, India |
| Data accessibility | Data available within article and via the |