| Literature DB >> 20678256 |
Alireza G Senejani1, Joann B Sweasy.
Abstract
Escherichia coli (E. coli) are commonly used as hosts for DNA cloning and sequencing. Upon transformation of E. coli with recombined vector carrying a gene of interest, the bacteria multiply the gene of interest while maintaining the integrity of its content. During the subcloning of a mouse genomic fragment into a plasmid vector, we noticed that the size of the insert increased significantly upon replication in E. coli. The sequence of the insert was determined and found to contain a novel DNA sequence within the mouse genomic insert. A BLAST search of GenBank revealed the novel sequence to be that of the Insertion Sequence 2 (IS2) element from E. coli that was likely inserted during replication in that organism. Importantly, a detailed search of GenBank shows that the IS2 is present within many eukaryotic nucleotide sequences, and in many cases, has been annotated as being part of the protein. The results of this study suggest that one must perform additional careful analysis of the sequence results using BLAST comparisons, and further verification of gene annotation before submission into the GenBank.Entities:
Year: 2010 PMID: 20678256 PMCID: PMC2907563 DOI: 10.1186/2041-9414-1-2
Source DB: PubMed Journal: Genome Integr ISSN: 2041-9414
Figure 1Schematic diagram illustrating the cloning process that lead to detection of an extra DNA element integrated into the insert during the process. A PCR fragment, amplified from the gene of interest, with a size of about 4.5 kb, was inserted into a plasmid. The recombinant plasmid was then transformed into E. coli. The PCR was performed using the same sets of primers and extracted recombinant plasmids as a template. The resulting PCR fragment appeared to be about 5.8 kb long. This indicates the presence of extra DNA inside the insert. Further multiple restriction digestion analyses and sequencing confirmed the presence of the extra 1.3 kb DNA fragment within the insert.
Figure 2Taxonomy BLAST reports of species submitted into the GenBank that contain the bacterial insertion element IS2 [1,8]. The IS2 elements in each of these organisms is nearly identical. The numbers indicate how often the insertion element IS2 was found in the BLAST hitlist.
List of some selected genes that contain the IS2 element.
| Gene name and ID | Locationa | Surrounding IS2 sequencesb |
|---|---|---|
| 3742928-3744271 | ||
| dbj| | 27366872-27368214 11718374-11719715 13525525-13524204 | |
| emb| | 18027-19368 | |
| gb| | 20138-18796 | |
| gb| | 72232-70890 | |
| gb| | 69921-68580 | |
| gb| | 169869-171210 | |
| gb| | 146766-145426 | |
| gb| | 99933-98589 | |
| gb| | 76175-74834 | |
| gb| | 155658-154318 | |
| gb| | 31412-30070 | |
| gb| | 178002-176660 | |
| dbj| | 2374-3716 | |
| dbj| | 869-2210 | |
| gb| | 1180-2523 | |
| dbj| | 3369-2028 | |
| gb| | 90423-91765 |
aThe location and the duplicated nucleotides bsurrounding IS2 sequences are shown. The underlined sequences are the first and last triplet nucleotides of the IS2 element.