| Literature DB >> 23203982 |
Nikita S Vassetzky1, Dmitri A Kramerov.
Abstract
SINEBase (http://sines.eimb.ru) integrates the revisited body of knowledge about short interspersed elements (SINEs). A set of formal definitions concerning SINEs was introduced. All available sequence data were screened through these definitions and the genetic elements misidentified as SINEs were discarded. As a result, 175 SINE families have been recognized in animals, flowering plants and green algae. These families were classified by the modular structure of their nucleotide sequences and the frequencies of different patterns were evaluated. These data formed the basis for the database of SINEs. The SINEBase website can be used in two ways: first, to explore the database of SINE families, and second, to analyse candidate SINE sequences using specifically developed tools. This article presents an overview of the database and the process of SINE identification and analysis.Entities:
Mesh:
Year: 2012 PMID: 23203982 PMCID: PMC3531059 DOI: 10.1093/nar/gks1263
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Length distribution for 175 eukaryotic SINE families (without tail). The range is from 75 to 662 nt with the mean and median length of 253 and 236 nt, respectively.
Figure 2.Occurrence of different SINE structures. Complex SINEs are highlighted by shades of purple; brown and yellow sectors represent 7SL RNA- and 5S rRNA-derived SINEs, respectively; tRNA-derived are shown in shades of green (dark and light hues correspond to SINEs with and without LINE-derived region, respectively). Dotted sectors indicate SINEs with CORE domains; schematic SINE structures (as in SINETable/Supplementary Table S1) are shown next to or over the corresponding sectors. Percentage is the fraction of a structure among 175 SINE families, and the number in parentheses is the number of SINE families representing the structure.
Figure 3.Conservation of tRNA-derived sequences in SINEs. Sequence logos of (A) 175 tRNA-derived sequences in SINEs (including those in the second and third monomers of complex SINEs) and (B) 359 human tRNA genes (10) were generated by WebLogo 3.1 (13). The multiple alignments were edited to eliminate gaps in the logos. The original alignments of tRNAs and tRNA-derived sequences in SINEs are available in Supplementary Alignments S4.