| Literature DB >> 15239844 |
Luis Sánchez-Pulido1, Ana M Rojas, Karel H van Wely, Carlos Martinez-A, Alfonso Valencia.
Abstract
BACKGROUND: The Split ends (Spen) family are large proteins characterised by N-terminal RNA recognition motifs (RRMs) and a conserved SPOC (Spen paralog and ortholog C-terminal) domain. The aim of this study is to characterize the family at the sequence level.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15239844 PMCID: PMC481055 DOI: 10.1186/1471-2105-5-91
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Representative multiple alignment of the SPOC domain. The colouring scheme indicates average BLOSUM62 score (correlated to amino acid conservation) in each alignment column: cyan (greater than 3), light red (between 3 and 1.5) and light green (between 1.5 and 0.5). The limits of the domains are indicated by the residue positions on each side. The limits of proteins from partially sequenced genes whose full-length proteins are not available are not shown. X-Ray determined structure of the SPOC domain [9], pdb-code: 1OW1 is shown below the SHARP sequence (Swissprot-ID:MINT_HUMAN). PHD secondary structure prediction [29] for DIO family is shown below the DIO-1 human sequence (Swissprot-Id: DAT1_HUMAN), with E indicating a β strand (in red) and H an α helix (in green). The asterisks below the alignment marks the conserved pair arginine-tyrosine mentioned in the text. The sequences are named with their swissprot or sptrembl identifications, and also, if necessary, with their common species name: Human, Homo sapiens; Frog, Xenopus laevis; Drome, Drosophila melanogaster; Caeel, Caenorhabditis elegans; Arath,Arabidopsis thaliana; Ciona, Ciona intestinalis; Yeast, Saccharomyces cerevisiae; Fish, Brachydanio rerio; Plafal, Plasmodium falciparum; Aspnidu, Aspergillus nidulans; Pinus, Pinus taeda; Glycine, Glycine max; Schpo, Schizosaccharomyces pombe; Triti, Triticum aestivum. The "est" prefix identifies consensus sequences manually reconstructed by assembling highly similar expressed sequence tags from identical species (conceptual translations). The "unf" prefix identifies sequences obtained from Genome BLAST server at NCBI [24]. Complementary information is accessible at: .
Figure 2Schematic representation of the domain architecture and common features in representative members of the SPOC domain contained families. The sequences are named with their swissprot or sptrembl identifications, and also, if necessary, with their common species name. The slashes represent inserts that are not shown. The proteins are drawn approximately to scale. The localization of the other domains PHD, TFIIS, BRK (BRM and KIS domain) and RRM is according to Pfam and SMART families databases [35,36].
Figure 3Comparison of SPOC domains from SHARP and DIO proteins. (A) Ribbon representation and electrostatic surface potential map of structure of the SPOC domain of SHARP protein (PDB code: 1OW1). (B) Ribbon representation and electrostatic surface potential map of homology model of the SPOC domain of DIO protein. Blue indicates positively charged regions, whereas red shows negatively charged regions. All the molecules are in the same orientation. Dotted circles indicates the conserved basic cluster, where is located the conserved arginine-tyrosine pair.