| Literature DB >> 17545200 |
Marco Pagni1, Vassilios Ioannidis, Lorenzo Cerutti, Monique Zahn-Zabal, C Victor Jongeneel, Jörg Hau, Olivier Martin, Dmitri Kuznetsov, Laurent Falquet.
Abstract
The MyHits web site (http://myhits.isb-sib.ch) is an integrated service dedicated to the analysis of protein sequences. Since its first description in 2004, both the user interface and the back end of the server were improved. A number of tools (e.g. MAFFT, Jacop, Dotlet, Jalview, ESTScan) were added or updated to improve the usability of the service. The MySQL schema and its associated API were revamped and the database engine (HitKeeper) was separated from the web interface. This paper summarizes the current status of the server, with an emphasis on the new services.Entities:
Mesh:
Year: 2007 PMID: 17545200 PMCID: PMC1933190 DOI: 10.1093/nar/gkm352
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The different computational services that are integrated in MyHits
| | Fast database searches using a multiple sequence alignment (MSA) as input, with taxonomic filter and clustering of the matched sequences by similarity. | Updated |
| | Slow database searches using an MSA as input, with taxonomic filter and clustering of the matched sequences by similarity. | New |
| | Database searches using a regular expression as input, with taxonomic filter and clustering of the matched sequences if identical. | Updated |
| | Scan for motifs (pattern, profile, HMMs) in a sequence. | Updated |
| | Query on pre-computed hit lists using motifs as input, with taxonomic filter, and AND/OR logic on the motif occurrences. | Updated |
| | Query on pre-computed hit lists using sequences as input. | Updated |
| | Compute MSA. | – |
| | See External links below. | Replaced |
| | Compute MSA for a large number of sequences. | New |
| | Automated protein classification. Allows the detection of discriminant regions of sequences. | New |
| | Translate a DNA sequence into selectable ORFs, with the | New |
| | Detect coding regions in a DNA sequence for a given species. | New |
| | Create postscript and PDF document with the graphical representation of motifs and features of proteins. | New |
| | Graphical representation of a local alignment between a protein sequence and a motif (PSI-Blast checkpoint, generalized profile, HMM). | New |
| | Search for synonyms of an identifier in all databases. | New |
| | View the raw text of a database entry. | – |
| | Compute high quality MSA. | Updated |
| | Compute high quality MSA. | New |
| | Compute high quality MSA using 3D structure information. Pretty slow. | New |
| | Produce a DNA MSA based on a protein MSA (automatically extract DNA sequences from GenBank). | New |
| | Consistency-based evaluation of an existing MSA. | New |
| | Allows the selection of representative members from a list of sequences. | – |
| | Create graphical representation of the motifs and features of a list of proteins. | New |
| | Java Applet displaying a dot plot. Allows printing to a PDF, as well as the interactive selection of similarity regions. | Updated |
| | Visualize motifs and features from a Swiss-Prot entry. | – |
| | Allows the selection of representative members from a list of sequences. | – |
| | Create graphical representation of the motifs and features of a list of proteins, starting from an MSA. | New |
| | Java Applet allowing an MSA to be visualized and edited, outputs a PDF or an MSA. | Updated |
| | External server to create a graphical MSA output for publication. | – |
Note: When available, the reference for the program or algorithm is given, as well as the specificity of the deployment on our server. The status reported in the last column contains the differences relative to the previous publication (1).
Figure 1.PDF generated using the Catalogue factory from an output of Jacop. The data set is the one designated as ‘ARLY’ in (14). A representative member of every four sub-groups of the automatically generated classification was picked for graphical display. The histograms indicate where the regions of homology are located. The color scheme corresponds to the sub-groups, i.e. 1#2,4 means that this region of homology matches with a sufficient score proteins of sub-groups 1#2 and 1#4 exclusively. Annotation from the FT lines of the original Swiss-Prot entries, as well as matches from the pre-computed hit list (FUMARATE_LYASES pattern), are represented; numbers refer to: 1, pat:FUMARATE_LYASES [!]; 2, BINDING substrate (by similarity); 3, REGION substrate binding (by similarity); 4, ACT_SITE Proton donor (by similarity); 5, ACT_SITE Proton acceptor (by similarity); 6, CONFLICT P -> A (in Ref. 1); 7, CONFLICT I -> L (in Ref. 1); 8, CONFLICT Y -> K (in Ref. 3).