| Literature DB >> 19484130 |
Nuno A Fonseca1, Cristina P Vieira, Jorge Vieira.
Abstract
BACKGROUND: Comparative studies using hundreds of sequences can give a detailed picture of the evolution of a given gene family. Nevertheless, retrieving only the sequences of interest from public databases can be difficult, in particular, when working with highly divergent sequences. The difficulty increases substantially when one wants to include in the study sequences from many (or less well studied) species whose genomes are non-annotated or incompletely annotated. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2009 PMID: 19484130 PMCID: PMC2685005 DOI: 10.1371/journal.pone.0005748
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Canonical DLX sequence.
The DLX specific TQTQV motif is highlighted in red.
Annotated flanking genes (number of occurrences in brackets) in the close vicinity of DLX genes.
|
| Annotated flanking genes |
|
|
|
|
|
|
| Teleosteii |
|
|
|
|
|
|
|
| Teleosteii | |
| Teleosteii |
|
|
|
|
|
|
|
Fixed amino acid differences between a given DLX gene and all other DLX genes (excluding fast evolving sequences).
| Gene | Amino acid position | ||||
| 17 | 18 | 23 | 56 | 60 | |
|
| W | ||||
|
| A | Y | |||
|
| Q | Y | |||
|
| H | ||||
|
| I,L,M | ||||
|
| H | ||||
including Teleostei sequences.
sequences from Teleostei.
An H is observed in the Elasmobranchii DLX4 sequence.
Not observed in DLX4 Ornithorhynchus anatinus. Amphibian DLX1 and DLX6 sequences also show a Y at this position.
An H is also observed in Amphibian DLX1 sequences and in the Elasmobranchii DLX4 sequence.
Fixed amino acid differences between DLX genes 1/4/6 and 2/3/5.
|
| Amino acid position | ||
| 11 | 14 | 17 | |
|
| L, | Q | N, H, |
|
| F, Y | A | Q |
| Protostomes | L, | Q | N |
In bold are shown rare (three or less occurrences) amino acid variants.
Figure 2Maximum likelihood phylogenetic trees based on a set of 222 sequences (four NK2 and 218 DLX sequences) aligned using different multiple alignment algorithms.
A) M-Coffee [15]; B) Muscle [16]; C) T-coffee [4]. As suggested by Notredame et al. ([4]), only amino acid aligned positions without gaps and with a score greater than 3 were used. Numbers are the number of times a given cluster is obtained out of 500 replicates. Blue – NK2 gene sequences; Red – Non-bilaterian species; Plum – Mammals; Light orange – Aves; Green – Teleostei fish; Gray 50% - Urochordata; Teal – Protostomes; Gold – Reptiles; Brown – Elasmobranchii; Light green – Hemichordata; Turquoise – Cephalocordata; Pink – Amphibians; Lavender – Hyperoartia. Arrows point to sequences that do not show the region where the characteristic DLX amino acids are located.