| Literature DB >> 18205906 |
Botond Sipos1, Kálmán Somogyi, István Andó, Zsolt Pénzes.
Abstract
BACKGROUND: The models developed to characterize the evolution of multigene families (such as the birth-and-death and the concerted models) have also been applied on the level of sequence repeats inside a gene/protein. Phylogenetic reconstruction is the method of choice to study the evolution of gene families and also sequence repeats in the light of these models. The characterization of the gene family evolution in view of the evolutionary models is done by the evaluation of the clustering of the sequences with the originating loci in mind. As the locus represents positional information, it is straightforward that in the case of the repeats the exact position in the sequence should be used, as the simple numbering according to repeat order can be misleading.Entities:
Mesh:
Year: 2008 PMID: 18205906 PMCID: PMC2267704 DOI: 10.1186/1471-2105-9-27
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Pairwise repeat homology diagrams of selected Tenascin proteins (SVG outputs). The diagrams were produced with the default PhyML parameters of the script (WAG+Γ+I, 4 gamma categories, gamma parameter and proportion of invariable sites estimated by ML, BIONJ starting tree) and with colour gradient paramater w set to 2. The repeats (EGF on A, B and C; Fn-III on A', B' and C') are indicated by red rectangles in the protein sequence schemes (N-termini on top). The identified homology relations are represented by lines or arcs connecting the respective repeats (blue lines between repeats of different protein sequences (orthology), and brown arcs in case of internal relations (paralogy)). The colour scale bar demonstrates the colour intensities as a function of the patristic distance between the respective clades divided by the total tree length. The lines represent the values between 0 and 1 by units of 0.1. The sequence positions are shown on the right of each scheme. Abbreviations: TCH: Homo sapiens Tenascin C; TXH: Homo sapiens Tenascin X and TXM: Mus musculus Tenascin X.
Figure 2Phylogenetic tree of Fn-III repeats of Tenascin X proteins of . Maximum Likelihood tree produced during the analysis of human versus murine Tenascin X sequences with Fn-III profile HMM (diagram on Figure 1C'). Repeats are identified by starting and ending positions. Abbreviations: see Figure 1.