| Literature DB >> 28516027 |
Shaheena Anwar1, Mohammed Perwaiz Iqbal2, Shamshad Zarina3, Zulfiqar A Bhutta1.
Abstract
With so many diverse functions such as transporter of vitamin D metabolites and fatty acids, actin scavenger and macrophage activating factor, Gc must have been one of the most conserved proteins in animal kingdom. Our objective was to investigate the evolution of Gc by analyzing its differences at protein level. Using BLAST (Basic Local Alignment Search Tool) searches, Gc amino acid sequences were analyzed for homology. Clustal W2 and Jalview were used for multiple sequence alignment analysis, phylogenetic tree by PhyML 3.0 while Batch Web CD-Search Tool was used for identification for conserved domains within protein sequences. Gc protein percent identity between human and rabbit was 83%, which decreased to 81% with cow, 78% with mouse, 76% with rat, 51% with chicken, 41% with frog and 28% with zebrafish. Phylogram showed that rat Gc was the most diverged, while chicken Gc was the most conserved protein. Analysis also indicated high homology among mammals (human, rabbit, cow, rat, and mouse). Gc is a highly conserved protein in chicken and zebrafish. However, the distance from ancestral protein gradually increased in amphibian (frog) and mammals (human, rabbit, cow, rat, and mouse). Human Gc and rabbit Gc appear to be recently evolved proteins. There appears to be an interesting evolutionary pattern- chicken Gc has the least distance from the ancestral protein, while rat Gc is the most diverged. There is no vertebrate devoid of Gc which is suggestive of its important role in vitamin D metabolism in vertebrates.Entities:
Keywords: Gc; Gc evolution; albumin superfamily; animal kingdom; conserved regions; phylogenetic tree; vitamin D-binding protein
Year: 2014 PMID: 28516027 PMCID: PMC5424798 DOI: 10.4161/idp.27450
Source DB: PubMed Journal: Intrinsically Disord Proteins ISSN: 2169-0707
Table 1. Gc protein precursor accession numbers and lengths of the eukaryotic organisms
| | | |
| Human | NP_000574.2 | 474 |
| Cow | NP_001030457.1 | 474 |
| Mouse | NP_032122.1 | 476 |
| Chicken | NP_990213.1 | 484 |
| Frog | NP_001015745.1 | 482 |
| Zebrafish | NP_001002568.1 | 464 |
| Rat | AAA41080.1 | 476 |
| Rabbit | BAA06137.1 | 476 |
amino acids
Table 2. Pairwise comparison of protein precursors in the selected vertebrate species
| | | |
| Human | Rabbit | 83 |
| Human | Cow | 81 |
| Human | Mouse | 78 |
| Human | Rat | 76 |
| Human | Chicken | 51 |
| Human | Frog | 41 |
| Human | Zebrafish | 28 |
| Cow | Rabbit | 77 |
| Cow | Mouse | 74 |
| Cow | Rat | 72 |
| Cow | Chicken | 51 |
| Cow | Frog | 41 |
| Cow | Zebrafish | 28 |
| Mouse | Rat | 91 |
| Mouse | Rabbit | 76 |
| Mouse | Chicken | 50 |
| Mouse | Frog | 42 |
| Mouse | Zebrafish | 26 |
| Chicken | Rat | 51 |
| Chicken | Rabbit | 49 |
| Chicken | Frog | 41 |
| Chicken | Zebrafish | 30 |
| Frog | Rabbit | 42 |
| Frog | Rat | 41 |
| Frog | Zebrafish | 27 |
| Rat | Rabbit | 73 |
| Rat | Zebrafish | 26 |
| Rabbit | Zebrafish | 28 |

Figure 1. Clustal W2 sequence alignment of vertebrate Gc amino acid. Single fully conserved residues are denoted by asterisk (*). Conservative change in amino acid is denoted by semi colon (:). A neutral change is denoted by dot (.). Dash (-) represent non homologous segments. The numbers on the right represent the amino acid positions.

Figure 2. (A) Jalview sequence alignment of Gc amino acids.The residues are colored by default settings of Clustal X where a minimum percentage of single residues or a combination of residues must be achieved for a color to be applied. The position of amino acids is specified in the top. The vertebrates are indicated on the left. (B) Protein consensus and conserved sequences in Jalview. The amino acid property conservation defines the measurement of conservation of physiochemical properties in a column. Any change in amino acid homology is calculated as observed substitution by BLOSUM62 program in Jalview and denoted by golden bars. The consensus column defines the most common residues for each column of the alignment. The conserved amino acids are indicated in the bottom line. The black boxes are a visual summary of the degree of conservation in Gc protein which appears to be quite high for some regions.

Figure 3. Phylogenetic tree of Gc amino acids by PhyML 3.0

Figure 4. (A) Conserved domains of Gc protein precursor of human, rabbit, cow, rat, mouse, chicken, and frog. The triangles represent the amino acids of conserved domains in albumin superfamily. The vitamin D binding motif is denoted by a box at the end of the sequence. The positions of the binding sites are specified numerically at the top. (B) Conserved domains of Gc protein of zebrafish. The vitamin D binding site is absent in zebrafish. The albumin superfamily binding sites are indicated by triangles. The blue colored region was not included in database search because it was recognized as biased region by the software.