| Literature DB >> 28680266 |
Shugang Li1, Yiping Cao1, Fang Geng2.
Abstract
Albumins are the most well-known globular proteins, and the most typical representatives are the serum albumins. However, less attention was paid to the albumin family, except for the human and bovine serum albumin. To characterize the features of albumin family, we have mined all the putative albumin proteins from the available genome sequences. The results showed that albumin is widely distributed in vertebrates, but not present in the bacteria and archaea. The phylogenetic analysis of vertebrate albumin family implied an evolutionary relationship between members of serum albumin, α-fetoprotein, vitamin D-binding protein, and afamin. Meanwhile, a new member from the albumin family was found, namely, extracellular matrix protein 1. The structural analysis revealed that the motifs for forming the internal disulfide bonds are highly conserved in the albumin family, despite the low overall sequence identity across the family. The domain arrangement of albumin proteins indicated that most of vertebrate albumins contain 3 characteristic domains, arising from 2 evolutionary patterns. And a significant trend has been observed that the albumin proteins in higher vertebrate species tend to possess more characteristic domains. This study has provided the fundamental information required for achieving a better understanding of the albumin distribution, phylogenetic relationship, characteristic motif, structure, and new insights into the evolutionary pattern.Entities:
Keywords: Albumin; characteristic motif; evolution; vertebrate
Year: 2017 PMID: 28680266 PMCID: PMC5480655 DOI: 10.1177/1176934317716089
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Summary on albumin distribution in animal species.
| SPECIES NUMBER | PERCENTAGE | AVERAGE GENOMIC NUMBER | |
|---|---|---|---|
| Flatworm | 7 | 0 | − |
| Roundworm | 27 | 0 | − |
| Insects | 73 | 1.37 | + |
| Fishes | 32 | 53.1 | 3.2 |
| Amphibians | 2 | 100 | + |
| Reptiles | 9 | 100 | 3.0 |
| Birds | 56 | 100 | 3.2 |
| Mammals | 84 | 100 | 6.1 |
“−” represents the absence of albumin genetic sequence; “+” means that the limited species are not sufficient to calculate the accurate average genetic number.
Species number: the total number of the species we studied.
Percentage: the percentage of the species possessing albumin genetic sequences in the corresponding category.
Average genomic number: the average number of the albumin sequences in one species genome.
Figure 1Phylogenetic tree of albumins. The inner circle is the phylogenetic tree derived from the consensus sequences of albumins. The upper left legend shows their taxonomic groups with different colors in the branches. The middle circle is the corresponding albumins of different taxonomic groups. The dotted line links the branch with each taxon. The outer numbers indicate the 6 types based on this study; alternating black and red show their ranges in phylogenetic tree. The branch length is indicated as scale bar.
Figure 2Structural features of 6 albumin types (I–VI). The classification of albumins into 6 types is referred Figure 1.
Distribution of characteristic domain count within albumins in different taxonomic groups (%).
| DOMAIN COUNT | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | AVERAGE |
|---|---|---|---|---|---|---|---|---|---|---|---|
| All species | 5.37 | 8.18 | 83.2 | 2.30 | 0.38 | 0.13 | – | – | 0.13 | 0.26 | 2.87 |
| Mammals | 3.14 | 3.73 | 90.0 | 1.76 | 0.59 | 0.20 | – | – | 0.20 | 0.39 | 2.97 |
| Birds | 0.56 | 11.7 | 83.3 | 4.44 | – | – | – | – | – | – | 2.92 |
| Reptiles | 7.41 | 29.6 | 59.3 | 3.70 | – | – | – | – | – | – | 2.59 |
| Amphibians | 30.0 | 20.0 | 50.0 | – | – | – | – | – | – | – | 2.20 |
| Fishes | 35.2 | 25.9 | 38.9 | – | – | – | – | – | – | – | 2.04 |
“–” represents that no albumin genetic sequence contains this number of function regions. The percentage is presented per taxonomic groups.
Figure 3Evolutionary pattern of the albumin characteristic domains. The evolutionary relationship were investigated among albumins with 3 characteristic domains and summarized into 2 types: (A) pattern A, 3 characteristic domains derived from different origins and (B) pattern B, 2 characteristic domains originated from the same sequences. (C) Percentage of the evolution pattern A in albumin types I, II, III, IV, V, and VI, whereas the remaining is for the evolution pattern B. The taxonomic groups were indicated by different colors.