| Literature DB >> 24970121 |
Roger S Holmes1, Ujjwal K Rout2.
Abstract
Intregins are heterodimeric α- and β-subunit containing membrane receptor proteins which serve various cell adhesion roles in tissue repair, hemostasis, immune response, embryogenesis and metastasis. At least 18 α- (ITA or ITGA) and 8 β-integrin subunits (ITB or ITGB) are encoded on mammalian genomes. Comparative ITB amino acid sequences and protein structures and ITB gene locations were examined using data from several vertebrate genome projects. Vertebrate ITB genes usually contained 13-16 coding exons and encoded protein subunits with ~800 amino acids, whereas vertebrate ITB4 genes contained 36-39 coding exons and encoded larger proteins with ~1800 amino acids. The ITB sequences exhibited several conserved domains including signal peptide, extracellular β-integrin, β-tail domain and integrin β-cytoplasmic domains. Sequence alignments of the integrin β-cytoplasmic domains revealed highly conserved regions possibly for performing essential functions and its maintenance during vertebrate evolution. With the exception of the human ITB8 sequence, the other ITB sequences shared a predicted 19 residue α-helix for this region. Potential sites for regulating human ITB gene expression were identified which included CpG islands, transcription factor binding sites and microRNA binding sites within the 3'-UTR of human ITB genes. Phylogenetic analyses examined the relationships of vertebrate beta-integrin genes which were consistent with four major groups: 1: ITB1, ITB2, ITB7; 2: ITB3, ITB5, ITB6; 3: ITB4; and 4: ITB8 and a common evolutionary origin from an ancestral gene, prior to the appearance of fish during vertebrate evolution. The phylogenetic analyses revealed that ITB4 is the most likely primordial form of the vertebrate β integrin subunit encoding genes, that is the only β subunit expressed as a constituent of the sole integrin receptor 'α6β4' in the hemidesmosomes of unicellular organisms.Entities:
Year: 2011 PMID: 24970121 PMCID: PMC4030831 DOI: 10.3390/biom1010003
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Vertebrate beta integrin and nematode beta integrin-like genes and proteins. RefSeq: the reference amino acid sequence; 1,3 predicted Ensembl amino acid sequence; 2 not available; 4 Contig refers to a DNA scaffold for sequencing analyses; GenBank IDs are derived NCBI http://www.ncbi.nlm.nih.gov/genbank/; Ensembl ID was derived from Ensembl genome database http://www.ensembl.org/; UNIPROT refers to UniprotKB/Swiss-Prot IDs for individual acid lipases (see http://kr.expasy.org/); bps refers to base pairs of nucleotide sequences; pI refers to theoretical isoelectric points; the number of coding exons are listed.
| Human | 10:33,190,501-33,224,486 | 35.2 | 16 (−) | 88,415 | 798 | NM_002211 | P05556 | |||
| Mouse | 8:131,209,552-131,257,438 | 47.9 | 15 (+) | 88,231 | 798 | NM_010578 | P09055 | |||
| Horse | 29:5,050,336-5,076,608 | 26.3 | 15 (+) | 88,202 | 798 | XP_0014927152 | na | |||
| Opossum | 8:240,074,436-240,104,533 | 30.1 | 15 (+) | 88,329 | 799 | XP_0013665672 | na | |||
| Chicken | 2:13,977,440-14,000,322 | 22.9 | 15 (+) | 88,554 | 803 | NP_001034343 | P07228 | |||
| Frog | 5035:452,582-465,188 | 12.6 | 15 (+) | 88,083 | 798 | NP_989160 | Q6P4X1 | |||
| Zebrafish | 24:1,010,009-1,028,526 | 18.5 | 15 (−) | 88,592 | 798 | NP_001030143 | Q3YAA1 | |||
| Zebrafish | 2:42,692,236-42,708,345 | 16.1 | 15 (+) | 86,570 | 787 | NP_001030151 | Q3YA99 | |||
| Human | 21:46,306,286-46,330,697 | 40.0 | 15 (−) | 84,782 | 769 | NM_001127491 | P05107 | |||
| Mouse | 10:76,993,093-77,028,419 | 35.3 | 15 (+) | 85,026 | 771 | NM_008404 | P11835 | |||
| Mouse | 16:96,643,905-96,665,221 | 21.3 | 15 (−) | 81,547 | 738 | NM_008405 | Q3UV74 | |||
| Horse | 26:39,992,918-40,009,865 | 16.9 | 15 (−) | 85,290 | 770 | XP_0014900522 | na | |||
| Horse | 26:40,054,164-40,069,857 | 15.7 | 14 (−) | 79,824 | 726 | chr26.199.13 | na | |||
| Opossum | 2:539,079,162-539,090,540 | 11.4 | 13 (−) | 83,015 | 761 | chr2.11.580.a3 | na | |||
| Chicken | 7:7,143,899-7,150,094 | 6.2 | 15 (−) | 85,409 | 772 | NP_990582 | na | |||
| Frog | 2185:1,356,257-1,405,734 | 49.5 | 16 (−) | 86,980 | 782 | XP_0029365702 | na | |||
| Zebrafish | 9:47,603,423-47,619,406 | 16.0 | 15 (+) | 84519 | 768 | XP_6860122 | na | |||
| Human | 17:45,331,228-45,387,567 | 37.4 | 9 (+) | 87,058 | 788 | NM_000212 | P05106 | |||
| Mouse | 11:104,469,370-104,528,689 | 59.3 | 15 (+) | 86,694 | 787 | NM_016780 | O54890 | |||
| Horse | 11:17,206,504-17,237,153 | 30.7 | 15 (−) | 86,360 | 784 | NM_001081802 | na | |||
| Opossum | 2:208,102,619-208,154,477 | 51.9 | 14 (+) | 84,497 | 764 | chr2.5.154.a3 | na | |||
| Chicken | 27:2,207,723-2,223,943 | 16.2 | 15 (+) | 86,088 | 781 | NP_989646 | na | |||
| Frog | 9735:169,881-200,256 | 30.4 | 15 (−) | 88,235 | 792 | XP_0029424013 | na | |||
| Zebrafish | 3:16,152,931-16,181,252 | 28.3 | 15 (+) | 85,693 | 785 | NP_001032312 | Q3LTM4 | |||
| Zebrafish | 12:21,495,417-21,520,656 | 25.2 | 15 (−) | 87,700 | 790 | NP_001076417 | B3DIP9 | |||
| Human | 17:73,720,784-73,753,633 | 36.4 | 39 (+) | 202,167 | 1,822 | NM_000213 | P16144 | |||
| Mouse | 11:115,836,039-115,869,725 | 33.7 | 39 (+) | 201,650 | 1,818 | NM_133663 | A2A863 | |||
| Horse | 11:6,286,192-6,312,590 | 26.4 | 39 (−) | 194,864 | 1,752 | XP_0019159152 | na | |||
| Opossum | 2:213,276,209-213,319,767 | 43.6 | 36 (−) | 198,824 | 1,778 | XP_0013776062 | na | |||
| Chicken | 18:4,713,504-4,732,758 | 19.3 | 39 (−) | 203,330 | 1,818 | E1C9G74 | E1C9G7 | |||
| Frog | 5455:495,511-545,134 | 49.6 | 38 (+) | 204,208 | 1,835 | XP_0029400202 | na | |||
| Zebrafish | 8:13,034,599-13,080,896 | 46.3 | 39 (+) | 210,736 | 1,893 | NP_001019557 | Q4U0S1 | |||
| Human | 3:124,482,473-124,605,847 | 139.5 | 15 (−) | 88,054 | 799 | BC0065416 | P18084 | |||
| Mouse | 3:126,102,954-125,963,482 | 139.5 | 15 (+) | 87,909 | 798 | NM_010580 | O70309 | |||
| Horse | 19:34,769,099-34,876,312 | 107.2 | 15 (+) | 87,985 | 803 | XP_0015000772 | na | |||
| Opossum | 4:90,575,064-90,751,974 | 17.7 | 14 (−) | 86,072 | 779 | XP_0013727112 | na | |||
| Chicken | 7:29,463,064-29,500,494 | 37.4 | 15 (−) | 88,475 | 812 | NP_989814 | na | |||
| Frog | 6 | 6 | 6 | 87,561 | 807 | NP_001135704 | B5DEV6 | |||
| Zebrafish | 9:22,031,538-22,101,156 | 69.6 | 16 (+) | 88,956 | 802 | NP_001076305 | A5D6V1 | |||
| Human | 2:160,958,250-161,056,574 | 172.2 | 15 (−) | 85,936 | 788 | NM_000888 | P18564 | |||
| Mouse | 2:160,836,644-160,664,416 | 172.2 | 15 (+) | 86,042 | 787 | NM_021359 | Q9Z0T9 | |||
| Horse | 18:40,790,052-40,860,613 | 70.6 | 15 (−) | 85,817 | 788 | XP_0014929142 | na | |||
| Opossum | 4:165,626,656-165,720,419 | 93.8 | 15 (−) | 86,274 | 787 | ENSMODT65823 | na | |||
| Chicken | 7:23,382,214-23,409,883 | 27.7 | 15 (+) | 86,530 | 789 | XP_4220372 | E1C6K8 | |||
| Frog | 515:1,605,719-1,638,193 | 32.5 | 13 (+) | 76,152 | 696 | NP_001090775 | A4IGI8 | |||
| Human | 12:53,585,343-53,594,227 | 16.3 | 15 (−) | 86,903 | 798 | NM_000889 | P26010 | |||
| Mouse | 15:102,046,428-102,062,317 | 15.9 | 15 (−) | 87,411 | 806 | NM_013566 | P26011 | |||
| Horse | 6:70,193,060-70,201,400 | 8.3 | 14 (−) | 86,666 | 797 | XP_0014949172 | na | |||
| Frog | 226:1,387,775-1,412,224 | 24.5 | 15 (+) | 84,461 | 766 | XP_0029366862 | na | |||
| Zebrafish | 6:43,251,691-43,271,173 | 19.5 | 15 (−) | 74,006 | 661 | XP_0013379492 | na | |||
| Human | 7:20,371,430-20,449,617 | 85.1 | 14 (+) | 85,632 | 769 | NM_002214 | P26012 | |||
| Mouse | 12:120,477,276-120,396,490 | 80.8 | 14 (−) | 84,519 | 767 | NM_177290 | Q0VDB0 | |||
| Horse | 4:52,216,892-52,290,256 | 73.4 | 14 (+) | 85,361 | 767 | XP_0014972712 | na | |||
| Opossum | 8:302,826,175-302,899,169 | 73.0 | 14 (−) | 83,721 | 757 | XP_0013718342 | na | |||
| Chicken | 2:30,002,182-30,052,908 | 49.3 | 15 (+) | 86,990 | 786 | XP_4187062 | na | |||
| Frog | 525:945,865-97,887 | 33.0 | 14 (−) | 81,104 | 728 | XP_0029333442 | na | |||
| Zebrafish | 19:2,377,677-2,441,753 | 64.1 | 14 (−) | 74,877 | 669 | XP_0019196262 | A3RI57 | |||
| Nematode | III:3,909,309-3,914,106 | 4.8 | 8 (−) | 90,138 | 809 | NP_497787 | Q27874 |
Figure 1Predicted domains and transmembrane helix for human ITB2 and ITB4. Domains and key regions are identified for ITB2 and ITB4 amino acid sequences using NCBI web tools (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) to identify functional domains and ExPasy web tools to identify predicted transmembrane domains (TMHMM) (http://www.cbs.dtu.dk/services/TMHMM-2.0/); INB or integrin_beta; integrin B tail (in pink); TMHMM transmembranes (in red); cytosolic domain (in blue); Calx-beta domain (in green); FN3 (fibronectin 3), cytokine receptor and interdomain contacts (red triangles); Note: lack of FN3 binding domains and the cytokine receptor motifs in β4 subunit that interacts only with laminin-332.
Figure 2Amino acid alignments for vertebrate ITB1 cytosolic domain sequences. ITB1 sequences examined included Hu-human; Rh-rhesus; Ma-marmoset; Mo-mouse; Ra-rat; Gp-guinea pig; Ho-horse; Co-cow; Pi-pig; Op-opossum; Ch-chicken; Fr-Xenopus tropicalis; Zf-zebrafish; see Table 1 for details; note that 2 ITB1-like genes were observed in zebrafish (designated as ITB1A and ITB1B); * shows identical residues for ITB subunits; : similar alternate residues;. dissimilar alternate residues; α-helix for vertebrate ITB sequences is in shaded yellow; β-sheet is in shaded grey; colors for amino acids are shown as: basic (R and K); acidic (D and E); neutral hydrophilic (G, Y, Q, S, T, N, Y, C, H); and hydrophobic (M, A, F, I, L, W, P, V); the Cyto-1, Cyto-2 (NPXY) and Cyto-3 (NXXY) domains are shown in dotted lines (see text for details).
Figure 3Amino acid sequence alignments for vertebrate ITB cytosolic domain sequences. (A) Comparison and alignments of human ITB1 major isoforms for cytosolic domain sequences; (B) Consensus sequences of vertebrate ITB cytosolic domains; see Table 1 for sources of beta integrin cytosolic domain sequences: * shows identical residues for ITB subunits; : similar alternate residues;. dissimilar alternate residues; α-helix for vertebrate ITB sequences is in shaded yellow; β-sheet is in shaded grey; colors for amino acids are shown as: basic (R and K); acidic (D and E); neutral hydrophilic (G, Y, Q, S, T, N, Y, C, H); and hydrophobic (M, A, F, I, L, W, P, V); the Cyto-1, Cyto-2 (NPXY) and Cyto-3 (NXXY) domains are shown in dotted lines (see text for details).
Figure 5Comparative tissue expression for mouse beta integrin genes (ITB). Expression ‘heat maps’ (GNF Expression Atlas 2 data) (http://biogps.gnf.org) [91] were examined for comparative gene expression levels among human and mouse tissues for ITB genes showing high (red); intermediate (black); and low (green) expression levels; derived from mouse genome browsers (http://genome.ucsc.edu) [45].
Figure 4Structures and major splicing isoforms for human beta integrin genes. Derived from the AceView website http://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/ [57]; mature isoform variants (a) are shown with capped 5′- and 3′- ends for the predicted mRNA sequences; NM refers to the NCBI reference sequence; exons are in pink; the directions for transcription are shown as 5′ → 3′; sizes of mRNA sequences are shown in kilobases (kb).
Predicted transcription factor binding sites (TFBS), CpG islands and MiRNA (MiR) regions for human and mouse ITB Genes. The human and mouse genome browsers (http://genome.ucsc.edu) [45] were used to examine the predicted transcription factor binding sites (TFBS), CpG islands and Mi-RNA binding sites for human and mouse ITB genes.
| Human | CpG:158 | 39 | 8 | 6.1 | wide expression | |||
| CpG18,21,22 | 7 | 0 | 3.8 | thymus, spleen, bone | ||||
| CpG92 | 10 | 5 | 1.4 | placenta, kidney, skin | ||||
| CpG19,22,62,91 | 51 | 2 | 4.6 | colon, ovary, pancreas | ||||
| CpG152 | 47 | 0 | 4.5 | lung, ovary, kidney | ||||
| 0 | 56 | 0 | 1.3 | pancreas, lung, kidney | ||||
| CpG48 | 20 | 0 | 0.8 | leukocytes, spleen | ||||
| CpG133 | 105 | 35 | 2.2 | wide expression | ||||
|
| ||||||||
| Mouse | CpG120 | na | 3 | 2.0 | wide expression | |||
| 0 | na | 0 | 1.6 | thymus, spleen, bone | ||||
| 0 | na | 0 | 0.3 | bone marrow | ||||
| CpG522 | na | 3 | 0.5 | spleen, placenta | ||||
| CpG38,59 | na | 1 | 1.8 | mammary, brain | ||||
| CpG87 | na | - | 3.7 | mammary, lung | ||||
| 0 | na | 1 | 0.4 | kidney, bladder, heart | ||||
| CpG26 | na | 0 | 1.1 | spleen, mammary | ||||
| CpG316 | na | 2 | 1.2 | mammary, kidney | ||||
Figure 6Phylogenetic tree of vertebrate beta integrin cytosolic domain amino acid sequences. The tree is labeled with the ITB name and the name of the animal and is ‘rooted’ with the Caenorhabitis elegans (nematode) ITB-like sequence (see Table 1). Note the 7 major clusters corresponding to the ITB1, ITB2, ITB3, ITB4, ITB5, ITB6, ITB7 and ITB8 gene families. A genetic distance scale is shown (% amino acid substitutions). The number of times a clade (sequences common to a node or branch) occurred in the bootstrap replicates are shown. Only replicate values of 90 or more which are highly significant are shown with 100 bootstrap replicates performed in each case.
Multiplicity and specificity of ligand binding by ITB subunits; ECM refers to extracellular matrix.
| Collagens | β1 | α1β1, α2β1, α10β1, α11β1 |
| E-Cadherin | β1, β7 | αEβ7, α2β1 |
| Endorepellin | β1 | α2β1 |
| Endostatin | β1 | α5β1 |
| Factor X | β2 | αMβ 2 |
| Fibronectin | β1, β2, β3, β6 and β7 | α4β1, α5β1, α8β1, αvβ1, αIIbβ3, αvβ3, αvβ6, α4β7, αMβ2 |
| Fibrinogen | β2, β3 and β5 | αMβ2, αXβ2, αIIbβ3, αvβ3, αvβ5 |
| ICAM-1, -2, -3, -5 | β2 | αLβ2 |
| ICAM-3 | β2 | αDβ2 |
| iC3b | β2 | αMβ2, αXβ2 |
| Laminins | β1 and β4 | α1β1, α2β1, α3β1, α6β1, α6β4, α7β1, |
| MadCAM-1 | β7 | α4β7 |
| Nephronectin (RGD) | β1 | α8β1 |
| Osteopontin | β1, β3 and β5 | αvβ1, αvβ3, αvβ5 |
| Semaphorin 7A | β1 | α1β1 |
| Tenascin-C | β1, β3 and β6 | α1β1, α8β1, α9β1, αvβ3 and αvβ6 |
| TGF-β-LAP (RGD) | β6 and β8 | αvβ6, αvβ8 |
| Tumastatin | β3 | αvβ3 |
| VCAM-1 | β2 and β7 | αDβ2, α4β7, αHβ7 |
| VEGF-C, VEGF-D | β1 | α9β1 |
| Vitronectin | β1, β3, β5 and β8 | αvβ1, α8β1, αvβ3, αvβ5, αvβ8, , |
| Von Willebrandt Factor | β3 | αIIbβ3, αvβ3, |
Figure 7The blue arrows show the predicted evolutionary paths of β subunits from the ancestral β4 subunit. Pairings of different α and β subunits are shown by thin black lines. This concept shows two lines of evolution diverging from β1, one towards blood cell integrins consisting of β2 or β7 subunits and the other towards a cluster consisting of β5, β6, β8 and β3 subunits that are primarily expressed in tissues other than blood with exception of α IIb.β3 that is expressed also in blood platelets (see text).