| Literature DB >> 17026767 |
Richard A Stein1, James V Staros.
Abstract
BACKGROUND: In the time since we presented the first moleculEntities:
Mesh:
Substances:
Year: 2006 PMID: 17026767 PMCID: PMC1618406 DOI: 10.1186/1471-2148-6-79
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
List of Ligands
| Liganda | Recb | Speciesc |
| amphiregulin (AR) | E1 | c, ch, co, d, es, f, h, m, ma, o, p, r, ra, rh, rt(2), t, xt, z |
| betacellulin (BTC) | E1, E4 | ae, c, ca, ch, co, d, es, f, h, m, p, r, ra rh, s, t, xt, z |
| epidermal growth factor (EGF) | E1 | ae, c, ch, ct, d, dn, et, f, h, m, me, o, p, r, t, xt, z |
| epigen | E1 | c, ch, co, d, h, m, o, r, ra, xl(2), xt, z |
| epiregulin (EPR) | E1, E4 | c, ch, co, d, f, h, m, me(2), o, r, ra, rh, rt(2), t, xt, zf |
| gurken | I | da, de, di, dm, dp, dr, ds, dw, dy |
| heparin-binding epidermal growth factor (HB-EGF) | E1, E4 | c, eg, ch, co, d, f, gm, h, m, ma, me, o, p, r, ra, rt, st, t, xt, z |
| interphotoreceptor matrix proteoglycan-2 (IMP2) | UN | c, ch, co, d, f, h, m, o, r, rh, t, xt, z |
| keren | I | da, de, dg, di, dj, dm, dp, dr, ds, dv, dw, dy, g |
| lin3 | I | cb, ce |
| meprin 1α (MEP1α) | UN | ae, c, ch, d, dn, f, h, m, o, r, rh, t, xl, xt, z |
| meprin 1β (MEP1β) | UN | c, ch, co, d, f, h, m, o, p, r, rh, t, xl(2), xt, z |
| mucin 3 (MUC3) | UN | co, h, m, r, rh, rt, xt(3), z |
| mucin 4 (MUC4) | P | c, co, d, h, m, o, r, xt |
| mucin 12 (MUC12) | UN | c, co, d, h, m, ra, rh |
| mucin 17 (MUC17) | UN | d, et, h, m, o, r, rh |
| neuregulin-1α (NRG1α) | E3, E4 | ch, co, d, dn, gu, h, m, ma, o, p, r, ra, rt, xl, z |
| neuregulin-1β (NRG1β) | E3, E4 | c, ch, co, d, h, m, o, r, xl, z |
| neuregulin-2α (NRG2α) | E3, E4 | c, ch, co, d, et, f, gu, h, m, o, r, rh, t, z |
| neuregulin-2β (NRG2β) | E3, E4 | c, ch, co, d, et, f, gu, h, m, o, r, rh, z |
| neuregulin-3 (NRG3) | E4 | c, ch, co, d, es, f, gu, h, m, o, p, r, t, xt, z(2) |
| neuregulin-4 (NRG4) | E4 | c, ch, co, f, gu, h, m, me, o, p, r, rh, rt, xt |
| neuroglycan-C (NGC) | E3 | ab, c, ch, co, d, f, h, m, me(2), o, r, rh, sh, xt, z |
| spitz | I | an, da, de, dg, di, dj, dm, dp, dr, ds, dw, dv, dy, hb, 1, tc, yf |
| tomoregulin-1 (TR1) | E4 | c, ch, co, d, dn, es, et, f, h, m, o, p, r, xl, xt, z(2) |
| tomoregulin-2 (TR2) | E4 | c, ch, co, d, f, h, m, o, p, r, rh, rt, t(2), xl, xt, z |
| transforming growth factor α (TGFα) | E1 | aa, ae, c, ch, co, d, dn, f, h, m, ma, o, or, p, r, ra, rh, sh, t, xl(2), xt(2), z |
| vein | I | an, da, de, dg, di, dj, dm, dp, dr, ds, dv, dy, hb, yf |
| viral growth factor | V | ar, be; bp(2); cl(2), cp, ep(5), fp(2), gp(2), ls(2), mp, my, rf, rp, sa, sp(3), va(4), vc(5), yl |
a List of ligands used in the evolutionary analysis. In parentheses are the abbreviated names used in the text and figures.
b Indication of the receptor that each ligand binds to. E1: EGF receptor; E3: ErbB3; E4: ErbB4; I: the invertebrate ligands bind to the one invertebrate receptor; P: interacts with ErbB2, but it is unknown if it is as a typical ligand; V: the receptor specificity varies and depends on the origin of the poxvirus growth factor; UN: the receptor specificity for these potential ligands is unknown.
c List of species for each ligand. The number in parentheses indicates the number of copies of the ligand found in that species or in the case of the viral growth factors the number of viral strains. The abbreviations used are as follows:
aa: Arvicanthis ansorgei; ab: Astatotilapia burtoni; ae: Loxodonta africana (African elephant); an: Anopheles gambiae (African malaria mosquito); ar: aracatuba; be: BeAn58058; bp: cowpox; c: Pan troglodytes (chimp); ca: Cyprinus carpio (carp); cb: Caenorhabditis briggsae; ce: Caenorhabditis elegans; cg: Cricetulus grises (Chinese hamster); ch: Gallus gallus (chicken); cl: camelpox; co: Bos taurus (cow); cp: canarypox; ct: Felis catus (cat); d: Canis familiaris (dog); da: Drosophila ananasse; de: Drosophila erectus; dg: Drosophila gritnshawi; di: Drosophila sitnulans; dj: Drosophila mojavensis; dm: Drosophila melanogaster, dn: Dasypus novemcinctus (armadillo); dp: Drosophila pseudoobscura; dr: Drosophila persimilis; ds: Drosophila sechellia; dv: Drosophila virilis; dw: Drosophila willistoni; dy: Drosophila yakuba; ep: ectromelia; es: Sorex araneus (European shrew); et: Echinops telfaira (small Madagascar hedgehog); f: Takifugu rubripes (fugu); fp: fowlpox; g: Glossina morsitans (tsetse fly); gm: Cercopithecus aethiops (green monkey); gp: goatpox; gu: Cavia porcellus (guinea pig); h: Homo sapiens (human); hb: Apis mellifera (honey bee); 1: Homerus americanus (American lobster); Is: lumpy skin disease; m: Mus musculus (mouse); ma: Mesocricetus auratus (golden hamster); me: Oryzias latipes (medaka); mp: monkeypox; my: myxoma; o: Monodelphis domestica (opossum); or: Pongo pygmaeus (orangutan); p: Sus scrofa (pig); r: Rattus novegicus (rat); ra: Oryctolagus cuniculus (rabbit); rf: rabbit fibroma; rh: Macaca mulatta (rhesus monkey); rp: rabbitpox; rt: Oncorhynchus mykiss (rainbow trout); s: Salmo salar (salmon); sa: SPAN232; sh: Ovis aries (sheep); sp: sheeppox; st: Gasterosteus aculeatus (stickleback); t: Tetraodon nigroviridis (tetraodon); tc: Tibolium castaneum; va: variola; vc: vaccinia; xl: Xenopus laevis; xt: Xenopus tropicalis; yf: Aedes aegypti (yellow fever mosquito); yl: yaba-like; z: Danio rerio (zebrafish)
Figure 1Phylogenetic relationship of the EGF modules from the invertebrate ErbB ligands. This tree was generated using neighbor joining with poisson correction of protein sequences in MEGA version 3.1 [61]. Some of the bootstrap percentages for the various branch points are shown.
Figure 2Consensus sequences for the mammalian ligands. Alignment was generated in ClustalX [60]. To minimize errors in amino acid sequence from the DNA sequences used in the analysis, a conserved residue was called conserved if it was in at least 75% of the sequences for an individual ligand. In the alignment, gaps are denoted by a dash (-) and non-conserved residues are indicated by an X. Reverse text (white text on black background) denotes residues that are at least 75% conserved among the different ligands, with grey shaded text (black text on grey background) denoting residues that are different at these conserved positions. Shown for comparison at the bottom is the sequence of human EGF and numbering for the mature ligand.
Figure 3Phylogenetic relationship of the EGF modules from the vertebrate ErbB ligands. The tree shown was generated using neighbor joining with poisson correction of protein sequences in MEGA version 3.1 [61]. Each colored oval highlights the cluster of branches for a different ligand. Shown are some of the bootstrap percentages for the split between the two ligand families. Though the bootstrap percentages show low confidence in some of the branches of the tree, trees generated using different methods of distance correction exhibited similar separation of EGF receptor ligands and ErbB3/ErbB4 ligands and the positions of ligands relative to each other were comparable. Similar trees were generated using the Phylip [62] group of programs.
Figure 4Detailed trees for the AR/HB-EGF and TR1/TR2 pairs. (A) The AR/HB-EGF pair from the tree in Fig. 3. HB-EGF exists in both teleosts and tetrapods, but there is no teleost AR, while the teleosts do have a second sequence labeled AHP, which is slightly more similar to HB-EGF than to AR. (B) The TR1/TR2 pair from the tree in Fig. 3. This tree shows an additional duplication pattern with both TR1 and TR2 forms in the teleosts segregating together. This tree is complicated by the different ligand length for TR2 in the teleosts compared to the rest of the ligands on this branch. The difference in length does suggest an alteration in the sequence requirement for TR2 in the teleosts. (C) Tomoregulin 1 and 2 sequences from human and zebrafish. These sequences are representative of the sequences from other species. TR2 is two amino acids shorter in the zebrafish than the other sequences. Reverse text (white text on black background) denotes residues that are at least 75% conserved between the four ligands, with grey shaded text (black text on grey background) denoting residues that are different at these conserved positions.
List of Receptors
| Receptora | Speciesb |
| EGF receptor | an, c, cb, ce, ch, ci, co, cv, d, dm, dp, ds, ef, em, f, h, hb, m, op, p, r, rh, sm, t, xt, xx, yf, z |
| ErbB2 | ch, co, d, f, h, m, ma, op, r, rh, t, xt, z |
| ErbB3 | c, ch, co, d, f(2), h, m, o, op, r, rh, t(2), xt, z(2) |
| ErbB4 | c, ch, d, f(2), h, m, r, rh, t(2), xt, z |
aList of receptors used in the evolutionary analysis.
bList of species for each receptor. The number in parentheses indicates the number of copies of the receptor found in that species. The abbreviations used are as follows:
an: Anopheles gambiae (African malaria mosquito); c: Pan troglodytes (chimp); cb: Caenorhabditis briggsae; ce: Caenorhabditis elegans; ch: Gallus gallus (chicken); ci: Ciona intestinalis; co: Bos taurus (cow); cv: Caenorhabditis vulgaris; d: Canis familiaris (dog); dm: Drosophila melanogaster; dp: Drosophila pseudoobscura; ds: Drosophila simulans; ef: Ephydatia fluviatilis; em: Echinococcus multilocularis; f: Takifugu rubripes (fugu); h: Homo sapiens (human); hb: Apis mellifera (honey bee); m: Mus musculus (mouse); ma: Mesocricetus auratus (golden hamster); mo: Anopheles gambiae (mosquito); o: Pongo pygmaeus (orangutan); op: Monodelphis domestica (opossum); p: Sus scrofa (pig); r: Rattus novegicus (rat); rh: Macaca mulatta (rhesus monkey); sm: Schistosoma mansoni; t: Tetraodon nigroviridis (tetraodon); xt: Xenopus tropicalis; xx: Xiphiphorus xiphidium; yf: Aedes aegypti (yellow fever mosquito); z: Danio rerio (zebrafish)
Figure 5Consensus sequences for the teleost and tetrapod ErbB receptors. The alignment was generated in ClustalX [60]. To minimize errors in amino acid sequence from the DNA sequences used in the analysis, a conserved residue was called conserved if it was in 75% of the sequences. In the alignment, gaps are denoted by a dash (-) and non-conserved residues are indicated by an X. Reverse text (white text on black background) denotes residues that are at least 75% conserved between the different ligands, with grey shaded text (black text on grey background) denoting residues that are different at these conserved positions. The color bars along the top denote different subdomains within the receptor: red, subdomain I; magenta, subdomain II; green, subdomain III; cyan, subdomain IV; yellow, transmembrane; blue, intracellular juxtamembrane domain; and orange, kinase domain. The sequences start at the beginning of the second exon, and the residue numbers are for the human receptors. The regions or residues of interest are: (A) extended regions that are not well conserved in ErbB2 sequences; (B) extracellular juxtamembrane region that is alternatively spliced in ErbB4 yielding a long and short form; (C) the one glycosylation site that is conserved in the four receptors; (D) regions in the kinase domain where ErbB3 differs relative to the other three receptors, corresponding to the C-helix (D1) and the activation loop (D2); (E) the C-terminal portion of the kinase domain that has receptor-specific sequences and has been shown to be involved in mediating high affinity binding; (#) residue involved in subdomain II-subdomain II interactions in the receptor dimer and subdomain II-subdomain IV interactions in the tethered receptor monomer; (&) and (*) residues involved in subdomain II-subdomain II interactions in the receptor dimer; (+) residues involved in subdomain II-subdomain IV interactions in the tethered receptor monomer; and (^) residues that interact with ligand.
Figure 6Phylogenetic relationship of the ErbB receptors. Shown is a tree generated using neighbor joining with p-distance correction of protein sequences in MEGA version 3.1 [61]. Shown are the bootstrap percentages for the split between invertebrate and vertebrate receptors. Similar trees were generated using different methods of distance correction. The invertebrate receptors lead into the vertebrate receptors separating ErbB3 and ErbB4 from EGF receptor and ErbB2. This structure suggests three gene duplication events, depicted by the filled circles, the first generating EGF receptor/ErbB2 and ErbB3/ErbB4 progenitors. Two more gene duplication events generated the four receptors seen in the vertebrates.
Ligand/Receptor Interactions
| EGFR | ErbB2 | ErbB3 | ErbB4 | EGF | TGFα | |||||
| #a | tet | tel | tet | tel | tet | tel | tet | tel | ||
| 45 | Tyr | His/Tyr | Tyr | His | Leu | Gln/Met | Ser | Ser | Met/Ile | Arg |
| 90 | Glub | Gluc | Glud | var | Aspe | Gluf | Glu | Glu | Lys | Lys |
| 350 | Val | Thr | Glu | var | Thrg | Thrh | Thr | Thr | Leu | Phe |
| 355 | Asp | Asp | Aspi | Aspi | Asp | Asp | Asp | Asp | Arg | Arg |
| 357 | Phe | Tyrj | var | var | Trp | Phe/Tyr | Tyr | Tyrk | Tyr | Phe |
| 384 | Gln | Gln | Serl | Gln | Gluf | Gln | Gln | Gln | Gln/Arg | Glu |
a: EGFR residue number
b: Asp in chicken, Lys in X. tropicalis
c: Asp in tetraodon
d: Gln in X. tropicalis
e: Glu in chicken
f: Asp in zebrafish
g: Ile in X. tropicalis
h: Leu in zebrafish
i: Asn in mouse, golden hamster, and zebrafish
j: His in zebrafish
k: Phe in zebrafish
l: Glu in X. tropicalis