| Literature DB >> 18053128 |
Amir Ali Abbasi1, Karl-Heinz Grzeschik.
Abstract
BACKGROUND: The human chromosomes 2q, 7, 12q and 17q show extensive intra-genomic homology, containing duplicate, triplicate and quadruplicate paralogous regions centered on the HOX gene clusters. The fact that two or more representatives of different gene families are linked with HOX clusters is taken as evidence that these paralogous gene sets might have arisen from a single chromosomal segment through block or whole chromosome duplication events. This would imply that the constituent genes including the HOX clusters reflect the architecture of a single ancestral block (before vertebrate origin) where all of these genes were linked in a single copy.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18053128 PMCID: PMC2235844 DOI: 10.1186/1471-2148-7-239
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Gene families with members on at least three of the human HOX-bearing chromsomes 2, 7, 12 and 17. Restricted location of members of many of these gene familes near the HOX clusters suggests that these paralogons may have duplicated simultaneously by block/whole chromosome duplication. SLC4, solute carrier family 4; INHB, inhibins; GLI, glioma-associated oncogene homolog belonging to kruppel family; ITGB, integrin β chains; SP, transcription factor Sp; HOX, homeobox; COL, collagens; MYL, myosin light chains; EGFR/ERBB, epidermal growth factor receptor/erythroblastoma; ZNFN1A, zinc finger protein, subfamily 1A; IGFBP, insulin-like growth factor-binding protein; HH, hedgehog. None of the features of this Figure are drawn to scale.
Human Gene Families used in Analysis
| COL2A1 | 12q13.11-q13.2 | P02458 | Extracellular matrix structural constituent, Structural constituent of bone, Phosphate transport, Cell adhesion, Skeletal development, Perception of sound. | |
| COL3A1 | 2q31 | P02461 | ||
| COL5A2 | 2q14-q32 | Q7KZ55 | ||
| COL1A2 | 7q22.1 | P08123 | ||
| COL1A1 | 17q21.33 | P02452 | ||
| ERBB2 | 17q21.1 | Q96RT1 | Epidermal growth factor receptor activity, Protein serine/threonine kinase activity, Electron transporter activity, Cell proliferation, ATP binding. | |
| EGFR | 7p12 | P00533 | ||
| ERBB4 | 2q33.3-q34 | Q15303 | ||
| ERBB3 | 12q13 | P21860 | ||
| IGFBP4 | 17q12-q21.1 | P22692 | Regulation of cell growth, Signal transduction, Skeletal development Cell proliferation. | |
| IGFBP1 | 7p13-p12 | P08833 | ||
| IGFBP2 | 2q33-q34 | P18065 | ||
| IGFBP6 | 12q13 | P24592 | ||
| IGFBP3 | 7p13-p12 | P17936 | ||
| IGFBP5 | 2q33-q36 | P24593 | ||
| ITGB3 | 17q21.32 | P05106 | Receptor activity, Cell-matrix adhesion, Integrin-mediated signaling pathway. | |
| ITGB5 | 3q21.2 | P18084 | ||
| ITGB6 | 2q24.2 | P18564 | ||
| ITGB7 | 12q13.13 | P26010 | ||
| ITGB4 | 17q25 | P16144 | ||
| ITGB8 | 7p15.3 | P26012 | ||
| MYL4 | 17q21-qter | P12829 | Phosphoprotein phosphatase activity, Structural constituent of muscle, Muscle development, Microfilament motor activity. | |
| MYL6 | 12q13.2 | P60660 | ||
| MYL1 | 2q33-q34 | P06741 | ||
| MYL7 | 7p21-p11.2 | Q01449 | ||
| MYL2 | 12q23-q24.3 | P10916 | ||
| SP1 | 12q13.1 | P08047 | RNA polymerase II transcription factor activity. | |
| SP2 | 17q21.32 | Q02086 | ||
| SP3 | 2q31 | Q02447 | ||
| SP4 | 7p15 | Q02446 | ||
| SP8 | 7p21.2 | Q8IXZ3 | ||
| ZNFN1A1 | 7p13-p11.1 | Q13422 | DNA-dependent regulation of transcription, Specification and the maturation of the lymphocyte. | |
| ZNFN1A2 | 2qter | Q9UKS7 | ||
| ZNFN1A3 | 17q21 | Q9UKT9 | ||
| ZNFN1A4 | 12q13 | Q96JP3 | ||
| SLC4A1 | 17q21-q22 | P02730 | Inorganic anion exchanger activity, Bicarbonate transport, Chloride transport. | |
| SLC4A2 | 7q35-q36 | P04920 | ||
| SLC4A3 | 2q36 | P48751 | ||
| SLC4A5 | 2p13 | Q14203 | ||
| SLC4A8 | 12q13 | O95233 | ||
| SLC4A10 | 2q23-q24 | Q9HCQ6 | ||
| GLI1 | 12q13.2-q13.3 | P08151 | Regulation of transcription from RNA polymerase II promoter, Morphogenesis of limb and brain. | |
| GLI2 | 2q14 | P10070 | ||
| GLI3 | 7p13 | P10071 | ||
| SHH | 7q36 | Q15465 | Mesodermal cell fate determination, Proteolysis and peptidolysis, Cell-cell signaling, Intein-mediated protein splicing. | |
| DHH | 12q12-q13.1 | O43323 | ||
| IHH | 2q33-q35 | Q14623 | ||
| INHBA | 7p15-p13 | P08476 | Cytokine activity, Growth factor activity, Induction of apoptosis, Mesoderm development, Defense response. | |
| INHBB | 2cen-q13 | P09529 | ||
| INHBC | 12q13.1 | P55103 | ||
| INHBE | 12q13.3 | P58166 |
Figure 2Neighbor-Joining tree of the COL family members. Uncorrected p-distance was used. Complete-deletion option was used. Numbers on branches represent bootstrap values (based on 1000 replications) supporting that branch; only the values ≥50% are presented here. Scale bar shows amino acid substitution per site.
Figure 3Neighbor-Joining tree of the ERBB family. Symbols and parameters are the same as described in Figure 2.
Figure 4Neighbor-Joining tree of the IGFBP family. Symbols and parameters are the same as described in Figure 2.
Figure 5Neighbor-Joining tree of the Integrin β chain family. Symbols and parameters are the same as described in Figure 2.
Figure 6Neighbor-Joining tree of the (A) Myosin light chain family (B) SP family (C) ZNFN1A family (D) SLC4A family. Symbols and parameters are the same as described in Figure 2.
Figure 7Neighbor-Joining tree of the (A) GLI family (B) Hedgehog family and (C) Inhibin family. Symbols and parameters are the same as described in Figure 2.
Figure 8Members of HOX linked gene families that have arisen early in vertebrate history. Order of branching within phylogenetic trees was used to estimate the time windows (double headed arrows on the right) of gene duplication events relative to major cladogenetic events. For each family the lower limit of time window was defined from fish-tetrapod split and the upper limit from the branching order of available closest invertebrate ancestral sequence (Protostomes: Drosophila, Apis mellifera; Echinoderm: Sea Urchin; Cephalochordates: Amphioxus; Urchordate: Ciona intestinalis, Ciona savignyi). The INHBE, INHBC and SLC4A8, SLC4A10 genes arose after the fish-tetrapods split. Previously Proposed timing [3-6] of extensive gene duplications during early chordate evolution is given on the left of the diagram.
Summary of the Phylogenetic Analysis of Gene Families
| ERBB | ERBB4 | EGFR | ERBB3 | ERBB2 | - | (((17, 7)2)12) |
| Collagen | COL3A1 | COL1A2 | COL2A1 | COL1A1 | - | ((((12,17)7)2)2) |
| IGFBP | IGFBP2 | IGFBP1 | IGFBP6 | IGFBP4 | - | ((17, 7)2) |
| INTB | ITGB6 | ITGB5* | ITGB7 | ITGB3 | - | (((3, 17)2)12) |
| MYL | MYL1 | - | MYL6 | MYL4 | - | ((17, 2)12) |
| SP | Sp3 | Sp4 | Sp1 | Sp2 | Yes | (((12,2)7)17) |
| ZNFN1A | ZNFN1A2 | ZNFN1A1 | ZNFN1A4 | ZNFN1A3 | - | (((7,17)2)12) |
| INHB | INHBB | INHBA | INHBC | - | - | ((7,2)12) |
| SLC4A | SLC4A3 | SLC4A2 | - | SLC4A1 | - | ((7,2)17) |
| GLI | GLI2 | GLI3 | GLI1 | - | - | ((7,2)12) |
| HH | IHH | SHH | DHH | - | ((7,2)12) |
For each gene family the chromosomal location and topologies (in the Newick format) of those genes are given, which arose through duplications after the invertebrates-vertebrates split and before the tetrapods-fishes divergence. The percentage bootstrap support of the internal branches is given below each relevant topology.
Figure 9Consistencies in phylogenies of families having members on at least three of the HOX-bearing chromosomes (A) schematic topology of GLI, INHB, IGFBP, HH and SLC4A families (B) schematic topology of ERBB, ZNFN1A and IGFBP family members (C) schematic topology of HOX clusters and SP gene family (D) schematic topology of integrin beta chain and myosin light chain gene families. In each case the percentage bootstrap support of the internal branches is given in parentheses. The connecting bars on the left depict the close physical linkage of relevant genes.