| Literature DB >> 20939927 |
Shengfeng Huang1, Haozhen Tian, Zelin Chen, Ting Yu, Anlong Xu.
Abstract
BACKGROUND: The vertebrate tetraspanin family has many features which make it suitable for preserving the imprint of ancient sequence evolution and amenable for phylogenomic analysis. So we believe that an in-depth analysis of the tetraspanin evolution not only provides more complete understanding of tetraspanin biology, but offers new insights into the influence of the two rounds of whole genome duplication (2R-WGD) at the origin of vertebrates.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20939927 PMCID: PMC2965184 DOI: 10.1186/1471-2148-10-306
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1The functions and structures of vertebrate tetraspanins. (A) Schematic of some tetraspanin functions. (B) The tetraspanin overall structure. C1~C8 indicate the conserved cysteines; 100% conserved cysteines are labeled; 'CCG' is the so-called tetraspanin signature. (C) The LEL structure of tetraspanins.
Figure 2The protein phylogenetic (ME) tree of invertebrate deuterostome tetraspanins. Proteins marked with a rectangle are members of the TSPAN4 gene cluster. This tree includes all 39 amphioxus sequences but omitted 9 C. instestinalis sequences and 5 S. purpuratus. Readers interested in the omitted genes are referred to Figure S1-4.
Figure 3The chordate TSPAN4 gene clusters in amphioxus and humans. Triangles show the gene direction. Picture is not drawn to scale. The relationships between four human paralogous region are shown using a tree pattern. An inferred TSPAN4 cluster in the pre-WGD vertebrate ancestor is also provided.
Classification of all vertebrate tetraspanin LELs
| subtype | cysteine pattern of LEL | tetraspanin families |
|---|---|---|
| 4-cys | CCG--C--C | CD81 |
| 6-cys-a | CCG--[DN][WY]--PXXCC--C--GC | CD151, CD63, TSPAN3, TSPAN6, TSPAN12, TSPAN18, RDS, UPK1, |
| 6-cys-b1 | CCG--[DN][WY]--[P]XXCXC--C--GC | TSPAN8, some amphioxus and ascidian TSPAN8-like |
| 6-cys-b2 | CCG--[DN][WY]--PCXC--C--GC | CD37 |
| 6-cys-c | types other than 6a, 6b1, 6b2 | TSPAN13, some amphioxus TSPAN8-like |
| 8-cys | CCG--[DN][WY]--C--C--PXXCC--C--GC | TSPAN5, TSPAN10, TSPAN15 and TSPAN33 |
Figure 4The protein phylogenetic (ME) trees of all 17 ancestral (pre-WGD) vertebrate tetraspanin lineages. The bony vertebrate ortholog lineages and the ohnolog patterns are highlighted as shown in panel A. Red-filled rectangles are used to mark each ancestral lineage in panel J and K. #1, a distant tetraspanin, only found in ray-finned fish otocephala; #2, found in ray-finned fishes, xenopus and reptiles, probably an independent duplicate of TSPAN4; #3, a tetraspanin pseudogene from mouse; #4, a divergent tetraspanin found in all bony vertebrates, originated by retrotransposition; #5, mammalian ROM1, the true ortholog of bony vertebrate ROM1, but too divergent to cluster with other ROM1; #6, mammal TSPAN16, the true ortholog of teleost TSPAN16, but too divergent to cluster with other TSPAN16; #7, a divergent lineage, its position is not determined; #8 and #9, no synteny shared between TSPAN33 and TSPAN33-like, but since they were separated before the radiation of bony vertebrates, here we treated them as an ohnolog pair; #10, found in reptiles and mammals, as a result of tandem duplication of CD81, becoming too divergent to cluster with CD81; #11, only found in ray-finned fish otocephala, as a result of independent duplication of CD9, becoming too divergent to cluster with CD9; #12, has weak support for its clustering with tetrapod CD37 and sharing no synteny, so its identity is questionable.
Figure 5(A) Schematic representation of the tree topology. Three types of branches (in thick lines) are selected for branch-site model tests for positive selection. (B) The one-ratio tree of RDS. The branches selected for tests are marked with "#". (C-F) RDS gene trees inferred using branch-specific models, with the mammalian ROM1 branch as the foreground branch. (C) Synonymous substitution (dS) tree of transmembrane region (TM). (D) Nonsynonymous substitution (dN) tree of TMs. (E) dS tree of the non-TM regions. (F) dN tree of the non-TM region.
Statistics on branches under significant positive selection
| Branch type | No. of branches | Proportion of sites under positive selection average(median) | Significant LRTa (P < 0.05) (proportion) | Fisher's exact testb | Significant LRTa (P < 0.01) (proportion) | Fisher's exact testb |
|---|---|---|---|---|---|---|
| * | ||||||
| A. duplications | 34 | - | 26(76%) | 0.0425;A/B | 19(56%) | 0.0045;A/B |
| B. mammalsc (portion 1,2,3 as a whole) | 31 | - | 15(48%) | 0.7961;B/D | 6(19%) | 0.7552;B/D |
| C. mammalsc (portion 1,2,3 as independent branches) | 81 | - | 26(32%) | 1.4e-5;A/C | 13(16%) | 3.0e-5;A/C |
| D. teleosts | 28 | - | 15(54%) | 0.0667;A/D | 7(25%) | 0.0202;A/D |
| * | ||||||
| E. duplications | 36 | 18.4%(19.0%) | 28(78%) | 0.0138;E/F | 20(56%) | 0.0025;E/F |
| F. mammalsc (portion 1,2,3 as a whole) | 33 | - | 16(48%) | 0.8025;F/H | 6(18%) | 0.7577;F/H |
| G. mammalsc (portion 1,2,3 as independent branches) | 86 | 7.2%(4.1%) | 27(31%) | 3.0e-6;E/G | 14(16%) | 0.0001;E/G |
| H. teleosts | 30 | 9.6%(8.3%) | 16(53%) | 0.0652;E/H | 7(23%) | 0.0118;E/H |
| * | ||||||
| I. duplications | 0 | - | - | - | - | - |
| J. speciation (as independent branches) | 6 | - | 0(0%) | - | 0(0%) | - |
aNumber of branches on which positive selection is detected using branch-site model at given p-value; Hochberg's method (1988) was used for multiple testing correction.
bFisher's exact test was used for testing the difference between different types of branches
cThe bony vertebrate-mammal branch was interrupted by amphibians and reptiles, hence produced portion 1,2,3, see Figure 5A for details.