| Literature DB >> 28871169 |
Abhishek Kumar1,2, Anita Bhandari3, Sandeep J Sarde4,5, Chandan Goswami6.
Abstract
HSP47/SERPINH1 is key-regulator for collagen biosynthesis and its structural assembly. To date, there is no comprehensive study on the phylogenetic history of HSP47. Herein we illustrate the evolutionary history of HSP47/SERPINH1 along with sequence, structural and syntenic traits for HSP47/SERPINH1. We have identified ancestral HSP47/SERPINH1 locus in Japanese lamprey (Lethenteron japonicum). This gene remains on the same or similar locus for ~500 million years (MY), but chromosomal duplication was observed in ray-finned fishes, leading into three sets of three sets (I-III) of HSP47/SERPINH1. Two novel introns were inserted at the positions 36b and 102b in the first exon of only HSP47_1 gene from the selected ray-finned fishes. On the evolutionary time scale, the events of HSP47 duplications took placed between 416-360 MY ago (MYA) while intron insertion dates back to 231-190 MYA after early divergence of ray-finned fishes.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28871169 PMCID: PMC5583329 DOI: 10.1038/s41598-017-10740-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Bayesian phylogeny of representative vertebrate serpins depicts ray-finned fishes specific three sets of HSP47/SERPINH1 within group V6. Set I appears to be close to single copy of HSP47/SERPINH1 in tetrapods, coelacanth and lamprey. Set II is recent duplicate of set I, while set III is very early branching out, hints for its ancestral nature.
Figure 2Gene structural patterns of different HSP47/SERPINH1 genes illustrate that intron insertion is only confined to HSP47_1 of selected ray-finned fishes.
Figure 3Synteny analyses depict origin of different HSP47/SERPINH1 genes. (A) Orthology is shared by tetrapod HSP47/SERPINH1 gene and ray-finned specific HSP47_1/SERPINH1 gene and selected ray-finned fishes have intron gain. Tetrapod HSP47/SERPINH1 shares loci with ray-finned specific HSP47_1. HSP47_1 locus is conserved in different ray-finned fishes as shown in the red box, but not all ray-finned fishes intron gain and fishes with no intron gain are shown in green box. (B) HSP47_2 is originated by recent duplication of HSP47_1. (C) Locus of HSP47_3 is distinct with only few conserved marker genes. + = presence of two additional introns at the positions 36b and 102b; X = Gene is either partial or lost.
Summary of gene annotation for the flanking genes on the ancestral locus of HSP47/SERPINH1 on the scaffold00131 from Japanese lamprey (L. japonicum) genome, A total 45 genes are residing on this locus of size 1 Mb. The gene g32.t1 is LjaHSP47/SERPINH1 and the g19.t1 is P2RY6-like GPCR (also known as lysophosphatic acid receptor, LPA6R) and these two genes are conserved in several vertebrate genomes (Fig. 3) and hence marked in red color. Gene annotation was performed using BLAST2GO 3.0 [8].
| Gene ID | Gene Annotation# | Protein Length | e-Value | Mean Similarity |
|---|---|---|---|---|
|
| diacylglycerol kinase partial | 99 | 5,40E − 57 | 96% |
|
| —NA—$ | 109 | — | — |
|
| —NA— | 108 | — | — |
|
| phosphatidylinositol-glycan biosynthesis class f protein | 207 | 2,10E − 33 | 76,05% |
|
| ovotransferrin-like | 794 | 0,00E + 00 | 58,80% |
|
| conserved oligomeric golgi complex subunit 3 | 913 | 0,00E + 00 | 74,25% |
|
| hypothetical chloroplast rf2 | 813 | 2,70E − 18 | 50,30% |
|
| glypican- partial | 202 | 3,80E − 07 | 68,55% |
|
| glypican-5 isoform × 25 | 537 | 1,40E − 101 | 60,35% |
|
| endoplasmic reticulum-golgi intermediate compartment protein partial | 161 | 1,20E − 86 | 88,15% |
|
| endoplasmic reticulum-golgi intermediate compartment protein 3 isoform × 1 | 266 | 2,50E − 89 | 76,95% |
|
| progestin and adipoq receptor family member 9 | 354 | 4,10E − 74 | 59,30% |
|
| procollagen c-endopeptidase enhancer 2 | 389 | 3,10E − 64 | 52,75% |
|
| short transient receptor potential channel 1 | 497 | 0,00E + 00 | 84,20% |
|
| inhibitor of nuclear factor kappa-b kinase-interacting protein isoform × 1 | 254 | 5,30E − 04 | 47,44% |
|
| ninein-like protein | 366 | 4,70E − 12 | 48,70% |
|
| nucleoredoxin-like protein 2 | 128 | 2,00E − 15 | 55,45% |
|
| —NA— | 169 | — | — |
|
|
|
|
|
|
|
| ef-hand calcium-binding domain-containing protein 2 | 82 | 5,50E − 26 | 85,10% |
|
| —NA— | 70 | — | — |
|
| coiled-coil domain-containing protein 160 | 326 | 7,20E − 17 | 47,40% |
|
| low quality protein: wd repeat-containing protein 78 | 665 | 0,00E + 00 | 59,65% |
|
| growth hormone secretagogue receptor type 1 | 260 | 4,60E − 54 | 67,65% |
|
| fibronectin type iii domain-containing protein 3b | 843 | 2,00E − 152 | 50,75% |
|
| —NA— | 438 | — | — |
|
| fibronectin type iii domain-containing protein 3b | 94 | 4,80E − 08 | 70,71% |
|
| —NA— | 201 | — | — |
|
| —NA— | 423 | — | — |
|
| glycerol kinase 5 | 532 | 9,10E − 175 | 64,60% |
|
| zinc finger b-box domain-containing protein 1 | 236 | 1,70E − 14 | 42,65% |
|
|
|
|
|
|
|
| lysosome-associated membrane glycoprotein 1 | 380 | 3,30E − 32 | 40,05% |
|
| haus augmin-like complex subunit partial | 355 | 8,10E − 08 | 63,10% |
|
| —NA— | 225 | — | — |
|
| adp-ribosylation factor | 208 | 3,10E − 53 | 67,15% |
|
| rcc1 and btb domain-containing protein 1 | 531 | 0,00E + 00 | 83,15% |
|
| neuroligin-3 isoform × 4 | 656 | 0,00E + 00 | 66,25% |
|
| neuroligin- x-linked-like | 187 | 2,50E − 69 | 70,55% |
|
| neuroligin-2-like isoform × 5 | 164 | 2,50E − 57 | 84,95% |
|
| —NA— | 96 | — | — |
|
| protein ect2 isoform × 1 | 368 | 1,40E − 35 | 55,80% |
|
| hypothetical protein H310_04227 | 72 | 7,30E − 04 | 61% |
|
| protein ect2 isoform × 1 | 1002 | 5,30E − 101 | 73,35% |
|
| gpalpp motifs-containing protein 1 | 320 | 7,90E − 56 | 55,25% |
#Full details available in Table S2.
$—NA—– Not available.
*Used in Fig. 3, matching to syntenic data.
Summary of sequence conservation on the secondary structural element levels of HSP47/SERPINH1 proteins.
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
| ||||||
| N-terminal segment | 0 | 8 | 9 | |||||
| hA | 3 | 4 | 7 |
| ||||
| s6B | 1 | 0 | 2 | N49 | S53 | |||
| hB | 3 | 4 | 4 | P54 | S56 | L61 | G67 | |
| hC | 2 | 3 | 2 | T72 | L80 | |||
| hD | 2 | 5 | 5 | |||||
| s2A | 0 | 6 | 3 | |||||
| hE | 1 | 5 | 4 | F130 | ||||
| s1A | 0 | 1 | 2 | |||||
| hF | 3 | 4 | 9 | F147 | I157 | N158 |
| T165 |
| Loop between hF/s3A | 1 | 1 | 2 | I169 | T180 | |||
| s3A | 5 | 4 | 8 |
| N186 | F190 | K191 |
|
| hF1 | 0 | 2 | 1 | |||||
| s4C | 2 | 0 | 3 | F198 | T203 | F208 | ||
| s3C | 5 | 4 | 2 | V218 | M220 | M221 | ||
| s1B | 1 | 0 | 1 | |||||
| s2B | 1 | 2 | 1 |
| ||||
| s3B | 0 | 1 | 1 |
| P255 | |||
| hG | 2 | 4 | 1 | |||||
| hH | 1 | 1 | 2 | |||||
| s2C | 3 | 2 | 1 | |||||
| s6A | 0 | 1 | 2 | P289 | K290 | |||
| hI | 4 | 0 | 2 | L299 | L303 | G307 | ||
| hI1 | 0 | 2 | 0 | |||||
| Loop between hI/s5A | 6 | 6 | 6 |
| A316 | L327 | ||
| s5A | 2 | 3 | 1 | H334 |
| |||
| s4A (RCL) | 3 | 4 | 10 | G344 |
| |||
| s1C | 3 | 0 | 0 | |||||
| s4B | 2 | 3 | 2 | P369 | F370 | |||
| s5B | 2 | 2 | 5 | L383 |
| G386 | ||
| C terminal end | 4 | 1 | 5 | P391 | ||||
$As proposed by Irving et al. (2000). !Bold – Missing.
%Underscore – Partially present (<50% of HSP47).
Figure 4Collagen-specific molecular chaperone HSP47/SERPINH1 has originated in lampreys, dating ~500 million years ago (MYA) and ray-finned fishes has 1–4 copies of this chaperone. (A) Characteristics of different HSP47/SERPINH1 as depicted with the Neighbor-Joining tree. Except for HSP47/SERPINH1 set I from selected ray-finned fishes (marked red). All HSP47/SERPINH1 genes have 4exon-3intron gene structures with variable syntenic organization and sequence identities. All HSP47/SERPINH1 proteins possess non-inhibitory RCL and ER-retention signals with some variations. (B) Homology model of ancestral HSP47/SERPINH1 from Japanese lamprey, illustrating non-inhibitory RCL and ER-retention signal as the HDEL motif. (C) Timescales for the Evolutionary history of HSP47/SERPINH1 depicts origin of HSP47 duplications (pink shade), dated between 416–360 MYA and intron insertion events for HSP47_1 (blue shade), dated between 231–190 MYA. Number of HSP47 is shown in bracket.