| Literature DB >> 26527411 |
Abstract
Homologues of calpain, often thought to be an essential, cytoplasmic, calcium-dependent cysteine endopeptidase found exclusively in eukaryotes, have been found in bacterial proteomes. The homologues lack calcium-binding sites, have differing domain architectures, and can be secreted or membrane-associated. Homologues are rare and occur in a minority of bacterial phyla and often in a minority of species in a genus. However, the differences in domain architecture argue against a recent, horizontal gene transfer from a eukaryote. From analysis of a phylogenetic tree and absence of homologues in archaea, calpains in eukaryotes may be derived from genes horizontally transferred from a bacterium.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26527411 PMCID: PMC4631099 DOI: 10.1186/s13062-015-0095-0
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Fig. 1Phylogenetic tree for selected members of the calpain family. The tree generated by FastTree is shown. Coloured branches indicate significant bootstrap values where light green is 100 %, dark green is >90 %, brown is >85 % and red >80 %. Tips are labelled with UniProt identifiers (where possible). Coloured dots indicate classification, additional domains and non-peptidase homologues, where brown is Bacteria, green is a signal peptide, pink is a transmembrane region, red is a CIII domain, yellow is an EF-hand structure domain, blue is a zinc-finger domain, orange is a C2 domain, turquoise is another additional domain, and black is a non-peptidase homologue. For the key to sequences, see Additional file 4: Table S2
Fig. 2Representative domain architectures for calpain homologues. Domain architecture diagrams were downloaded from the Pfam website (http://pfam.xfam.org). Diagrams are shown in the same order as they occur in Fig. 1, starting with homologues from bacteria and proceeding in a clockwise direction. The UniProt identifier for each protein is shown on the left. In each domain architecture diagram sequence length is indicated by the horizontal grey line (the “string”): sequence lengths are approximately proportional to one another. Domains are shown in the order they appear in the sequence as “beads on a string”. A domain with rounded ends is complete; a domain with a ragged end matches only a fragment of the complete domain as defined in Pfam. All calpain domains are shown in green and labelled “peptidase C2”; other domains are shown in other colours. Small rectangles indicate transmembrane regions (pink), disordered regions (grey), regions of low complexity (pale blue), and regions of coiled-coil (green). Blue “lollipops” at the top edge of a domain indicate active site residues (in the order Cys, His, Asn). Carbohydrate attachment sites are shown as square-topped lollipops at the bottom edge of a domain (where known). Key to sequences (unp = unnamed protein, domains are shown as strings with each domain separated by a tilde, pC2 = calpain domain): E5XTU0_9ACTO, unp (Segniliparus rugosus strain ATCC BAA-974), pC2; E5C759_9BACE, unp (Bacteroides sp. D2), pC2; Q7NDU2_GLOVI, Glr4140 protein (Gloeobacter violaceus strain PCC 7421), HemolysinCabind x 8 ~ pC2 Q8YZT8_NOSS1, Alr0369 protein (Nostoc sp. strain PCC 7120), PPC ~ pC2; K1Q056_CRAGI, calpain-A (Crassostrea gigas (Pacific oyster)), pC2 ~ Calpain III x 2 ~ EF hand 8; K9TN70_9CYAN, unp (Oscillatoria acuminata strain PCC 6304), DUF4114 ~ pC2; N9TPR9_ENTHI, unp (Entamoeba histolytica strain HM-1:IMSS-A), pC2 ~ LIM; ADGB_HUMAN, androglobin (human), pC2; I1CS59_RHIO9, unp (Rhizopus delemar), MIT ~ pC2; R7V270_CAPTE, unp (Capitella teleta), MIT x 2 ~ pC2 ~ Calpain III x 2; G4LXK3_SCHMA, calpain-7 (Schistosoma mansoni), UCH ~ MIT ~ pC2 ~ Calpain III; CAN7_HUMAN, calpain-7 (human), MIT ~ pC2 ~ Calpain III; D2VVS6_NAEGR, unp (Naegleria gruberi), TPR 1 ~ TPR 11 ~ pC2 ~ Calpain III; D2V3V2_NAEGR, unp (Naegleria gruberi), TPR 8 ~ TPR 11 ~ pC2; L1J1Q4_GUITH, unp (Guillardia theta strain CCMP2712), pC2 ~ Calpain III x 3; L1J739_GUITH, unp (Guillardia theta strain CCMP2712), pC2 ~ Calpain III ~ pC2 ~ Calpain III; L1J0U9_GUITH, unp (Guillardia theta strain CCMP2712), pC2 x 2; Q012Z6_OSTTA, CG1391-PB, isoform B (Ostreococcus tauri), UBA 4 ~ zf GRF ~ pC2; C1MTG3_MICPC, unp (Micromonas pusilla strain CCMP1545), UBA 4 ~ zf GRF ~ pC2~; CAN15_HUMAN, calpain-15 (human), zf RANBP x 3 ~ pC2; G4TER7_PIRID, unp (Piriformospora indica strain DSM 11827), pC2 ~ ZZ x 2; D6W573_HUMAN, HCG1988128 isoform CRA_b (human), pC2 ~ Calpain III; CAN13_HUMAN, calpain-13 (human), pC2 ~ Calpain III; K1PHX3_CRAGI, calpain-9 (Crassostrea gigas (Pacific oyster)), Pkinase x 2 ~ pC2 ~ Calpain III; CAN6_HUMAN, calpain-6 (human), pC2 ~ Calpain III ~ C2; CAN5_HUMAN, calpain-5 (human), pC2 ~ Calpain III ~ C2; CAN10_HUMAN, calpain-10 (human), pC2 ~ Calpain III x 2; X1Y3H7_ANODA, unp (Anopheles darlingi), Pkinase Tyr ~ pC2 ~ Calpain III ~ pC2 ~ Calpain III; CAN12_HUMAN, calpain-12 (human), pC2 ~ Calpain III; CAN3_HUMAN, calpain-3 (human), pC2 ~ Calpain III ~ Calpain u2 ~ EF-hand 5; CAN9_HUMAN, calpain-9 (human), pC2 ~ Calpain III; CAN11_HUMAN, calpain-11 (human), pC2 ~ Calpain III; CAN8_HUMAN, calpain-8 (human), pC2 ~ Calpain III; CAN2_HUMAN, calpain-2 (human), pC2 ~ Calpain III ~ EF-hand 8; CAN1_HUMAN, calpain-1 (human), pC2 ~ Calpain III ~ EF-hand 8
Human calpain genes
| Gene | Chromosome location | Segment ID | Paralogy group | Duplication product |
|---|---|---|---|---|
| ADGB | 6q24.3 | 6.2 | 4 | C |
| CANP7 | 3p25.1 | 3.1 | 13 | C |
| CANP15 | 16p13.3 | 16.1 | 15 | B |
| CANP14 | 2p23-21 | 2.2 | 11 | A |
| CANP13 | 2p21-22 | 2.2 | 11 | A |
| CANP6 | Xq23 | X.7 | 9, 13 | A, B |
| CANP5 | 11q13.5 | 11.7 | 9 | C |
| CANP10 | 2q37.3 | 2.9 | 10 | A |
| CANP12 | 19q13.2 | 19.2 | 1, 2 | C, D |
| CANP3 | 15q15.1 | 15.1 | 11 | B |
| CANP9 | 1q42.2 | 1.16 | 11 | A |
| CANP11 | 6p21.1 | 6.4-5 | 11 | D |
| CANP8 | 1q41 | 1.16 | 11 | A |
| CANP2 | 1q41 | 1.16 | 11 | A |
| CANP1 | 11q13.1 | 11.4-5 | 11, 17 | C, D |
Genes for human calpains are shown in divergence order with the gene derived from the most ancient divergence first. Segment IDs and paralogy groups are taken from Supplementary Table 14 from Putnam et al., [50]. Note that the chromosome locations for the CANP6, CANP12 and CANP1 genes overlap more than one paralogy group