| Literature DB >> 25757364 |
Alexander G Martynov1,2, Elena N Elpidina3, Lindsey Perkin4, Brenda Oppert5.
Abstract
BACKGROUND: Larvae of the tenebrionids Tenebrio molitor and Tribolium castaneum have highly compartmentalized guts, with primarily cysteine peptidases in the acidic anterior midgut that contribute to the early stages of protein digestion.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25757364 PMCID: PMC4336737 DOI: 10.1186/s12864-015-1306-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Predicted cysteine cathepsin genes (B, L, O, K, and F) in the genome, and relative expression levels in the larval gut, as estimated by transcriptome and microarray data
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| NP_001164001 | LOC659441 | 11001 | 10 | 77,228.22 | 82 | QCHN | cathepsin L |
| NP_001164314 | LOC659502 | 11000 | 10 | 25,848.86 | 14 | QCHN | cathepsin L |
| XP_970644 | LOC659226 | 11003 | 10 | 42.77 | 7 | QCHN | cathepsin L |
| XP_970773* | LOC659367 | 11002 | 10 | 35.06 | 2 | ESHN | cathepsin L homolog |
| XP_970951 | LOC659565 | 10999 | 10 | 0.28 | 2 | QCHN | cathepsin L |
| XP_971698 | LOC660368 | 09365 | 7 | 2387.62 | 44 | QCHN | cathepsin L |
| XP_971867 | LOC660551 | 09362 | 7 | 1.99 | 1 | QCHN | cathepsin L |
| XP_971752 | LOC660428 | 09364 | 7 | 0.98 | 1 | QCHN | cathepsin L |
| XP_971975 | LOC660669 | 09448 | 7 | 0.02 | 2 | QCHN | cathepsin L |
| NC_007422 | LOC660491 | pseudogene? | 7 | - | NOC | - | |
| XP_974298 | LOC663145 | 02952 | 3 | 3,142.15 | 43 | QCHN HH | cathepsin B |
| NP_001164205 | LOC663117 | 02953 | 3 | 1,132.96 | 46 | QCHN HH | cathepsin B |
| XP_974244 | LOC663090 | 02954 | 3 | 248.71 | 9 | QCHN HH | cathepsin B |
| XP_974220 | LOC663066 | 02955 | 3 | 79.91 | 2 | QCHN | cathepsin B-like |
| XP_966750 | LOC655148 | 05431 | 8 | 443.27 | 3 | QCHN | cathepsin B-like |
| XP_966663 | LOC655077 | 05432 | 8 | 1.18 | 3 | QCHN | cathepsin B-like |
| XP_968689* | LOC657117 | 05954 | 8 | 57.60 | NOC | QSTN | cathepsin B homolog |
| XP_968767 | LOC657203 | 05953 | 8 | 82.17 | 3 | QCHN | cathepsin B-like |
| XP_0081964674 | LOC656957 | 05955/05956 | 8 | - | NOC | QCHN | cathepsin B-like |
| XP_0081964654 | LOC657038 | --- | 8 | - | NOC | QCHN | cathepsin B-like |
| NP_001164088 | LOC663234 (26-29-p) | --- (09486) | 7 | 1,309.16 | NOC | QCHN | cathepsin L |
| XP_969833 | LOC658343 | 02843 | 3 | 0.02 | 1 | QCHN | cathepsin L |
| XP_967834* | LOC656198 | 09217 | 7 | 28.64 | 1 | QSHN | cathepsin B homolog |
| XP_970512 | LOC659087 | 07214 | 4 | 11.99 | NOC | QCHN | cathepsin O |
| XP_973607 | LOC662417 | --- | 7 | 2.32 | NOC | QCHN | cathepsin F |
| XP_001814509 | LOC100141668 | 13582 | 1 (X) | 0 | 1 | QCHN | cathepsin K |
1From [1]. Tc09363 and Tc01950 were in the original annotation but have been removed from the annotations of cysteine cathepsins; Tc09486 was missed in the original annotation.
2As defined in [21], from microarray gene expression data from larval gut tissue (higher ranks=higher expression); NOC – not on chip.
3Active site residues including those in occluding loop [55].
4Changed in the Tcas4 genome build; listed as a pseudogene in Tcas3, and no expression values available.
5Now annotated by NCBI as tubulointerstitial nephritis antigen-like.
6Changed in Tcas4 genome build.
*Predicted homologs according to lack of sequence conservation in active site residues.
Predicted cysteine cathepsin genes in the genome, and relative expression levels in the larval gut, as estimated by transcriptome data
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| TmL13 | KP303287 | AM4-22 (ABC88769, 99%), AM3-32 (ABC88768, 99%)2; TmCysII, TmCysIII3; ppCal3 (AAP94048)4; 3QT45; Cont-08897, Bt-075835 | 19,726.5 | 8,496.6 | QCHN | cathepsin L |
| TmL5 | KP303279 | ppCAL2 (AAR05023, 97%)4; 3QJ35; Cont-01354, Bt-075286 | 1,356.6 | 572.7 | QCHN | cathepsin L |
| TmL11 | KP303285 | Cont-00009, Bt-014976 | 1,149.4 | 354.9 | QCHN | cathepsin L |
| TmL2 | KP303276 | ppCAL1a,b,c (AAP94046, 100%)4; Cont-09057, Bt-001115 | 337.3 | 263.5 | QCHN | cathepsin L |
| TmL4 | KP303278 | 326.9 | 168.9 | QCHN | cathepsin L | |
| TmL1 | KP303275 | 162.2 | 113.9 | QCHN | cathepsin L | |
| TmL30* | KP303289 | 130.3 | 104.4 | ESHN | cathepsin L homolog | |
| TmL29* | KP303289 | 130.3 | 104.4 | QAHN | cathepsin L homolog | |
| TmL3 | KP303277 | AAP94047 (91%)3 | 62.1 | 31.3 | QCHN | cathepsin L |
| TmL9 | KP303283 | 72.0 | 46.0 | QCHN | cathepsin L | |
| TmL7 | KP303281 | 25.3 | 11.7 | QCHN | cathepsin L | |
| TmL6 | KP303280 | Bt-07886 | 15.8 | 5.8 | QCHN | cathepsin L |
| TmL8 | KP303282 | 5.7 | 3.7 | QCHN | cathepsin L | |
| TmL15 | KP303288 | 0.2 | 0.2 | QCHN | cathepsin L | |
| TmB33 | KP303302 | AM4-18 (ABC88766, 98%)2; TmCysII3; Cont-09310, Bt-002495 | 2,489.6 | 1,160.4 | QCSN HH | cathepsin B |
| TmB20 | KP303293 | АМ3-87 (ABC88767, 99%)2; Cont-008906 | 448.4 | 221.5 | QCHN | cathepsin B-like |
| TmB25 | KP303297 | 672.6 | 296.7 | QCHN | cathepsin B-like | |
| TmB26 | KP303298 | Cont-08975, Bt-082376 | 657.5 | 431.6 | QCHN | cathepsin B-like |
| TmB18 | KP303291 | 283.2 | 175.8 | QCHN HH | cathepsin B | |
| TmB17 | KP303290 | Cont-00240, Bt-014536 | 163.9 | 99.0 | QCHN HH | cathepsin B |
| TmB32* | KP303301 | 77.9 | 37.5 | QSHN | cathepsin B homolog | |
| TmB23 | KP303295 | 48.8 | 29.3 | QCHN | cathepsin B-like | |
| TmB19 | KP303292 | 34.2 | 24.9 | QCHN | cathepsin B-like | |
| TmB27 | KP303299 | 26.5 | 13.9 | QCHN | cathepsin B-like | |
| TmB24 | KP303296 | 20.0 | 6.5 | QCHN | cathepsin B-like | |
| TmB28 | KP303300 | 4.0 | 2.2 | QCHN | cathepsin B-like | |
| TmB22 | KP303294 | 1.2 | 0.7 | QCHN | cathepsin B-like | |
| TmO12 | KP303286 | 38.4 | 22.2 | QCHN | cathepsin O | |
| TmF10 | KP303284 | 23.4 | 8.4 | QCHN | cathepsin F |
1Accession numbers are for predicted mRNA.
2[16].
3[14,18].
4[19].
5[41].
6[27].
7Active site residues including those in occluding loop [55].
*Predicted homologs according to lack of sequence conservation in active site residues.
Figure 1Cladogram of B, L, O, and F cysteine cathepsins from (TmL_, TmB_, Tm_O, Tm_F) and (TcL_, TcB_, Tc_O, Tc_F), (papain, AAB02650), and cathepsin L (HsCathL1, NP_001903, HsCathL2, AAI10513) or B (HsCathB, AAH10240), with outgroup sequence cathepsin K (TcK_XP_001814509) from . Clades corresponding to chromosomes in T. castaneum as well as cathepsin groups (B or L) are indicated.
Comparison of putative cathepsin L orthologs in and , and comparison of key residues to those in human cathepsin L1 (NP_001903) and L2 (AAI10513)
|
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| ||||||||||
|
|
|
|
| ||||||||
|
|
|
|
|
|
|
|
|
|
| ||
| Cathepsin L1 human | GG | LM | A | M | A | GG | LM | A | M | A | |
| Cathepsin L2 human | GG | FM | A | L | A | GG | FM | A | L | A | |
|
|
| ||||||||||
| TmL13 | GG | WM | A | L | A | NP_001164001 | GG | WM | A | L | A |
| TmL1 | GG | YM | A | L | Q | NP_001164314 | GG | WM | A | L | A |
| TmL2 | GG | LM | A | L | E | XP_970644 | GG | LM | A | L | Q |
| TmL3 | GG | LM | A | L | E | XP_970951 | AG | LM | A | V | Q |
| TmL29* | GG | LT | A | L | S | XP_970773* | GG | HA | T | L | S |
| TmL30* | GG | SI | A | L | D | ||||||
| TmL4 | GG | WM | A | F | K | ||||||
| TmL5 | GG | WM | A | F | V | XP_971698 | GG | WM | A | F | K |
| TmL6 | GG | WM | A | F | K | XP_971867 | GG | WM | A | F | K |
| TmL15 | GG | WM | A | F | Q | XP_971752 | GG | YL | S | K | R |
| XP_971975 | GG | WM | A | L | H | ||||||
| XP_969833 | GG | WI | A | L | H | ||||||
| TmL11 | GG | ED | G | L | T | NP_001164088 | GG | ED | A | L | T |
| TmL7 | MQ | LD | T | F | I | ||||||
| TmL8 | MQ | LD | T | F | R | ||||||
| TmL9 | LE | ME | I | Y | Y | ||||||
| TmF10 | GG | LM | A | L | P | XP_008195656 | GG | LM | A | L | P |
| TmO12 | GG | DV | A | L | E | XP_970512 | GG | DI | A | L | E |
| XP_001814509 | GG | SL | S | V | Y | ||||||
1Papain numbering.
*Predicted homolog.
Comparison of putative cathepsin B orthologs in and , and comparison of key residues to those in human cathepsin B (P07858)
|
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| ||||||||||
|
|
|
|
| ||||||||
|
|
|
|
|
|
|
|
|
|
| ||
| Cathepsin B human | GG | YP | A | G | E | GG | YP | A | G | E | |
|
|
| ||||||||||
| TmB33 | GG | WP | D | G | D | XP_974298 | GG | WP | D | G | D |
| TmB18 | GG | YP | S | G | D | NP_001164205 | GG | MP | S | G | G |
| TmB17 | GG | FP | A | G | E | XP_974220 | GG | FP | A | G | S |
| TmB20 | GG | YM | N | G | Y | XP_974244 | GG | YM | S | G | N |
| TmB19 | GG | YI | G | G | Y | ||||||
| TmB22 | GG | YM | G | G | N | ||||||
| TmB23 | GG | YV | T | G | Y | ||||||
| TmB24 | GG | AP | N | G | N | XP_966663 | GG | YS | S | G | N |
| TmB25 | GG | WP | S | G | N | XP_966750 | GG | AP | H | G | Y |
| TmB27 | GG | WM | A | F | Q | ||||||
| TmB26 | GG | SS | S | G | N | ||||||
| XP_008196467 | GG | YO | Y | G | E | ||||||
| XP_008196465 | GG | YT | T | X | E | ||||||
| XP_968767 | GG | YS | G | G | S | ||||||
| XP_968689* | SG | YT | A | G | S | ||||||
| TmB28* | SG | SS | I | S | H | ||||||
| TmB32* | GG | YL | T | G | F | XP_008195382* | GG | YL | T | G | F |
1Papain numbering.
*Predicted homolog.
Figure 2Predicted structure of TmB19, an atypical B-like peptidase from , obtained by 3D modeling. Dark blue - active site (residues Gln-24, Cys-30, His-187, Asn-207); purple – His -109 and green Ile-110, Asn-111 are in the short occluding loop, which is marked light green.
The binding energy of peptide substrates in the models of the active site of cathepsins in compared to that of human cathepsin L1 (3OF8), using the substrates FRF and LRF
|
|
| |
|---|---|---|
|
|
| |
| Human cathepsin L1 | −7.7 | - 6.1 |
| TmL2 | −7.6 | - |
| TmL5 | −7.1 | −3.3 |
| TmL7 | - | - |
| TmL8 | - | - |
| TmL13 | −6.9 | −5.5 |
| TmB18 | −7.6 | −3.9 |
| TmB33 | −8.7 | −6.6 |