| Literature DB >> 23945933 |
Sean W Harshman1, Nicolas L Young, Mark R Parthun, Michael A Freitas.
Abstract
H1 and related linker histones are important both for maintenance of higher-order chromatin structure and for the regulation of gene expression. The biology of the linker histones is complex, as they are evolutionarily variable, exist in multiple isoforms and undergo a large variety of posttranslational modifications in their long, unstructured, NH2- and COOH-terminal tails. We review recent progress in understanding the structure, genetics and posttranslational modifications of linker histones, with an emphasis on the dynamic interactions of these proteins with DNA and transcriptional regulators. We also discuss various experimental challenges to the study of H1 and related proteins, including limitations of immunological reagents and practical difficulties in the analysis of posttranslational modifications by mass spectrometry.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23945933 PMCID: PMC3834806 DOI: 10.1093/nar/gkt700
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Histone H1 posttranslational modifications identified by mass spectrometry
| H1 variant | Length | Phosphorylation sites | Acetylation sites | Methylation sites | Ubiquitination sites | Formylation sites | References |
|---|---|---|---|---|---|---|---|
| 213 | S2, T4, | S2 | K34, K52, K64, K97, K106, K119, K168, K187 | K46, K64, K75, K85, K90, K97, K106 | K17, K34, K46, K63, K64, K75, K85, K90, K97, K160 | ( | |
| H1.3 | 221 | T4, | S2a, K17, K34, K46, K52, K63, K64, K85, K90, K97, K169 | K52, K64, K97, K106, K169 | K47, K65, K76, K86, K91, K98, K107 | K34, K46, K63, K64, K75, K85, K90, K97, K141, K160 | ( |
| H1.4 | 219 | S2, T4, | S2a, K17, K26, K34, K46, K52, K63, K64, K85, K90, K97, K169 | K26, K52, K64, K97, K106, K119, K148, K169 | K17, K21, K34, K46, K64, K75, K85, K90, K97, K106 | K17, K34, K46, K63, K64, K75, K85, K90, K97, K110, K140, K160 | ( |
| H1.5 | 226 | S2, T4, T11, | S2a, K17, K49, K88, K93, K109, K168, K209 | K27, K168, K169 | K67, K85, K88 | ( |
A list of the posttranslational modifications on the most common histone H1 variants (H1.2, H1.3, H1.4 and H1.5), as identified by mass spectrometry. Phosphorylation sites in bold are consensus CDK sites (S/T-P-X-K, where X is any amino acid).
aDenotes N-α-acetylation of the N-terminal residue after methionine removal.
Figure 1.A search of PubMed for primary research articles containing histone H1 or histone H3 in the title or abstract. Data show a steady-state low number (<100) of publications for histone H1, whereas histone H3 displays an increasing trend over time.
Figure 2.Sequence alignment of histone H1 variants. (A) Amino acid sequence alignment for the histone H1 variants H1.2, H1.3, H1.4 and H1.5. (B) Pairwise scores of sequence homology. The alignment shows a high homology between the human H1 variants.
Figure 3.An illustration of the MS-identified posttranslational modifications on histone H1.4. Asterisk denotes N-α-acetylation of the N-terminal residue after methionine removal.
Figure 4.A graphical representation of the tryptic peptide length (A) and a histogram of relative hydrophobicities (B) for histone H1.4, histone H4 and bovine serum albumin.
In silico tryptic digestion and relative peptide hydrophobicity
| Histone H1.4 | Histone H4 | Bovine serum albumin | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Position | Domain | Mass (Da) | Tryptic peptide sequence | Relative hydrophobicity | Position | Tryptic peptide sequence | Relative hydrophobicity | Position | Tryptic peptide sequence | Relative hydrophobicity |
| 1–17 | NTD | 1608.7817 | MSETAPAAPAAPAPAEK | 17.96 | 25–36 | DNIQGITKPAIR | 18.67 | 45–65 | GLVLIAFSQYLQQCPFDEHVK | 45.00 |
| 35–46 | Globular | 1197.6605 | ASGPPVSELITK | 23.86 | 81–92 | TVTAMDVVYALK | 34.21 | 3–19 | WVTFISLLLLFSSAYSR | 60.49 |
| 65–75 | Globular | 1106.5607 | ALAAAGYDVEK | 17.53 | 47–56 | ISGLIYEETR | 24.65 | 319–336 | DAIPENLPPLTADFAEDK | 36.72 |
| 111–119 | CTD | 857.4606 | AASGEAKPK | 4.9 | 69–78 | DAVTYTEHAK | 11.85 | 169–183 | HPYFYAPELLYYANK | 33.98 |
| 55–63 | Globular | 844.5018 | SGVSLAALK | 20.51 | 61–68 | VFLENVIR | 31.34 | 529–544 | LFTFHADICTLPDTEK | 32.14 |
| 98–106 | Globular | 810.3872 | GTGASGSFK | 10.76 | 97–103 | TLYGFGG | 23.94 | 508–523 | RPCFSALTPDETYVPK | 26.62 |
| 160–168 | CTD | 783.4602 | KPAAAAGAK | 2.81 | 1–4 | MSGR | 3.40 | 469–482 | MPCTEDYLSLILNR | 42.61 |
| 91–97 | Globular | 745.4334 | GTLVQTK | 12.49 | 57–60 | GVLK | 9.04 | 184–197 | YNGVFQECCQAEDK | 19.55 |
| 141–148 | CTD | 715.3864 | ATGAATPK | 1.04 | 22–24 | VLR | 267–280 | ECCHGDLLECADDR | 18.00 | |
| 130–136 | CTD | 641.3860 | KPAGAAK | 4.22 | 10–13 | GLGK | 4.91 | 347–359 | DAFLGSFLYEYSR | 44.12 |
| 47–52 | Globular | 545.3173 | AVAASK | 2.92 | 42–45 | GGVK | 3.97 | 139–151 | LKPDPNTLCDEFK | 22.23 |
| 203–207 | CTD | 543.3380 | TAKPK | 2.29 | 94–96 | QGR | 438–451 | VPQVSTPTLVEVSR | 25.49 | |
| 193–197 | CTD | 541.3588 | AVKPK | 1.77 | 38–40 | LAR | 387–399 | DDPHACYSTVFDK | 21.44 | |
| 86–90 | Globular | 532.3220 | SLVSK | 9.08 | 14–17 | GGAK | 3.66 | 421–433 | LGEYGFQNALIVR | 34.17 |
| 178–182 | CTD | 513.3275 | AAKPK | 2.27 | 19–20 | HR | 569–580 | TVMENFVAFVDK | 40.25 | |
| 198–202 | CTD | 513.3275 | AAKPK | 2.27 | 7–9 | GGK | 375–386 | EYEATLEECCAK | 19.43 | |
| 208–212 | CTD | 513.3275 | AAKPK | 2.27 | 5–6 | GK | 286–297 | YICDNQDTISSK | 17.06 | |
| 27–32 | NTD | 503.2703 | SAGAAK | 2.47 | 18 | R | 106–117 | ETYGDMADCCEK | 15.45 | |
| 76–79 | Globular | 489.2296 | NNSR | 2.31 | 37 | R | 89–100 | SLHTLFGDELCK | 26.51 | |
| 18–21 | NTD | 443.2744 | TPVK | 4.05 | 41 | R | 76–88 | TCVADESHAGCEK | 6.12 | |
| 82–85 | Globular | 429.2951 | LGLK | 11.37 | 46 | R | 402–412 | HLVDEPQNLIK | 18.84 | |
| 123–127 | CTD | 416.2383 | AGAAK | 2.88 | 79 | R | 361–371 | HPEYAVSVLLR | 25.00 | |
| 187–190 | CTD | 401.2274 | SPAK | 3.28 | 93 | R | 300–309 | ECCDKPLLEK | 13.08 | |
| 107–109 | Globular | 373.2325 | LNK | 21 | K | 66–75 | LVNELTEFAK | 28.92 | ||
| 137–139 | CTD | 371.2532 | KPK | 80 | K | 460–468 | CCTKPESER | 5.07 | ||
| 214–217 | CTD | 359.2168 | AAAK | 3.16 | 588–597 | EACFAVEGPK | 18.92 | |||
| 154–156 | CTD | 344.2060 | TPK | 499–507 | CCTESLVNR | 13.86 | ||||
| 172–174 | CTD | 330.1903 | SPK | 310–318 | SHCIAEVEK | 8.92 | ||||
| 184–186 | CTD | 314.1954 | APK | 549–557 | QTALVELLK | 31.48 | ||||
| 150–152 | CTD | 304.1747 | SAK | 413–420 | QNCDQFEK | 9.92 | ||||
| 53–54 | Globular | 303.1543 | ER | 598–607 | LVVSTQTALA | 22.03 | ||||
| 80–81 | Globular | 259.1896 | IK | 123–130 | NECFLSHK | 15.81 | ||||
| 24–25 | NTD | 245.1488 | AR | 37–44 | DLGEEHFK | 14.77 | ||||
| 120–121 | CTD | 217.1426 | AK | 161–167 | YLYEIAR | 25.57 | ||||
| 128–129 | CTD | 217.1426 | AK | 249–256 | AEFVEVTK | 18.96 | ||||
| 158–159 | CTD | 217.1426 | AK | 131–138 | DDSPDLPK | 14.11 | ||||
| 170–171 | CTD | 217.1426 | AK | 483–489 | LCVLHEK | 14.10 | ||||
| 176–177 | CTD | 217.1426 | AK | 562–568 | ATEEQLK | 8.95 | ||||
| 191–192 | CTD | 217.1426 | AK | 257–263 | LVTDLTK | 16.78 | ||||
| 33 | NTD | 174.1117 | R | 341–346 | NYQEAK | 4.02 | ||||
| 22 | NTD | 146.1055 | K | 581–587 | CCAADDK | 3.29 | ||||
| 23 | NTD | 146.1055 | K | 29–34 | SEIAHR | 4.83 | ||||
| 26 | NTD | 146.1055 | K | 212–218 | VLASSAR | 8.89 | ||||
| 34 | NTD | 146.1055 | K | 198–204 | GACLLPK | 17.86 | ||||
| 64 | Globular | 146.1055 | K | 236–241 | AWSVAR | 16.55 | ||||
| 110 | CTD | 146.1055 | K | 490–495 | TPVSEK | 3.58 | ||||
| 122 | CTD | 146.1055 | K | 118–122 | QEPER | 0.97 | ||||
| 140 | CTD | 146.1055 | K | 205–209 | IETMR | 11.47 | ||||
| 149 | CTD | 146.1055 | K | 223–228 | CASIQK | 6.48 | ||||
| 153 | CTD | 146.1055 | K | 524–528 | AFDEK | 8.64 | ||||
| 157 | CTD | 146.1055 | K | 101–105 | VASLR | 9.87 | ||||
| 169 | CTD | 146.1055 | K | 157–160 | FWGK | 19.96 | ||||
| 175 | CTD | 146.1055 | K | 281–285 | ADLAK | 8.03 | ||||
| 183 | CTD | 146.1055 | K | 558–561 | HKPK | 1.57 | ||||
| 213 | CTD | 146.1055 | K | 229–232 | FGER | 7.76 | ||||
| 218 | CTD | 146.1055 | K | 25–28 | DTHK | 2.24 | ||||
| 219 | CTD | 146.1055 | K | 20–23 | GVFR | 13.33 | ||||
| 242–245 | LSQK | 2.96 | ||||||||
| 337–340 | DVCK | 3.49 | ||||||||
| 152–155 | ADEK | 2.46 | ||||||||
| 434–436 | YTR | |||||||||
| 456–459 | VGTR | 3.32 | ||||||||
| 452–455 | SLGK | 4.66 | ||||||||
| 246–248 | FPK | |||||||||
| 545–547 | QIK | |||||||||
| 264–266 | VHK | |||||||||
| 496–498 | VTK | |||||||||
| 233–235 | ALK | |||||||||
| 372–374 | LAK | |||||||||
| 219–220 | QR | |||||||||
| 35–36 | FK | |||||||||
| 221-222 | LR | |||||||||
| 1–2 | MK | |||||||||
| 210–211 | EK | |||||||||
| 298–299 | LK | |||||||||
| 400–401 | LK | |||||||||
| 24 | R | |||||||||
| 168 | R | |||||||||
| 360 | R | |||||||||
| 156 | K | |||||||||
| 437 | K | |||||||||
| 548 | K | |||||||||
A list of the sequences, peptide lengths, domain, relative hydrophobicities and masses produced by an in silico digestion of histone H1.4 with trypsin. Histone H4 and BSA were also added for comparison.
V8 protease (Glu-C) in silico digestion and charge states
| Histone H1.4 | Histone H4 | Bovine serum albumin | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence | Length | Charge (+) | Peptide amino acid composition | Sequence | Length | Charge (+) | Peptide amino acid composition | Sequence | Length | Charge (+) | Peptide amino acid composition |
| 116–219 | 104 | 44 | AKPKAKKAGAAKAKKPAGAA KKPKKATGAATPKKSAKKTP KKAKKPAAAAGAKKAKSPKK AKAAKPKKAPKSPAKAKAVK PKAAKPKTAKPKAAKPKKAA AKKK | 1–54 | 54 | 17 | MSGRGKGGKGLGKGGAKRHR KVLRDNIQGITKPAIRRLAR RGGVKRISGLIYEE | 383–419 | 37 | 5 | CCAKDDPHACYSTVFDKLKH LVDEPQNLIKQNCDQFE |
| 75–115 | 41 | 10 | KNNSRIKLGLKSLVSKGTLV QTKGTGASGSFKLNKKAASG E | 76–103 | 28 | 7 | HAKRKTVTAMDVVYALKRQG RTLYGFGG | 1–30 | 30 | 6 | MKWVTFISLLLLFSSAYSRG VFRRDTHKSE |
| 17–42 | 26 | 10 | KTPVKKKARKSAGAAKRKAS GPPVSE | 65–75 | 11 | 2 | NVIRDAVTYTE | 424–448 | 25 | 4 | YGFQNALIVRYTRKVPQVST PTLVE |
| 54–74 | 21 | 4 | RSGVSLAALKKALAAAGYDV E | 55–64 | 10 | 3 | TRGVLKVFLE | 125–149 | 25 | 4 | CFLSHKDDSPDLPKLKPDPN TLCDE |
| 4–16 | 13 | 1 | TAPAAPAAPAPAE | 276–300 | 25 | 5 | CADDRADLAKYICDNQDTIS SKLKE | ||||
| 43–53 | 11 | 3 | LITKAVAASKE | 42–62 | 21 | 2 | HFKGLVLIAFSQYLQQCPFD E | ||||
| 1–3 | 3 | 1 | MSE | 211–231 | 21 | 6 | KVLASSARQRLRCASIQKFG E | ||||
| 232–250 | 19 | 6 | RALKAWSVARLSQKFPKAE | ||||||||
| 528–543 | 16 | 2 | KLFTFHADICTLPDTE | ||||||||
| 449–465 | 17 | 3 | VSRSLGKVGTRCCTKPE | ||||||||
| 503–518 | 16 | 3 | SLVNRRPCFSALTPDE | ||||||||
| 474–488 | 15 | 2 | DYLSLILNRLCVLHE | ||||||||
| 573–588 | 16 | 1 | NFVAFVDKCCAADDKE | ||||||||
| 177–190 | 14 | 2 | LLYYANKYNGVFQE | ||||||||
| 254–267 | 14 | 4 | VTKLVTDLTKVHKE | ||||||||
| 165–176 | 12 | 3 | IARRHPYFYAPE | ||||||||
| 364–375 | 12 | 3 | YAVSVLLRLAKE | ||||||||
| 155–164 | 10 | 4 | KKFWGKYLYE | ||||||||
| 345–356 | 12 | 2 | AKDAFLGSFLYE | ||||||||
| 31–41 | 11 | 3 | IAHRFKDLGEE | ||||||||
| 555–565 | 11 | 4 | LLKHKPKATEE | ||||||||
| 544–554 | 13 | 4 | KQIKKQTALVE | ||||||||
| 595–607 | 10 | 2 | GPKLVVSTQTALA | ||||||||
| 335–344 | 11 | 3 | DKDVCKNYQE | ||||||||
| 324–334 | 11 | 1 | NLPPLTADFAE | ||||||||
| 196–206 | 10 | 3 | DKGACLLPKIE | ||||||||
| 88–97 | 10 | 2 | KSLHTLFGDE | ||||||||
| 107–116 | 9 | 1 | TYGDMADCCE | ||||||||
| 519–527 | 9 | 2 | TYVPKAFDE | ||||||||
| 98–106 | 9 | 3 | LCKVASLRE | ||||||||
| 73–81 | 7 | 2 | FAKTCVADE | ||||||||
| 357–363 | 8 | 3 | YSRRHPE | ||||||||
| 301–308 | 8 | 2 | CCDKPLLE | ||||||||
| 495–502 | 8 | 3 | KVTKCCTE | ||||||||
| 268–275 | 7 | 1 | CCHGDLLE | ||||||||
| 566–572 | 7 | 2 | QLKTVME | ||||||||
| 63–69 | 7 | 2 | HVKLVNE | ||||||||
| 309–315 | 6 | 2 | KSHCIAE | ||||||||
| 468–473 | 6 | 2 | RMPCTE | ||||||||
| 318–323 | 6 | 2 | KDAIPE | ||||||||
| 489–494 | 6 | 2 | KTPVSE | ||||||||
| 589–594 | 5 | 1 | ACFAVE | ||||||||
| 117––121 | 5 | 2 | KQEPE | ||||||||
| 150–154 | 6 | 2 | FKADE | ||||||||
| 82–87 | 5 | 1 | SHAGCE | ||||||||
| 378–382 | 5 | 1 | ATLEE | ||||||||
| 191–195 | 4 | 1 | CCQAE | ||||||||
| 207–210 | 4 | 2 | TMRE | ||||||||
| 420–423 | 3 | 2 | KLGE | ||||||||
| 122–124 | 3 | 2 | RNE | ||||||||
| 251–253 | 3 | 1 | FVE | ||||||||
| 70–72 | 3 | 1 | LTE | ||||||||
| 376–377 | 2 | 1 | YE | ||||||||
| 316–317 | 2 | 1 | VE | ||||||||
| 466–467 | 2 | 1 | SE | ||||||||
A list of peptide lengths, charge and sequences produced by an in silico digestion of histone H1.4, histone H4 and BSA with V8 protease (Glu-C).