| Literature DB >> 22650316 |
Paul B Talbert1, Kami Ahmad, Geneviève Almouzni, Juan Ausió, Frederic Berger, Prem L Bhalla, William M Bonner, W Zacheus Cande, Brian P Chadwick, Simon W L Chan, George A M Cross, Liwang Cui, Stefan I Dimitrov, Detlef Doenecke, José M Eirin-López, Martin A Gorovsky, Sandra B Hake, Barbara A Hamkalo, Sarah Holec, Steven E Jacobsen, Kinga Kamieniarz, Saadi Khochbin, Andreas G Ladurner, David Landsman, John A Latham, Benjamin Loppin, Harmit S Malik, William F Marzluff, John R Pehrson, Jan Postberg, Robert Schneider, Mohan B Singh, M Mitchell Smith, Eric Thompson, Maria-Elena Torres-Padilla, David John Tremethick, Bryan M Turner, Jakob Harm Waterborg, Heike Wollmann, Ramesh Yelagandula, Bing Zhu, Steven Henikoff.
Abstract
Histone variants are non-allelic protein isoforms that play key roles in diversifying chromatin structure. The known number of such variants has greatly increased in recent years, but the lack of naming conventions for them has led to a variety of naming styles, multiple synonyms and misleading homographs that obscure variant relationships and complicate database searches. We propose here a unified nomenclature for variants of all five classes of histones that uses consistent but flexible naming conventions to produce names that are informative and readily searchable. The nomenclature builds on historical usage and incorporates phylogenetic relationships, which are strong predictors of structure and function. A key feature is the consistent use of punctuation to represent phylogenetic divergence, making explicit the relationships among variant subtypes that have previously been implicit or unclear. We recommend that by default new histone variants be named with organism-specific paralog-number suffixes that lack phylogenetic implication, while letter suffixes be reserved for structurally distinct clades of variants. For clarity and searchability, we encourage the use of descriptors that are separate from the phylogeny-based variant name to indicate developmental and other properties of variants that may be independent of structure.Entities:
Year: 2012 PMID: 22650316 PMCID: PMC3380720 DOI: 10.1186/1756-8935-5-7
Source DB: PubMed Journal: Epigenetics Chromatin ISSN: 1756-8935 Impact factor: 4.954
Summary of nomenclature guidelines
| Core histone name | Use in an inclusive sense for the protein family. Specify subgroups with a descriptor, prefix, letter suffix, or number suffix. | ‘H2A can be ubiquitylated.’ ‘H3 can be methylated on K4.’ |
| Capitalization | Upper and lower case are equivalent in meaning, but upper case is preferred for designating core histones, their suffixes, and modifiable amino acids. Use lowercase for modifications and for prefixes. | H3.3K4me3, H2BK123ub1, cenH3 |
| Descriptors | Descriptors can be used before the core histone name to specify a feature, group variants developmentally or functionally, indicate the species of origin, or other uses. There should be a space between the descriptor and the core histone name. There is no requirement that a descriptor specifies a clade. | RC H2A, early H4, testis-specific H3.4 or TS H3.4, |
| Prefixes | These should be few in number and specify a structurally distinct clade of a core histone that is universal or characteristic of a high-level taxonomic clade. Lower case is preferred for prefixes. | macroH2A, cenH3, subH2B |
| Letter suffixes | These should be preceded by a period (.) and specify a structurally distinct monophyletic clade of a histone family (exception: H2A.X). A suffix may be applied judiciously at any taxon level. | H2A.Z, H3.X, H2A.B |
| Number suffixes | These should be preceded by a period (.) and specify a particular variant of a core histone, without constraint as to distinctiveness and without implication as to phylogeny. Number suffixes should be assumed to be species-specific, but it is convenient to name variants in related species consistently where unique orthologies are clear. A number suffix should be the default designation of new variants. | H3.5, H2A.1, macroH2A.2, H1.0 |
| Punctuation | Use a period (.) after core histone names to indicate a subtype (letter or number suffix). Use additional periods as necessary to separate finer divisions of subtypes. A period is equivalent to a branch point in a phylogenetic tree. | H2A.Z.1, H2A.L.1 |
| Splice variants | Use a period (.) before a splice variant number. Treat the same as paralog number suffixes, except that a lowercase ‘s’ may precede the number to indicate that the isoform is a splice variant. | macroH2A.1.2, H2A.Z.s3 |
| Synonyms | For names changed by this nomenclature, refer to both old and new synonyms in the abstract of papers to facilitate literature searches. Optional descriptors can aid identification. | ‘Avian H1.0, also known as H5’ |
Figure 1Unrooted H2A phylogeny. H2A.Z is a monophyletic clade present in all eukaryotes, while macroH2A (mH2A) is restricted to animals and H2A.B (H2A.Bbd) and H2A.L (H2AL) are confined to mammals. Paraphyletic or polyphyetic H2A.X and replication-coupled H2As have diverged repeatedly. Alignments and trees constructed using default ClustalW parameters and displayed using Dendroscope [45].
Figure 2Unrooted H2B phylogeny. TS H2B.1 (TH2B), H2B.W (H2BFWT) and subH2B (SubH2Bv) are mammal-specific clades. Highly divergent generative cell H2Bs in plants do not form a clear clade. Apicomplexan H2Bv does not appear to be related to trypanosome H2BV, despite the fact that both are thought to interact with H2A.Z.
Specific name change suggestions for histones
| H2A (with SPKK motifs) | plants | H2A.W |
| H2A.Bbd | mammals | H2A.B |
| H2Abd1, H2Abd2, H2Abd | bdelloid rotifers | (bdelloid) H2A.1 |
| H2AL | mammals | H2A.L |
| H2Av, H2AvD, D2, hv1, Htz1p | H2A.Z | |
| SubH2Bv | mammals | subH2B |
| H2BL1 | mammals | subH2B |
| H2Bv | apicomplexans | H2B.Z |
| H2BV | trypanosomes | H2B.V |
| H2BFWT | mammals | H2B.W |
| TH2B, hTSH2B | mammals | (TS) H2B.1 |
| H3(P) | H3.P | |
| H3t | mammals | (TS) H3.4 |
| H3v1 to H3v10 | H3.1 to H3.10 | |
| H3V | trypanosomes | H3.V |
| H3.X | human | H3.Y.2 |
| H3.Y | human | H3.Y.1 |
| H4V | trypanosomes | H4.V |
| H1° | animals | H1.0 |
| H5 | birds | H1.0 |
| H1δ | echinoderms | H1.0 |
| H1t | mammals | (TS) H1.6 |
| H1T2 | mammals | (TS) H1.7 |
| H1oo | mammals | (OO) H1.8 |
| Hils1 | mammals | (TS) H1.9 |
| H1x | vertebrates | H1.10 |
| B4 | frogs | (Amphibian) H1.4 |
Parentheses ( ) indicate an optional descriptor.
Figure 3Unrooted H3 phylogeny. (A) cenH3s are not clearly separable from the divergent H3s of excavates and of plant generative cells. (B) Replication-independent H3.3s and replication-coupled H3s have diverged repeatedly in different lineages.
Sequence variation in ‘H3.3’ and ‘H2A.X’ variants
| Kingdom | Organism | Histone Variant | Residue | Residues |
|---|---|---|---|---|
| Animals | H3.3 | |||
| | H3.2 | |||
| Fungi | H3 | S | SAIG | |
| Plants | H3.3 | |||
| | H3.1 | |||
| Rhodophytes | H3.3? | |||
| | H3.1? | |||
| Alveolates | H3.3 | |||
| | H3.1 | |||
| Heterokonts | H3.3? | |||
| | H3.1? | |||
| Amoebozoa | H3.3? | AAI | ||
| | H3.1? | AAI | ||
| Excavates | H3 | A | NAIL | |
| | | | ||
| Animals | H2A.X | PNIQAVLLPKKSATVGPKAPSGGKKATQA | ||
| | H2A.2.2 | PNIQAVLLPKKTSHKPGKNK | ||
| Fungi | H2A.X | PNIHQNLLPKKSAKATKA | ||
| Plants | H2A.X.3 | PNIHQTLLPSKVGKNKGDIGSA | ||
| | H2A.1 | PNIHNLLLPKKAGASKPQED | ||
| Rhodophytes | H2A.X | PNIHQVLMPRKKTKGDA | ||
| | H2A | PNIHAVLLPKKKAKGE | ||
| Alveolates | H2A.X | PNINPMLLPSKSKKTESRGGA | ||
| | H2A.1 | PNINPMLLPSKTKKSTEPEH | ||
| Heterokonts | H2A.X | PNIHAILLPKKTIKTKGP | ||
| | H2A.3 | PNIHAILLPKKSGPTK | ||
| Amoebozoa | H2A.X? | PTPQQSTGEKKKKPSKKAAEGS | ||
| | H2A | PTPQSNTEGKKKKATSKKS | ||
| Excavates | H2A.X | RSAKEGREGKGSHR | ||
| H2A | PSLNKALAKKQKSGKHAKATPSV | |||
RI ‘H3.3’ variants and ‘H2A.X’ variants are paraphyletic or polyphyletic, but have recognizable sequence features. Upper panel: Divergence between RI and RC H3 variants usually involves differences at residue 31 and residues 86 to 89. Lower panel: ‘H2A.X’ variants are distinguished from related H2A variants by bearing a consensus SQ(E/D)Φ phosphorylation motif at the C-terminus. Residue numbers refer to the human H3.3 and H2A.X protein sequences, and to orthologous positions in other variants.
Figure 4Unrooted H4 phylogeny. Most eukaryotes have a single form of H4, and most divergence in H4s is found in excavates, amoebozoans, and ciliates versus other eukaryotes.
Figure 5Unrooted H1 phylogeny. H1.0 (H1°) is an animal-specific clade, including avian H5. H1.10 (H1x) is found in vertebrates. Mammal-specific clades include H1.7 (H1T2) and H1.9 (Hils1). H1.8 (H1oo) and H1.6 (H1t) are also monophyletic in mammals, but other TS H1s and oocyte H1s are not clearly members of the same clades.
Unified nomenclature and synonyms for mammalian H1 variants
| H1.1 | | H1a | H1a | H1.1 | |||||
| H1.5 | H1a | H1s-3 | H1b | H1.5 | |||||
| H1.2 | H1d | H1s-1 | H1c | H1.2 | |||||
| H1.3 | H1c | H1s-2 | H1d | H1.3 | |||||
| H1.4 | H1b | H1s-4 | H1e | H1.4 | |||||
| H1t | | | H1t | (TS) H1.6 | |||||
| | |||||||||
| H1.0, H1° | H1 histone family, member 0 | H1(0) | H1.0 | ||||||
| H1T2 | H1 histone family, member N, testis-specific | H1t2 | (TS) H1.7 | ||||||
| H1oo | H1 histone family, member O, oocyte-specific | H1oo | (OO) H1.8 | ||||||
| | Histone H1-like protein in spermatids 1 | TISP64 | (TS) H1.9 | ||||||
| H1x | H1 histone family, member X | H1X | H1.10 | ||||||
Application of nomenclature to
| At1g06760 | H1.1 | H1 | |
| At2g30620 | H1.2 | " | |
| At2g18050 | H1.3 | " | |
| At5g54640 | H2A.1 | Canonical H2A | |
| At4g27230 | H2A.2 | " | |
| At1g51060 | H2A.10 | " | |
| At3g20670 | H2A.13 | " | |
| At1g54690 | H2A.X.3 | H2A.X | |
| At1g08880 | H2A.X.5 | " | |
| At5g59870 | H2A.W.6 | SPKK-bearing H2As | |
| At5g27670 | H2A.W.7 | " | |
| At5g02560 | H2A.W.12 | " | |
| At2g38810 | H2A.Z.8 | H2A.Z | |
| At1g52740 | H2A.Z.9 | " | |
| At3g54560 | H2A.Z.11 | " | |
| At4g13570 | H2A.Z.4 | none | |
| At1g07790 | H2B.1 | H2B | |
| At5g22880 | H2B.2 | " | |
| At2g28720 | H2B.3 | " | |
| At5g59910 | H2B.4 | " | |
| At2g37470 | H2B.5 | " | |
| At3g53650 | H2B.6 | " | |
| At3g09480 | H2B.7 | " | |
| At1g08170 | H2B.8 | " | |
| At3g45980 | H2B.9 | " | |
| At5g02570 | H2B.10 | " | |
| At3g46030 | H2B.11 | " | |
| At5g65360 | H3.1 | H3.1 | |
| At1g09200 | " | " | |
| At3g27360 | " | " | |
| At5g10400 | " | " | |
| At5g10390 | " | " | |
| At4g40030 | H3.3 | H3.3 | |
| At4g40040 | " | " | |
| At5g10980 | " | " | |
| At1g13370 | H3.6 | none | |
| At1g19890 | H3.10 | MGH3/HTR10 | |
| At1g75600 | H3.14 | none | |
| At1g01370 | cenH3 | CENH3/CENP-A/HTR12 | |
| At1g75610 | H3.7 | | |
| At3g46320 | H4 | H4 | |
| At5g59690 | " | " | |
| At2g28740 | " | " | |
| At1g07820 | " | " | |
| At3g53730 | " | " | |
| At5g59970 | " | " | |
| At3g45930 | " | " | |
| At1g07660 | " | " |