| Literature DB >> 35524943 |
Joshua T Trujillo1, Jiaxin Long1, Erin Aboelnour1,2, Joseph Ogas1, Jennifer H Wisecaver1.
Abstract
Chromatin remodelers play a fundamental role in the assembly of chromatin, regulation of transcription, and DNA repair. Biochemical and functional characterizations of the CHD family of chromatin remodelers from a variety of model organisms have shown that these remodelers participate in a wide range of activities. However, because the evolutionary history of CHD homologs is unclear, it is difficult to predict which of these activities are broadly conserved and which have evolved more recently in individual eukaryotic lineages. Here, we performed a comprehensive phylogenetic analysis of 8,042 CHD homologs from 1,894 species to create a model for the evolution of this family across eukaryotes with a particular focus on the timing of duplications that gave rise to the diverse copies observed in plants, animals, and fungi. Our analysis confirms that the three major subfamilies of CHD remodelers originated in the eukaryotic last common ancestor, and subsequent losses occurred independently in different lineages. Improved taxon sampling identified several subfamilies of CHD remodelers in plants that were absent or highly divergent in the model plant Arabidopsis thaliana. Whereas the timing of CHD subfamily expansions in vertebrates corresponds to whole genome duplication events, the mechanisms underlying CHD diversification in land plants appear more complicated. Analysis of protein domains reveals that CHD remodeler diversification has been accompanied by distinct transitions in domain architecture, contributing to the functional differences observed between these remodelers. This study demonstrates the importance of proper taxon sampling when studying ancient evolutionary events to prevent misinterpretation of subsequent lineage-specific changes and provides an evolutionary framework for functional and comparative analysis of this critical chromatin remodeler family across eukaryotes.Entities:
Keywords: evolutionary innovation; gene duplication; gene loss; protein domain prediction; subfunctionalization; whole genome duplication
Mesh:
Substances:
Year: 2022 PMID: 35524943 PMCID: PMC9113485 DOI: 10.1093/gbe/evac066
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 4.065
Fig. 1.Distribution of CHD gene family across eukaryotes and model domain architecture. (A) Maximum-likelihood phylogeny of CHD homologs. Branches corresponding to subfamily (sf) I, II, and III are indicated. Grey circles indicate branches with ultrafast bootstrap support ≥ 0.95. Clades of animal (red), plant (green), or fungi (blue) are collapsed. (B) PFAM domain architecture of CHD homologs from model eukaryotes. Width of ovals and rectangles are proportional to the width of the protein domain.
Summary Counts of All CHD Homologs
| Lineage | Subfamily I Counts | Subfamily II Counts | Subfamily III Counts | Combined Counts | ||||
|---|---|---|---|---|---|---|---|---|
| Species | Sequences | Species | Sequences | Species | Sequences | Species | Sequences | |
|
|
|
| — |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| — |
|
|
|
|
| — |
|
|
|
|
|
|
|
|
|
|
| — |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Microsporidia | — | — | — | — | 10 | 10 | 10 | 10 |
| Chytridiomycota | 3 | 3 | — | — | 3 | 3 | 3 | 6 |
| Mucoromycota | 4 | 4 | 3 | 3 | 3 | 5 | 4 | 12 |
| Basidiomycota | 53 | 53 | 30 | 31 | — | — | 53 | 84 |
| Ascomycota | 221 | 227 | 170 | 172 | — | — | 222 | 399 |
|
|
|
|
|
|
|
|
|
|
|
| — |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Other Metazoans | 10 | 10 | 12 | 18 | 12 | 12 | 12 | 40 |
| Other Protostomes | 22 | 30 | 23 | 40 | 21 | 27 | 24 | 97 |
| Arthropods | 146 | 166 | 147 | 167 | 138 | 277 | 149 | 610 |
| Other Deuterostomes | 5 | 6 | 6 | 6 | 5 | 5 | 6 | 17 |
| Chondrichthyes | 2 | 5 | 2 | 3 | 2 | 7 | 2 | 15 |
| Other Bony Vertebrates | 78 | 231 | 79 | 376 | 79 | 425 | 79 | 1,032 |
| Amphibians | 5 | 14 | 5 | 18 | 5 | 29 | 5 | 61 |
| Reptiles | 91 | 231 | 91 | 221 | 91 | 371 | 91 | 823 |
| Mammals | 129 | 430 | 130 | 677 | 130 | 706 | 130 | 1,813 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| — |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| — |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Chlorophyta | 71 | 76 | 45 | 54 | 30 | 52 | 94 | 182 |
| Other Streptophytes | 18 | 18 | 21 | 25 | 1 | 1 | 27 | 44 |
| Other Embryophytes | 37 | 40 | 55 | 139 | 26 | 31 | 55 | 210 |
| Lycophytes | 11 | 11 | 13 | 29 | 2 | 2 | 15 | 42 |
| Ferns | 20 | 20 | 47 | 68 | 13 | 14 | 47 | 102 |
| Gymnosperms | 37 | 37 | 59 | 112 | — | — | 59 | 149 |
| Other flowering plants | 29 | 29 | 47 | 101 | — | — | 47 | 130 |
| Monocots | 58 | 62 | 91 | 229 | — | — | 92 | 291 |
| Eudicots | 279 | 317 | 454 | 1,153 | — | — | 455 | 1,470 |
|
|
|
|
|
|
|
|
|
|
note.—Main eukaryotic lineages are bolded. Sub-lineages of Fungi, Metazoa, and Viridiplantae are also listed (unbolded).
Fig. 2.Detailed subfamily phylogenies with domains. Maximum-likelihood phylogenies for (A) subfamily I, (B) subfamily II, and (C) subfamily III. Location of CHD homologs from model eukaryotes are indicated. Branches are colored as in figure 1. Additional taxonomic resolution is provided by the color bars. The outer track indicates the PFAM domain architecture for each homolog.
Summary Counts of Viridiplantae Sequences in Subfamily II
| Lineage | PKL Counts | PKR1 Counts | PKR4 Counts | MOM Counts | ||||
|---|---|---|---|---|---|---|---|---|
| Species | Sequences | Species | Sequences | Species | Sequences | Species | Sequences | |
| Chlorophyta | 41 | 47 | 4 | 4 | 3 | 3 | — | — |
| Other Streptophytes | 16 | 16 | 8 | 8 | 1 | 1 | — | — |
| Other Embryophytes | 54 | 70 | 26 | 30 | 37 | 39 | 1* | 1* |
| Lycophytes | 12 | 18 | 9 | 9 | 2 | 2 | 5* | 5* |
| Ferns | 47 | 47 | 21 | 21 | — | — | 6* | 7* |
| Other flowering plants | 46 | 51 | 23 | 25 | 2 | 2 | 15 | 23 |
| Gymnosperms | 59 | 62 | 18 | 19 | 25 | 25 | 6 | 6 |
| Monocots | 90 | 107 | 53 | 62 | 13 | 15 | 27 | 45 |
| Eudicots | 440 | 587 | 262 | 317 | — | — | 164 | 249 |
note.—Asterisk (*) indicates sequences that were manually added based on presence of conserved MOM motif(s), see Materials and Methods.
Fig. 3.Novel conserved motifs and disordered regions in CHD proteins: (A) AtMOM1, (B) HsCHD6, (C) HsCHD7, (D) HsCHD8, (E) HsCHD9, and (F) DmKismet. IUPred score denotes the disorder tendency of each residue in the given protein, where higher values correspond to a higher probability of disorder. The top domain track for each protein indicates the location of the canonical PFAM conserved and accessory structural domains. The bottom track (*) indicates the location of predicted IUPred-derived structural domains in MOM (CMM1/2) and subfamily III (SF3M1-6). Width of ovals and rectangles are proportional to the width of the protein domain.