Literature DB >> 31881057

Genome-wide identification and expression analysis of the PHD-finger gene family in Solanum tuberosum.

Mingyue Qin1,2,3, Wenbin Luo4, Yan Zheng1,2,3, Huazhong Guan2,3, Xiaofang Xie1,2,3.   

Abstract

Plant homeodomain (PHD) proteins are prevalent in eukaryotes and play important roles in plant growth, development and abiotic stress response. In this study, the comprehensive study of the PHD family (StPHD) was performed in potato (Solanum tuberosum L.). Seventy-two PHD genes (named StPHD1-72) were identified and grouped into 10 subfamilies based on phylogenetic analysis. Similar structure organizations were found within each subfamily according to the exon/intron structures and protein motif analysis. These genes were unequally scattered on the chromosomes of potato, with 9 pairs of segmental duplicated genes and 6 pairs of tandem duplicated genes showing that both segmental duplicated and tandem duplicated events contributed to the expansion of the potato PHD family. The gene ontology (GO) analysis suggests that StPHD mainly functioned at the intracellular level and was involved in various binding, metabolic and regulation processes. The analysis of expression patterns of StPHD genes showed that these genes were differentially expressed in 10 different tissues and responded specifically to heat, salt and drought stress based on the FPKM (Fragments per kilobase of transcript per million mapped reads) values of the RNA-seq data. Furthermore, the real-time quantitative PCR for 12 selected StPHD genes revealed the various levels of gene expression corresponding to abiotic stress. Our results provide useful information for a better understanding of PHD genes and provide the foundation for additional functional exploration of the potato PHD gene family.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31881057      PMCID: PMC6934267          DOI: 10.1371/journal.pone.0226964

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Zinc-finger proteins are widely dispersed in eukaryotic organisms. Zinc-finger domains are rich in cysteine or histidine and have been classified into several types, including RING (Really Interesting New Genes), LIM (Lin11, Isl-1 and Mec-3), and plant homeodomain (PHD) [1]. The majority of PHD proteins are in the nucleus [2]. A typical PHD protein usually contains one to several PHD-finger domains, and each of the proteins contains approximately 60 amino acids with the structure composition of Cys4-His-Cys3 zinc-binding motif [3]. The residues of the pair of cysteines or between cysteine and histidine are usually conserved, whereas the second amino acid residue before the final pair of cysteines is usually tryptophan or another aromatic amino acid [4]. The combination of the core amino acid residues and two zinc ions plays a fundamental role in maintaining a firm spatial framework for the domain, a roughly globular domain in a three-dimensional conformation [5]. PHDs proteins have been widely studied in plant species since the first PHD protein HAT3.1 was identified in Arabidopsis [6]. Studies have shown that PHD proteins perform critical roles in plant development [7]. In Arabidopsis, the PHD protein VIL1 regulates the expression of floral repressors through photoperiod and vernalization pathways. DUET, a PHD protein, is also essential for chromosome organization and progression during spermatogenesis in Arabidopsis [8]. The PHD protein VIM1 contains the histone residue and participates in the regulation of the chromatin state [9]. As a PHD domain containing protein, SIZ1 codes a SUMO E3 ligase, which contains the PHD domain and regulates the anther dehiscence for spikelet fertility. Furthermore, SIZ1 regulates both vegetative and reproductive development in plants [10, 11]. The Ehd3 gene, which encodes a PHD protein, is an important promoter of rice flowering [12]. Recent studies showed that PHD proteins are essential epigenetic members in methylation maintenance [13, 14]. For example, the PHD proteins ATX1 and ATX2 control the expression of genes that encode histone methyltransferase [15]. In addition, some PHD proteins are involved in abiotic stress response [16]. Six soybean PHD proteins, encoding the Alfin1-type PHD protein, showed differential expression in response to cold, ABA and drought treatment, and displayed resistance to salt stress in transgenic Arabidopsis [7]. The PHD gene families have been studied in several plant species, such as Arabidopsis (Arabidopsis thaliana), maize (Zea mays) [17], poplar (Populus trichocarpa) [18], Chinese pear (Pyrus bretschneideri) [19] and bamboo (Phyllostachys edulis) [20]. Potato (Solanum tuberosum) is one of the most important food crops in the world, with more than 380 million tons of field production worldwide (http://www.fao.org). However, the PHD gene family in potato has not been thoroughly examined. Genome sequence and annotation resources of the potato genome provides an excellent opportunity for understanding the PHD gene family [21]. In this study, a comprehensive investigation of the PHD gene family in the potato genome was conducted, including PHD gene structure, chromosomal localization, gene duplication, phylogenetic relationship, gene ontology, tissue-specific expression profile, and expression patterns following exposure to drought, heat and salt stress. The objectives of this study were to identify the genome-wide sequence structures and the evolutionary relationship of the potato PHD gene family and provide a better understanding of functional mechanisms of the PHD-finger gene family in plants.

Materials and methods

Identification of PHD genes

The PHD protein sequences of Arabidopsis thaliana were used as the queries to identify the amino acid orthologs in potato through the BLASTP tool of SpudDB [22] and Phytozome v12.1 (https://phytozome.jgi.doe.gov). Potato candidate PHD proteins were identified with more than 30% similarity to the query sequence and an E value less than E-10. The genomic sequences of Arabidopsis thaliana PHD proteins were downloaded from the Phytozome database (https://phytozome.jgi.doe.gov/pz/portal.html). The domains for PHD proteins were confirmed by using the Conserved Domain Database (CDD) from NCBI (https://www.ncbi.nlm.nih.gov/cdd/) [23] with an E value of 1e-10. Finally, the sequences with complete PHD domains were retained and were assigned as potato PHD (StPHD) genes. The information for gene IDs, physical position, sequence of gene and protein, and coding sequences (CDS) were retrieved from Phytozome. These StPHD genes were renamed according to the order of their physical position.

Gene structure and conserved motif identification

The parameters for the final confirmed PHD proteins were calculated using online ExPASy programs (http://web.expasy.org/protparam/) [24]. The exon-intron structures for the StPHD genes were identified on the Gene Structure Display Server (GSDS, http://gsds.cbi.pku.edu.cn/) [25]. The conserved motifs were identified by the MEME program (version 5.0.3, http://alternate.meme-suite.org/tools/meme) [26] using an optimum motif width of 50–250 residues and 20 as the number of maximum motifs. The function of the identified motifs was annotated with the CDD program [23].

Chromosomal location and gene duplication

The chromosomal location image of StPHD genes was generated by the MapInspect tool (http://www.plantbreeding.wur.nl/uk/software_mapinspect.html) based on gene physical position data. Gene duplication was determined by the length and similarity (>70%) [27, 28]. Tandem duplicated genes were defined when the physical interval of two duplicated gene was less than 100 kb and they were separated by less than five genes [29].

Phylogenetic analysis

Alignment of all potato PHD-finger protein sequences was generated using ClustalX1.83 [30]. A phylogenetic tree with 1,000 bootstrap replicates was constructed based on the IQ-TREE maximum-likelihood method [31]. BLASTN was used to identify the orthologs among potato, Arabidopsis and maize [32]. The two sequences with the best alignment were confirmed as orthologs if the alignment was more than 300 bp and shared at least 40% similarity.

GO annotation and RNA-seq data analysis

GO analysis for StPHD genes was performed using PlantRegMap (http://plantregmap.cbi.pku.edu.cn/go.php). FPKM values of StPHD for various tissues and treatments derived from RNA-seq (DM_v4.03) [21] were extracted from SpudDB. The heatmap2 function of the R package for heatmap functions were used to analyze the expression of StPHD genes [33].

Plant growth and treatments

T Virus-free plantlets (S. tuberosum L. autotetraploid cultivar Zhongshu 3) were produced from in vitro cuttings. Using the nodal cutting method, the potato shoots were cultured in full MS solid media with a 16-h day temperature of 22°C and an 8-h night temperature of 18 °C for one month. The plantlets were then grown in a tray with a half-strength modified Hoagland solution [34] for 6 days prior to treatment. Small foam squares were used to suspend plantlets in the solution, which was adjusted to cover the roots. Plantlets were exposed to abiotic stress conditions included heat (35°C), drought (260 mM mannitol) and salt (150 mM NaCl) treatment, untreated plantlets were used as control groups. All treated plantlets and controls were collected 6 h after treatment and stored at −80°C before RNA samples were extracted.

Quantitative real-time PCR analysis

The total RNA of the plantlets was extracted using TRIzol reagent (Invitrogen, http://www.invitrogen.com). The cDNA samples were assayed by qRT-PCR using SYBR Premix Ex Taq (Takara). The primers used for qPCR analysis are listed in S1 Table. Actin was used as an internal control. Three biological replicates (each replicate contained 6 plants) and three technical replicates were tested. The relative expression level of a gene was measured using the 2−ΔΔCt method [35].

Results

Identification and characterization of PHD genes

A BLASTP search was performed using 70 PHD protein sequences of Arabidopsis [36] as the queries against the potato genome database to analyze PHD genes in potato. In addition, the CDD analysis was used to validate the presence of the PHD conserved domain in the predicted sequences. Finally, a total of 72 PHD genes were confirmed in potato and these genes were named StPHD1 through StPHD72 based on their physical position on chromosomes (Table 1). However, a single gene (StPHD1) was located on the unassembled scaffolds, and therefore, StPHD1 could not be assigned to any of the potato chromosomes. The StPHD genes varied greatly in terms of the number of amino acids and molecular weights. The StPHD proteins contain 115 (StPHD42) to 1,569 (StPHD38) amino acids, with an average of 532.7 amino acids, whereas the molecular weights ranged from 12.94 kDa (StPHD42) to 175.35 kDa (StPHD38) and the theoretical isoelectric points (pI) varied between 4.43 (StPHD24) and 9.27 (StPHD63). The detailed information for StPHD genes, including the gene ID, protein length (aa), molecular mass (MS), pI and location are listed in Table 1.
Table 1

The PHD family genes in Solanum tuberosum.

Gene nameGene IDProteinChr.Location
Length (a.a.)Mol. wt. (Da)PI
StPHD1PGSC0003DMG40000674017318752.746.18039070649‥39071170
StPHD2PGSC0003DMG40002277353160482.719.011381797‥384537
StPHD3PGSC0003DMG40002277453160328.358.721389334‥392000
StPHD4PGSC0003DMG4000068591167129839.476.09165467581‥65472212
StPHD5PGSC0003DMG400024731949108238.644.92177859513‥77863125
StPHD6PGSC0003DMG40001826124828022.565.16179431944‥79437244
StPHD7PGSC0003DMG40002576057863075.324.78183891354‥83895400
StPHD8PGSC0003DMG40003095521124452.448.49186394125‥86398488
StPHD9PGSC0003DMG40001803970778191.687.13188623560‥88626720
StPHD10PGSC0003DMG40002111854560035.477.01229095938‥29104284
StPHD11PGSC0003DMG40001671221124070.48.46236806739‥36811222
StPHD12PGSC0003DMG40101281274884165.628.6731914594‥1918304
StPHD13PGSC0003DMG40002253782288752.295.2033774837‥3781249
StPHD14PGSC0003DMG400016917919104398.788.05316816173‥16820858
StPHD15PGSC0003DMG40001196270977804.295.43322309009‥22311760
StPHD16PGSC0003DMG40000912170480208.218.87345237890‥45246090
StPHD17PGSC0003DMG40001816264373457.518.86352582102‥52584033
StPHD18PGSC0003DMG40000060573584248.626.14356433707‥56436222
StPHD19PGSC0003DMG40000248527631569.314.73361241418‥61245409
StPHD20PGSC0003DMG40000600670578358.888.3941614152‥1617748
StPHD21PGSC0003DMG40002950253460406.385.7843056100‥3058915
StPHD22PGSC0003DMG40002948874783866.015.2443252237‥3254776
StPHD23PGSC0003DMG40003789851458420.767.8143281893‥3284133
StPHD24PGSC0003DMG40201800815317300.494.43456927171‥56927632
StPHD25PGSC0003DMG40001800628932109.284.83456954320‥56955189
StPHD26PGSC0003DMG40000935759567593.616.28465066151‥65070283
StPHD27PGSC0003DMG40003392168676008.657.17517769678‥17773971
StPHD28PGSC0003DMG40002742967877234.896.36519370632‥19373940
StPHD29PGSC0003DMG40101704322826080.798.45532754127‥32757896
StPHD30PGSC0003DMG40000845654961964.656.74549835834‥49840825
StPHD31PGSC0003DMG40001626215316795.425.45640408827‥40409456
StPHD32PGSC0003DMG40001676832036468.839.03645161128‥45168547
StPHD33PGSC0003DMG40002047236241318.58.65645612751‥45616624
StPHD34PGSC0003DMG40002613431936736.17.53645906361‥45911506
StPHD35PGSC0003DMG40003700128031695.416.63645930651‥45935015
StPHD36PGSC0003DMG40000488725729094.655.05648587673‥48593031
StPHD37PGSC0003DMG40000406065071454.448.79651123204‥51129858
StPHD38PGSC0003DMG4000071261569175353.57.48655175748‥55182922
StPHD39PGSC0003DMG40002874071981505.828.5276333109‥6341436
StPHD40PGSC0003DMG40002095124728051.454.99737507602‥37511402
StPHD41PGSC0003DMG40000239134739664.278.09740086321‥40087538
StPHD42PGSC0003DMG40000984911512938.178.37740131259‥40131606
StPHD43PGSC0003DMG40000614654661748.025.81744316143‥44320770
StPHD44PGSC0003DMG40000067323426735.496.50744939178‥44941263
StPHD45PGSC0003DMG40003109638643398.37.88751461135‥51463722
StPHD46PGSC0003DMG40000708630233717.546.02752252668‥52254002
StPHD47PGSC0003DMG40001243148553833.925.19752735489‥52742279
StPHD48PGSC0003DMG40202220282389381.038.39755159113‥55164799
StPHD49PGSC0003DMG40002625862271489.428.60851235299‥51240722
StPHD50PGSC0003DMG402012227946102296.147.90856760683‥56768601
StPHD51PGSC0003DMG400003902942103697.198.7794612598‥4621802
StPHD52PGSC0003DMG40002307824127297.645.3298326536‥8334789
StPHD53PGSC0003DMG4000220791217138563.936.54939798638‥39807441
StPHD54PGSC0003DMG40002310035439514.27.74942979927‥42982358
StPHD55PGSC0003DMG40003017440645444.279.19943275771‥43285203
StPHD56PGSC0003DMG40003017519421554.598.00943358605‥43359881
StPHD57PGSC0003DMG40004197023526104.015.56943720347‥43721437
StPHD58PGSC0003DMG40003226246651807.816.25953864208‥53868363
StPHD59PGSC0003DMG40001158823626231.556.14957716591‥57724943
StPHD60PGSC0003DMG40001917824027090.435.771050195563‥50201617
StPHD61PGSC0003DMG40001102224027074.375.181053903104‥53908590
StPHD62PGSC0003DMG40001105951456806.27.351054485212‥54491878
StPHD63PGSC0003DMG40002818176784820.199.271055326036‥55335829
StPHD64PGSC0003DMG40002371886995197.695.591057644472‥57648188
StPHD65PGSC0003DMG40000836725829818.384.961059176774‥59180143
StPHD66PGSC0003DMG40001324916718644.648.05111051899‥1052552
StPHD67PGSC0003DMG40003642333337045.914.52111088487‥1089488
StPHD68PGSC0003DMG40001324768377481.465.66111102178‥1104593
StPHD69PGSC0003DMG40002865639545262.0778.12116978554‥6982962
StPHD70PGSC0003DMG40002779244549761.588.12119596045‥9597382
StPHD71PGSC0003DMG4000004591059116446.34.951139273198‥39281957
StPHD72PGSC0003DMG4000378171328150283.026.521141080332‥41083612

Phylogenetics and structure of PHD protein in potato

Phylogenetic analysis was performed, and an unrooted phylogenetic tree was generated using the 72 potato PHD protein sequences, with 1,000 bootstrap replicates, to reveal the evolutionary relationship for the PHD family (Fig 1A). Moreover, the exon/intron structures for individual PHD genes were investigated (Fig 1B). Together with the results of gene structure analysis, the StPHD family genes were clustered into 10 groups (I through X) with the bootstrap values (≥60%) on the phylogenetic tree. However, 4 StPHD genes (StPHD12, -10, -27 and -38) could not be grouped into any subfamily for the 10 groups due to the low bootstrap values (<60%). Among these 10 groups: group X was the largest group, which contained 18 members; group I, V and VIII had 8 members; group IX had 7 members, and these five clades represented 68.06% of the total StPHD proteins. In contrast, groups III and VII only contained two to three members.
Fig 1

Phylogenetic relationships and gene structures for the StPHD family.

A. Phylogenetic tree generated by maximum-likelihood method with bootstrapping analysis (1,000 replicates) based on the protein sequences of StPHD genes (percentage of bootstrap value was displayed at each node). The branches of different groups are shown in different colors. B. The graphic displays exon-intron structures for StPHD gene family members using GSDS. The horizontal black lines and the blue boxes show introns and exons, and the scale displays the relative length and position of the introns and exons.

Phylogenetic relationships and gene structures for the StPHD family.

A. Phylogenetic tree generated by maximum-likelihood method with bootstrapping analysis (1,000 replicates) based on the protein sequences of StPHD genes (percentage of bootstrap value was displayed at each node). The branches of different groups are shown in different colors. B. The graphic displays exon-intron structures for StPHD gene family members using GSDS. The horizontal black lines and the blue boxes show introns and exons, and the scale displays the relative length and position of the introns and exons. Structure analysis of StPHD genes (Fig 1B) showed the number of introns ranged from 0 to 20. Among them, 7 genes had no introns (StPHD24, -25, -17, -70, -67, -42 and -1) and StPHD51 had the largest number of introns (20). Most of the StPHD genes that shared the same group showed similar exon/intron structure, including the intron numbers and exon length, and however, exceptions were also found among these genes. For example, StPHD genes in group I contained 2 to 6 introns, and the members in StPHD gene groups II contained 5 to 9 introns. Interestingly, the members in group V had larger gene sequences and contained a wide range of introns (7–20) whereas they exhibited great diversity in exon length.

Potato PHD-finger domain and motifs

To further investigate the similarity between the potato PHD-finger domains, 72 PHD-finger domain sequences were aligned (S1 Fig). The highlighted figures in black and pink colors were the Zn ion binding sites with seven cysteines and one histidine residue (Cys4-His-Cys3), and the result indicated that the PHD-finger domain of potato is very conservative. Based on the result of alignment, the consensus sequence of the potato PHD-finger domain was C–X(1–2)-C–X(4–24)-C–X(2–17)-C–X(4–15)-H-X2-C–X(2–25)-C–X(0–4)-C. The consensus sequence of potato PHD domain was basically consistent with previous research [17, 18, 37] The online MEME software was used to investigate the conserved motifs of 72 StPHD proteins within each subfamily to investigate the diversity of motif components among StPHDs. A total of 20 distinct motifs were identified (Fig 2), and the detailed sequences for each motif is listed in S2 Table. By searching CDD database, 11 putative motifs were acquired functional annotation and motifs 1, 2, 3, 5 and 8 were annotated for the components of the conserved PHD domain (Fig 2), and however, no functional annotation was found for the other 9 putative motifs. The majority of StPHD proteins in the same subfamily have similar motif components and distribution, suggesting that the StPHD proteins share the same groups may possess similar functions. For example, proteins in group I possessed motifs 4, 19 and 3, while group E members contained motifs 9, 3 and 6. However, significant differences were also observed between different groups, and some motifs were exclusively presented in a particular group, suggested that these motifs might play specific functions in the group. For instance, motifs 4 and 19 contain a histone-binding component, which is common for all members in group I and play an important role in transcription start [38]. Motifs 6 and 9 present in group E members, and motif 6 is a type of N-Acetyltransferases (NAT) and mostly catalyzes the transfer of an acyl group. Motif 9 possesses Jas domain and this motif appears to bind to the Groucho/Tup1-type co-repressor TOPLESS (TPL) and TPL-related proteins (TPRs) and with an important role in the jasmonate signalling pathway [39]. Motif 7 is the Bromo-adjacent Homology domain (BAH) in group A members and plays an important role in DNA methylation, replication and transcriptional regulation [40]. While many members in group J contain motif 10, it is Oberon Coiled-coil region (Oberon_cc) and plays an important role for maintenance and establishment of both the shoot and root apical meristems [41].
Fig 2

Schematic diagram of conserved motifs for StPHD proteins.

Different motifs are displayed with different colored boxes and numbers (1–20). The annotation of each motif listed on the right, which was identified by CDD program.

Schematic diagram of conserved motifs for StPHD proteins.

Different motifs are displayed with different colored boxes and numbers (1–20). The annotation of each motif listed on the right, which was identified by CDD program.

Chromosomal locations and duplications of StPHD genes

A total of 71 out of the 72 PHD genes were assigned to the 11 potato chromosomes, except chromosome 12 (Fig 3). Chromosome 7 contained the largest number of PHD genes (10), followed by chromosomes 9 (9 PHD genes), whereas chromosomes 2 and 8 possessed the smallest number of PHD gene members (2). A similar uneven distribution pattern of PHD gene families was also observed in Arabidopsis [36] and Populus trichocarpa [18]. The regions on the proximate or the distal ends of potato chromosomes exhibited a dense distribution of StPHD genes, such as the bottom of chromosomes 1, 6, 7, 9, 10 and 11. Both tandem duplication and segmental duplication perform an important role in the expansion of the gene family [42]. The potential duplication events were investigated to reveal the detailed mechanism for the PHD gene family. According to the phylogenetic and comparative analysis of the StPHD genes, 9 gene pairs (StPHD6/40, StPHD 9/15, StPHD 6/36, StPHD 9/64, StPHD 15/64, StPHD 14/53, StPHD52/60, StPHD52/61 and StPHD51/63) were found to be involved in the segmental duplication events (Fig 3). A biased distribution pattern was also found among the 9 segmental duplication pairs and no pairs were distributed on chromosomes 2, 4, 5, 8, 11 and 12. In addition, 6 gene pairs (StPHD2/3, StPHD21/22, StPHD67/68, StPHD33/34, StPHD33/35 and StPHD34/35) were confirmed to be tandem duplicated genes, and they were distributed on chromosomes 1, 4, 5 and 11. These results suggest that segmental duplication events and segmental duplication may play important roles in the amplification of the potato PHD gene family.
Fig 3

Chromosomal locations of potato PHD genes.

The left scale represents the length of potato chromosomes (Mb). The tandem duplicated gene pairs are marked with gray boxes, and the segmental duplicated gene pairs are marked with colored boxes and are linked by corresponding color lines.

Chromosomal locations of potato PHD genes.

The left scale represents the length of potato chromosomes (Mb). The tandem duplicated gene pairs are marked with gray boxes, and the segmental duplicated gene pairs are marked with colored boxes and are linked by corresponding color lines.

Phylogenetic analysis of the StPHD gene family

To explore the evolutionary process of the PHD gene family in potato, 72 StPHD protein sequences were aligned with 70 PHD proteins from Arabidopsis [36] and 67 PHD proteins from maize [17]. The phylogenetic tree was classified into 11 distinct groups (group A through K) with high bootstrap values (≥60%) support (Fig 4). The PHD proteins with low bootstrap values were not included with any subfamily. Group C was the largest subfamily, with 42 proteins, whereas the smallest groups, E, G and I, contained 8 to 9 proteins. StPHD proteins were more closely related to those in the same subfamily from Arabidopsis and maize than those of the other potato PHD proteins. Recent reports showed that PHD proteins for many plant species were highly similar to those of Arabidopsis [43]. Based on our analysis, most PHD protein structures in potato, Arabidopsis and maize were highly similar. A total of 18 members of PHD subfamily K were identified and 5 times more members were found in potato than Arabidopsis; only 3 members were identified in Arabidopsis. Only 2 members of subfamily K were identified in maize and 8 more times were found in potato than in maize (Fig 4).
Fig 4

Phylogenetic analysis of potato, Arabidopsis and maize PHDs.

The tree was constructed by maximum-likelihood method. Each PHD subfamily was marked by a specific color, except for black, which represented genes that did not belong to any subfamily due to low bootstrap values. The square, circle and diamond represent maize, potato and Arabidopsis PHD proteins, respectively.

Phylogenetic analysis of potato, Arabidopsis and maize PHDs.

The tree was constructed by maximum-likelihood method. Each PHD subfamily was marked by a specific color, except for black, which represented genes that did not belong to any subfamily due to low bootstrap values. The square, circle and diamond represent maize, potato and Arabidopsis PHD proteins, respectively. In total, 77 ortholog pairs (St-At) were identified between potato and Arabidopsis (S3 Table). However, only 68 ortholog pairs were found between potato and maize (St-Zm) (S4 Table). The number of orthologs between potato and Arabidopsis were greater than that between potato and maize, which may be due to the closer evolutionary distance between potato and Arabidopsis. One StPHD gene identified two or more orthologs from Arabidopsis or maize. For example, AT1G14510, AT2G02470, AT3G42790, AT5G26210 and AT3G11200 are orthologs to StPHD6, and these paralogs may perform important roles in the expanding process of the PHD gene family in plants.

GO annotation for StPHD proteins

Previous reports showed that PHD is involved in many biological processes at the nuclear level [4], including chromatin modification and mediation of molecular interactions in gene transcription. In addition, PHD displayed the complicated histone sequence reading ability mediated by histone modifications [44]. Diverse functions were found by the GO analysis of StPHD proteins (Fig 5). The majority of StPHD proteins were involved in intracellular activities, whereas some of them were located in the nucleus, others were located in the organelle. Some StPHD proteins had molecular function of binding, including ion, protein, DNA and chromatin. Meanwhile, some StPHD proteins showed function for acetyltransferase activity. In terms of biological processes, StPHD proteins participated in various biological pathways and regulated the various metabolic and biological processes, such as mediation of various organizations of cellular components, chromosomes, macromolecules, complex subunits, and organelles. In addition, some StPHD proteins engaged in gene expression. The diverse biological functions of StPHD proteins may be due to diverged sequences rather than the conserved StPHD domain.
Fig 5

GO annotation of StPHD proteins.

The annotation was performed on three categories, including biological process (blue color), cellular component (green color), and molecular function (yellow color).

GO annotation of StPHD proteins.

The annotation was performed on three categories, including biological process (blue color), cellular component (green color), and molecular function (yellow color).

Tissue expression patterns for StPHD genes

To reveal the potential functions of the StPHD genes, the expression patterns of StPHD genes were investigated based on the FPKM values extracted from SpudDB from 10 main tissues (root, shoot, petal, carpel, sepal, stamen, tuber, leaf, flower and petiole) (Fig 6). However, 20 genes were excluded from the heat map analysis because of low expression (FPKM < 0.5) or lack of expression in all examined tissues. Based on the heatmap, considerable variations in expression were found among individual genes from different tissues. A total of 52 StPHD genes were clustered into 3 groups (Fig 6), and 16 genes (StPHD52, -6, -40, -65, -47, -59, -61, -13, -20, -4, -60, -5, -43, -30, -29 and -8) were included in group II, which showed high expression levels in most of the analyzed tissues. Among them, two genes (StPHD29 and StPHD8) were especially abundant. In addition, 12 genes were included in group III and showed lower expression levels in most of the tissues. Some StPHD genes showed tissue-specific expression patterns. For example, StPHD27 was found to show abundant expression in root, shoot and stamen but had lower expression levels in petal, carpel and leaf. The expression patterns for these genes provided preliminary information for their functions.
Fig 6

Expression heatmap of StPHD genes in different tissues.

FPKM values for StPHD genes were transformed by log10.

Expression heatmap of StPHD genes in different tissues.

FPKM values for StPHD genes were transformed by log10.

Expression profiles for StPHD genes in response to abiotic stress

To further explore the response of the 52 StPHD genes following exposure to various types of stress (heat, salt and drought), a heatmap was generated using the relative expression level (represented by the FPKM values of stress/control) (Fig 7). In response to heat stress, most of the StPHD genes were downregulated. StPHD15, StPHD39, StPHD70 and StPHD11 showed significant downregulation, and a few StPHD genes (e.g., StPHD49 and StPHD24) exhibited significant upregulation. In contrast, the variation in expression levels in response to salt and drought stress were not as divergent as that of heat stress. Most StPHD genes were upregulated; StPHD10, StPHD7 and StPHD24 displayed the most significant upregulation in response to salt. However, StPHD59 showed significant downregulation in response to drought stress. A few StPHD genes (e.g., StPHD10, StPHD69 and StPHD46), showed contrasting expression patterns among the three stress types. StPDH46 was not sensitive to salt stress, but it was upregulated when exposed to heat stress and downregulated with drought stress. The StPHD genes with diverse stress responses and expression patterns may be due to the functional dissimilation of StPHD proteins.
Fig 7

Expression heatmap for StPHD genes exposed to heat (H), salt (S) and drought (D) stress.

It has been confirmed that a subfamily of the maize PHD gene family was involved in abiotic stress response (namely, GRMZM5G893976, GRMZM2G017142, GRMZM2G148810, GRMZM2G158918, GRMZM2G016817, GRMZM2G172001, GRMZM2G080917, GRMZM2G050495, GRMZM2G156088, GRMZM2G063864, GRMZM2G107807, GRMZM2G008259, GRMZM2G115424, GRMZM2G153087, AC225147.4, GRMZM2G110952 and GRMZM2G047316) [17]. In our study, phylogenetic analysis showed that all of these members in this maize subfamily were distributed in group J (Fig 4), suggesting the members in this group shared a close evolutionary relationship. As the functions were related to their protein structures, we concluded the 7 StPHD genes (StPHD40, -36, -19, -6, -52, -61 and -60) in group J (Fig 4) may also perform a similar role in response to various stress types in potato. Therefore, to further understand the function of these StPHD genes, qRT-PCR was used to study the expression patterns of 7 StPHD genes in response to abiotic stress (heat, salt and drought). In addition, 5 StPHD genes (StPHD10, -24, -38, -46 and -49) that exhibited significant response to abiotic stress, based on the FPKM values from SpudDB (Fig 7), were also selected for qRT-PCR analysis. The expression patterns for the 12 StPHD genes in response to heat, salt and drought are shown in Fig 8. In total, StPHD48 showed a slight response to all three abiotic treatments. Of the 12 PHD-finger genes, most of the genes were upregulated in response to heat stress, except StPHD52 and StPHD60. StPHD10, StPHD19 and StPHD61 were significantly upregulated (>2 fold). StPHD49 showed extreme sensitivity (>5-fold) in response to heat stress (Fig 7). Although 9 genes were upregulated in response to salt treatment, StPHD10, StPHD38 and StPHD48 were slightly downregulated compared to that of the control. In response to drought stress, 5 (StPHD6, StPHD19, StPHD24, StPHD60 and StPHD61) out of the 9 upregulated genes showed significant variation in expression level (>2 fold), whereas StPHD46, StPHD48 and StPHD49 were slightly downregulated.
Fig 8

Relative expression level of 12 StPHD genes in response to heat, salt and drought stress at 6 h.

The error bars indicated standard deviation. Asterisks indicated that the expression level was significantly different from the value of the control (*P<0.05, **P<0.01).

Relative expression level of 12 StPHD genes in response to heat, salt and drought stress at 6 h.

The error bars indicated standard deviation. Asterisks indicated that the expression level was significantly different from the value of the control (*P<0.05, **P<0.01).

Discussion

Previously reported PHD proteins, which participate in many biological processes including chromatin modification and mediating the molecular interactions in gene transcription and histone modifications, played important roles for the growth and development of plants [4, 44]. To comprehensively analyze the PHD gene family, genome-wide identification and annotation of PHD genes have been performed for many species including moso bamboo (60), maize (67) [17], Populus trichocarpa (73) [18], rice (58) and Arabidopsis (70) [45]. The total members of PHD and genome size appeared not to be always related when the PHD gene family of the reported species was compared. For example, moso bamboo has much larger genome sizes than that of poplar and Arabidopsis but possesses fewer PHD genes than these two species. Therefore, PHD gene families may undergo differential expansion in different species. It was hypothesized that PHD gene family expansion might be related to three whole-genome duplication events in Arabidopsis [46] and the specific “Salicoid” duplication event in poplar [47]. In maize, the segmental duplication events were found to perform an important role in the amplification of the PHD gene family during the evolutionary process [17]. It was reported that potato genome had gone through at least two rounds of genome duplication; one ancient event might have occurred after the divergence between dicots and monocots approximately 185 ± 55 million years ago, and the recent duplication occurred approximately 67 million years ago [48]. In our study, a total of 72 nonredundant potato PHD genes were identified. Of these genes, 18.06% were identified from segmental duplication suggesting that whole genome duplication events performed essential roles during the amplification process of PHD gene family. Compared to random duplicated genes in maize (3.0%) and poplar (5.5%), more than two times the tandem duplicated genes (12.5%) were found in potato. These results suggest that the random duplicated event performed an important role during the PHD gene family expansion process in potato, which differed from that of maize and poplar. According to the phylogenetic tree (Fig 4), we found potato had many more members than Arabidopsis (5 times) and maize (8 times) in group K, furthermore, two paralog pairs (StPHD2/3 and StPHD21/22) were derived from random duplicated events distributed in this subfamily. These results further support the important role of random duplicated events during the PHD gene family expansion process in potato. Protein structure is closely linked with protein function. In our study, most of the StPHD proteins included in the same subfamily exhibited similar domain architectures, which indicated that they could perform similar functions. Furthermore, the structures and motif constitutions for the StPHD protein within each group were basically matched with the phylogenetic analysis (Figs 1 and 2). However, a few branches of the phylogenetic tree failed to categorize into any subfamily because of their low bootstrap values, which may be due to the relatively few feature positions except the highly conserved PHD domain. A similar phenomenon was found in other species, such as maize [17], Arabidopsis and poplar [18]. StPHD members appeared to have different patterns for gaining or losing introns since their origin, and the proteins in the same group were not entirely identical in terms of their structure and motifs (Figs 1 and 2). These results imply that the highly diverse characteristics of sequences among these genes, other than that of the conserved PHD-finger domain, have contributed to the functional diversity of the PHD in potato. Based on the RNA-seq data for potato, the expression of StPHD genes exhibited incongruous expression profiles in various tissues (Fig 6). Two genes, StPHD8 and StPHD29, showed the specific action of housekeeping expression and they displayed significant expression levels in most of the tissues. Studies for 6 pairs of tandem duplicated genes found that many of these paralogs (StPHD22, -67, -68, -33, -34 and -35) exhibited lower or no expression level in the investigated tissues; these results suggest that they may be expressed at specific developmental stages or have gone through pseudogenization. Most paralogs displayed similar expression patterns, and only a few segmental duplicated StPHD gene pairs displayed diverse expression patterns with each other (e.g., StPHD8 and StPHD44), suggesting that the functional differentiation may be a main character for the surviving duplicated genes [49]. It was reported that some PHD proteins were involved in abiotic stress responses [16]. For example, a subfamily of the maize PHD gene family was reported to be involved in abiotic stress response by RT-PCR [17]. According to the phylogenetic analysis, 7 StPHD proteins were grouped in the same cluster with these maize subfamily members (Fig 4), indicating that these 7 StPHD genes have the potential function of abiotic stress response in potato. This conclusion was verified by qRT-PCR results (Fig 8), and 7 StPHD genes showed varying response to heat, salt and drought treatments. To analyze the trend of the gene expression derived from qRT-PCR (Fig 8) and the FPKM values, we compared the results from these two different platforms (Fig 7). Similar tendency of the gene expression was found between the two different platforms. Two genes (StPHD6 and StPHD10) had the same expression trends in response to the three abiotic treatments. Four genes (StPHD24, -38, -46 and -49) had similar expression patterns in response to heat and salt stress. StPHD49 was extremely sensitive in both sets of results, whereas three genes (StPHD52, -60 and -61) had similar expression patterns in response to salt and drought stress. For other genes, the same direction changes were identified in response to one of the abiotic stress types. However, the results of qRT-PCR did not totally agree with the pattern of gene expression from RNA-seq data, and this observation may be due to several putative reasons. First, the potato varieties used in the two experiments were different; a doubled monoploid potato variety (DM) was used in RNA-seq [21], whereas autotetraploid cultivar Zhongshu 3 was employed in qRT-PCR. Second, the experimental treatments were different. The plants for qRT-PCR were exposed to a photoperiod of 16 h light/8 h dark in this study, whereas the plants for RNA-seq were placed in the dark environment [21]. Third, the relative expression data was generated by using the 2−ΔΔCt method depending on the SYBR, and the RNA-seq data used in the present study was indicated by FPKM. The FPKM value should be a good method to decrease sample differences, but it may cause deviation for highly expressed genes [50]. The bias of the FPKM value may result in different expression compared to that of qRT-PCR.

Potato PHD gene-specific primers used for qRT-PCR analysis.

(DOCX) Click here for additional data file.

Sequences of 20 predicted motifs of StPHD proteins.

(DOCX) Click here for additional data file.

Information about orthologs between potato and Arabidopsis.

(XLSX) Click here for additional data file.

Information about orthologs between potato and maize.

(XLSX) Click here for additional data file.

Multiple sequence alignment of PHD-finger domain of 72 potato PHD proteins.

Identical residues are labeled in black while similar residues are colored in pink, and the high conserved amino acids are positioned at the bottom of the figure. (TIF) Click here for additional data file.
  45 in total

1.  PERSISTENT TAPETAL CELL1 encodes a PHD-finger protein that is required for tapetal cell death and pollen development in rice.

Authors:  Hui Li; Zheng Yuan; Gema Vizcay-Barrena; Caiyun Yang; Wanqi Liang; Jie Zong; Zoe A Wilson; Dabing Zhang
Journal:  Plant Physiol       Date:  2011-04-22       Impact factor: 8.340

2.  The DUET gene is necessary for chromosome organization and progression during male meiosis in Arabidopsis and encodes a PHD finger protein.

Authors:  Thamalampudi Venkata Reddy; Jagreet Kaur; Bhavna Agashe; Venkatesan Sundaresan; Imran Siddiqi
Journal:  Development       Date:  2003-10-22       Impact factor: 6.868

3.  ExPASy: The proteomics server for in-depth protein knowledge and analysis.

Authors:  Elisabeth Gasteiger; Alexandre Gattiker; Christine Hoogland; Ivan Ivanyi; Ron D Appel; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

4.  Genome-wide identification, classification and expression analysis of the PHD-finger protein family in Populus trichocarpa.

Authors:  Shengnan Wu; Min Wu; Qing Dong; Haiyang Jiang; Ronghao Cai; Yan Xiang
Journal:  Gene       Date:  2015-08-24       Impact factor: 3.688

5.  [GSDS: a gene structure display server].

Authors:  An-Yuan Guo; Qi-Hui Zhu; Xin Chen; Jing-Chu Luo
Journal:  Yi Chuan       Date:  2007-08

6.  Systematic analysis and comparison of the PHD-Finger gene family in Chinese pear (Pyrus bretschneideri) and its role in fruit development.

Authors:  Yunpeng Cao; Yahui Han; Dandan Meng; Muhammad Abdullah; Dahui Li; Qing Jin; Yi Lin; Yongping Cai
Journal:  Funct Integr Genomics       Date:  2018-04-20       Impact factor: 3.410

7.  HAT3.1, a novel Arabidopsis homeodomain protein containing a conserved cysteine-rich region.

Authors:  U Schindler; H Beckmann; A R Cashmore
Journal:  Plant J       Date:  1993-07       Impact factor: 6.417

8.  Identification of UHRF1/2 as new N-methylpurine DNA glycosylase-interacting proteins.

Authors:  Chao Liang; Xueli Zhang; Shanshan Song; Chunyan Tian; Yuxin Yin; Guichun Xing; Fuchu He; Lingqiang Zhang
Journal:  Biochem Biophys Res Commun       Date:  2013-03-26       Impact factor: 3.575

9.  The Arabidopsis OBERON1 and OBERON2 genes encode plant homeodomain finger proteins and are required for apical meristem maintenance.

Authors:  Shunsuke Saiga; Chihiro Furumizu; Ryusuke Yokoyama; Tetsuya Kurata; Shusei Sato; Tomohiko Kato; Satoshi Tabata; Mitsuhiro Suzuki; Yoshibumi Komeda
Journal:  Development       Date:  2008-04-09       Impact factor: 6.868

10.  Systematic Analysis of the Maize PHD-Finger Gene Family Reveals a Subfamily Involved in Abiotic Stress Response.

Authors:  Qianqian Wang; Jinyang Liu; Yu Wang; Yang Zhao; Haiyang Jiang; Beijiu Cheng
Journal:  Int J Mol Sci       Date:  2015-09-30       Impact factor: 5.923

View more
  4 in total

1.  Identification of novel PHD-finger genes in pepper by genomic re-annotation and comparative analyses.

Authors:  Ji-Yoon Guk; Min-Jeong Jang; Seungill Kim
Journal:  BMC Plant Biol       Date:  2022-04-20       Impact factor: 5.260

2.  Response of phytohormone mediated plant homeodomain (PHD) family to abiotic stress in upland cotton (Gossypium hirsutum spp.).

Authors:  Huanhuan Wu; Lei Zheng; Ghulam Qanmber; Mengzhen Guo; Zhi Wang; Zuoren Yang
Journal:  BMC Plant Biol       Date:  2021-01-06       Impact factor: 4.215

3.  A Large-Scale Genomic Association Analysis Identifies the Candidate Genes Regulating Salt Tolerance in Cucumber (Cucumis sativus L.) Seedlings.

Authors:  Dongrang Liu; Shaoyun Dong; Han Miao; Xiaoping Liu; Caixia Li; Jianan Han; Shengping Zhang; Xingfang Gu
Journal:  Int J Mol Sci       Date:  2022-07-27       Impact factor: 6.208

4.  Genome-wide identification, evolution, and transcript profiling of Aldehyde dehydrogenase superfamily in potato during development stages and stress conditions.

Authors:  Md Sifatul Islam; Md Soyib Hasan; Md Nazmul Hasan; Shamsul H Prodhan; Tahmina Islam; Ajit Ghosh
Journal:  Sci Rep       Date:  2021-09-14       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.