Literature DB >> 32743916

A module of multifactor-mediated dysfunction guides the molecular typing of coronary heart disease.

Yuewei Li^1,2, Maohuan Lin^1,2, Kangjie Wang³, YaQing Zhan⁴, Wenli Gu^1,2, Guanghao Gao^1,2, Yuna Huang^1,2, Yangxin Chen^1,2, Tucheng Huang^1,2, Jingfeng Wang^1,2.

Abstract

BACKGROUND: Coronary atherosclerotic heart disease (CHD) is the most common cardiovascular disease and has become a leading cause of death globally. Various molecular typing methods are available for the diagnosis and treatment of tumors. However, molecular typing results are not routinely used for CHD. METHODS AND
RESULTS: Aiming to uncover the underlying molecular features of different types of CHD, we screened the differentially expressed genes (DEGs) associated with CHD based on the Gene Expression Omnibus (GEO) data and expanded those with the NCBI-gene and OMIM databases to finally obtain 2021 DEGs. The weighted gene co-expression analysis (WGCNA) was performed on the candidate genes, and six distinctive WGCNA modules were identified, two of which were associated with CHD. Moreover, DEGs were mined as key genes for co-expression based on the module network relationship. Furthermore, the differentially expressed miRNAs in CHD and interactions in the database were mined in the GEO data set to build a multifactor regulatory network of key genes for co-expression. Based on the network, the CHD samples were further classified into five clusters and we defined FTH1, HCAR3, RGS2, S100A9, and TYROBP as the top genes of the five subgroups. Finally, the mRNA levels of FTH1, S100A9, and TYROBP were found to be significantly increased, while the expression of HCAR3 was decreased in the blood of CHD patients. We did not detect measurable levels of RGS2.
CONCLUSION: The screened core clusters of genes may be a target for the diagnosis and treatment of CHD as a molecular typing module.

Entities: Chemical Disease Gene Species

Keywords: Coronary heart disease (CHD); GEO; WGCNA; difference analysis; disease classification; network mining

Mesh：

Substances：

Year: 2020 PMID： 32743916 PMCID： PMC7549572 DOI： 10.1002/mgg3.1415

Source DB: PubMed Journal: Mol Genet Genomic Med ISSN： 2324-9269 Impact factor: 2.183

INTRODUCTION

Coronary atherosclerotic heart disease (CHD), also known as coronary heart disease, is caused by coronary atherosclerosis resulting in stenosis or occlusion of the cardiac coronary arteries. This may lead to myocardial ischemia and hypoxia in patients (Arnett et al., 2019; Thygesen et al., 2018). Over the past century, a dramatic increase in coronary heart disease has been seen in developed and developing countries. Diagnosis of CHD relies on clinical manifestation, electrocardiogram data, as well as levels of cardiac enzymes and troponin. The complexity of this diagnosis in an emergency setting necessitates rapid diagnosis, especially in developing countries. When choosing conservative or invasive treatment for CHD, each patient must weigh the risk of invasive treatment complications, such as the possibility of restenosis or occlusion. Therefore, individualized risk stratification and treatment are urgently needed. Molecular typing facilitates rapid screening and diagnosis and may guide further individualized treatment. In 1999, the National Cancer Institute (NCI) put forward the concept of tumor molecular typing, based on molecular features through comprehensive molecular analysis techniques, to transform the basis of tumor classification from morphology to a new tumor classification system. Perou et al. proposed the molecular typing concept (Perou et al., 2000), for which various molecular typing methods can be used to diagnose and treat cancers. However, molecular typing results are currently not routinely used for CHD. Therefore, we used bioinformatics analysis to identify the molecular typing of CHD. We validated multifactor‐mediated dysfunction, predicted molecular typing, and confirmed in blood samples of patients with CHD.

MATERIALS AND METHODS

Data sets

All data sets we used were downloaded from the Gene Expression Omnibus (GEO submission: GSE66306, GSE64554, GSE59421, GSE28858, and GSE64563). Two mRNA expression profiles (GSE66360 and GSE64554) of human CHD and healthy samples were downloaded from the GEO database (http://www.ncbi.nlm.nih.gov/geo/) (Barrett et al., 2013). The numbers of each data set are depicted in Table 1. To obtain miRNA profiles of human CHD, the GSE59421, GSE28858, and GSE64563 data sets were fetched from the GEO database. The numbers of each data set are shown in Table 2. CHD‐related genes were searched and extracted from the NCBI‐gene and OMIM databases.

Table 1

GEO database mRNA expression profile data sets

Data set ID	Platform	Tissue	CHD	Normal
GSE66360	GPL570	Whole Blood	49	50
GSE64554	GPL6947	EAT	13	10
GSE64554		SAT	13	10

Abbreviations: CHD, coronary atherosclerotic heart disease; EAT, epicardial adipose tissue; SAT, subcutaneous adipose tissue.

Table 2

GEO database miRNA expression profile data sets

Data set ID	Platform	Tissue	CHD	Normal
GSE59421	GPL10850	Blood	33	63
GSE28858	GPL8179	Blood	12	12
GSE64563	GPL8179	EAT	13	10
GSE64563	GPL8179	SAT	13	10

Abbreviations: CHD, coronary atherosclerotic heart disease; EAT, epicardial adipose tissue; SAT, subcutaneous adipose tissue.

GEO database mRNA expression profile data sets Abbreviations: CHD, coronary atherosclerotic heart disease; EAT, epicardial adipose tissue; SAT, subcutaneous adipose tissue. GEO database miRNA expression profile data sets Abbreviations: CHD, coronary atherosclerotic heart disease; EAT, epicardial adipose tissue; SAT, subcutaneous adipose tissue.

Methods

Ethical compliance

The study protocol was approved by the Ethics Committee of Sun Yat‐sen Memorial Hospital (affiliated to Sun Yat‐sen University, Guangzhou, China). Written informed consent was obtained from each participant.

Data preprocessing and differential gene analysis

The data processing flowchart is depicted in Figure 1. The downloaded data sets were log2‐transformed and we quantile‐normalized signal intensity. First, after corresponding probes to genes/miRNAs, we removed unloaded probes. Second, we selected the expression median of probes as the expression value of the genes/miRNAs, if more than one probe was mapped to the same gene. Then, we used the R package Limma to perform differential expression analysis of the processed genes/miRNAs. Finally, differentially expressed genes/miRNAs between CHD and control samples were selected by fold changes and significance of differentially expressed analysis.

Figure 1

Data processing flowchart

Candidate gene sets and expression data

Candidate gene sets were mined through the merging of DEGs and CHD‐related genes from the GENE and OMIM databases, and redundancy was removed. Candidate genes also expressed in the GSE66360 data set were selected for further analysis.

Weighted gene co‐expression network analysis (WGCNA)

As a systems‐biology approach, WGCNA was applied to conduct a scale‐free network using genes expression data (Zhang & Horvath, 2005). First, we constructed a similarity matrix based on gene expression data. The absolute value of the Pearson correlation coefficient between two genes was calculated using Formula 1, where x and y represent the gene i and gene j expression values, respectively. Formula 1 Next, we converted the similarity matrix of gene expression into an adjacency matrix using Formula 2, with the network type assigned, in which β was the soft threshold. We evaluated the Pearson correlation for each pair of genes to the power of β. As a result, strong correlations were strengthened while weak ones weakened at the exponential level. Formula 2 Then, we transformed the adjacency matrix into topological overlap measure (TOM) using Formula 3, to depict the degree of correlation between genes. Formula 3 TOM represents the degree of dissimilarity between gene i and gene j. With 1‐TOM used as the distance metric, hierarchical clustering was performed on genes. Then, the dynamic shear tree method was used to identify modules. Module eigenvector genes (ME), the most representative ones, and the first principal component in each module, were calculated by Formula 4, in which i represents the gene, and l represents the chip sample in module q. Formula 4 A gene module membership (MM) was evaluated by the Pearson correlation between the gene expression profile in all samples and the ME expression profile in the module. MM was calculated with Formula 5, where x represents the expression profile of gene i, MEq represents the ME of module q and represents the MM of gene i in module q. When , gene iwas excluded from the module q. The closer was to +1 or −1, the more highly correlated gene iwas with module q. The plus or minus sign indicates whether there was a positive or negative correlation between gene i and module q. Formula 5 We used gene significance (GS) to assess the association between genes and external information. A higher GS indicates that the gene has a higher biological significance. When GS = 0, it can be considered that the gene is not involved in the biological process. The WGCNA was constructed to identify CHD‐correlated modules for the selected gene expression data using the WGCNA R package. Gene set enrichment analysis was conducted using the ClusterProfiler R package to evaluate the significant gene sets involved in the CHD‐correlated modules using GO and KEGG annotation pathways. A false discovery rate below 0.05 was considered significantly enriched. The interaction subnet was constructed for the co‐expression gene sets in the CHD‐correlated modules, and the core genes of the module were screened according to the node degree of the network.

Construction of a multi‐factor regulatory network

LncRNAs and miRNAs that interacted with core genes were screened out from starBase v2.0 database (http://starbase.sysu.edu.cn/starbase2/), which included miRNA‐lncRNA and miRNA‐mRNA interaction pairs (Li, Liu, Zhou, Qu, & Yang, 2014). Transcription factors (TF) regulating the expression of core genes were screened from the TRRUST v2 database, which included the data of human TF‐mRNA interaction pairs. Based on the identified interaction pairs, a multifactor regulatory network was constructed, including mRNA, TF, lncRNA, and miRNA.

CHD molecular typing mediated by core genes

CHD samples were classified according to the module core gene pairs. The optimal K value (number of categories) was determined by finding the best sum of squared error (SSE) inflection point. CHD was divided into different subclasses by using the unsupervised clustering method K‐means combined with t‐SNE dimension reduction. We examined the expression patterns of these core genes in different subclasses and analyzed the genes with significant differences in different subclasses to find potential marker genes of CHD subclasses.

Blood samples preparation

Blood samples were collected from CHD patients and healthy controls. Five milliliters of peripheral venous blood was collected in an anticoagulant tube with ethylenediaminetetraacetic acid (EDTA). Subsequently, the blood samples were centrifuged in primary blood collection tubes for 10 min at 1000 × g and 4 °C using a swinging bucket rotor. The substratum blood cells were stored at –80 °C until analysis.

Reverse‐transcription polymerase chain reaction

Total RNA of blood cells was extracted using Trizol reagent (Invitrogen). Individual RNA samples were reversely transcribed into cDNA using PrimeScript® RT Master Mix (Takara, Japan) according to the manufacturer's instructions. The primers used are listed in Table 3. The relative levels of target gene mRNA transcripts were determined by quantitative real‐time polymerase chain reaction (qRT‐PCR) using the FastStart Universal SYBR Green Master (Roche Applied Sciences) and specific primers on a Roche LightCycler 480 Real‐Time PCR System.

Table 3

List of primers used in qRT‐PCR

	Sense primers (5′−3′)	Antisense primers (5′−3′)
FTH1	GCTTGGCGGAATATCTCTT	AACTGAACAACGGCACTTA
RGS2	GAATCAGGAAGCCAGTAACT	TCAACACCATAGCACTCATT
S100A9	GCTGGAACGCAACATAGA	CTCCTGATTAGTGGCTGTG
TYROBP	CTGCTGGCTGTAAGTGATT	GTGTTGAGGTCGCTGTAG
HCAR3	TGGCGGTAGACAGGTATT	TGAGCAGAACAGGATGATG

Abbreviations: FTH1, ferritin heavy chain 1; HCAR3: hydroxycarboxylic acid receptor 3; RGS2, regulator of G protein signaling 2; S100A9, S100 calcium binding protein A9; TYROBP, transmembrane immune signaling adaptor TYROBP.

List of primers used in qRT‐PCR Abbreviations: FTH1, ferritin heavy chain 1; HCAR3: hydroxycarboxylic acid receptor 3; RGS2, regulator of G protein signaling 2; S100A9, S100 calcium binding protein A9; TYROBP, transmembrane immune signaling adaptor TYROBP.

RESULT

Differential mRNA gene screening

To obtain the mRNA expression profile of human CHD, we downloaded two data sets, GSE66360 and GSE64554, from the GEO database. The GSE64554 data set was divided into EAT and SAT groups based on tissue locations. Therefore, there were 49_vs_50, 13_vs_10, and 13_vs_10 samples (CHD vs. normal) in the three groups, respectively. The R package Limma was used for differential analysis among the three groups. Differentially‐expressed genes (DEGs) were selected based on the fold change (|logFC|>0.584962501) and the significance threshold (adj. p. Val <0.01). We selected 921 DEGs from data set GSE66360, among which 561 DEGs were upregulated while 360 were downregulated, 330 DEGs from GSE64554_EAT, among which 175 DEGs were upregulated while 173 were downregulated, 407 DEGs from GSE64554_SAT, among which there were 175 upregulated DEGs and 232 downregulated (Table 4). The volcano diagram of differential genes in each group is shown in Figure 2a. (Partial genes were not concentrated; therefore, only the majority is shown).

Table 4

Statistical table of differential expressed genes

Data set ID	DEG num	Up	Down
GSE66360	921	561	360
GSE64554_EAT	330	157	173
GSE64554_SAT	407	175	232

Differential expressed genes in the Data set of GSE66360, GSE64554_EAT and GSE64554_SAT.

Abbreviations: EAT, epicardial adipose tissue; SAT, subcutaneous adipose tissue.

Figure 2

Candidate gene sets were selected from DEGs. (a) DEGs volcano map mined from the GSE66360 and GSE64554 datasets. (b) Venn diagram of CHD‐related genes from different sources

Statistical table of differential expressed genes Differential expressed genes in the Data set of GSE66360, GSE64554_EAT and GSE64554_SAT. Abbreviations: EAT, epicardial adipose tissue; SAT, subcutaneous adipose tissue. Candidate gene sets were selected from DEGs. (a) DEGs volcano map mined from the GSE66360 and GSE64554 datasets. (b) Venn diagram of CHD‐related genes from different sources

Candidate gene sets

We downloaded 797 CHD‐related genes (CRGs) from the GENE database and 115 CRGs from the OMIM database. By merging DEGs and CRGs from both databases and removing redundancy, 2339 candidate genes were selected. Of those, 2021 genes expressed in GSE66360 were selected for the next WGCNA (Figure 2b).

Construction of WGCNA for candidate gene sets

We constructed a weighted co‐expression network for candidate gene sets with the R software package WGCNA. The cluster suggested that eight samples were outliers, and the remaining 91 samples were analyzed. This analysis indicated that the network of gene co‐expression accorded with the scale‐free network. In other words, the logarithm log (k) of the node with k connectivity was negatively correlated with the logarithm log (P (k)) of the probability of the node, and the correlation coefficient was higher than 0.8. To ensure that the expression network was a scale‐free one, we set β = 6 (Figure S1). Next, we transformed the expression matrix into an adjacency matrix, which was subsequently transformed into a topological matrix. In the light of TOM, genes were clustered using the average‐linkage hierarchical clustering method (Langfelder & Horvath, 2008). The minimum number of genes in each gene network module was set to 30 based on hybrid dynamic shear pruning tree criteria. After using the dynamic shearing method to determine the gene module, the eigengenes value of each module was calculated at once. A clustering analysis was then performed on the modules. Next, we merged modules with close distances. With height set to 0.25, six modules were obtained (Figure 3a). The number of genes in each module is shown in Table 5.

Figure 3

Table 5

Statistical results of corresponding genes for each module

Module	No. of genes
Blue	536
Brown	448
Green	41
Grey	119
Turquoise	711
Yellow	166

Six modules were defined by using weighted gene co‐expression network analysis. There were 536 genes in blue modules, 448 genes in brown modules, 41 genes in green modules, 119 genes in grey modules, 711 genes in turquoise modules, 166 genes in yellow modules, respectively.

The results of WGCNA. (a) Gene dendrogram and module colors; (b) Module‐trait relationships; (c) Gene significance across modules. (d–e）Relationships between module membership and gene significance in CHD. Statistical results of corresponding genes for each module Six modules were defined by using weighted gene co‐expression network analysis. There were 536 genes in blue modules, 448 genes in brown modules, 41 genes in green modules, 119 genes in grey modules, 711 genes in turquoise modules, 166 genes in yellow modules, respectively. The Pearson correlation coefficient was calculated for the ME of each module and CHD sample. The higher the coefficient, the more important the module was considered. The rows in Figure 3b represent the eigenvector genes of each module, and the correlation coefficients are presented from high to low (red to blue). The numbers in each cell represent the correlation coefficient between the gene module and the CHD sample, while the numbers in brackets represent the p value of significance. The GS value of each gene module was then calculated. The larger the GS value, the more relevant the module was for the CHD sample. From the results of the above two methods of analyzing the correlation between modules and phenotypes, the modules most relevant to CHD were colored yellow and blue (Figure 3c). Pathway enrichment analysis was conducted on the yellow gene and blue gene modules (only the top 10 are shown) (Figure S2). Genes in the yellow module were associated with biological processes such as positive regulation of cell activation, muscle cell proliferation, cell activation, and regulation of leukocyte‐cell adhesion. In contrast, genes in the blue module were associated with signaling pathways such as fluid shear stress and atherosclerosis, and the NF‐kappa B signaling pathway. Moreover, genes in both modules correlated with CHD (Figure 3d,e).

Module core gene analysis

To determine the CHD‐related core gene, functional analysis of the yellow and blue modules was performed. Based on the gene expression relationship in the yellow module, a co‐expression network diagram was constructed with co‐expression pairs with the connection threshold ≥0.13 as its edge, and the genes with node degree ≥10 (N = 10) as its core genes. Similarly, based on the blue module's gene expression relationships, we created a co‐expression network diagram with co‐expression pairs with the connection threshold ≥0.25 as its edge, and the genes with node degree ≥10 (N = 9) as its core genes (Figure 4a). Finally, 19 genes were determined as co‐expression key genes.

Figure 4

Module core gene analysis and construction of multifactor regulation network. (a) module gene network visualization. (b) Multifactor regulation network: yellow represents mRNA, green represents TF, purple represents miRNA, and red represents lncRNA. (c) Enrichment analysis of target genes of core regulators

Differential miRNA screening

miRNA data of the GSE59421, GSE28858, and GSE64563 CHD and control samples were fetched from the GEO database. The GSE64563 data set was divided into EAT and SAT groups based on tissue locations. There were 33_vs_63, 12_vs_12, 13_vs_10, and 13_vs_10 samples (CHD vs. normal) in the four groups. The R package Limma was used for analysis between the four groups, among which DEGs were selected based on the difference multiple (|logFC|>0.2) and significance threshold (adj. p. Val <0.05). We selected 38 differentially‐expressed miRNAs from dataset GSE59421, among which 9 miRNAs were upregulated and 19 were downregulated, 138 differentially expressed miRNAs from data set GSE28858, among which 74 miRNAs were upregulated and 64 were downregulated, 47 differentially expressed miRNAs from GSE64563_EAT, among which 17 miRNAs were upregulated and 30 were downregulated, 28 differentially expressed miRNAs from GSE64563_SAT, among which 12 were upregulated and 16 were downregulated (Table 6). Differential genes from each group were displayed in a volcano diagram (Figure S3a). (Partial miRNAs were not concentrated, so only the majority is shown). We acquired 227 candidate miRNAs by merging the differentially expressed miRNAs of the four groups (Figure S3b).

Table 6

Differential Expressed miRNAs

Data set ID	DEG num	Up	Down
GSE59421	38	9	29
GSE28858	138	74	64
GSE64563_EAT	47	17	30
GSE64563_SAT	28	12	16

Differential expressed miRNAs in the Data set of GSE59421, GSE28858, GSE64563_EAT and GSE64563_SAT. EAT: Epicardial Adipose Tissue; SAT: Subcutaneous Adipose Tissue.

Differential Expressed miRNAs Differential expressed miRNAs in the Data set of GSE59421, GSE28858, GSE64563_EAT and GSE64563_SAT. EAT: Epicardial Adipose Tissue; SAT: Subcutaneous Adipose Tissue.

Construction of multifactor regulatory network

From the starBase v2.0 database, the number of supporting experiments ≥high stringency (≥3), number of cancer types (pan‐cancer) ≥1 cancer type, miRNA‐lncRNA interactions were screened, and 376 miRNA‐lncRNAs were obtained. In addition to the above two conditions, the screening of miRNA‐mRNA interactions from starBase v2.0 also met the condition that the interaction pairs could be identified by at least one of these software packages: targetScan, picTar, RNA22, PITA, or miRanda/mirSVR. Finally, 160,774 miRNA‐mRNA interactions were obtained. Human TF‐mRNA regulatory relationships were obtained from the TRRUST v2 database, resulting in 9396 TF‐mRNA interactions. Firstly, TF interacting with the 19 co‐expressed key genes were selected. Next, five miRNAs, which interacted with co‐expressed key genes and belonged to candidate miRNA, were selected as the final miRNA. Then, we screened for lncRNA interacting with these five miRNAs. Finally, a multifactor regulatory network containing 63 interaction pairs was obtained (Figure 4b). The degree of each regulator (miRNA, lncRNA, TF) was calculated, and the top 10 regulators were defined as core regulators (degree >1) (Table 7). To explore the functions and signaling pathways of these core TFs, 5,706 target genes interplaying with core TFs were selected from miRNA‐lncRNA, miRNA‐mRNA, and TF‐mRNA interactions. Pathway enrichment analysis on target genes revealed that these genes were related to the TNF signaling pathway and protein processing in the endoplasmic reticulum (Figure 4c).

Table 7

Core regulatory factors

Factor	Spe	Degree
NFKB1	TF	4
RELA	TF	4
CEBPB	TF	3
MALAT1	lncRNA	2
hsa‐miR‐1	miRNA	2
hsa‐miR‐137	miRNA	2
hsa‐miR‐206	miRNA	2
CEBPA	TF	2
ETS2	TF	2
PPARG	TF	2

10 core regulators were screened out by constructing a multifactor regulatory network.

Core regulatory factors 10 core regulators were screened out by constructing a multifactor regulatory network.

CHD clustering based on dysfunction module genes

The 19 co‐expressed key genes were mapped to GSE66360 for unsupervised clustering, selecting the CHD samples to be classified by K‐means unsupervised clustering. First, the optimal value of K was determined by the SSE node, which represents the sum of the squares of the distances from all points to the center of the cluster to which they belong. As depicted in Figure S4a, SSE decline slowed down when K > 5, therefore, we defined K = 5. We used the R software package Rtsne for dimension reduction of the gene expression data. As displayed in Figure S4b, all CHD samples could be classified into five categories. The expression of co‐expressed key genes in all CHD samples is shown in Figure 5a. Although there was no taxonomic difference in the expression of a single gene, the results were highly consistent with those combining 19 co‐expressed key genes in Figure S4b. Therefore, we inferred that these 19 co‐expressed key genes were of significance for CHD typing.

Figure 5

The expression of co‐expressed key genes in CHD samples or in each cluster and validation in blood samples. (a,b) the expression of 19 co‐expressed key genes in CHD samples or in each cluster. (c–g) Gene expression in different clusters (each dot represents a sample). (h) mRNA expression of S100A9, HCAR3, TYROBP, and FTH1 in blood samples To further explain the differential expression of the 19 co‐expressed key genes in different clusters, the mean value of each gene in each category was viewed as the expression value of genes in this category. FTH1, HCAR3, RGS2, S100A9, and TYROBP were differentially expressed in five clusters (Figure 5b). To explore the expression of these five genes in different CHD subtypes, we looked separately at the expression of these five genes in all CHD samples. We found significant differences among the five genes in different categories (p < 0.01), suggesting these genes could be used as marker genes for different CHD subcategories (Figure 5c–g).

Verification of the DEGs in patient samples

To estimate the levels of the five DEGs likely implicated in CHD molecular typing, RT‐qPCR was performed on blood samples from CHD patients and healthy volunteers. As shown in Figure 5h, the mRNA expression levels of S100A9, TYROBP, and FTH1 in the blood of CHD patients were significantly higher than that in healthy subjects, while a decreased expression level of HCAR3 was observed in patients with CHD. However, RGS2 could hardly be detected at the transcriptional level. These results indicate that S100A9, TYROBP, FTH1, and HCAR3 could be used for classifying the molecular typing of CHD, while RGS2 should be excluded as it could not be detected.

DISCUSSION

This study was motivated by the molecular typing for cancers, which could identify the molecular typing based on their gene expression profiles or network data. We screened the differential expression of genes associated with CHD based on the GEO data and expanded those data with the NCBI‐gene and OMIM databases to obtain the candidate genes associated with CHD. Next, a co‐expression network analysis was performed on the candidate genes for CHD using WGCNA, and the two most relevant modules of CHD were mined, including 75 CHD and 70 healthy mRNA samples. Both modules correlated with CHD, indicating that the mining results of WGCNA were ideal. Core genes were mined as key genes for co‐expression based on their module network relationship, and the differential miRNAs of CHD and interactions in the database were mined in the GEO data set to build a multi‐factor regulatory network of key genes for co‐expression. The functions of target genes of core regulators and CHD correlated, indicating that the co‐expressed key genes obtained by mining are valuable. The classification of CHD samples based on the co‐expression of key genes suggested significant differences in five genes between the analyzed subgroups, and these five genes could be used as biomarker genes to differentiate CHD subtypes. FTH1, HCAR3, RGS2, S100A9, and TYROBP genes were differentially expressed in five clusters. Several studies have shown that S100A9 (acute coronary syndrome biomarker) is related to CHD (Buyukterzi et al., 2017; Li et al., 2019). The FTH1 gene encodes the heavy subunit of ferritin, the major intracellular iron storage protein in prokaryotes and eukaryotes. It is a substrate of ferroptosis, which emerges to play vital roles in CHD (Fisher et al., 2007; Kobayashi et al., 2018, Sukseree et al., 2020). Mitochondrial ferritin is a protein that regulates iron storage and H‐ferritin‐like protein, which exists only in mitochondria and possesses ferroxidase activity, plays a critical role in cell survival and reactive oxygen species regulation in acute myocardial injury (Lee et al., 2014), including percutaneous coronary intervention. However, the physiological roles of FTH1 in CHD are poorly understood. In this study, we verified by RT‐qPCR in blood samples that FTH1 was significantly increased in CHD patients. We speculate that patients with coronary heart disease with elevated FTH1 may have plaque rupture or myocardial injury, but whether it is associated with coronary no‐reflow injury or myocardial fibrosis remains to be confirmed. HCAR3 is a hydroxycarboxylic acid receptor involved in fatty acid metabolism (Offermanns et al., 2011). Human HCAR3 is activated by MAPK cascades in the PLC/PKC and MMP/EGFR pathways, which substantially increases ATP production by enhancing free fatty acid oxidation under conditions such as diabetic ketoacidosis and fasting (Offermanns et al., 2011; Zhou et al., 2012). Free fatty acid oxidation is vital in myocardial metabolism. The myocardial free fatty acid metabolism of CHD patients with elevated HCAR3 might be better than those with low levels, affecting CHD symptoms. Symptoms of coronary heart disease depend on the balance between oxygen and energy metabolism. RGS2 has GTPase‐activating protein activity and acts as an essential controller of the magnitude and kinetics of cell responses after GPCR activation (Biesemann et al., 2014). Though we could not detect RGS2 mRNA in human blood, previous research indicated that the upregulation of RGS2 protected against myocardial injury via inhibition of cAMP levels and maintenance of cardiomyocyte contractility in a failing heart (Sjogren, Parra, Atkins, Karaj, & Neubig, 2016). Moreover, RGS2 facilitates uterine artery adaptation during pregnancy in mice (Koch, Dahlen, Owens, & Osei‐Owusu, 2019), but it remains unclear whether it increases coronary artery perfusion in CHD patients, in particular in acute myocardial infarction patients. As a biomarker of immunity and inflammation, S100A9 is related to CHD, especially acute coronary syndrome (Buyukterzi et al., 2017; Li et al., 2019). One study showed that S100A9 blockage protects against myocardial infarction by reducing postischemic myocardial damage and improving cardiac function (Marinkovic, 2019). TYROBP is a key regulator of the immune system and is significantly upregulated in the brains of patients with Alzheimer's disease (Haure‐Mirande et al., 2017). Two co‐expression analyses showed that TYROBP is a promising target for preventing unstable carotid plaque formation in the carotid artery (Chen et al., 2019; Liu, Huan, Wu, Zou, & Qu, 2020). Whether TYROBP plays a role in unstable plaque in CHD remains unclear, and further research is required to confirm such effects. FTH1, HCAR3, S100A9, and TYROBP genes were differentially expressed in the blood of CHD patients, but the significance and function of these genes involved in the development of CHD are different. Perhaps the molecular typing of CHD can be guided by the changes and dysfunction of these five genes to guide the individual diagnosis and treatment of CHD patients. All in all, HCAR3, RGS2, and TYROBP may play a beneficial role for CHD patients, but FTH1 and S100A9 may adversely affect CHD patients. In this study, we used co‐expression analysis to explore the core regulators (miRNA, lncRNA, TF) and performed an enrichment analysis on target genes to reveal the mechanisms behind CHD. We found that these clusters of genes were related to the TNF signaling pathway, the hepatitis B pathway, and protein processing in the endoplasmic reticulum. TNF‐α interacts with apoptosis and autophagy in CHD (Dong et al., 2019). Several studies have shown that endoplasmic reticulum stress affects coronary artery endothelial cell function in patients with CHD and participates in CHD development. Abnormal endoplasmic reticulum function affects protein synthesis and disturbs cellular metabolism (Bi et al., 2018; Chang et al., 2019; Li, Hu, Gong, & Yan, 2011). These core factors show the different mechanisms involved in CHD, which may lead to distinctive, individualized treatment of CHD. Therefore, these co‐expression clusters of genes contribute to the classification of CHD molecular typing and could guide the clinical management of the disease. However, we only verified our results in a limited number of patients. The molecular typing guided by the dysfunction of different factors described in this study needs further research to confirm and verify its impact on clinical management and prognosis.

CONCLUSION

Taken together, the hub‐genes FTH1, HCAR3, RGS2, S100A9, and TYROBP are differentially expressed in five clusters associated with CHD, and the top microRNAs hsa‐miR‐1, hsa‐miR‐206, and hsa‐miR‐137, the top TF NFKB1, RELA, CEBPB, CEBPA, ETS2, and PPARG, the LncRNA MALAT1, may be potential targets for diagnosis and treatment of CHD. The core genes may contribute to the classification of the molecular typing of CHD. Further research to confirm and verify its impact on clinical management and prognosis is required.

Limitations

There were several limitations to this study. First, this study was based on bioinformatics analysis of gene expression and did not include assessment of changes in proteins. Therefore, some translational modifications or posttranslational modifications were not considered. Second, the small sample size for validation could not fully explain the scientific issues. Third, the study lacked mechanism verification, and the roles of the dysfunction of the core genes in molecular typing needs to be confirmed. Large and prospective clinical studies are necessary to validate our results.

CONFLICT OF INTEREST

The authors declare that there is no conflict of interests.

AUTHOR CONTRIBUTIONS

The contributions of the authors are as follows: data analysis, Yuewei Li, Maohuan Lin, Kangjie Wang; sample collection, YaQing Zhan, Wenli Gu, Guanghao Gao, Yuna Huang, Yangxin Chen; experiment, Yuewei Li, Maohuan Lin; writing, Yuewei Li, Tucheng Huang and Jingfeng Wang; overall concept and coordination: Yuewei Li, Tucheng Huang and Jingfeng Wang. Fig S1 Click here for additional data file. Fig S2 Click here for additional data file. Fig S3 Click here for additional data file. Fig S4 Click here for additional data file.

27 in total

1. Ferritin: a novel mechanism for delivery of iron to the brain and other organs.

Authors: J Fisher; K Devraj; J Ingram; B Slagle-Webb; A B Madhankumar; X Liu; M Klinger; I A Simpson; J R Connor
Journal: Am J Physiol Cell Physiol Date: 2007-04-25 Impact factor: 4.249

Review 2. Fourth Universal Definition of Myocardial Infarction (2018).

Authors: Kristian Thygesen; Joseph S Alpert; Allan S Jaffe; Bernard R Chaitman; Jeroen J Bax; David A Morrow; Harvey D White
Journal: J Am Coll Cardiol Date: 2018-08-25 Impact factor: 24.094

3. Inhibition of pro-inflammatory myeloid cell responses by short-term S100A9 blockade improves cardiac function after myocardial infarction.

Authors: Goran Marinković; Helena Grauen Larsen; Troels Yndigegn; Istvan Adorjan Szabo; Razvan Gheorghita Mares; Lisa de Camp; Matthew Weiland; Lukas Tomas; Isabel Goncalves; Jan Nilsson; Stefan Jovinge; Alexandru Schiopu
Journal: Eur Heart J Date: 2019-08-21 Impact factor: 29.983

4. Molecular portraits of human breast tumours.

Authors: C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein
Journal: Nature Date: 2000-08-17 Impact factor: 49.962

5. Myostatin regulates energy homeostasis in the heart and prevents heart failure.

Authors: Nadine Biesemann; Luca Mendler; Astrid Wietelmann; Sven Hermann; Michael Schäfers; Marcus Krüger; Thomas Boettger; Thilo Borchardt; Thomas Braun
Journal: Circ Res Date: 2014-05-07 Impact factor: 17.367

6. Digoxin-Mediated Upregulation of RGS2 Protein Protects against Cardiac Injury.

Authors: Benita Sjögren; Sergio Parra; Kevin B Atkins; Behirda Karaj; Richard R Neubig
Journal: J Pharmacol Exp Ther Date: 2016-03-03 Impact factor: 4.030

7. NCBI GEO: archive for functional genomics data sets--update.

Authors: Tanya Barrett; Stephen E Wilhite; Pierre Ledoux; Carlos Evangelista; Irene F Kim; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Michelle Holko; Andrey Yefanov; Hyeseung Lee; Naigong Zhang; Cynthia L Robertson; Nadezhda Serova; Sean Davis; Alexandra Soboleva
Journal: Nucleic Acids Res Date: 2012-11-27 Impact factor: 16.971

8. Deficiency of TYROBP, an adapter protein for TREM2 and CR3 receptors, is neuroprotective in a mouse model of early Alzheimer's pathology.

Authors: Jean-Vianney Haure-Mirande; Mickael Audrain; Tomas Fanutza; Soong Ho Kim; William L Klein; Charles Glabe; Ben Readhead; Joel T Dudley; Robert D Blitzer; Minghui Wang; Bin Zhang; Eric E Schadt; Sam Gandy; Michelle E Ehrlich
Journal: Acta Neuropathol Date: 2017-06-13 Impact factor: 17.088

9. ZYZ-803 Mitigates Endoplasmic Reticulum Stress-Related Necroptosis after Acute Myocardial Infarction through Downregulating the RIP3-CaMKII Signaling Pathway.

Authors: Lingling Chang; Zhijun Wang; Fenfen Ma; Bahieu Tran; Rui Zhong; Ying Xiong; Tao Dai; Jian Wu; Xiaoming Xin; Wei Guo; Ying Xie; Yicheng Mao; Yi-Zhun Zhu
Journal: Oxid Med Cell Longev Date: 2019-06-02 Impact factor: 6.543

10. IGFBP6 Is Downregulated in Unstable Carotid Atherosclerotic Plaques According to an Integrated Bioinformatics Analysis and Experimental Verification.

Authors: Yandong Liu; Wei Huan; Jianjin Wu; Sili Zou; Lefeng Qu
Journal: J Atheroscler Thromb Date: 2020-02-08 Impact factor: 4.928

5 in total

1. Construction and validation of molecular subtypes of coronary artery disease based on ferroptosis-related genes.

Authors: Lina Ding; Fei Long; Dan An; Jing Liu; Guannan Zhang
Journal: BMC Cardiovasc Disord Date: 2022-06-22 Impact factor: 2.174

2. Comprehensive bioinformatics analysis identifies LAPTM5 as a potential blood biomarker for hypertensive patients with left ventricular hypertrophy.

Authors: Tiegang Li; Weiqi Wang; Wenqiang Gan; Silin Lv; Zifan Zeng; Yufang Hou; Zheng Yan; Rixin Zhang; Min Yang
Journal: Aging (Albany NY) Date: 2022-02-14 Impact factor: 5.682

3. Identification and validation of glomerulotubular crosstalk genes mediating IgA nephropathy by integrated bioinformatics.

Authors: Yawen Bai; Yajing Li; Yali Xi; Chunjie Ma
Journal: BMC Nephrol Date: 2022-04-13 Impact factor: 2.388

4. A module of multifactor-mediated dysfunction guides the molecular typing of coronary heart disease.

Authors: Yuewei Li; Maohuan Lin; Kangjie Wang; YaQing Zhan; Wenli Gu; Guanghao Gao; Yuna Huang; Yangxin Chen; Tucheng Huang; Jingfeng Wang
Journal: Mol Genet Genomic Med Date: 2020-08-02 Impact factor: 2.183

5. Alterations in Kynurenine and NAD⁺ Salvage Pathways during the Successful Treatment of Inflammatory Bowel Disease Suggest HCAR3 and NNMT as Potential Drug Targets.

Authors: Artur Wnorowski; Sylwia Wnorowska; Jacek Kurzepa; Jolanta Parada-Turska
Journal: Int J Mol Sci Date: 2021-12-16 Impact factor: 5.923

5 in total