Literature DB >> 35875838

RNA-seq-driven expression analysis to investigate cardiovascular disease genes with associated phenotypes among atrial fibrillation patients.

Asude Berber1, Habiba Abdelhalim1, Saman Zeeshan2, Sreya Vadapalli1, Barr von Oehsen3, Naveena Yanamala4, Partho Sengupta4, Zeeshan Ahmed1,5.   

Abstract

Entities:  

Mesh:

Year:  2022        PMID: 35875838      PMCID: PMC9309637          DOI: 10.1002/ctm2.974

Source DB:  PubMed          Journal:  Clin Transl Med        ISSN: 2001-1326


× No keyword cloud information.
To the Editor Atrial fibrillation (AF) is defined as the high‐frequency excitation of the atrium, resulting in both dyssynchronous atrial contraction and the irregularity of ventricular excitation. According to its condition, AF disease is divided into two sub‐types: paroxysmal and persistent. In contrast to persistent AF, paroxysmal AF is diagnosed in the first phase of the disease, which later progresses to persistent AF. Furthermore, AF includes risk factors such as obesity, diabetes, smoking and a sedentary lifestyle and is prevalent in the older males of European ancestry. Previous studies have shown that both heart failure (HF) and cardiovascular diseases (CVD) contribute to an increased risk of AF. In this study, we investigated genes responsible for AF with sub‐disease groups through transcriptomic analysis (Additional file 1: High‐resolution figures). It was conducted as a continuation of our thorough CVD research focusing on HF performed on 61 CVD patients (Sample IDs: 1058–1118) and 10 patients without CVD (Control IDs: 648–658) (Additional file 2: population details). When grouped by gender and race, there were 40 males and 21 females, 42 Whites, 7 Blacks (Blacks or African Americans), 1 Asian, 1 Decline to Answer, 2 others, and 8 NA (Table 1 and Figure 1A). Peripheral blood samples were used for RNA extraction, and sequencing was performed using Illumina NovaSeq 6000‐S4 to assess the RNA quality. An efficient data management system (PROMIS‐LCR) with data extraction, transfer and loader system (ETL), created by the authors, was used for patient recruitment and consent tracking as well as dealing with the multi‐omics data, respectively. We also created a publicly available gene‐disease database, PAS‐Gen, which includes over 59 000 protein‐coding and non‐coding genes, and over 90 000 classified gene‐disease associations, to ease the gene‐disease visualization for researchers, medical practitioners and pharmacists.
TABLE 1

A total number of atrial fibrillation (AF) patient samples were used for the investigative study

ID Gender Race Age Type
648MaleWhite30Control
649MaleWhite38Control
650MaleWhite69Control
651FemaleWhite67Control
652MaleWhite63Control
653FemaleWhite34Control
655MaleWhite62Control
656FemaleWhite62Control
657FemaleWhite72Control
658FemaleWhite60Control
1058FemaleWhite72Case
1059MaleWhite79Case
1060MaleNA58Case
1061MaleWhite70Case
1062MaleWhite67Case
1063MaleWhite66Case
1064FemaleNA54Case
1065FemaleWhite51Case
1066MaleWhite82Case
1067MaleWhite62Case
1068MaleNA70Case
1069FemaleWhite65Case
1070MaleWhite57Case
1071FemaleAsian52Case
1072FemaleWhite91Case
1073FemaleWhite89Case
1074FemaleWhite81Case
1075FemaleWhite59Case
1076MaleWhite45Case
1077MaleWhite73Case
1078FemaleWhite72Case
1079MaleNA92Case
1080MaleWhite86Case
1081MaleBlack57Case
1082FemaleBlack59Case
1083MaleWhite85Case
1084FemaleOther69Case
1085MaleOther64Case
1086MaleBlack65Case
1087FemaleNA69Case
1088FemaleWhite65Case
1089MaleWhite55Case
1090MaleWhite70Case
1091MaleWhite77Case
1092MaleWhite62Case
1093FemaleWhite70Case
1094MaleWhite64Case
1095MaleWhite66Case
1096MaleBlack59Case
1097FemaleWhite57Case
1098MaleNA83Case
1099MaleWhite67Case
1100MaleNA81Case
1101MaleWhite64Case
1102MaleBlack71Case
1103MaleWhite80Case
1104MaleWhite73Case
1105FemaleWhite71Case
1106MaleNA79Case
1107MaleWhite84Case
1108FemaleBlack57Case
1109MaleWhite75Case
1110MaleDecline to Answer80Case
1111FemaleWhite86Case
1112MaleWhite72Case
1113MaleWhite60Case
1114FemaleBlack54Case
1115MaleWhite67Case
1116FemaleWhite63Case
1117MaleWhite66Case
1118MaleWhite88Case

Note: This table includes patient ID, gender (40 males and 21 females), age and race (42 White, 7 Black: Black or African American, 1 Asian, 1 declined to answer, and 8 NA). Samples 1058–1118 were obtained from CVD patients, whereas samples 648–658 were obtained from healthy individuals. The age of healthy individuals is not available.

Abbreviations: CVD, cardiovascular diseases; NA, not available.

FIGURE 1

(A) Gender, age and race details of the atrial fibrillation (AF) population. (B) Gene‐disease annotation and expression analysis of all AF genes. (C) Differential expression analysis of AF gene. Gender, age and race information table for both patient and healthy control groups. The X‐axis signifies samples (AF ids: 1058–1118 and healthy ids: 648–658), and the Y‐axis indicates ages. The blue, yellow, grey and orange bars indicate both race and gender groups; White, Black, male and female, respectively. Genes‐disease heat map for the expression analysis of AF among all diseased and healthy control patients. The X‐axis signifies samples (AF ids: 1058–1118 and healthy ids: 648–658), the left Y‐axis shows genes, and the right Y‐axis presents genes associated with the AF. Differential gene expression heat map of AF for all patients and healthy control groups

A total number of atrial fibrillation (AF) patient samples were used for the investigative study Note: This table includes patient ID, gender (40 males and 21 females), age and race (42 White, 7 Black: Black or African American, 1 Asian, 1 declined to answer, and 8 NA). Samples 1058–1118 were obtained from CVD patients, whereas samples 648–658 were obtained from healthy individuals. The age of healthy individuals is not available. Abbreviations: CVD, cardiovascular diseases; NA, not available. (A) Gender, age and race details of the atrial fibrillation (AF) population. (B) Gene‐disease annotation and expression analysis of all AF genes. (C) Differential expression analysis of AF gene. Gender, age and race information table for both patient and healthy control groups. The X‐axis signifies samples (AF ids: 1058–1118 and healthy ids: 648–658), and the Y‐axis indicates ages. The blue, yellow, grey and orange bars indicate both race and gender groups; White, Black, male and female, respectively. Genes‐disease heat map for the expression analysis of AF among all diseased and healthy control patients. The X‐axis signifies samples (AF ids: 1058–1118 and healthy ids: 648–658), the left Y‐axis shows genes, and the right Y‐axis presents genes associated with the AF. Differential gene expression heat map of AF for all patients and healthy control groups First, the transcriptomic data analysis involved the development of an RNA‐seq processing pipeline that contained four operating parts: (I) data pre‐processing, (II) data quality checking, (III) data storage and management and (IV) data visualization (Additional file 1: High‐resolution figures). The analysis of transcripts per million (TPM) was performed to normalize the RNA‐seq data by using the visualizing genes with disease‐causing variants environment with the findable, accessible, intelligent and reproducible approach (Additional file 4: AF analysis ‐ gene expression data). It reveals all genes annotated with their associated clinical AF phenotype using gene–disease association. , This expression analysis was expanded to visualize the classification of protein‐ and non‐coding genes in detail as gender‐ and race‐based. First, we looked across the AF‐annotated genes to identify protein‐ and non‐coding genes together and found 71 genes related to AF and relative diseases (Additional file 3: Complete Gene List). Next, we observed expression in protein‐coding genes and found 22 genes associated with direct and relative AF diseases, which are denominated as AF phenotypes (SCN1B, NPPA‐AS1, KCNQ1, KCNE1, VKORC1, ATF7, KCNH2, SELP, PDE4D, ACE, PRKAR1B, NUP155, CYP4F2, ABCC9, KCNJ2‐AS1, CFAP20, KCNJ2, MYBPC3, KCNE3, PF4, PPBP, MYL4) (Figure 1B and Table 2). After the initial analysis, differential gene expression analysis was implemented to further investigate AF genes. Of the protein‐coding genes, seven AF‐associated genes (MYL4, PPBP, PF4, KCNE3, VKORC1, KCNQ1 and CYP4F2) showed differentially regulated expression (Figure 1C). A previous study has reported some of these genes (GJA5, KCNA5, KCNE2, KCNJ2, KCNQ1, KCNH2, NPPA and SCN5A) as novel genes for familial AF in the absence of mutations, whereas mutations in MYL4 have been strongly associated with AF disease in humans.
TABLE 2

List of genes associated with atrial fibrillation (AF) diseases

ENSEMBL ID Gene name Category Disease
ENSG00000105711SCN1BProtein codingAtrial fibrillation
ENSG00000242349NPPA‐AS1Antisense/non‐codingAtrial fibrillation familial 6
ENSG00000053918KCNQ1Protein codingAtrial fibrillation familial 3
ENSG00000180509KCNE1Protein codingAtrial fibrillation
ENSG00000167397VKORC1Protein codingAtrial fibrillation
ENSG00000170653ATF7Protein codingFamilial atrial fibrillation
ENSG00000055118KCNH2Protein codingAtrial fibrillation
ENSG00000174175SELPProtein codingAtrial fibrillation
ENSG00000113448PDE4DProtein codingAtrial fibrillation and stroke
ENSG00000159640ACEProtein codingAtrial fibrillation
ENSG00000188191PRKAR1BProtein codingFamilial atrial fibrillation
ENSG00000113569NUP155Protein codingAtrial fibrillation familial 15
ENSG00000186115CYP4F2Protein codingAtrial fibrillation
ENSG00000069431ABCC9Protein codingAtrial fibrillation familial 12
ENSG00000267365KCNJ2‐AS1Antisense/non‐codingFamilial atrial fibrillation
ENSG00000070761CFAP20Protein codingFamilial atrial fibrillation
ENSG00000123700KCNJ2Protein codingAtrial fibrillation familial 9
ENSG00000134571MYBPC3Protein codingAtrial fibrillation
ENSG00000175538KCNE3Protein codingFamilial atrial fibrillation
ENSG00000163737PF4Protein codingAtrial fibrillation
ENSG00000163736PPBPProtein codingAtrial fibrillation
ENSG00000198336MYL4Protein codingAtrial fibrillation familial 18

Note: This table includes the ENSG ID, gene name, category and disease associated with that gene.

List of genes associated with atrial fibrillation (AF) diseases Note: This table includes the ENSG ID, gene name, category and disease associated with that gene. With a deeper investigation of the normalized expression analysis, we found that PF4, PPBP, MYL4, KCNE3, VKORC1, KCNQ1 and CYP4F2 genes are highly expressed in AF (Figure 1B) with relative diseases as AF phenotypes; AF (both for PF4 and PPBP); AF familial 18; familial AF, AF, AF familial 3, AF, respectively (Additional file 5: Information about AF phenotypes). The phenotypes represent different subsets of how the disease presents when it is inherited based on the gene of interest. Additionally, these findings were supported by another study in which two long non‐coding RNAs genes were found to interact with protein‐coding genes associated with AF. A subsequent analysis was performed based on two groupings: race‐ and gender‐based. The race‐based analysis involved Black, White and all other races in which PF4, PPBP and MYL4 were found to be highly expressed protein‐coding genes in AF in all different race groups (Figure 2A–C). Although KCN3 appeared in the analysis, it did not show consistent expression across the patients. In addition, the PPBP gene, which is one of the three immune‐related genes (CXCL12, CCL4), has been found to have a positive relationship with the infiltration of immune cells (e.g. neutrophils, plasma cells and resting dendritic cells) and plays a role in the development of AF disease. Furthermore, the gene expression analysis based on gender segregation showed similar results, with PF4, PPBP and MYL4 genes as highly expressed with AF disease in both female and male groups (Figure 2D,E).
FIGURE 2

Race‐ and gender‐based gene expression analysis of atrial fibrillation (AF) genes. All and highly expressed protein‐coding genes related to AF in self‐described Whites (A), Blacks/African Americans (B), and all other races (C), and male (D) and female (E)

Race‐ and gender‐based gene expression analysis of atrial fibrillation (AF) genes. All and highly expressed protein‐coding genes related to AF in self‐described Whites (A), Blacks/African Americans (B), and all other races (C), and male (D) and female (E) In summary, we performed the systematic transcriptomic characterization of AF‐associated genes. Our findings report three highly expressed genes and their associated diseases as AF phenotypes; PF4: AF; PPBP: AF and MYL4: AF familial 18, with a similar expression pattern across races and genders. Moreover, when we compared the genes associated with HF from our previous CVD/HF study with those associated with AF, we discovered that two genes (ACE and MYBPC3) were associated with both diseases (HF and AF). These findings are valuable for future research studies as they signify the potential to further investigate these genes for mutations and disease‐specific variants. This will provide a new path focusing on a more personalized approach to therapy and treatment. In the future, we seek to evaluate the causal basis for AF by moving beyond the one gene‐one disease model through the integration of the expressed genome, characterization of mutations derived from genomic signatures and mapping them on phenotypic traits in the electronic medical records. We aim to contribute to the paradigm shift in the application and interpretation of genetic and genomically informed medicine for AF, moving from a deterministic conceptualization to a probabilistic interpretation of genetic risk. This will support diagnostic and preventive care delivery strategies beyond traditional symptom‐driven, disease‐causal medical practice. We aim to construct machine learning models to identify a baseline transcriptional signature highly predictive of response across these indications. This might accelerate our ability to leverage and extend the information contained within the original data and to model patient‐specific genomics and clinical data for significant transcriptional correlations, highlighting the association of genetic variants to clinical outcomes of treatment in AF and other CVD. , ,

CONFLICT OF INTEREST

The authors declare no conflict of interests regarding financial or non‐financial aspects. Supplementary Material 1: High‐resolution figures. Click here for additional data file. Supplementary Material 2: Population details. Click here for additional data file. Supplementary Material 3: Detailed genes list. Click here for additional data file. Supplementary Material 4: AF analysis – gene expression data. Click here for additional data file. Supplementary Material 5: Information about AF phenotypes. Click here for additional data file.
  10 in total

Review 1.  Atrial Fibrillation: Epidemiology, Pathophysiology, and Clinical Outcomes.

Authors:  Laila Staerk; Jason A Sherer; Darae Ko; Emelia J Benjamin; Robert H Helm
Journal:  Circ Res       Date:  2017-04-28       Impact factor: 17.367

Review 2.  Artificial intelligence and machine learning approaches using gene expression and variant data for personalized medicine.

Authors:  Sreya Vadapalli; Habiba Abdelhalim; Saman Zeeshan; Zeeshan Ahmed
Journal:  Brief Bioinform       Date:  2022-09-20       Impact factor: 13.994

3.  Debutant iOS app and gene-disease complexities in clinical genomics and precision medicine.

Authors:  Zeeshan Ahmed; Saman Zeeshan; Ruoyun Xiong; Bruce T Liang
Journal:  Clin Transl Med       Date:  2019-10-04

4.  Comprehensive RNA-Seq Data Analysis Identifies Key mRNAs and lncRNAs in Atrial Fibrillation.

Authors:  Dong-Mei Wu; Zheng-Kun Zhou; Shao-Hua Fan; Zi-Hui Zheng; Xin Wen; Xin-Rui Han; Shan Wang; Yong-Jian Wang; Zi-Feng Zhang; Qun Shan; Meng-Qiu Li; Bin Hu; Jun Lu; Gui-Quan Chen; Xiao-Wu Hong; Yuan-Lin Zheng
Journal:  Front Genet       Date:  2019-10-02       Impact factor: 4.599

5.  Human gene and disease associations for clinical-genomics and precision medicine research.

Authors:  Zeeshan Ahmed; Saman Zeeshan; Dinesh Mendhe; XinQi Dong
Journal:  Clin Transl Med       Date:  2020-01

6.  Identification of key immune-related genes and immune infiltration in atrial fibrillation with valvular heart disease based on bioinformatics analysis.

Authors:  Shuo Li; Zhiyuan Jiang; Xiaoying Chao; Chenyang Jiang; Guoqiang Zhong
Journal:  J Thorac Dis       Date:  2021-03       Impact factor: 2.895

7.  RNA-seq driven expression and enrichment analysis to investigate CVD genes with associated phenotypes among high-risk heart failure patients.

Authors:  Zeeshan Ahmed; Saman Zeeshan; Bruce T Liang
Journal:  Hum Genomics       Date:  2021-11-13       Impact factor: 4.639

8.  A mutation in the atrial-specific myosin light chain gene (MYL4) causes familial atrial fibrillation.

Authors:  Nathan Orr; Rima Arnaout; Lorne J Gula; Danna A Spears; Peter Leong-Sit; Qiuju Li; Wadea Tarhuni; Sven Reischauer; Vijay S Chauhan; Matthew Borkovich; Shaheen Uppal; Arnon Adler; Shaun R Coughlin; Didier Y R Stainier; Michael H Gollob
Journal:  Nat Commun       Date:  2016-04-12       Impact factor: 14.919

9.  Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine.

Authors:  Zeeshan Ahmed; Khalid Mohamed; Saman Zeeshan; XinQi Dong
Journal:  Database (Oxford)       Date:  2020-01-01       Impact factor: 3.451

10.  Advancing clinical genomics and precision medicine with GVViZ: FAIR bioinformatics platform for variable gene-disease annotation, visualization, and expression analysis.

Authors:  Zeeshan Ahmed; Eduard Gibert Renart; Saman Zeeshan; XinQi Dong
Journal:  Hum Genomics       Date:  2021-06-26       Impact factor: 4.639

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.