Literature DB >> 33156839

Whole blood transcriptome biomarkers of unruptured intracranial aneurysm.

Kerry E Poppenberg1,2,3, Lu Li4, Muhammad Waqas3, Nikhil Paliwal1,2, Kaiyu Jiang5, James N Jarvis5,6, Yijun Sun5,7, Kenneth V Snyder1,3,8,9, Elad I Levy1,3,8, Adnan H Siddiqui1,3,8, John Kolega1,10, Hui Meng1,2,3,11, Vincent M Tutino1,2,3,10.   

Abstract

BACKGROUND: The rupture of an intracranial aneurysm (IA) causes devastating subarachnoid hemorrhages, yet most IAs remain undiscovered until they rupture. Recently, we found an IA RNA expression signature of circulating neutrophils, and used transcriptome data to build predictive models for unruptured IAs. In this study, we evaluate the feasibility of using whole blood transcriptomes to predict the presence of unruptured IAs.
METHODS: We subjected RNA from peripheral whole blood of 67 patients (34 with unruptured IA, 33 without IA) to next-generation RNA sequencing. Model genes were identified using the least absolute shrinkage and selection operator (LASSO) in a random training cohort (n = 47). These genes were used to train a Gaussian Support Vector Machine (gSVM) model to distinguish patients with IA. The model was applied to an independent testing cohort (n = 20) to evaluate performance by receiver operating characteristic (ROC) curve. Gene ontology and pathway analyses investigated the underlying biology of the model genes.
RESULTS: We identified 18 genes that could distinguish IA patients in a training cohort with 85% accuracy. This SVM model also had 85% accuracy in the testing cohort, with an area under the ROC curve of 0.91. Bioinformatics reflected activation and recruitment of leukocytes, activation of macrophages, and inflammatory response, suggesting that the biomarker captures important processes in IA pathogenesis.
CONCLUSIONS: Circulating whole blood transcriptomes can detect the presence of unruptured IAs. Pending additional testing in larger cohorts, this could serve as a foundation to develop a simple blood-based test to facilitate screening and early detection of IAs.

Entities:  

Year:  2020        PMID: 33156839      PMCID: PMC7647097          DOI: 10.1371/journal.pone.0241838

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Intracranial aneurysms (IAs) are pathological outpouchings within cerebrovasculature whose natural history is driven by inflammation [1-3]. Although the rupture of an IA occurs at a rate of approximately 1% per year, the consequences are devastating. Rupture is the main cause of non-traumatic subarachnoid hemorrhage (SAH), which carries high rates of mortality (up to 50%) [4-6]. Early IA detection would enable closer monitoring and preventive treatments, which can drastically reduce the rate of rupture [7, 8]. For instance, one study found that for a 50-year-old man with an IA, the probability of rupture during his remaining lifetime was 22.8%, but can be reduced to 1.6% after surgical clipping or 3.4% after endovascular coiling [7]. Currently, the only way to diagnose IAs is with cerebral imaging such as MR angiography, computed tomography angiography (CTA), or digital subtraction angiography (DSA). But as the vast majority of IAs are asymptomatic, they are mostly detected incidentally. These imaging modalities are generally not suited for regular IA screening due to prohibitively high costs and potential risks, such as invasive complications (allergic reaction, injection site infection, hematoma, death) and radiation exposure (e.g., DSA, CTA) [9]. And while MRI without contrast enhancement is non-invasive, it is unable to accurately detect many IAs and typically requires invasive follow-up DSA. As it stands, the American Stroke Association does not recommend IA screening in the general population by medical imaging because of the high costs [10], though it has been shown to be cost effecitve in those with an IA family history [11, 12]. Therefore, a blood test would provide an inexpensive, rapid, and minimally-invasive screening test for detecting unruptured IAs in a large population. Those who test positive could then plan for diagnostic imaging and preventive maintenance. In search of blood-based IA biomarkers, we previously studied gene expression profiles in circulating neutrophils. An initial case-controlled study used RNA-seq to profile neutrophils from individuals with and without IAs (confirmed by angiography) and identified an 82-gene signature that was associated with IA [13]. In a follow-up study, we implemented machine learning to test the feasibility of using gene expression profiles to detect unruptured IAs [14]. The classification algorithm we developed achieved a predictive accuracy of 90% and an area under the curve (AUC) of 0.80 in a small validation cohort. Bioinformatics analyses demonstrated that predictive genes were related to neutrophil activation and dysregulated inflammatory responses, which may explain why they distinguished patients with IAs. While these studies demonstrated that neutrophil transcriptomes can potentially identify patients with IA, a leukocyte-based diagnostic would not be ideal for clinical implementation. In this case, neutrophils must be processed immediately after collection, and the abundance of neutrophil-derived endonucleases [15] makes it difficult to obtain high quality RNA. An assay using whole blood would overcome these challenges, as whole blood RNAs can be rapidly stabilized at room temperature and do not require rigorous extraction procedures. Such an assay could also be run using standard equipment in diagnostic labs. Thus, in this study, we investigated if gene expression differences in whole blood can distinguish individuals with IA from those without IA, and further, if machine learning could use those differences to build an IA prediction model.

Methods

Study enrollment

All methods in this study were approved by the University at Buffalo Institutional Review Board (study no. 030–474433). Written informed consent was obtained from all subjects prior to sample collection and the study was carried out in accordance with the approved protocol. Patients at Gates Vascular Institute (Buffalo, NY) receiving cerebral DSA with and without IA diagnosis were enrolled in this study. Indications for DSA include confirmation of IAs detected on noninvasive imaging or follow-up noninvasive imaging of previously-detected IAs for IA group, or to identify presence or absence of vascular disease (i.e. malformations, carotid stenosis) for control group. Patients who consented to participate in this study were over 18 years old, English-speaking, and had not previously been treated for IA. Patients who were pregnant, had a fever (>100°F), recently had invasive surgery, were receiving chemotherapy treatments, had autoimmune diseases, or were on immunomodulating drugs, as noted in their medical records, were excluded. Information about patient’s history and comorbidities was collected from electronic medical records.

Whole blood RNA processing

A volume of 2.5 mL of blood was taken from the femoral access sheath and transferred into a PAXgene blood RNA tube (PreAnalytiX, Hombrechtikon, Switzerland). Total RNA was extracted using the PAXgene Blood RNA kit (Qiagen, Venlo, Limburg, Netherlands) according to manufacturer’s instructions. Globin mRNA was removed by magnetic-bead capture with the GLOBINclear kit (Ambion, Austin, TX, USA) following manufacturer’s instructions. RNA purity and concentration were assessed by absorbance at 260 nm and were measured by the Quant-iT RiboGreen Assay (Invitrogen, Carlsbad, CA) and the Agilent 2100 BioAnalyzer RNA 6000 Pico Chip (Agilent, Las Vegas, NV), respectively.

RNA sequencing and data analysis

Libraries were prepared using the Illumina TruSeq stranded total RNA gold kit (Illumina, San Diego, CA). Samples underwent 50-cycle single-read sequencing in a HiSeq2500 Illumina system and were demultiplexed with Bcl2Fastq. After sequencing, per-cycle basecall files generated by the Illumina HiSeq2500 were converted to per-read FASTQ files using bcl2fastq version 2.20.0.422 using default parameters. The quality of the sequencing was reviewed using FastQC version 0.11.5. Potential contamination was detected using FastQ Screen version 0.11.1. No adapter sequences were detected, so no trimming was performed. Genomic alignments were performed using HISAT2 version 2.1.0 using default parameters. NCBI reference GRCh38 was used as reference genome and gene annotation set. Sequence alignments were compressed and sorted into binary alignment map files using samtools version 1.3. Mapped reads for genomic features were counted using Subread featureCounts version 1.6.2 using the parameters -s 2 –g gene_id–t exon–Q 60; the annotation file specified with–a was the NCBI GRCh38 reference provided by Illumina’s iGenomes. Raw counts were normalized as transcripts per million (TPM), and ComBat in R was used to correct TPM levels for any bias introduced by sequencing on different flow cells [16-18]. Dispersion of control and aneurysm groups was plotted using log10(TPM+1) for all transcripts with a group average>0 for both aneurysm and control groups. Cell composition analysis was performed using open-access CIBERSORT application (version 1.06) with the TPM normalized gene expression values and the provided 6 cell-type leukocyte signatures (B cells, CD8 T cells, CD4 T cells, NK cells, monocytes, neutrophils) [19]. CIBERSORT uses a linear support vector regression to estimate cell proportions. Transcripts with approved HGNC symbol names (n = 22,924) were used in this analysis. We also visualized how transcriptomes separated control and IA samples by hierarchical clustering via hclust in R under default settings using raw counts with sum>0 across all samples.

Model gene selection

Prior to selecting genes for model building, we randomly divided the whole blood transcriptome dataset into training and testing cohorts, following a 70:30 split for both aneurysm and control groups. Within the training cohort, we reduced the feature space to protein coding transcripts with an average TPM>1. Candidate genes for the predictive model were then identified using Hilbert-Schmidt Independence Criterion Least Absolute Shrinkage and Selection Operator (HSIC LASSO). Using the l-regularizer, HSIC LASSO finds a combination of genes that consists of non-redundant features with strong dependence on disease status. Principal component analysis (PCA) with the prcomp function in R visualized how the selected transcripts separate IA from control. Statistical significance of differential expression between IA and control groups was tested by independent samples t-test for equal variance and by Mann-Whitney U test for unequal variance.

Development of IA prediction model

Features selected by LASSO were used to train a classification model by SVM with a Gaussian kernel in MATLAB’s Statistics and Machine Learning Toolbox. SVM has been successfully used in a variety of disease classification applications, including our previous efforts using neutrophil transcriptomes [14]. A 10-fold cross-validation within the training set was performed as the model was being developed to reduce likelihood of overfitting. We compared model predictions to clinical diagnoses to determine number of true positives, true negatives, false positives, and false negatives, which were used to calculate sensitivity, specificity, and accuracy as defined elsewhere [14]. Receiver operating characteristic (ROC) curves were created, from which area under ROC curve (AUC) was calculated as a metric of model performance in the training cohort. We also calculated positive and negative predictive values (PPV, NPV) to examine how disease prevalence influences model’s predictive ability using equations enumerated elsewhere [14]. The model was subsequently applied to the independent testing cohort. The TPM values of LASSO-selected features for each subject in the testing cohort were input to the model by a blinded operator to make predictions. Then, the true diagnoses of the subjects (positive or negative for IA) were compared to model predictions to evaluate model performance in the testing cohort.

Bioinformatics

Ontologies that were significantly enriched in model genes compared to the background list of 34,605 expressed genes in our sequencing were identified using the Gene Ontology enRIchment anaLysis and visuaLizAtion tool (GORILLA) [20]. We reported biological processes with a p-value<0.0005. Pathways and networks associated with the LASSO genes were studied with Ingenuity Pathway Analysis (IPA) software [21], using fold-changes calculated in the training cohort for the 18 model genes. IPA maps our genes of interest to a gene object in the Ingenuity Knowledge Base to create networks based on known interactions between products of the genes and to identify enriched ontologies and upstream regulators. Networks with a p-score ≥15, ontologies that assign at least 3 model genes and have a p-value<0.05, and upstream regulators with an activation z-score≥|1.5| were considered significant.

Results

Study population characteristics

We included a total of 67 peripheral blood samples (34 IA, 33 control) that met our inclusion/exclusion and quality criteria in this study. These samples were randomly divided into an n = 47 training cohort (24 IA) and an n = 20 testing cohort (10 IA). Control and IA populations in both cohorts had similar demographics and comorbidities (Table 1), with the exception of smoking, which was higher in the IA training group. Aneurysm size (largest diameter measured on DSA) ranged from 1 to 19 mm, with a mean of 5.6 mm (S1 Table). There were 41 IAs total, as 6 patients had multiple IAs.
Table 1

Clinical characteristics of training and testing cohorts*.

Training CohortTesting Cohort
Control (n = 23)Aneurysm (n = 24)Control (n = 10)Aneurysm (n = 10)
Age (Mean±SE)58±3.655±2.656±4.057±4.6
Over 5560.87%45.83%40.00%60.00%
Sex
Female56.52%70.83%60.00%90.00%
Smoker
Yes0.00%33.33%10.00%30.00%
Comorbidities
Hypertension30.43%29.17%20.00%40.00%
Heart Disease17.39%16.67%0.00%20.00%
High Cholesterol34.78%33.33%40.00%30.00%
Stroke History17.39%8.33%10.00%0.00%
Diabetes13.04%8.33%10.00%0.00%
Osteoarthritis26.09%25.00%30.00%30.00%

*Clinical characteristics of the randomly-created training and testing cohorts. With exception of age, these factors were quantified as binary data points. The clinical factors were retrieved from patients’ medical records via latest “Patient Medical History” form administered before imaging. SE = standard error.

*Clinical characteristics of the randomly-created training and testing cohorts. With exception of age, these factors were quantified as binary data points. The clinical factors were retrieved from patients’ medical records via latest “Patient Medical History” form administered before imaging. SE = standard error. RNA quality and sequencing metrics are reported in S2 Table. The 67 sequenced samples had an average 260/280 of 1.9 and average RNA integrity number of 8.4. On average, 57.29 million sequences per sample and 96% aligned rate were obtained. Expression dispersion between IA and control groups are visualized in Fig 1A. To verify differentially expressed transcripts were derived from expression differences related to presence of IA, rather than differences in cell populations, we estimated the proportions of different cell populations in each sample using CIBERSORT [19]. This showed no statistically significant difference in proportions of cell types between control and IA groups. On average across all samples, neutrophils represent the majority (45%) followed by monocytes (19%), CD4 T cells (16%), CD8 T cells (8%), B cells (7%), and NK cells (5%) (Fig 1B).
Fig 1

Differential expression analysis.

A) Scatterplot depicts dispersion in expression between IA and control groups. B) No difference between cell type proportions of aneurysm and control groups was found. In both, neutrophils comprise majority of cells expressed in whole blood transcriptomes. C) Hierarchical clustering using genes with TPM sum>0 for all 67 whole blood transcriptomes. Teal indicates control samples, while pink indicates aneurysm samples.

Differential expression analysis.

A) Scatterplot depicts dispersion in expression between IA and control groups. B) No difference between cell type proportions of aneurysm and control groups was found. In both, neutrophils comprise majority of cells expressed in whole blood transcriptomes. C) Hierarchical clustering using genes with TPM sum>0 for all 67 whole blood transcriptomes. Teal indicates control samples, while pink indicates aneurysm samples. All 67 transcriptomes were sorted by supervised hierarchical clustering using raw counts without any feature reduction. Generally, clusters were either predominantly IA or control, as seen in Fig 1C. The first few clusters in the dendrogram are composed of mainly control samples, but progressing rightwards, there are larger groups of mostly aneurysm samples followed by a group of equal composition at the far right.

gSVM model can detect IA with high accuracy

We employed a regression-based technique, LASSO, to select genes with the greatest predictive ability (S3 Table) to use in our model. Fold-change and p-value of the 18 genes in the training cohort are reported in Table 2. PCA (Fig 2A) using these 18 genes demonstrates they are able to clearly separate disease from control cases in the training cohort. The first 3 components capture the majority (56%) of total variance. These 18 genes were used to train our SVM prediction model.
Table 2

18 transcripts selected during model training.

GeneGene IDAccession no.F-CP-value
ATF3467NM_001674-1.86<0.001
CBWD6644019NM_0010854571.350.001
CCDC85B11007NM_0068481.350.001
CCR81237NM_0052011.68<0.001
CHMP4B128866NM_176812-1.140.007
CLEC4F165530NM_173535-2.810.002
CXCL103627NM_001565-2.64<0.001
FN12335NM_212476-2.880.06
MT2A4502NM_005953-1.65<0.001
MZT2B80097NM_0250291.180.008
PCSK1N27344NM_0132711.560.018
PIM3415116NM_0010018521.31<0.001
SLC37A384255NM_0322951.230.032
ST6GALNAC155808NM_0184141.71<0.001
TCN26948NM_000355-1.77<0.001
TIFAB497189NM_001099221-1.480.007
TNFRSF47293NM_0033271.48<0.001
UFSP1402682NM_0010150721.320.003

P-values were calculated in the training dataset by independent t-test if equal population variances, Mann-Whitney U test if not. No. = number, F-C = fold-change.

Fig 2

Performance of 18 gene SVM biomarker in training and testing.

Training. A) PCA shows this panel can distinguish between aneurysm (red) and control (blue) samples. B) Accuracy, sensitivity, specificity, 5% NPV, and 5% PPV of model in training. C) ROC curve for model has AUC of 0.92. Testing. D) PCA illustrates panel was able to separate samples in a new cohort. E) Accuracy, sensitivity, specificity, 5% NPV, and 5% PPV of model in testing. F) ROC shows high performance in testing cohort (AUC = 0.91).

Performance of 18 gene SVM biomarker in training and testing.

Training. A) PCA shows this panel can distinguish between aneurysm (red) and control (blue) samples. B) Accuracy, sensitivity, specificity, 5% NPV, and 5% PPV of model in training. C) ROC curve for model has AUC of 0.92. Testing. D) PCA illustrates panel was able to separate samples in a new cohort. E) Accuracy, sensitivity, specificity, 5% NPV, and 5% PPV of model in testing. F) ROC shows high performance in testing cohort (AUC = 0.91). P-values were calculated in the training dataset by independent t-test if equal population variances, Mann-Whitney U test if not. No. = number, F-C = fold-change. In training, this model achieved greater than 80% accuracy, sensitivity, and specificity. Using an estimated prevalence of 5% [22], we calculated that the model had an NPV of approximately 1 (Fig 2B). ROC analysis confirmed the robust predictive ability as the model had an AUC of 0.92 (Fig 2C). We examined how this set of model genes could separate IA and control groups within the independent testing cohort, consisting of 20 patients (10 IA). PCA visualization demonstrates that these transcripts can still discriminate IA and control groups within this new dataset, again accounting for the majority of variance (64%) within the first 3 principal components (Fig 2D). As shown in Fig 2E and 2F, the model performed well in independent testing with an accuracy of 85% and an AUC of 0.91.

Genes in model reflect inflammatory processes

Biological functions of the 18 model genes were investigated through gene ontology and pathway analysis. GORILLA identified 4 significant processes: “negative regulation of secretion”, “negative regulation of protein secretion”, “negative regulation of peptide secretion”, and “cytokine-mediated signaling pathway” (Table 3). IPA analysis indicated 2 significant networks (Fig 3A and 3B) with functions related to cell death and survival, cardiovascular system development and function, and tissue development (A); cancer, endocrine system disorders, and gastrointestinal disease (B). Network A is a highly connected network with dense signaling centered around AKT, ERK, FN1, IL1, JNK, MAPK, PI3K, and VEGF. In Network B, TP53 and CTNNB1 function as central nodes. Genes in each network are reported in S4 Table. The disease and biological functions reported by IPA include activation of leukocytes, cell death of immune cells, activation of macrophages, inflammatory response, recruitment of leukocytes, apoptosis of leukocytes (full list presented in S5 Table). Progesterone, OSM, and IL1B (the latter two being important cytokines involved in inflammatory signaling [23, 24]) were upstream regulators predicted to be inhibited (Fig 3D).
Table 3

GORILLA ontologies for the 18 transcripts selected by LASSO.

GO termDescriptionP-valueGenes
GO:0051048Negative regulation of secretion4.29E-05FN1, PIM3, TIFAB, TNFRSF4
GO:0050709Negative regulation of protein secretion2.46E-04FN1, PIM3, TNFRSF4
GO:0002792Negative regulation of peptide secretion2.71E-04FN1, PIM3, TNFRSF4
GO:0019221Cytokine-mediated signaling pathway3.23E-04CCR8, CXCL10, FN1, MT2A, TNFRSF4
Fig 3

IPA Network analysis of 18 genes identified by LASSO.

Transcripts with increased expression in IA are red; transcripts with lower expression are green; fold-change represented by intensity. A) The first network (p-score = 20) reflects cardiovascular system development and function, cell death and survival, and tissue development. B) The second network (p-score = 17) has ontologies of cancer, endocrine system disorders, and gastrointestinal disease. C) Network constructed using 3 significant upstream regulators (progesterone, OSM, IL1B).

IPA Network analysis of 18 genes identified by LASSO.

Transcripts with increased expression in IA are red; transcripts with lower expression are green; fold-change represented by intensity. A) The first network (p-score = 20) reflects cardiovascular system development and function, cell death and survival, and tissue development. B) The second network (p-score = 17) has ontologies of cancer, endocrine system disorders, and gastrointestinal disease. C) Network constructed using 3 significant upstream regulators (progesterone, OSM, IL1B).

Discussion

There is a critical need for a minimally-invasive prescreen to identify patients who have an unruptured IA and would, therefore, maximally benefit from cerebral vascular imaging (such as MRA) for IA detection. Previously, we hypothesized that circulating blood cells have altered expression profiles after contact with IA tissue or inflammatory mediators released by IAs [25]. We investigated this by performing a transcriptome profiling study of circulating neutrophils in patients with and without IA and found an IA signature in neutrophils [13], which when trained via a machine learning pipeline demonstrated predictive ability to detect unruptured IAs. In this study, we discovered that a unique IA signature exists in whole blood transcriptomes, albeit there were no common genes between neutrophil and whole blood IA biomarkers. Our data yielded 18 genes that discriminated IA from control cases via LASSO regression. This type of feature selection overcomes shortcomings of traditional statistical filtering, since filtering methods consider genes independently, neglecting functional interactions between genes/gene products. Consequently, feature selection by traditional filtering methods may omit genes that constructively work together during a particular disease state, and may select redundant genes. During training, we used an SVM algorithm which separates binary labeled samples by transforming them into a multidimensional space and establishing a hyper-plane that maximizes the distance between samples of either class. Our Gaussian SVM (gSVM) model using the 18 selected genes had a prediction accuracy of 85% in both the training cohort (via cross-validation) and an independent testing cohort. Using isolated neutrophils in our previous study, we achieved a maximum accuracy of 90% in an independent testing cohort. The decrease in observed biomarker performance may be due to additional noise associated with a heterogeneous cell population in whole blood. There may also be greater inter-patient variability with whole blood transcriptomes due to contribution of multiple cell types. However, the whole blood model still achieved an NPV>0.98 in both training and testing, supporting its feasibility as a pre-screen for IA, for which high NPV is desired. We suspect that the 18 classifier transcripts detect IAs because they capture key facets of the disease, related to inflammation, infiltration, and degradation of the IA wall. Four of the model genes (TNFRSF4, TIFAB, MT2A, PIM3) are associated with NF-κB, an important inflammatory signaling pathway implicated in IA pathogenesis [26]. Most notably, NF-κB upregulates MMP-9 [27], a main driver of IA wall degradation [28], and MCP-1, which recruits macrophages to the IA wall (a hallmark of aneurysmal tissue) [28]. TNFRSF4 (increased in our study) is a member of the TNF-receptor superfamily that is involved in NF-κB pathway activation and has also been found to be increased in aneurysm tissue. Both the TNF family and NF-κB complex contribute to vessel degradation and are captured in the first IPA-derived network with direct connections to multiple model genes (ATF3, CCR8, CXCL10, FN1, PIM3, TNFRSF4). TNFα, a cytokine within the TNF family, has increased expression in plasma of aneurysm patients [29] and IA walls, and leads to EC dysfunction, inflammation, and apoptosis [28]. Conversely, TIFAB and MT2A (both decreased in our study) inhibit activation of NF-κB, suggesting another mechanism for NF-κB activation in patients with IAs. A role for inflammation is also reflected in other biomarker genes (CCR8, CXCL10) related to cytokine/chemokine signaling. For example, CCR8 (increased in our study) is a member of the beta chemokine receptor family and has greater expression in M1 pro-inflammatory macrophages (vs. M2 macrophages). M1 macrophages have been shown to be more prevalent in IAs [30] and may contribute more to pathologic remodeling during IA natural history [31]. Critical signaling pathways are also represented in the first network, with the transcription factor AP1 interacting with ERK, JNK, and MAPK complexes. AP1 has been linked to other inflammatory diseases, such as arthritis [32], and has been shown to regulate MMP-2, which could degrade extracellular matrix in IA [33]. Furthermore, a predicted upstream regulator of AP1 is IL1B, which is significantly increased in plasma of IA patients [29] and is associated with extracellular matrix destruction, NF-κB signaling, and vascular SMC apoptosis [34]. IL1B upregulates adhesion molecules on endothelial cells that recruit neutrophils and monocytes, and can induce both reactive oxygen species production and MMP-9 degradation via NGAL [35, 36]. Another important cytokine implicated in IPA analysis is OSM, an upstream regulator. OSM can regulate production of other cytokines, including IL6, which has been implicated in polymorphism studies of multiple populations [37-39]. While numerous model genes have clear associations to IA pathogenesis via inflammatory and signaling pathways, others are related to functions that have not been extensively explored in IA. For example, intra-and extra-cellular signaling, reflected by ATF3, CHMP4B, and PCSK1N may play a role in the complex reactions of circulating cells to IA presence. Overall, these and the other remaining genes require further study to elucidate their roles in IA pathogenesis, since they may represent unique predictive targets in whole blood RNA expression profiles. One way of determining which transcripts are most associated with IA may be to investigate if their “signal” increases when blood is collected from the intracranial vessels or from the aneurysm sac, as others have done [25, 40]. We hypothesize that differences in the IA-associated transcripts would be exaggerated in blood samples drawn closer to the IA tissue, as the circulating cells that confer transcriptomic changes due to a blood-IA interaction would be most concentrated at that location. Future studies are required to test this hypothesis. This study has several limitations. First, this is a single-center study, which may have introduced selection bias into our experimental design. Second, every subject underwent imaging by DSA. Thus, while our control population was confirmed to not have an IA, they may have other health issues that prompted DSA imaging. Furthermore, we used DSA for identification of IAs, as it is the gold standard in cerebrovascular imaging because of its high resolution. However, future studies may use less sensitive modalities, such as CTA or MR angiography, to confirm the presence of IA, which therefore, may result in false positives for the proposed biomarker. Third, there was an imbalance in comorbidities between IA and control groups that could have contributed to differential expression. To address these limitations, we are currently planning a multi-center study to prospectively validate our biomarker in patients receiving both DSA and non-invasive imaging, such as MRA. This large study will increase both our sample size and the diversity of patient population. It will also allow us to incorporate multiple control groups, such as those with other types of aneurysm or vascular abnormalities, to identify transcripts most specific to IA.

Conclusions

In this study we developed an accurate (85%) machine learning classifier derived from whole blood transcriptomes to predict presence of unruptured IA. Bioinformatics analyses indicate that critical inflammatory pathways are captured by the model genes, which is consistent with our previous findings using neutrophils. While other groups have studied whole blood transcriptomes for IA biomarkers, they used single, small RNA molecules [41, 42], did not perform cerebral imaging on control subjects, or did not use an independent testing cohort. We addressed these shortcomings by confirming presence or absence of IA with cerebral imaging, using gSVM with a panel of genes to better handle inter-sample variability, performing feature identification and model construction in a separate training cohort, and assessing true model performance in an independent testing cohort. While we implemented an improved study design, we still need to confirm our biomarker in a large, multi-center study.

Characteristics of 27 aneurysms in all patients with intracranial aneurysms (6 patients had multiple intracranial aneurysms).

(DOCX) Click here for additional data file.

Cohort assignment and RNA quality.

(DOCX) Click here for additional data file.

Per-gene performance of the 18 model transcripts.

(DOCX) Click here for additional data file.

Transcripts in the 2 significant networks constructed by ingenuity pathway analysis (IPA).

(DOCX) Click here for additional data file.

Significant disease and biological functions assigned by ingenuity pathway analysis for genes identified by LASSO.

(DOCX) Click here for additional data file.

The testing data.

(DOCX) Click here for additional data file. 16 Sep 2020 PONE-D-20-25156 Whole blood transcriptome biomarkers of unruptured intracranial aneurysms PLOS ONE Dear Dr. Tutino, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by October 15th. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols We look forward to receiving your revised manuscript. Kind regards, Jinglu Ai, M.D., Ph.D. Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. Thank you for stating the following in the Competing Interests section: "I have read the journal's policy and the authors of this manuscript have the following competing interests: JNJ—Principal Investigator: NIH Grant R01-AR-060604. KVS—Consulting/teaching: Canon Medical Systems Corporation, Penumbra Inc., Medtronic, Jacobs Institute. Co-Founder: Neurovascular Diagnostics, Inc. EIL—Intratech Medical Ltd. NeXtGen Biologics. Principal investigator: Medtronic US SWIFT PRIME Trials. Honoraria–Medtronic. Consultant–Pulsar Vascular. Advisory Board-Stryker, NeXtGen Biologics, MEDX, Cognition Medical. Other financial support—Abbott Vascular for carotid training sessions. AHS—Financial Interest/Investor/Stock Options/Ownership: Amnis Therapeutics, Apama Medical,BlinkTBI, Inc., Buffalo Technology Partners, Inc., Cardinal Health, Cerebrotech Medical Systems, Inc., Claret Medical, Cognition Medical, Endostream Medical, Ltd., Imperative Care, International Medical Distribution Partners, Rebound Therapeutics Corp., Silk Road Medical, StimMed, Synchron, Three Rivers Medical, Inc., Viseon Spine, Inc. Consultant/Advisory Board: Amnis Therapeutics, Boston Scientific, Canon Medical Systems USA, Inc., Cerebrotech Medical Systems, Inc., Cerenovus, Claret Medical, Corindus, Inc., Endostream Medical, Ltd., Guidepoint 15Global Consulting, Imperative Care, Integra, Medtronic, MicroVention, Northwest University—DSMB Chair for HEAT Trial, Penumbra, Rapid Medical,Rebound Therapeutics Corp., Silk Road Medical, StimMed, Stryker, Three Rivers Medical, Inc.,VasSol, W.L. Gore & Associates. National PI/Steering Committees: Cerenovus LARGE Trial and ARISE II Trial,Medtronic SWIFTPRIME and SWIFT DIRECT Trials, MicroVention FRED Trial & CONFIDENCE Study, MUSC POSITIVE Trial,Penumbra 3D Separator Trial, COMPASS Trial, INVEST Trial. HM—Principal investigator:NIH Grants R01-NS-091075 and R01-NS-064592. Grant support: Canon Medical Systems. Co-founder: Neurovascular Diagnostics, Inc. VMT—Principal investigator: National Science Foundation Award No. 1746694, Brain Aneurysm Foundation grant, Center for Advanced Technology grant, and Cummings Foundation grant. Co-founder:Neurovascular Diagnostics, Inc." Please confirm that this does not alter your adherence to all PLOS ONE policies on sharing data and materials, by including the following statement: "This does not alter our adherence to  PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests).  If there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared. Please include your updated Competing Interests statement in your cover letter; we will change the online submission form on your behalf. Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests 3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. In your revised cover letter, please address the following prompts: a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent. b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. We will update your Data Availability statement on your behalf to reflect the information you provide. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The authors present work on a transcriptome method of unruptured IA. This is both a critical public health need and appears to be conducted in a scientifically sound manner which improves on previous work. I recommend this manuscript for publication. There are a few issues which must be corrected: 1) Proofreading. One example: "Currently, the only way to diagnose IAs is with cerebral imaging such as MR angiographjy, 59 computed tomographgy angiography (CTA), or digital subtraction angriography (DSA)." This is easily solved and does not affect either understanding or deliberation. 2) Please include in the discussion what differences in signal occur testing peripheral blood compared to blood taken from an intracranial catheter during DSA. 3) Mention that DSA is the gold standard and that CTA/MRA is less sensitive and what affect that may have, please. 4) Femoral access is done with a sheath, not a true catheter. Reviewer #2: This is a well written manuscript & of great interest to all physicians who are involved in caring for subjects with unruptured intracranial aneurysms. One minor concern/question: Were the authors blinded when they performed the analysis on the tested group (not controlled/trained)?? If not, then this MUST be done first before establishing the sensitivity, specificity, accuracy, PPV & NPV?? ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 19 Oct 2020 Authors response to reviewer comments for Manuscript PONE-D-20-25156, entitled “Whole blood transcriptome biomarkers of unruptured intracranial aneurysms”: Authors: We thank the editors and reviewers for their insightful comments, which have helped to improve the manuscript. On the basis of these comments, we have made several changes, which are tracked by yellow highlighting in the revised manuscript. Below are our point-by-point responses to the comments. The page numbers mentioned correlate with the revised version of the manuscript. Thank you. Editorial Comments: Journal Requirements: 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. Authors: We have ensured that our manuscript meets PLOS ONE’s style requirements. 2. Thank you for stating the following [not shown] in the Competing Interests section. Please confirm that this does not alter your adherence to all PLOS ONE policies on sharing data and materials, by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests). If there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared. Please include your updated Competing Interests statement in your cover letter; we will change the online submission form on your behalf. Authors: The conflicts of interest do not alter our adherence to all PLOS ONE policies on sharing data and materials. We have included the Competing Interests statement in the cover letter and added the following language: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” 3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. In your revised cover letter, please address the following prompts: a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent. b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Authors: There are some legal restrictions, as the data are partially owned by a third-party organization, Neurovascular Diagnostics, with whom this study was performed in conjunction. The data can still be accessed via request to their Data Administrator. As per PLOS ONE’s policy, we have now our data availability statement: “Data cannot be shared publicly because it is partially owned by the study sponsor, Neurovascular Diagnostics, Inc. Data are available from the Neurovascular Diagnostics Data Access Committee (contact Dr. Hamidreza Rajabzadeh-Oghaz, Data Administrator at info@nvdiag.com, hacademic1811@gmail.com [alternate], or 1-866-552-6402) for researchers who meet the criteria for access to confidential data.” This statement has been added to the cover letter. We have also added this information to the revised manuscript on page 12, under “Availability of RNA expression data”. Reviewer Comments: Reviewer 1: 1. Proofreading. One example: "Currently, the only way to diagnose IAs is with cerebral imaging such as MR angiographjy, 59 computed tomographgy angiography (CTA), or digital subtraction angriography (DSA)." This is easily solved and does not affect either understanding or deliberation. Authors: Thank you for pointing this out. We have gone through the entire manuscript to correct spelling errors and typos throughout. 2. Please include in the discussion what differences in signal occur testing peripheral blood compared to blood taken from an intracranial catheter during DSA. Authors: This is a good suggestion. Intuitively, we suspect that blood closer to the aneurysm may give greater signal in the biomarker. Differences in truly IA-associated transcripts would likely be exaggerated in blood samples drawn closer to the aneurysm tissue because the circulating cells that confer transcriptomic changes due to a blood-IA interaction would be most concentrated there. This is now discussed in the Discussion on page 15 of the revised manuscript. 3. Mention that DSA is the gold standard and that CTA/MRA is less sensitive and what affect that may have, please. Authors: Thank you for this suggestion. In the limitations section of the Discussion on page 15 we now have stated that DSA is the gold standard, and that in future studies the use of CTA or MR angiography (which may be less sensitive to IA presence) to diagnose IAs could unintentionally give false positives for the proposed biomarker. 4. Femoral access is done with a sheath, not a true catheter. Authors: Thank you for this comment. We have now corrected this in the Methods on page 4 of the revised manuscript. Reviewer 2: 1. Were the authors blinded when they performed the analysis on the tested group (not controlled/trained)? If not, then this MUST be done first before establishing the sensitivity, specificity, accuracy, PPV & NPV? Authors: Thank you for this question. Yes. The operators who input the data into the machine learning models were blinded to the class of the subjects. Only after the predictions were made, did the operator check the prediction with the true aneurysm status (yes or no) of the patient. This is now detailed in the Methods on page 7 of the revised manuscript. Submitted filename: Response_to_Reviewers.docx Click here for additional data file. 22 Oct 2020 Whole blood transcriptome biomarkers of unruptured intracranial aneurysms PONE-D-20-25156R1 Dear Dr. Tutino, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Jinglu Ai, M.D., Ph.D. Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: 28 Oct 2020 PONE-D-20-25156R1 Whole blood transcriptome biomarkers of unruptured intracranial aneurysm Dear Dr. Tutino: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Jinglu Ai Academic Editor PLOS ONE
  38 in total

1.  Treatment options of unruptured intracranial aneurysms.

Authors:  Seppo Juvela
Journal:  Stroke       Date:  2004-02       Impact factor: 7.914

2.  The sva package for removing batch effects and other unwanted variation in high-throughput experiments.

Authors:  Jeffrey T Leek; W Evan Johnson; Hilary S Parker; Andrew E Jaffe; John D Storey
Journal:  Bioinformatics       Date:  2012-01-17       Impact factor: 6.937

Review 3.  Guidelines for the management of aneurysmal subarachnoid hemorrhage: a statement for healthcare professionals from a special writing group of the Stroke Council, American Heart Association.

Authors:  Joshua B Bederson; E Sander Connolly; H Hunt Batjer; Ralph G Dacey; Jacques E Dion; Michael N Diringer; John E Duldner; Robert E Harbaugh; Aman B Patel; Robert H Rosenwasser
Journal:  Stroke       Date:  2009-01-22       Impact factor: 7.914

4.  Localized increase of chemokines in the lumen of human cerebral aneurysms.

Authors:  Nohra Chalouhi; Lauren Points; Gary L Pierce; Zuhair Ballas; Pascal Jabbour; David Hasan
Journal:  Stroke       Date:  2013-07-25       Impact factor: 7.914

Review 5.  Case-fatality rates and functional outcome after subarachnoid hemorrhage: a systematic review.

Authors:  J W Hop; G J Rinkel; A Algra; J van Gijn
Journal:  Stroke       Date:  1997-03       Impact factor: 7.914

6.  Exosome-Derived MiRNAs as Biomarkers of the Development and Progression of Intracranial Aneurysms.

Authors:  Bao Liao; Meng-Xiao Zhou; Feng-Kun Zhou; Xiu-Mei Luo; Song-Xin Zhong; Yuan-Fang Zhou; Yan-Sheng Qin; Ping-Ping Li; Chao Qin
Journal:  J Atheroscler Thromb       Date:  2019-10-10       Impact factor: 4.928

7.  Optimal screening strategy for familial intracranial aneurysms: a cost-effectiveness analysis.

Authors:  A Stijntje E Bor; Hendrik Koffijberg; Marieke J H Wermer; Gabriel J E Rinkel
Journal:  Neurology       Date:  2010-05-25       Impact factor: 9.910

Review 8.  Oncostatin M: a pleiotropic cytokine in the central nervous system.

Authors:  Shao-Hua Chen; Etty N Benveniste
Journal:  Cytokine Growth Factor Rev       Date:  2004-10       Impact factor: 7.638

9.  Causal analysis approaches in Ingenuity Pathway Analysis.

Authors:  Andreas Krämer; Jeff Green; Jack Pollard; Stuart Tugendreich
Journal:  Bioinformatics       Date:  2013-12-13       Impact factor: 6.937

10.  Circular RNA hsa_circ_0021001 in peripheral blood: a potential novel biomarker in the screening of intracranial aneurysm.

Authors:  Lingfang Teng; Yu Chen; Huihui Chen; Xijun He; Junyou Wang; Yujiang Peng; Hongyu Duan; Huiyong Li; Da Lin; Bo Shao
Journal:  Oncotarget       Date:  2017-11-10
View more
  3 in total

1.  Isolation of RNA from Acute Ischemic Stroke Clots Retrieved by Mechanical Thrombectomy.

Authors:  Vincent M Tutino; Sarah Fricano; Kirsten Frauens; Tatsat R Patel; Andre Monteiro; Hamid H Rai; Muhammad Waqas; Lee Chaves; Kerry E Poppenberg; Adnan H Siddiqui
Journal:  Genes (Basel)       Date:  2021-10-14       Impact factor: 4.141

Review 2.  Intracranial aneurysm wall (in)stability-current state of knowledge and clinical perspectives.

Authors:  Philippe Bijlenga; Brenda R Kwak; Sandrine Morel
Journal:  Neurosurg Rev       Date:  2021-11-06       Impact factor: 2.800

3.  Identification of the key immune-related genes in aneurysmal subarachnoid hemorrhage.

Authors:  Xing Wang; Dingke Wen; Chao You; Lu Ma
Journal:  Front Mol Neurosci       Date:  2022-09-12       Impact factor: 6.261

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.