Literature DB >> 20667078

Novel proteins associated with risk for coronary heart disease or stroke among postmenopausal women identified by in-depth plasma proteome profiling.

Ross L Prentice1, Sophie Paczesny, Aaron Aragaki, Lynn M Amon, Lin Chen, Sharon J Pitteri, Martin McIntosh, Pei Wang, Tina Buson Busald, Judith Hsia, Rebecca D Jackson, Jacques E Rossouw, Joann E Manson, Karen Johnson, Charles Eaton, Samir M Hanash.   

Abstract

BACKGROUND: Coronary heart disease (CHD) and stroke were key outcomes in the Women's Health Initiative (WHI) randomized trials of postmenopausal estrogen and estrogen plus progestin therapy. We recently reported a large number of changes in blood protein concentrations in the first year following randomization in these trials using an in-depth quantitative proteomics approach. However, even though many affected proteins are in pathways relevant to the observed clinical effects, the relationships of these proteins to CHD and stroke risk among postmenopausal women remains substantially unknown.
METHODS: The same in-depth proteomics platform was applied to plasma samples, obtained at enrollment in the WHI Observational Study, from 800 women who developed CHD and 800 women who developed stroke during cohort follow-up, and from 1-1 matched controls. A plasma pooling strategy, followed by extensive fractionation prior to mass spectrometry, was used to identify proteins related to disease incidence, and the overlap of these proteins with those affected by hormone therapy was examined. Replication studies, using enzyme-linked-immunosorbent assay (ELISA), were carried out in the WHI hormone therapy trial cohorts.
RESULTS: Case versus control concentration differences were suggested for 37 proteins (nominal P < 0.05) for CHD, with three proteins, beta-2 microglobulin (B2M), alpha-1-acid glycoprotein 1 (ORM1), and insulin-like growth factor binding protein acid labile subunit (IGFALS) having a false discovery rate < 0.05. Corresponding numbers for stroke were 47 proteins with nominal P < 0.05, three of which, apolipoprotein A-II precursor (APOA2), peptidyl-prolyl isomerase A (PPIA), and insulin-like growth factor binding protein 4 (IGFBP4), have a false discovery rate < 0.05. Other proteins involved in insulin-like growth factor signaling were also highly ranked. The associations of B2M with CHD (P < 0.001) and IGFBP4 with stroke (P = 0.005) were confirmed using ELISA in replication studies, and changes in these proteins following the initiation of hormone therapy use were shown to have potential to help explain hormone therapy effects on those diseases.
CONCLUSIONS: In-depth proteomic discovery analysis of prediagnostic plasma samples identified B2M and IGFBP4 as risk markers for CHD and stroke respectively, and provided a number of candidate markers of disease risk and candidate mediators of hormone therapy effects on CHD and stroke. CLINICAL TRIALS REGISTRATION: ClinicalTrials.gov identifier: NCT00000611.

Entities:  

Year:  2010        PMID: 20667078      PMCID: PMC2923740          DOI: 10.1186/gm169

Source DB:  PubMed          Journal:  Genome Med        ISSN: 1756-994X            Impact factor:   11.117


Background

Blood protein concentrations provide a source for novel disease risk markers that may be modifiable by treatments or other exposures. As such, protein markers have potential to enhance the understanding of disease pathogenesis, and to elucidate biological processes whereby an exposure affects disease risk. We report here on a large-scale proteomic study that aimed to uncover novel associations between plasma proteins and the risk of subsequent coronary heart disease (CHD) or stroke. These diseases were key outcomes in Women's Health Initiative (WHI) randomized postmenopausal hormone therapy trials of 0.625 mg/d conjugated equine estrogen (E-alone), or this same preparation plus 2.5 mg/d medroxyprogesterone acetate (E+P). We also sought to identify proteins that both distinguished cases from controls and were altered by E-alone or E+P as candidate biomarkers for elucidation of hormone therapy effects on these diseases [1-6]. E-alone and E+P were each found to yield an elevation in stroke risk [3,4], whereas E+P effects were unfavorable, and unfavorable compared to E-alone effects, for CHD [5,6]. A related research effort is considering case versus control comparisons for breast cancer [7,8]. We recently reported blood proteomic changes between baseline and 1 year for 50 women assigned to active treatment in each of the E-alone and E+P trials [9,10]. An intact protein analysis system (IPAS) [11-14] was used for these analyses. Under stringent criteria for protein identification and relative quantification, 378 proteins were quantified [10]. There was some evidence (nominal P < 0.05) of change from baseline to 1 year with either or both of E-alone and E+P for a remarkable 44.7% of these proteins. These proteins were involved in coagulation, inflammation, immune response, metabolism, cell adhesion, growth factors, and osteogenesis; pathways that plausibly relate to observed clinical effects [1-8] for these regimens. A comparatively larger number of study subjects is needed to detect modest associations between plasma proteins and subsequent risk of CHD or stroke. Hence, we contrasted pools formed by equal plasma volumes from 100 cases or from 100 pair-matched controls, with eight such pool pairs for each of the study diseases. We report here on proteins, and sets of proteins, having evidence of a case-control difference in plasma concentration for CHD or stroke, and on the overlap of these proteins with those altered by E-alone or E+P. Enzyme-linked-immunosorbent assay (ELISA) replication studies in the WHI hormone therapy trial cohorts were carried out subsequently for selected proteins.

Methods

Study subjects and outcome ascertainment

Cases and controls were drawn from the WHI observational study, a prospective cohort study of 93,676 postmenopausal women in the age range 50 to 79 years at enrollment during 1993 to 1998 [15,16]. Fasting blood specimens were obtained at baseline as a part of eligibility screening. Serum and plasma samples were shipped to a central repository and stored at -70°C. Disease events during cohort follow-up were initially self-reported, followed by physician adjudication at participating WHI clinical centers, and central adjudication of some outcomes [17]. CHD was composed of myocardial infarction and death due to coronary disease. Cases of hospitalized stroke were based on rapid neurologic deficit attributable to obstruction or rupture of the arterial system or on a demonstrable lesion compatible with acute stroke. CHD and stroke cases were chosen as the earliest 800 incident cases during cohort follow-up for which a suitable plasma specimen was available. Each case was 1-1 matched to a control woman who did not develop any of the study diseases during cohort follow-up. Cases and controls were matched on baseline age (within 1 year), self-reported ethnicity, hysterectomy status, prior history of the study disease, and enrollment date (median difference 1 month). Non-overlapping sets of controls were chosen for CHD, stroke, and breast cancer. Diagnosis occurred an average of 2.2 and 4.5 years after blood draw for the CHD and stroke cases, respectively.

Sample preparation, protein fractionation, and mass spectrometry analysis

We used 3,200 patient samples (800 stroke cases, 800 CHD cases, and 1,600 controls) to form case and control pool pairs for 16 IPAS experiments (8 stroke + 8 CHD). For each IPAS experiment, a case and control pool was created using 5 μl of EDTA plasma for each of the 100 cases or 100 controls for proteomic analysis. The pools were independent, with each sample used in only one pool. The IPAS analytic methods used for this project have been described [13] and detailed information is available in Additional file 1. Following immunodepletion of the six most abundant proteins (albumin, IgG, IgA, transferrin, haptoglobin, antitrypsin), pools were concentrated and case and control pools were isotopically labeled with either the 'light' C12 or the 'heavy' C13 acrylamide. The case and corresponding control pools were then mixed together for further analysis. The combined sample was diluted, and each sample was separated into eight fractions using anion exchange chromatography, and each fraction was further separated using reversed-phase chromatography. Lyophilized aliquots from the reversed-phase fractionation were subjected to in-solution trypsin digestion, and individual digested fractions from each reversed-phase run were combined, giving a total of 96 (8 × 12 reversed-phase) fractions for analysis from each original mixed case and control pool. Tryptic peptides were analyzed by a LTQ-FT mass spectrometer. Spectra were acquired in a data-dependent mode in a mass/charge range of 400 to 1,800, and the 5 most abundant + 2 or + 3 ions were selected from each spectrum for tandem mass spectrometry (MS/MS) analysis.

Protein identification and case versus control concentration assessment

The acquired liquid chromatography MS/MS data were processed by a Computational Proteomics Analysis System [18]. Database searches were performed using X!Tandem against the human International Protein Index (IPI) using tryptic search [18]. Database search results were analyzed using PeptideProphet [19] and ProteinProphet [20]. Protein identification was based on ProteinProphet scores that indicate an error rate of less than 10%. The relative quantification of case versus control concentration for cysteine-containing peptides (acrylamide label binds to cysteine) identified by MS/MS was extracted using a script [11] that calculates the relative peak areas of heavy to light acrylamide-labeled peptides; see [10] for further details. Proteins from all IPAS experiments for a specific disease were aligned by their protein group number, assigned by ProteinProphet, in order to identify master groups of indistinguishable proteins across experiments. Ratios for these protein groups were logarithmically transformed and median-centered at zero for each IPAS experiment. Groups that had fewer than four peptide ratios across all experiments for a specific disease, groups that contained proteins that were targeted for depletion, and groups in which all proteins had been annotated as 'defunct' by IPI, were excluded from analysis.

Statistical analysis of case versus control protein concentrations

Data analysis was based on log(base2) concentration ratios from case versus control pools. The log-ratios for a particular protein were analyzed using linear models that included a disease-specific mean parameter plus a variable defined as 1 if the heavy acrylamide label was assigned to the case group and -1 otherwise. A weighted moderated t-test [21], implemented in the R package LIMMA [22], was used to examine whether there was evidence of a disease-specific mean parameter that differs from zero, after adjusting for any labeling effect. The log-ratios were weighted by the number of quantified peptides for each protein. Log-ratios for all three diseases were used to jointly estimate model parameters (the heavy acrylamide label was randomly assigned to the case or control pool for both stroke and breast cancer, and to the case pools for CHD), and to increase the degrees of freedom for log-ratio variance estimation. One of the breast cancer pool pairs gave log-ratios that were comparatively highly variable, and is excluded from all analyses. Benjamini and Hochberg's method [23] was used to accommodate multiple testing, through the calculation of estimated false discovery rates (FDRs), separately for each study disease.

Biological pathway analyses

A regularized Hotelling T2 procedure was used to identify sets of proteins, defined by biological pathways, that differ in concentrations between cases and controls for each study disease. This testing procedure takes advantage of the correlation structure among the log-ratios for proteins in a given set. Protein sets were defined using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database [24,25].

ELISA replication analyses

Selected protein associations with disease risk were further evaluated by ELISA testing of CHD and stroke cases and controls drawn from the non-overlapping WHI hormone therapy trial cohorts. Baseline plasma samples were evaluated for women who developed CHD or stroke during the first year following randomization, along with 1-1 matched disease-free controls. Matching variables included age, randomization date, hysterectomy status, and prevalent study disease. Assays were performed according to manufacturer's direction, for beta-2 microglobulin (B2M; Genway San Diego, CA, USA) and insulin-like growth factor binding protein 4 (IGFBP4; R & D Systems Minneapolis, MN, USA). All samples were assayed with sample characteristics blinded and in duplicate.

Results

Plasma protein risk markers

Additional file 2 provides information on baseline characteristics for the 800 CHD and 800 stroke cases and their non-overlapping 1-1 matched controls. All women were postmenopausal and in the age range 50 to 79 years at recruitment. Most were white. About two-thirds were overweight or obese. There were few current cigarette smokers. Sixteen percent of CHD cases had experienced a myocardial infarction and 15% of stroke cases had experienced a stroke prior to WHI enrollment. Case versus control concentration ratios were determined following application of stringent standards for identification and quantification (see Methods). Following application of an additional requirement that proteins were quantified for at least two of the pool pairs for a disease, 346 proteins for CHD and 366 proteins for stroke were included in statistical analyses. Of these, a total of 37 proteins have nominal significance levels of P < 0.05 for CHD cases versus controls, compared to 17.3 expected by chance; and 47 have P < 0.05 for stroke cases versus controls, compared to 18.3 expected by chance. These proteins are listed in Tables 1 and 2 along with their mean log-intensity ratios, P-values, and FDRs.
Table 1

Proteins having some evidence (P < 0.05) of difference in concentration between coronary heart disease cases and controls

ProteinDescriptionLog(base2) case vs control ratioP-value a FDR a
B2MBeta-2-microglobulin.0.2125.07e-050.0176
ORM1Alpha-1-acid glycoprotein 10.1200.0001820.0315
IGFALSInsulin-like growth factor-binding protein complex acid labile chain-0.1120.0003840.0443
THBS1Thrombospondin-1-0.6320.001330.0749
LPAApolipoprotein(A)0.3470.001380.0749
CFDComplement factor D preproprotein0.2100.001410.0749
PRG4Isoform C of proteoglycan 40.2320.001520.0749
GPX3Glutathione peroxidase 3-0.2240.003080.133
IGFBP1Insulin-like growth factor-binding protein 10.4230.003810.146
MST1Hepatocyte growth factor-like protein homolog-0.3060.005920.205
ITIH2Inter-alpha-trypsin inhibitor heavy chain H2-0.1400.007860.247
ENO1Isoform alpha-enolase of alpha-enolase-0.4180.009500.255
C9Complement component C90.08270.009890.255
SFTPBPulmonary surfactant-associated protein B precursor0.5510.01120.255
FHL1cDNA FLJ55259 highly similar to four and a half lim domains protein 1-0.4810.01160.255
CRISP3cDNA FLJ752070.1470.01180.255
SERPIND1Serpin peptidase inhibitor clade D (heparin cofactor) member 10.2100.01760.334
CD5LCD5 antigen-like0.1520.01810.334
SOD3Extracellular superoxide dismutase [Cu-Zn]0.4530.01830.334
TPI1Triosephosphate isomerase 1 isoform 2-0.1440.02320.401
C1QBComplement component 1 Q subcomponent B chain precursor-0.1060.02710.407
ATRNIsoform 1 of attractin-0.1510.02740.407
INHBEInhibin beta E chain0.3840.02840.407
CHRDL2Isoform 2 of chordin-like protein 2-0.6470.02870.407
LIMS1cDNA FLJ55516 highly similar to particularly interesting new Cys-His protein-0.4120.03180.407
VASPVasodilator-stimulated phosphoprotein-0.4990.03560.407
C8AComplement component C8 alpha chain0.1700.03590.407
C2Complement C2 (fragment)-0.2300.03610.407
CD14Monocyte differentiation antigen CD140.1050.03610.407
GCVitamin D-binding protein-0.04510.03640.407
MTPNMyotrophin-0.2400.03720.407
SERPINF2Serpin peptidase inhibitor, clade F, member 2-0.1100.03830.407
ACTA2Actin aortic smooth muscle-1.220.03880.407
TAGLN2Transgelin-2-0.1860.04260.433
FERMT3Isoform 2 of fermitin family homolog 3-0.5600.04620.454
F12Coagulation factor XII-0.1470.04720.454
AFMAfamin-0.07640.04900.458

aP-value = significance level for no difference in protein concentration; FDR = estimated false discovery rate.

Table 2

Proteins having some evidence (P < 0.05) of difference in concentration between stroke cases and controls

ProteinDescriptionLog(base2) case vs control ratioP-value a FDR a
APOA2Apolipoprotein A-II-0.1202.71e-050.00991
PPIAPeptidyl-prolyl cis-trans isomerase A0.1947.68e-050.0141
IGFBP4Insulin-like growth factor-binding protein 40.4090.0003200.0391
F2Prothrombin (fragment)-0.07320.0007020.0642
IGF2Isoform 1 of insulin-like growth factor II-0.06940.002250.138
C6Complement component 6 precursor-0.1400.002270.138
LILRA3Leukocyte immunoglobulin-like receptor subfamily a member 30.3160.003410.177
HPXHemopexin-0.04480.004070.177
IGFBP6Insulin-like growth factor-binding protein 60.6670.004350.177
LOC650157Similar to peptidyl-pro cis trans isomerase0.2370.005100.187
IGFBP2Insulin-like growth factor-binding protein 20.4800.006090.189
GCVitamin D-binding protein-0.05320.006990.189
CADM1Isoform 1 of cell adhesion molecule 1-0.1990.007620.189
PIN1Peptidyl-prolyl cis-trans isomerase NIMA-interacting 10.1900.007670.189
CTSDCathepsin D0.4900.007760.189
COL1A1Collagen alpha-1(I) chain0.1950.008260.189
F13BCoagulation factor XIII b chain0.1210.009030.194
MANSC1MANSC domain-containing protein 1-0.9620.01020.207
COL6A3Isoform 1 of collagen alpha-3(VI) chain0.8280.01090.210
GRNcDNA FLJ13286 fis clone OVARC1001154 highly similar to Homo sapiens clone 24720 epithelin 1 and 2 mRNA0.3160.01300.238
RNASE1Ribonuclease pancreatic0.5820.01430.243
MTPNMyotrophin0.2490.01460.243
GLIPR2Golgi-associated plant pathogenesis-related protein 10.6230.01680.265
ADAMTSL2ADAMTS-like protein 20.2050.01840.265
ITIH4Isoform 2 of inter-alpha-trypsin inhibitor heavy chain H4-0.2380.01870.265
HLA-DRB5bNon-secretory ribonuclease0.7840.01880.265
KLKB1Plasma kallikrein-0.1150.02020.270
CD59CD59 glycoprotein0.8660.02080.270
CD14Monocyte differentiation antigen CD140.1040.02140.270
CSF1RMacrophage colony-stimulating factor 1 receptor0.2590.02230.272
GRB2Isoform 1 of growth factor receptor-bound protein 21.580.02350.278
CD5LCD5 antigen-like0.1470.02530.289
B2MBeta-2-microglobulin0.07280.02800.310
SERPINC1Antithrombin-III-0.06310.03120.325
FCN3Isoform 1 of ficolin-30.1320.03230.325
HGFACHepatocyte growth factor activator-0.5920.03240.325
RBP4Retinol-binding protein 40.04780.03460.325
CFHR5Complement factor H-related 5-0.08000.03480.325
PRDX2Peroxiredoxin-2-0.5330.03610.325
C8AComplement component C8 alpha chain-0.1790.03730.325
ADAMTSL4Isoform 1 of ADAMTS-like protein 4-0.1300.03730.325
QSOX1Isoform 1 of sulfhydryl oxidase 10.3700.03760.325
CPB2Isoform 1 of carboxypeptidase B2-0.2280.03810.325
FETUBFetuin-B0.06620.04100.332
PPIFPeptidyl-prolyl cis-trans isomerase mitochondrial0.3180.04140.332
LCN2Neutrophil gelatinase-associated lipocalin0.1720.04170.332
DSC1Isoform 1B of desmocollin-1-0.2650.04380.341

aP-value = significance level for no difference in protein concentration; FDR = estimated false discovery rate. bThe DRB5 protein group also includes ZNF749, LOC100133811, LOC100133484, LOC100133661, HLA-DRB1, HLA-DRB4, RNASE2, and HLA-DRB3.

Proteins having some evidence (P < 0.05) of difference in concentration between coronary heart disease cases and controls aP-value = significance level for no difference in protein concentration; FDR = estimated false discovery rate. Proteins having some evidence (P < 0.05) of difference in concentration between stroke cases and controls aP-value = significance level for no difference in protein concentration; FDR = estimated false discovery rate. bThe DRB5 protein group also includes ZNF749, LOC100133811, LOC100133484, LOC100133661, HLA-DRB1, HLA-DRB4, RNASE2, and HLA-DRB3. Proteins having small FDRs are likely to be associated with disease risk. Three proteins, B2M, alpha-1-acid glycoprotein 1 (ORM1), and insulin-like growth factor binding protein, acid labile subunit (IGFALS) have a FDR < 0.05 for association with CHD risk; and three proteins, apolipoprotein A-II precursor (APOA2), peptidyl-prolyl isomerase A (PPIA), and IGFBP4 have a FDR < 0.05 for association with stroke risk. Six other proteins have a FDR < 0.20 for CHD association, and 14 have a FDR < 0.20 for stroke association. Figure 1 shows peptide coverage and case versus control concentration ratios for B2M, ORM1, PPIA, and IGFBP4 separately for each plasma pool pair. Additional files 3 and 4 show P-values and FDRs for the entire set of proteins quantified separately for the CHD and stroke analyses. These tables also provide information on the number of peptides and unique peptides identified, and on the number of peptides and unique peptides quantified for each listed protein. IPI numbers corresponding to the gene/protein are also listed.
Figure 1

Identification and quantitative analysis of peptides in plasma. From CHD cases and controls in eight experiments for (a) beta-2 microglobulin (B2M) and (b) alpha-1-acid glycoprotein 1 (ORM1); and from stroke cases and controls in eight experiments for (c) peptidyl-prolyl isomerase A (PPIA) and (d) insulin-like growth factor binding protein 4 (IGFBP4). Tryptic peptides from the amino terminus (1) to the carboxyl terminus are shown at the top. S, C and G indicate signal peptide, cysteine-containing and glycosylated peptides, respectively. Peptides identified, but which lack cysteine for quantification, are shown in gray. The log2 case/control ratio is shown for cysteine-containing peptides with the number of MS events for that peptide shown in parentheses. The number of plasma fractions where each peptide was quantified is indicated.

Identification and quantitative analysis of peptides in plasma. From CHD cases and controls in eight experiments for (a) beta-2 microglobulin (B2M) and (b) alpha-1-acid glycoprotein 1 (ORM1); and from stroke cases and controls in eight experiments for (c) peptidyl-prolyl isomerase A (PPIA) and (d) insulin-like growth factor binding protein 4 (IGFBP4). Tryptic peptides from the amino terminus (1) to the carboxyl terminus are shown at the top. S, C and G indicate signal peptide, cysteine-containing and glycosylated peptides, respectively. Peptides identified, but which lack cysteine for quantification, are shown in gray. The log2 case/control ratio is shown for cysteine-containing peptides with the number of MS events for that peptide shown in parentheses. The number of plasma fractions where each peptide was quantified is indicated.

Protein levels that are also affected by postmenopausal hormone therapy

Table 3 shows the subset of Table 1 proteins that appeared to have concentrations affected (P < 0.05) by one or both of E+P or E-alone in earlier proteomic discovery work [10], while Table 4 provides this information for the corresponding subset of Table 2. Five of the 6 proteins having a FDR < 0.05 for disease association are influenced by hormone therapy. In addition to these, certain other IGF binding proteins are evidently influenced by hormone therapy and may be related to CHD (IGFBP1) or stroke (IGFBP2, IGFBP6).
Table 3

Proteins having some evidence (P < 0.05) of concentration difference between CHD cases and controls that are altered (P < 0.05) by postmenopausal hormone therapy

CHDE+PE-alone



ProteinDescriptionLog(base2) case vs control ratioP-value a FDR a Log(base2) case vs control ratioP-value a Log(base2) case vs control ratioP-value a
B2MBeta-2-microglobulin0.2125.07e-050.01760.2080.002050.2300.00110
IGFALSInsulin-like growth factor-binding protein complex acid labile chain-0.1120.0003840.04430.1510.007850.1430.0282
CFDComplement factor D preproprotein0.2100.001410.0749-0.2460.00871-0.04720.620
PRG4Isoform C of proteoglycan 40.2320.001520.07490.07350.1810.1280.0327
IGFBP1Insulin-like growth factor-binding protein 10.4230.003810.1460.5280.002421.2703.66e-06
MST1Hepatocyte growth factor-like protein homolog-0.3060.005920.2050.5300.01000.6330.00195
C9Complement component C90.08270.009890.2550.1010.06450.1790.00858
SERPIND1Serpin peptidase inhibitor clade D (heparin cofactor) member 10.2100.01760.3340.4500.02400.1560.344
C1QBComplement component 1 Q subcomponent B chain precursor-0.1060.02710.4070.01130.4650.04800.0125
ATRNIsoform 1 of attractin-0.1510.02740.407-0.1900.000213-0.1260.00366
INHBEInhibin beta E chain0.3840.02840.4070.2580.07230.5200.00734
CHRDL2Isoform 2 of chordin-like protein 2-0.6470.02870.407-0.3010.0415-0.0009060.993
C8AComplement component C8 alpha chain0.1700.03590.407-0.2060.000163-0.2020.000121
C2Complement C2 (fragment)-0.2300.03610.4070.3340.003710.2910.0107
GCVitamin D-binding protein-0.04510.03640.4070.2313.10e-060.2372.75e-06
SERPINF2Serpin peptidase inhibitor, clade F, member 2-0.1100.03830.4070.09220.1480.1660.0247
F12Coagulation factor XII-0.1470.04720.4540.2610.0001020.2520.000219
AFMAfamin-0.07640.04900.4580.05800.1190.1770.000330

aP-value = significance level for no difference in protein concentration; FDR = estimated false discovery rate.

Table 4

Proteins having some evidence (P < 0.05) of concentration difference between stroke cases and controls that are altered (P < 0.05) by postmenopausal hormone therapy

StrokeE+PE-alone



ProteinDescriptionLog(base2) case vs control ratioP-value a FDR a Log(base2) case vs control ratioP-value a Log(base2) case vs control ratioP-value a
APOA2Apolipoprotein A-II-0.1202.71e-050.009910.2120.0005320.3021.75e-05
PPIAPeptidyl-prolyl cis-trans isomerase A0.1947.68e-050.01410.3810.008990.2010.126
IGFBP4Insulin-like growth factor-binding protein 40.4090.0003200.03910.1790.1020.5110.000697
F2Prothrombin (fragment)-0.07320.0007020.06420.06330.003660.02820.138
C6Complement component 6 precursor-0.1400.002270.138-0.1230.00151-0.1710.000123
LILRA3Leukocyte immunoglobulin-like receptor subfamily A member 30.3160.003410.177-0.2370.00874-0.2810.000277
HPXHemopexin-0.04480.004070.1770.1236.65e-050.1170.000124
IGFBP6Insulin-like growth factor-binding protein 60.6670.004350.1770.08680.2350.2070.0158
IGFBP2Insulin-like growth factor-binding protein 20.4800.006090.189-0.4200.00477-0.2870.0317
GCVitamin D-binding protein-0.05320.006990.1890.2313.10e-060.2372.75e-06
CADM1Isoform 1 of cell adhesion molecule 1-0.1990.007620.189-0.01390.8750.1800.0249
COL1A1Collagen alpha-1(I) chain0.1950.008260.189-0.8965.40e-07-0.5758.80e-05
COL6A3Isoform 1 of collagen alpha-3(VI) chain0.8280.01090.210-0.1970.00852-0.01340.834
RNASE1Ribonuclease pancreatic0.5820.01430.2430.03460.3110.09530.0427
ITIH4Isoform 2 of inter-alpha-trypsin inhibitor heavy chain H4-0.2380.01870.2650.4580.0007330.3740.00495
KLKB1Plasma kallikrein-0.1150.02020.2700.2520.002080.2300.00187
B2MBeta-2-microglobulin0.07280.02800.3100.2080.002050.2300.00110
SERPINC1Antithrombin-III-0.06310.03120.325-0.1965.05e-06-0.1435.50e-05
FCN3Isoform 1 of ficolin-30.1320.03230.3250.03510.02870.03570.0333
HGFACHepatocyte growth factor activator-0.5920.03240.325-0.1910.0979-0.3080.00765
RBP4Retinol-binding protein 40.04780.03460.3250.1670.0001170.1770.000262
CFHR5Complement factor H-related 5-0.08000.03480.3250.1790.0002640.2412.76e-05
PRDX2Peroxiredoxin-2-0.5330.03610.3250.6910.0201-0.02660.925
C8AComplement component C8 alpha chain-0.1790.03730.325-0.2060.000163-0.2020.000121
FETUBFetuin-B0.06620.04100.3320.7831.09e-090.7411.02e-09

aP-value = significance level for no difference in protein concentration; FDR = estimated false discovery rate.

Proteins having some evidence (P < 0.05) of concentration difference between CHD cases and controls that are altered (P < 0.05) by postmenopausal hormone therapy aP-value = significance level for no difference in protein concentration; FDR = estimated false discovery rate. Proteins having some evidence (P < 0.05) of concentration difference between stroke cases and controls that are altered (P < 0.05) by postmenopausal hormone therapy aP-value = significance level for no difference in protein concentration; FDR = estimated false discovery rate.

Protein set (pathway) analyses

For each disease, we focused attention on KEGG pathways for which relative quantification was available for three or more proteins and tested for evidence of a case versus control difference in plasma concentrations for the set of quantified proteins. For CHD there were two pathways having P < 0.05, namely a mitogen-activated protein kinase (MAPK) signaling pathway (P = 0.02), which included six quantified proteins (NTRK2, FLNA, CD14, TGFB1, FGFR1, and CACNA2D1), and a glycolysis and gluconeogenesis metabolic pathway (P = 0.03), which included nine quantified proteins (LDHB, LDHA, PKM2, ALDOA, ALDOC, TPI1, GAPDH, ENO1, PGK1). The FDRs were 0.09 for both pathways. In comparison, there were six pathways having P < 0.05 for stroke; four of which had a FDR < 0.05. These four were a hematopoietic cell lineage pathway (CD44, GP1BA, C5F1R, CD59, CD14), a purine metabolism pathway (AK1, AK2, PKM2), a peroxisome proliferator-activated receptor signaling pathway (APOA2, FABP4, FABP1), and a glycolysis and gluconeogenesis pathway having a set of quantified proteins (PKM2, ALDOA, ALDOC, ALDOB, TPI1, ENO2, GAPDH, ENO1, PGK1) that strongly overlaps that listed above for CHD. Figure 2 shows the substantial peptide coverage of glycolytic pathway proteins in the stroke IPAS experiments.
Figure 2

Glycolysis/gluconeogenesis pathway. Enzymes identified in stroke experiments are indicated by shading. Red and yellow indicate increased and no change in cases compared to controls, respectively. Gray indicates proteins identified but not quantified.

Glycolysis/gluconeogenesis pathway. Enzymes identified in stroke experiments are indicated by shading. Red and yellow indicate increased and no change in cases compared to controls, respectively. Gray indicates proteins identified but not quantified.

ELISA replication studies

B2M is of specific interest for CHD in view of higher levels in cases versus controls, and higher levels following 1-year of use of either E+P or E-alone (Table 3). IGFBP4 is of specific interest for stroke for these same reasons (Table 4). Hence, these proteins were selected for ELISA replication studies in the WHI hormone therapy trial cohorts. Based on individual plasma samples from 106 CHD cases occurring during the first year following randomization in the hormone therapy trials, and from 1-1 matched controls, ELISA evaluation yielded B2M concentrations that were 17.9% higher (P < 0.001) in cases versus controls (geometric mean of log-ratios of 1.179 with 95% confidence interval (CI) of 1.107 to 1.290), very similar to the 15.8% (20.212 = 1.158) higher concentration in cases compared to controls from the IPAS analyses of Table 1. Further analysis of case versus control log-ratios, which included the matching variables and several other CHD risk factors to control for possible confounding, produced similar findings (geometric mean of 1.275 with 95% CI of 1.122 to 1.450). Based on individual plasma samples from 68 stroke cases occurring during the first year following randomization in the hormone therapy trials, and from 1-1 matched controls, ELISA evaluation yielded IGFBP4 concentrations that were 16.6% higher (P = 0.005) in cases versus controls (geometric mean of log-ratios of 1.166 with 95% CI of 1.050 to 1.295). The ELISA case versus control ratio was little altered by additional control for several other potential stroke confounding factors (geometric mean of 1.149 with 95% CI of 1.008 to 1.309 following this control). Figure 3 shows the B2M assessments for individual CHD cases and controls and the IGFBP4 assessments for individual stroke cases and controls in these replication studies.
Figure 3

Baseline plasma B2M concentrations for CHD cases and controls, and IGFBP4 concentrations for stroke cases and controls, from the Women's Health Initiative hormone therapy trials. Individual ELISA-based concentrations are shown along with boxplots showing the median (dark line) and the 25th and 75th percentiles (bottom and top of box). The notches indicate 95% confidence intervals for the median.

Baseline plasma B2M concentrations for CHD cases and controls, and IGFBP4 concentrations for stroke cases and controls, from the Women's Health Initiative hormone therapy trials. Individual ELISA-based concentrations are shown along with boxplots showing the median (dark line) and the 25th and 75th percentiles (bottom and top of box). The notches indicate 95% confidence intervals for the median.

Discussion

The proteomic discovery and replication studies presented here show plasma B2M to be a risk marker for CHD in postmenopausal women. B2M is an amyloidogenic protein that is elevated in hemodialysis patients and in patients having bone disease [26,27]. B2M has been reported to be associated with CHD risk factors, and an inverse association with HDL cholesterol [28]. Positive associations with peripheral arterial disease [29] and with total mortality among elderly Japanese men and women [30] have also been reported. Our finding of B2M elevation in plasma obtained months or years prior to CHD diagnosis appears to be novel. Logistic regression analysis of ELISA B2M data yield odds ratios (95% CI) for the second, third, and fourth quartile of B2M, compared to the first, of 1.28 (0.46, 3.53), 1.77 (0.63, 4.96), and 3.40 (1.23, 9.35), with a trend test having P = 0.002, in analyses that control for case-control matching factors as well as hormone therapy randomization assignment, hysterectomy status, ethnicity, and history of myocardial infarction. From Table 3 we see that B2M levels increased by an estimated 15.5% (20.208 = 1.155) following E+P use and by 17.3% (20.230 = 1.173) following E-alone use. A 16% elevation in B2M projects a CHD odds ratio (95% CI) of 1.30 (1.11, 1.54) based on a logistic regression analysis with a linear term in log B2M, as determined by ELISA, and these same confounding control variables. Hence, the B2M elevation resulting from hormone therapy use could contribute importantly to an explanation for observed early elevations in CHD risk. The fact that CHD elevations evidently dissipate with longer-term hormone therapy use [5,6] could, for example, reflect concurrent favorable changes in plasma cholesterol fractions, especially for E-alone. Our proteomic discovery work also suggests (Table 4; P = 0.03) higher B2M levels in stroke cases versus controls, so that this marker may help to understand adverse effects of hormone therapy on cardiovascular disease more generally. The B2M we identified in prediagnostic plasma samples likely differs from modified forms in non-osteotendinous fibrils or insoluble cardiac deposits [31]. However, B2M may provide a valuable focus for studies of disease mechanism and therapeutic intervention in spite of uncertainties about the relationship of plasma levels and pathophysiologic effects within tissue. The discovery and replication studies presented here also show IGFBP4 to be a risk marker for stroke in postmenopausal women, which appears to be a novel finding. Logistic regression analyses that include a linear term in log IGFBP4 along with the case-control matching variables, hormone therapy randomization assignment, systolic and diastolic blood pressure, body mass index, and indicator variables for cigarette smoking, diabetes, and prior hormone therapy use yield a P-value of 0.018 for an association of IGFBP4 with stroke risk. A 20% increase in IGFBP4, as is consistent with the effects of E-alone and E+P on IGFBP4, projects an odds ratio (95% CI) of 1.40 (1.06, 1.85) in these analyses, suggesting that this marker could contribute importantly to a mechanistic explanation for the approximate 40% higher incidence of stroke among E-alone and E+P users in the WHI randomized trial [3,4]. Also, it is interesting that four of the eleven top-ranked proteins for association with stroke risk (Table 2) are members of the IGF signaling pathway (IGFBP4, IGF2, IGFBP6, IGFBP2). There have been some previous reports of associations between IGF pathway proteins and stroke [32-34]. Increased IGF binding protein levels may result in decreased IGF protein concentrations. IGF1 has been proposed as a potential neuroprotective protein for stroke [35]. To more directly assess the role of B2M and IGFBP4 in mediating hormone therapy effects on CHD and stroke, respectively, we are currently carrying out ELISA analyses of baseline and 1-year plasma samples in the WHI hormone therapy trials. The effect of changes between baseline and 1-year on these proteins on subsequent hormone therapy hazard ratios for CHD and stroke will be examined. Other proteins having small FDRs for association with CHD (Table 1) or stroke (Table 2) will benefit from evaluation in replication studies. Some of these have previously received some consideration as vascular disease risk markers, including ORM1 [36-40], APOA2 [41-43], PPIA [44], and IGFALS [45-47]. In addition to protein set analyses based on KEGG pathways (described in Results), we also examined Gene Ontology [48] pathways related to inflammation. There was some evidence (P = 0.03) for a difference between CHD cases and controls for a cytokine activity pathway (CCL5, C5, PF4, and CCL16), and some (P = 0.04) for an acute inflammatory response pathway (ORM1, ORM2, C2, CFHR1, MBL2, AHSG), whereas there was no evidence of corresponding differences between stroke cases and controls.

Conclusions

We have identified B2M and IGFBP4 as novel risk markers for CHD and stroke, respectively. These markers have potential to help elucidate hormone therapy effects on these diseases as observed in the WHI randomized controlled trials. The IPAS platform [11-14] provides quantification only for proteins having cysteine residues, but otherwise our analyses benefit from the depth of the proteomic profiling. Concentration ratios associated with hormone therapy in our earlier IPAS studies agreed closely with ELISA-based ratios from the same samples [9], and IPAS concentration ratios for E-alone and E+P agreed closely with each other for many proteins identified as hormone-therapy related. These comparisons suggest that a number of additional proteins with small FDRs (for example, < 0.2) in Tables 1 and 2 are likely also to be disease risk markers, though it will be important for these associations to be replicated in independent samples.

Abbreviations

APOA2: apolipoprotein A-II precursor; B2M: beta-2 microglobulin; CHD: coronary heart disease; CI: confidence interval; E-alone: estrogen-alone; E+P: estrogen plus progestin; ELISA: enzyme-linked immunosorbent assay; FDR: false discovery rate; IGFALS: insulin-like growth factor-binding protein acid labile subunit; IGFBP4: insulin-like growth factor binding protein 4; IPAS: intact protein analysis system; IPI: International Protein Index; KEGG: Kyoto Encyclopedia of Genes and Genomes; MS/MS: tandem mass spectrometry; ORM1: alpha-1-acid glycoprotein 1; PPIA: peptidyl-prolyl isomerase A; WHI: Women's Health Initiative.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

RLP, LMA, LC, SJP (FHCRC), JH, RDJ, JER, JEM, CE, and SMH participated in drafting the manuscript. Data were collected, analyzed, and interpreted by RLP, SJP (University of Michigan), LMA, SJP (FHCRC), MM, TBB, KJ, and SMH. RLP and SMH were responsible for study design. Statistical analysis was performed by AA, LC, MM, PW, and RLP.

Additional file 1

Supplementary methods. Detailed methods for sample preparation, protein fractionation, and mass spectrometry analysis are described. Click here for file

Additional file 2

Table S1. Baseline characteristics for women developing coronary heart disease (CHD) or stroke and for corresponding disease-free controls, drawn from the Women's Health Initiative Observational Study. Click here for file

Additional file 3

Table S2. CHD case versus control log-transformed concentration ratios for all quantified proteins. Click here for file

Additional file 4

Table S3. Stroke case versus control log(base2)-transformed concentration ratios for all quantified proteins. Click here for file
  45 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

Review 2.  The acid-labile subunit (ALS) of the 150 kDa IGF-binding protein complex: an important but forgotten component of the circulating IGF system.

Authors:  Y R Boisclair; R P Rhoads; I Ueki; J Wang; G T Ooi
Journal:  J Endocrinol       Date:  2001-07       Impact factor: 4.286

Review 3.  Mining the plasma proteome for cancer biomarkers.

Authors:  Samir M Hanash; Sharon J Pitteri; Vitor M Faca
Journal:  Nature       Date:  2008-04-03       Impact factor: 49.962

4.  Serum amyloid A protein levels in haemodialysis patients.

Authors:  H Polat; M Yeksan; M Dalmaz; B Kaptanoglu; A Koşar; I Akkuş
Journal:  Nephrol Dial Transplant       Date:  1996-07       Impact factor: 5.992

5.  Inflammatory and hemostatic markers in relation to cardiovascular prognosis in patients with stable angina pectoris. Results from the APSIS study. The Angina Prognosis Study in Stockholm.

Authors:  C Held; P Hjemdahl; N Håkan Wallén; I Björkander; L Forslund; B Wiman; N Rehnqvist
Journal:  Atherosclerosis       Date:  2000-01       Impact factor: 5.162

Review 6.  Alpha-1-acid glycoprotein.

Authors:  T Fournier; N Medjoubi-N; D Porquet
Journal:  Biochim Biophys Acta       Date:  2000-10-18

7.  Design of the Women's Health Initiative clinical trial and observational study. The Women's Health Initiative Study Group.

Authors: 
Journal:  Control Clin Trials       Date:  1998-02

8.  Beta-2 microglobulin is an amyloidogenic protein in man.

Authors:  P D Gorevic; T T Casey; W J Stone; C R DiRaimondo; F C Prelli; B Frangione
Journal:  J Clin Invest       Date:  1985-12       Impact factor: 14.808

9.  Serum glycoproteins and severity of coronary atherosclerosis.

Authors:  T Mori; J Sasaki; H Kawaguchi; K Handa; Y Takada; A Matsunaga; S Kono; K Arakawa
Journal:  Am Heart J       Date:  1995-02       Impact factor: 4.749

10.  Relation of C-reactive protein and coronary heart disease in the MRFIT nested case-control study. Multiple Risk Factor Intervention Trial.

Authors:  L H Kuller; R P Tracy; J Shaten; E N Meilahn
Journal:  Am J Epidemiol       Date:  1996-09-15       Impact factor: 4.897

View more
  27 in total

1.  A regularized Hotelling's T2 test for pathway analysis in proteomic studies.

Authors:  Lin S Chen; Debashis Paul; Ross L Prentice; Pei Wang
Journal:  J Am Stat Assoc       Date:  2011-12       Impact factor: 5.033

Review 2.  Correlation of imaging and histopathology of thrombi in acute ischemic stroke with etiology and outcome: a systematic review.

Authors:  Waleed Brinjikji; Sharon Duffy; Anthony Burrows; Werner Hacke; David Liebeskind; Charles B L M Majoie; Diederik W J Dippel; Adnan H Siddiqui; Pooja Khatri; Blaise Baxter; Raul Nogeuira; Matt Gounis; Tudor Jovin; David F Kallmes
Journal:  J Neurointerv Surg       Date:  2016-05-10       Impact factor: 5.836

3.  Progress in mining the human proteome for disease applications.

Authors:  Sam Hanash
Journal:  OMICS       Date:  2011-03

Review 4.  Application of proteomics to cerebrovascular disease.

Authors:  Mingming Ning; Mary Lopez; Jing Cao; Ferdinando S Buonanno; Eng H Lo
Journal:  Electrophoresis       Date:  2012-12       Impact factor: 3.535

5.  Genome-wide association study identified the human leukocyte antigen region as a novel locus for plasma beta-2 microglobulin.

Authors:  Adrienne Tin; Brad C Astor; Eric Boerwinkle; Ron C Hoogeveen; Josef Coresh; Wen Hong Linda Kao
Journal:  Hum Genet       Date:  2013-02-16       Impact factor: 4.132

6.  Mass Spectrometry-based Proteomics and Peptidomics for Systems Biology and Biomarker Discovery.

Authors:  Robert Cunningham; Di Ma; Lingjun Li
Journal:  Front Biol (Beijing)       Date:  2012-08-01

7.  β2-Microglobulin and TIMP1 Are Linked Together in Cardiorenal Remodeling and Failure.

Authors:  Annamaria Vianello; Laura Caponi; Fabio Galetta; Ferdinando Franzoni; Marco Taddei; Marco Rossi; Pietro Pietrini; Gino Santoro
Journal:  Cardiorenal Med       Date:  2014-12-12       Impact factor: 2.041

8.  Protein biomarkers of new-onset cardiovascular disease: prospective study from the systems approach to biomarker research in cardiovascular disease initiative.

Authors:  Xiaoyan Yin; Subha Subramanian; Shih-Jen Hwang; Christopher J O'Donnell; Caroline S Fox; Paul Courchesne; Pieter Muntendam; Neal Gordon; Aram Adourian; Peter Juhasz; Martin G Larson; Daniel Levy
Journal:  Arterioscler Thromb Vasc Biol       Date:  2014-02-13       Impact factor: 8.311

9.  Plasma Retinol-Binding Protein 4 Levels and the Risk of Ischemic Stroke among Women.

Authors:  Pamela M Rist; Monik C Jiménez; Shelley S Tworoger; Frank B Hu; JoAnn E Manson; Qi Sun; Kathryn M Rexrode
Journal:  J Stroke Cerebrovasc Dis       Date:  2017-09-06       Impact factor: 2.136

10.  The Active Compounds of Yixin Ningshen Tablet and Their Potential Action Mechanism in Treating Coronary Heart Disease- A Network Pharmacology and Proteomics Approach.

Authors:  Xing Lv; Huijun Wang; Ruoming Wu; Xiaoyan Shen; Guan Ye
Journal:  Evid Based Complement Alternat Med       Date:  2020-01-25       Impact factor: 2.629

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.