| Literature DB >> 28770199 |
Ersoy Subasi1, Munevver Mine Subasi2, Peter L Hammer3, John Roboz4, Victor Anbalagan5, Michael S Lipkowitz6.
Abstract
The African American Study of Kidney Disease and Hypertension (AASK), a randomized double-blinded treatment trial, was motivated by the high rate of hypertension-related renal disease in the African-American population and the scarcity of effective therapies. This study describes a pattern-based classification approach to predict the rate of decline of kidney function using surface-enhanced laser desorption ionization/time of flight proteomic data from rapid and slow progressors classified by rate of change in glomerular filtration rate. An accurate classification model consisting of 7 out of 5,751 serum proteomic features is constructed by applying the logical analysis of data (LAD) methodology. On cross-validation by 10-folding, the model was shown to have an accuracy of 80.6 ± 0.11%, sensitivity of 78.4 ± 0.17%, and specificity of 78.5 ± 0.16%. The LAD discriminant is used to identify the patients in different risk groups. The LAD risk scores assigned to 116 AASK patients generated a receiver operating curves curve with AUC 0.899 (CI 0.845-0.953) and outperforms the risk scores assigned by proteinuria, one of the best predictors of chronic kidney disease progression.Entities:
Keywords: Boolean; biomarker; chronic kidney disease; combinatorics; glomerular filtration rate; logical analysis of data; proteinuria; proteomics
Year: 2017 PMID: 28770199 PMCID: PMC5516355 DOI: 10.3389/fmed.2017.00097
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Distribution of glomerular filtration rate (GFR) slope mL/min/year (blue bars) and proteinuria (red bars) in AASK.
Patient characteristics.
| Slow ( | Rapid ( | ||
|---|---|---|---|
| Glomerular filtration rate (GFR) slope | +2.18 ± 1.13 | −6.64 ± 1.38 | <0.0001 |
| GFR | 53.6 ± 11.6 | 45.3 ± 12.2 | <0.001 |
| Proteinuria | 0.10 ± 0.19 | 1.09 ± 1.37 | <0.0001 |
| Age | 53.4 ± 11.6 | 51.2 ± 12.1 | NS |
| BMI | 30.0 ± 6.0 | 32.0 ± 7.4 | NS |
| Male | 29 | 39 | <0.001 |
| Female | 30 | 18 |
Figure 2Mean surface-enhanced laser desorption ionization/time of flight (SELDI-TOF) mass spectra of slow (blue) vs rapid (red) progressors.
Figure 3Logical analysis of data pattern characteristics. Prevalence is the proportion of all rapid (slow) progressors covered by the pattern. Homogeneity is the percent of rapid (slow) progressors among all those patients covered by a positive (negative) pattern.
Figure 4Heatmap of the logical analysis of data model. Blue: slow progressors (negative class), red: fast progressors (positive class), and black: Pattern covers the observation.
Support set of surface-enhanced laser desorption ionization/time of flight masses whose intensities are used to create the logical analysis of data classification model.
| SELDI mass ( | Correlation coefficient | Correlation rank |
|---|---|---|
| 2,018 | 0.039 | 4,115 |
| 2,756 | 0.260 | 16 |
| 2,780 | 0.252 | 28 |
| 5,266 | 0.065 | 3,290 |
| 9,940 | 0.194 | 348 |
| 11,274 | 0.133 | 1,565 |
| 11,752 | 0.192 | 378 |
Figure 5Logical analysis of data model for predicting rapid vs slow progression in surface-enhanced laser desorption ionization/time of flight AASK data. Positive patterns (P) predict rapid progression and negative patterns (N) slow progression. Mass/charge ratio (m/z) is a measure of protein mass; data represent peak intensity at the individual masses. A patient’s sample fulfills fast progression positive pattern 1 (P1) if the intensity of the peak at m/z 9,940 < 0.575 and the intensity at m/z 11,274 > 0.055. Other patterns are interpreted in a similar fashion.
Cross-validation of the logical analysis of data classification model.
| Accuracy (%) | Sensitivity (%) | Specificity (%) | Hazard ratio |
|---|---|---|---|
| 80.6 ± 0.11 | 78.4 ± 0.17 | 78.5 ± 0.16 | 2.72 |
Logical analysis of data risk scores from lowest risk of progression (Group 1) to highest (Group 5).
| Risk group | # of observations | % Rapid progressors | Average risk score |
|---|---|---|---|
| 1 | 23 | 0.00 | 0.087 |
| 2 | 23 | 21.74 | 0.275 |
| 3 | 23 | 47.83 | 0.498 |
| 4 | 23 | 73.91 | 0.697 |
| 5 | 24 | 100.00 | 0.924 |
Urine protein/urine creatinine risk scores.
| Risk group | # of observations | % Rapid progressors | Average UP/UCr |
|---|---|---|---|
| 1 | 23 | 16.7 | 0.02 |
| 2 | 23 | 17.4 | 0.03 |
| 3 | 23 | 47.8 | 0.08 |
| 4 | 23 | 69.67 | 0.31 |
| 5 | 24 | 95.7 | 1.35 |
Figure 6Proportion of patients classified as rapid progressors either by proteinuria (gray bars) or logical analysis of data (LAD) risk score (red bars) in each risk quintile.