| Literature DB >> 25837352 |
Črt Ahlin1, Daša Stupica2, Franc Strle2, Lara Lusa3.
Abstract
In biomedical studies the patients are often evaluated numerous times and a large number of variables are recorded at each time-point. Data entry and manipulation of longitudinal data can be performed using spreadsheet programs, which usually include some data plotting and analysis capabilities and are straightforward to use, but are not designed for the analyses of complex longitudinal data. Specialized statistical software offers more flexibility and capabilities, but first time users with biomedical background often find its use difficult. We developed medplot, an interactive web application that simplifies the exploration and analysis of longitudinal data. The application can be used to summarize, visualize and analyze data by researchers that are not familiar with statistical programs and whose knowledge of statistics is limited. The summary tools produce publication-ready tables and graphs. The analysis tools include features that are seldom available in spreadsheet software, such as correction for multiple testing, repeated measurement analyses and flexible non-linear modeling of the association of the numerical variables with the outcome. medplot is freely available and open source, it has an intuitive graphical user interface (GUI), it is accessible via the Internet and can be used within a web browser, without the need for installing and maintaining programs locally on the user's computer. This paper describes the application and gives detailed examples describing how to use the application on real data from a clinical study including patients with early Lyme borreliosis.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25837352 PMCID: PMC4383594 DOI: 10.1371/journal.pone.0121760
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Graphical user interface in the web browser.
The screen is divided in two parts: the sidebar (left part, used to for inputs) and the main panel (right part, used for outputs). The outputs are accessible through various tabs on top of the main panel part of the screen.
Fig 2Data for first two patients of the demo data set.
Data for two Erythema migrans patients are displayed. It spans eight rows, as each of them was evaluated on four occasions. Not all recorded variables are displayed.
Main panel: tabs with statistical output.
|
|
|
|
|
|---|---|---|---|
|
| Basic overview and summary of the data. | Number of: observations in the data set, unique subjects, subjects evaluated and missing values at each evaluation occasion, subjects stratified by the grouping variable. | |
|
| Graphical exploration analysis tools. | Profile plots, Lasagna plots, Boxplots, Timeline plots | Lasagna plots, Timeline plots, Barplots |
|
| Graphs and tables with summary statistics of the outcome variables. | Medians (with 95% confidence intervals) and interquartile ranges | Proportions (with 95% CI) and number of subjects with positive outcomes for binary outcomes. |
|
| Graphs and tables with summary statistics for two groups defined by a binary grouping variable. | The two groups are compared with Mann-Whitney test | The two groups are compared with a two-sample test for equality of proportions with continuity correction. |
| Unadjusted and adjusted | |||
|
| Graphical display of hierarchical clustering results for a particular evaluation occasion. | Hierarchical clustering of the outcomes, pairwise Spearman’s correlations between outcomes, visualization of the complete data (rearranged using the grouping of outcomes and subjects obtained by their hierarchical clustering). | |
|
| Estimates univariate regression models for a particular evaluation occasion; the covariate included in the models can be numerical or categorical. | Estimates of slope coefficients obtained with univariate linear regression with their 95% confidence intervals and P values. Numerical covariates can be modelled flexibly using restricted cubic splines. | Estimates of odds ratios obtained with univariate logistic regression with their 95% confidence intervals and P values. Firth correction can be used. |
|
| Estimates mixed-effects regression models, allowing a different (random) intercept for each subject. Three types of models can be estimated: with a single covariate, with a covariate and evaluation occasion, with a covariate and time since first evaluation. | Uses linear regression mixed models; provides the estimated slopes with their 95% confidence intervals and P values. | Uses logistic regression mixed models; provides the estimated odds ratios with their 95% confidence intervals and P values. |
|
| A table with the uploaded data (or Demo data, if selected). | ||
Fig 3Summary tab output for binary variables.
The table displays the descriptive statistics for the presence of each symptom; the plot shows the observed proportions of patients that report the presence of the symptom, along with their 95% confidence intervals.
Fig 4Summary: grouping variable tab output for binary variables.
The table displays the summary statistics for the presence of symptoms at baseline for groups defined by the response to treatment at last evaluation. The proportions are compared, unadjusted and adjusted P values and Q values are provided (see text for details).
Fig 5Graphical exploration tab output for numerical variables—lasagna plot.
The heat map displays graphically the intensity of arthralgia for each patient (horizontal axis) and evaluation occasion (vertical axis). A dendrogram showing patient similarity is plotted on the vertical axis.
Fig 6Clustering tab output—heat map displaying the similarities of reported symptoms and of patients.
The colors represent the intensity of the symptoms at baseline (rows) for each patient (columns). Hierarchical clustering is used to group symptoms and patients.
Fig 7Regression model: one evaluation time tab output—estimation of non-linear associations.
The graphs display the estimated associations between the age of the patients and selected symptom intensities at baseline evaluation. Restricted cubic splines are used for modeling. See text for details.