| Literature DB >> 30452481 |
Alec Davies1, Mark A Green1, Alex D Singleton1.
Abstract
The availability alongside growing awareness of medicine has led to increased self-treatment of minor ailments. Self-medication is where one 'self' diagnoses and prescribes over the counter medicines for treatment. The self-care movement has important policy implications, perceived to relieve the National Health Service (NHS) burden, increasing patient subsistence and freeing resources for more serious ailments. However, there has been little research exploring how self-medication behaviours vary between population groups due to a lack of available data. The aim of our study is to evaluate how high street retailer loyalty card data can help inform our understanding of how individuals self-medicate in England. Transaction level loyalty card data was acquired from a national high street retailer for England for 2012-2014. We calculated the proportion of loyalty card customers (n ~ 10 million) within Lower Super Output Areas who purchased the following medicines: 'coughs and colds', 'Hayfever', 'pain relief' and 'sun preps'. Machine learning was used to explore how 50 sociodemographic and health accessibility features were associated towards explaining purchasing of each product group. Random Forests are used as a baseline and Gradient Boosting as our final model. Our results showed that pain relief was the most common medicine purchased. There was little difference in purchasing behaviours by sex other than for sun preps. The gradient boosting models demonstrated that socioeconomic status of areas, as well as air pollution, were important predictors of each medicine. Our study adds to the self-medication literature through demonstrating the usefulness of loyalty card records for producing insights about how self-medication varies at the national level. Big data offer novel insights that add to and address issues that traditional studies are unable to consider. New forms of data through data linkage may offer opportunities to improve current public health decision making surrounding at risk population groups within self-medication behaviours.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30452481 PMCID: PMC6242371 DOI: 10.1371/journal.pone.0207523
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparison of machine learning model performance.
| Coughs and colds | Hayfever | Pain relief | Sun preps | |||||
|---|---|---|---|---|---|---|---|---|
| Random Forests | XGBoost | Random Forests | XGBoost | Random Forests | XGBoost | Random Forest | XGBoost | |
| Training sample size | 70% | 70% | 70% | 70% | 70% | 70% | 70% | 70% |
| Learning Rate | 0.01 | 0.01 | 0.01 | 0.01 | ||||
| Gamma | 0 | 0 | 0 | 0 | ||||
| Minimum child weight | 1 | 1 | 1 | 1 | ||||
| Column subsample | .33 | .7 | .33 | .7 | .33 | .7 | .33 | .7 |
| Row subsample | .8 | .8 | .8 | .8 | ||||
| Maximum depth | 6 | 6 | 6 | 6 | ||||
| Rounds | 500 | 5000 | 500 | 5000 | 500 | 5000 | 500 | 5000 |
| R2 | .5030 | .5010 | .5881 | .5993 | .6010 | .6063 | .6148 | .6379 |
| RMSE | .0492 | .0493 | .0391 | .0388 | .0427 | .0423 | .0475 | .0460 |
| Run Time (minutes) | 10 | 2 | 10 | 2 | 10 | 2 | 10 | 2 |
Learning rate = step size shrinkage used to make model conservative; Gamma = minimum loss reduction to make further partition; Minimum child weight = minimum instance weight needed in a child; Maximum depth = maximum depth of a tree (number of splits) [28]; RMSE = Root Mean Squared Error
Fig 1Proportion per local authority level of self-medication products by gender.
Fig 2Proportion per local authority level of self-medication products.
(Top left coughs and colds, top right Hayfever, bottom left pain relief, bottom right sun preps).
Fig 3Rank comparison of feature importance.
(Note: ‘Decile’ refers to the decile of ranks from XGBoost).
Fig 4Partial dependency plots.
(Note: top 5 products from each XGBoost model).