| Literature DB >> 25560347 |
Hossein Joudaki, Arash Rashidian1, Behrouz Minaei-Bidgoli, Mahmood Mahmoodi, Bijan Geraili, Mahdi Nasiri, Mohammad Arab.
Abstract
Inappropriate payments by insurance organizations or third party payers occur because of errors, abuse and fraud. The scale of this problem is large enough to make it a priority issue for health systems. Traditional methods of detecting health care fraud and abuse are time-consuming and inefficient. Combining automated methods and statistical knowledge lead to the emergence of a new interdisciplinary branch of science that is named Knowledge Discovery from Databases (KDD). Data mining is a core of the KDD process. Data mining can help third-party payers such as health insurance organizations to extract useful information from thousands of claims and identify a smaller subset of the claims or claimants for further assessment. We reviewed studies that performed data mining techniques for detecting health care fraud and abuse, using supervised and unsupervised data mining approaches. Most available studies have focused on algorithmic data mining without an emphasis on or application to fraud detection efforts in the context of health service provision or health insurance policy. More studies are needed to connect sound and evidence-based diagnosis and treatment approaches toward fraudulent or abusive behaviors. Ultimately, based on available studies, we recommend seven general steps to data mining of health care claims.Entities:
Mesh:
Year: 2014 PMID: 25560347 PMCID: PMC4796421 DOI: 10.5539/gjhs.v7n1p194
Source DB: PubMed Journal: Glob J Health Sci ISSN: 1916-9736
Primary studies that used data mining for detecting health care fraud and abuse
| Study Topic (Country) | The first author(year) | Data mining approach | Type of detected fraud | Applied data mining technique (s) |
|---|---|---|---|---|
| Healthcare fraud detection: A survey and a clustering model incorporating Geo-location information (US) | Unsupervised | Insurance subscribers’ fraud | Clustering | |
| Application of Bayesian Methods in Detection of Healthcare Fraud (-) | Unsupervised | Conspiracy fraud which involves more than one party | Bayesian co-clustering | |
| Unsupervised labeling of data for supervised learning and its application to medical claims prediction (US) | Hybrid supervised and unsupervised | Provider fraud (Obstetrics claims) | Unsupervised data labeling and outlier detection, classification and regression | |
| Outlier based predictors for health insurance fraud detection within U.S. Medicaid (US) | Unsupervised | Provider fraud (Dental claim data) | Outlier detection | |
| A scoring model to detect abusive billing patterns in health insurance claims (Korea) | Supervised | Provider fraud (Outpatient clinics) | Six statistical techniques — correlation analysis, logistic regression and classification tree | |
| A fraud detection approach with data mining in health insurance (Turkey) | Supervised | Provider fraud | Support vector machine (SVM) | |
| Applying Business Intelligence Concepts to Medicaid Claim Fraud Detection (US) | Unsupervised | Provider fraud | Visualization by histogram | |
| A prescription fraud detection model (Turkey) | Hybrid supervised and unsupervised | Prescription fraud | Distance based correlation and risked matrices | |
| Unsupervised fraud detection in Medicare Australia (Australia) | Unsupervised | Insurance subscribers’ fraud | Clustering, feature selection and outlier detection | |
| Two models to investigate Medicare fraud within unsupervised databases (US) | Unsupervised | Provider fraud | Clustering algorithms, regression analysis, and various descriptive statistics | |
| Data mining to predict and prevent errors in health insurance claims processing (US) | Supervised | Error in providers claims | Support vector machine (SVM) | |
| Discovering inappropriate billings with local density based outlier detection method (Australia) | Unsupervised | Provider fraud (Optometrists Billing) | Local density based outlier detection | |
| Mining medical specialist billing patterns for health service management (Australia) | Unsupervised | Provider fraud (Specialist billing) | Association rules | |
| Detecting hospital fraud and claim abuse through diabetic outpatient services (Taiwan) | Supervised | Provider fraud (Diabetic outpatient services) | Logistic regression, neural network, and classification trees | |
| A process-mining framework for the detection of healthcare fraud and abuse (Taiwan) | Supervised | Provider fraud (Gynecology services) | Classification based on associations algorithm, feature selection by Markov blanket filter | |
| A medical claim fraud/abuse detection system based on data mining: a case study in Chile (Chile) | Supervised | Provider fraud | Neural network | |
| EFD: A Hybrid Knowledge/Statistical-Based System for the Detection of Fraud (US) | Hybrid supervised and unsupervised | Provider fraud | Outlier detection and rule extraction | |
| Application of Genetic Algorithms and k-Nearest Neighbour method in real world medical fraud detection problem (Australia) | Unsupervised | Provider fraud (General practitioners) | Genetic algorithm and K-Nearest Neighbor clustering | |
| Evolutionary Hot Spots data mining: architecture for exploring for interesting Discoveries (Australia). | Williams (1999) | Hybrid supervised and unsupervised | Insurance subscribers’ fraud | Clustering and rule induction |
| Mining the knowledge mine: The Hot Spots methodology for mining large real world databases (Australia) | Hybrid supervised and unsupervised | Insurance subscribers’ fraud | Clustering and C5.0 classification algorithm | |
| Application of neural networks to detection of medical fraud (Australia) | He (1997) | Supervised | Provider fraud (General practitioners) | Neural network |