| Literature DB >> 25302317 |
K R Seeja1, Masoumeh Zareapoor1.
Abstract
This paper proposes an intelligent credit card fraud detection model for detecting fraud from highly imbalanced and anonymous credit card transaction datasets. The class imbalance problem is handled by finding legal as well as fraud transaction patterns for each customer by using frequent itemset mining. A matching algorithm is also proposed to find to which pattern (legal or fraud) the incoming transaction of a particular customer is closer and a decision is made accordingly. In order to handle the anonymous nature of the data, no preference is given to any of the attributes and each attribute is considered equally for finding the patterns. The performance evaluation of the proposed model is done on UCSD Data Mining Contest 2009 Dataset (anonymous and imbalanced) and it is found that the proposed model has very high fraud detection rate, balanced classification rate, Matthews correlation coefficient, and very less false alarm rate than other state-of-the-art classifiers.Entities:
Mesh:
Year: 2014 PMID: 25302317 PMCID: PMC4180893 DOI: 10.1155/2014/252797
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Proposed credit card fraud detection model.
Algorithm 1Training algorithm.
Algorithm 2Testing algorithm.
Imbalanced data.
| Number of customers | Number of transactions in training set | Number of transactions in testing set | ||||
|---|---|---|---|---|---|---|
| Legal | Fraud | Total | Legal | Fraud | Total | |
| 200 | 652 | 25 | 677 | 489 | 17 | 506 |
| 400 | 1226 | 48 | 1274 | 864 | 30 | 894 |
| 600 | 1716 | 64 | 1780 | 1244 | 48 | 1292 |
| 800 | 2169 | 71 | 2240 | 1612 | 57 | 1669 |
| 1000 | 2604 | 131 | 2735 | 2002 | 102 | 2104 |
| 1200 | 3056 | 157 | 3113 | 2604 | 144 | 2748 |
| 1400 | 3440 | 158 | 3598 | 3083 | 147 | 3230 |
Figure 2Performance comparison of classifiers on sensitivity.
Figure 3Performance comparison of classifiers on false alarm rate.
Figure 4Performance comparison of classifiers on balanced classification rate.
Figure 5Performance comparison of classifiers on MCC.
Overlapped dataset.
| custAttr1 | amount | hour1 | zip1 | field1 | field2 | fielsd3 | field4 | indicator1 | indicator2 | flag1 | flag2 | flag3 | flag4 | flag5 | Class |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1234567890123867 | 12.95 | 9 | 432 | 3 | 0 | 5454 | 10 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 |
| 1234567890123867 | 12.95 | 9 | 432 | 3 | 0 | 5454 | 10 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |