Improved TrAdaBoost and its Application to Transaction Fraud Detection

Lutao Zheng, Guanjun Liu, Chungang Yan, Changjun Jiang, Mengchu Zhou, Maozhen Li

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

AdaBoost is a boosting-based machine learning method under the assumption that the data in training and testing sets have the same distribution and input feature space. It increases the weights of those instances that are wrongly classified in a training process. However, the assumption does not hold in many real-world data sets. Therefore, AdaBoost is extended to transfer AdaBoost (TrAdaBoost) that can effectively transfer knowledge from one domain to another. TrAdaBoost decreases the weights of those instances that belong to the source domain but are wrongly classified in a training process. It is more suitable for the case that data are of different distribution. Can it be improved for some special transfer scenarios, e.g., the data distribution changes slightly over time We find that the distribution of credit card transaction data can change with the changes in the transaction behaviors of users, but the changes are slow most of the time. These changes are yet important for detecting transaction fraud since they result in a so-called concept drift problem. In order to make TrAdaBoost more suitable for the abovementioned case, we, thus, propose an improved TrAdaBoost (ITrAdaBoost) in this article. It updates (i.e., increases or decreases) the weight of a wrongly classified instance in a source domain according to the distribution distance from the instance to a target domain, and the calculation of distance is based on the theory of reproducing kernel Hilbert space. We do a series of experiments over five data sets, and the results illustrate the advantage of ITrAdaBoost.

Original languageEnglish (US)
Article number9178971
Pages (from-to)1304-1316
Number of pages13
JournalIEEE Transactions on Computational Social Systems
Volume7
Issue number5
DOIs
StatePublished - Oct 2020

All Science Journal Classification (ASJC) codes

  • Modeling and Simulation
  • Social Sciences (miscellaneous)
  • Human-Computer Interaction

Keywords

  • Boosting learning
  • E-commerce
  • transaction fraud detection
  • transfer learning

Fingerprint

Dive into the research topics of 'Improved TrAdaBoost and its Application to Transaction Fraud Detection'. Together they form a unique fingerprint.

Cite this