TY - GEN
T1 - Decision tree rule-based feature selection for large-scale imbalanced data
AU - Liu, Haoyue
AU - Zhou, Mengchu
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/5/15
Y1 - 2017/5/15
N2 - A class imbalance problem often appears in many real world applications, e.g. fault diagnosis, text categorization, fraud detection. When dealing with a large-scale imbalanced dataset, feature selection becomes a great challenge. To confront it, this work proposes a feature selection approach based on a decision tree rule. The effectiveness of the proposed approach is verified by classifying a large-scale dataset from Santander Bank. The results show that our approach can achieve higher Area Under the Curve (AUC) and less computational time. We also compare it with filter-based feature selection approaches, i.e., Chi-Square and F-statistic. The results show that it outperforms them but needs slightly more computational efforts.
AB - A class imbalance problem often appears in many real world applications, e.g. fault diagnosis, text categorization, fraud detection. When dealing with a large-scale imbalanced dataset, feature selection becomes a great challenge. To confront it, this work proposes a feature selection approach based on a decision tree rule. The effectiveness of the proposed approach is verified by classifying a large-scale dataset from Santander Bank. The results show that our approach can achieve higher Area Under the Curve (AUC) and less computational time. We also compare it with filter-based feature selection approaches, i.e., Chi-Square and F-statistic. The results show that it outperforms them but needs slightly more computational efforts.
KW - Decision tree
KW - feature selection
KW - large-scale imbalanced data
UR - http://www.scopus.com/inward/record.url?scp=85021442647&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85021442647&partnerID=8YFLogxK
U2 - 10.1109/WOCC.2017.7928973
DO - 10.1109/WOCC.2017.7928973
M3 - Conference contribution
AN - SCOPUS:85021442647
T3 - 2017 26th Wireless and Optical Communication Conference, WOCC 2017
BT - 2017 26th Wireless and Optical Communication Conference, WOCC 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 26th Wireless and Optical Communication Conference, WOCC 2017
Y2 - 7 April 2017 through 8 April 2017
ER -