Locating Multiple Equivalent Feature Subsets in Feature Selection for Imbalanced Classification

Shoufei Han, Kun Zhu, Meng Chu Zhou, Hesham Alhumade, Abdullah Abusorrah

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Feature selection can be used to solve imbalanced classification problems encountered in big data projects. There often exist multiple feature subsets achieving the same accuracy. These subsets tend to exhibit different acquisition difficulty and reliability, thus offering decision-makers with multiple choices if they can be well-identified. This work formulates feature selection as a Multimodal Multiobjective Problem (MMOP), where a point on Pareto front in objective space has multiple equivalent feature subsets in decision space. To seek more equivalent feature subsets, this work proposes a new multiobjective fireworks algorithm. It extends a latest single-objective fireworks algorithm to a multiobjective version such that it becomes suitable for solving MMOP. An adaptive strategy and special archive guidance are newly designed to improve its performance. A weighted extreme learning machine is chosen to classify datasets and return classification accuracy due to its fast learning speed. Experimental results show that the proposed algorithm outperforms its compared ones on 15 imbalanced classification datasets including 5 low-dimensional, 5 high-dimensional feature selection problems and 5 large-scale problems with larger imbalanced ratio, and its runtime is the least among them. Also, fault diagnosis in self-organizing cellular networks, as an important imbalance classification problem, is performed by the proposed algorithm and the results show that it can perform fault diagnosis well.

Original languageEnglish (US)
Pages (from-to)9195-9209
Number of pages15
JournalIEEE Transactions on Knowledge and Data Engineering
Volume35
Issue number9
DOIs
StatePublished - Sep 1 2023

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Keywords

  • Feature selection
  • fault diagnosis
  • fireworks algorithm
  • imbalanced classification
  • multimodal multiobjective problem
  • self-organizing cellular networks

Fingerprint

Dive into the research topics of 'Locating Multiple Equivalent Feature Subsets in Feature Selection for Imbalanced Classification'. Together they form a unique fingerprint.

Cite this