Abstract
Feature selection can be used to solve imbalanced classification problems encountered in big data projects. There often exist multiple feature subsets achieving the same accuracy. These subsets tend to exhibit different acquisition difficulty and reliability, thus offering decision-makers with multiple choices if they can be well-identified. This work formulates feature selection as a Multimodal Multiobjective Problem (MMOP), where a point on Pareto front in objective space has multiple equivalent feature subsets in decision space. To seek more equivalent feature subsets, this work proposes a new multiobjective fireworks algorithm. It extends a latest single-objective fireworks algorithm to a multiobjective version such that it becomes suitable for solving MMOP. An adaptive strategy and special archive guidance are newly designed to improve its performance. A weighted extreme learning machine is chosen to classify datasets and return classification accuracy due to its fast learning speed. Experimental results show that the proposed algorithm outperforms its compared ones on 15 imbalanced classification datasets including 5 low-dimensional, 5 high-dimensional feature selection problems and 5 large-scale problems with larger imbalanced ratio, and its runtime is the least among them. Also, fault diagnosis in self-organizing cellular networks, as an important imbalance classification problem, is performed by the proposed algorithm and the results show that it can perform fault diagnosis well.
Original language | English (US) |
---|---|
Pages (from-to) | 9195-9209 |
Number of pages | 15 |
Journal | IEEE Transactions on Knowledge and Data Engineering |
Volume | 35 |
Issue number | 9 |
DOIs | |
State | Published - Sep 1 2023 |
All Science Journal Classification (ASJC) codes
- Information Systems
- Computer Science Applications
- Computational Theory and Mathematics
Keywords
- Feature selection
- fault diagnosis
- fireworks algorithm
- imbalanced classification
- multimodal multiobjective problem
- self-organizing cellular networks