A Latent Factor Analysis-Based Approach to Online Sparse Streaming Feature Selection

Di Wu, Yi He, Xin Luo, Meng Chu Zhou

Research output: Contribution to journalArticlepeer-review

108 Scopus citations

Abstract

Online streaming feature selection (OSFS) has attracted extensive attention during the past decades. Current approaches commonly assume that the feature space of fixed data instances dynamically increases without any missing data. However, this assumption does not always hold in many real applications. Motivated by this observation, this study aims to implement online feature selection from sparse streaming features, i.e., features flow in one by one with missing data as instance count remains fixed. To do so, this study proposes a latent-factor-analysis-based online sparse-streaming-feature selection algorithm (LOSSA). Its main idea is to apply latent factor analysis to pre-estimate missing data in sparse streaming features before conducting feature selection, thereby addressing the missing data issue effectively and efficiently. Theoretical and empirical studies indicate that LOSSA can significantly improve the quality of OSFS when missing data are encountered in target instances.

Original languageEnglish (US)
Pages (from-to)6744-6758
Number of pages15
JournalIEEE Transactions on Systems, Man, and Cybernetics: Systems
Volume52
Issue number11
DOIs
StatePublished - Nov 1 2022

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering
  • Human-Computer Interaction
  • Computer Science Applications
  • Electrical and Electronic Engineering

Keywords

  • Big data
  • computational intelligence
  • latent factor analysis (LFA)
  • missing data
  • online algorithm
  • online feature selection
  • sparse streaming feature
  • streaming feature

Fingerprint

Dive into the research topics of 'A Latent Factor Analysis-Based Approach to Online Sparse Streaming Feature Selection'. Together they form a unique fingerprint.

Cite this