Online bootstrap confidence intervals for the stochastic gradient descent estimator

Yixin Fang, Jinfeng Xu, Lei Yang

Research output: Contribution to journalArticlepeer-review

30 Scopus citations

Abstract

In many applications involving large dataset or online learning, stochastic gradient descent (SGD) is a scalable algorithm to compute parameter estimates and has gained increasing popularity due to its numerical convenience and memory efficiency. While the asymptotic properties of SGD-based estimators have been well established, statistical inference such as interval estimation remains much unexplored. The classical bootstrap is not directly applicable if the data are not stored in memory. The plug-in method is not applicable when there is no explicit formula for the covariance matrix of the estimator. In this paper, we propose an online bootstrap procedure for the estimation of confidence intervals, which, upon the arrival of each observation, updates the SGD estimate as well as a number of randomly perturbed SGD estimates. The proposed method is easy to implement in practice. We establish its theoretical properties for a general class of models that includes linear regressions, generalized linear models, M-estimators and quantile regressions as special cases. The finite-sample performance and numerical utility is evaluated by simulation studies and real data applications.

Original languageEnglish (US)
JournalJournal of Machine Learning Research
Volume19
StatePublished - Dec 1 2018

All Science Journal Classification (ASJC) codes

  • Software
  • Artificial Intelligence
  • Control and Systems Engineering
  • Statistics and Probability

Keywords

  • Bootstrap
  • Generalized linear models
  • Interval estimation
  • Large datasets
  • M-estimators
  • Quantile regression
  • Resampling methods
  • Stochastic gradient descent

Fingerprint

Dive into the research topics of 'Online bootstrap confidence intervals for the stochastic gradient descent estimator'. Together they form a unique fingerprint.

Cite this