Scalable statistical inference for averaged implicit stochastic gradient descent

Yixin Fang

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

Stochastic gradient descent (SGD) provides a scalable way to compute parameter estimates in applications involving large-scale data or streaming data. As an alternative version, averaged implicit SGD (AI-SGD) has been shown to be more stable and more efficient. Although the asymptotic properties of AI-SGD have been well established, statistical inferences based on it such as interval estimation remain unexplored. The bootstrap method is not computationally feasible because it requires to repeatedly resample from the entire data set. In addition, the plug-in method is not applicable when there is no explicit covariance matrix formula. In this paper, we propose a scalable statistical inference procedure, which can be used for conducting inferences based on the AI-SGD estimator. The proposed procedure updates the AI-SGD estimate as well as many randomly perturbed AI-SGD estimates, upon the arrival of each observation. We derive some large-sample theoretical properties of the proposed procedure and examine its performance via simulation studies.

Original languageEnglish (US)
Pages (from-to)987-1002
Number of pages16
JournalScandinavian Journal of Statistics
Volume46
Issue number4
DOIs
StatePublished - Dec 1 2019
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Keywords

  • big data
  • bootstrap
  • interval estimation
  • resampling
  • stochastic gradient descent
  • streaming data

Fingerprint

Dive into the research topics of 'Scalable statistical inference for averaged implicit stochastic gradient descent'. Together they form a unique fingerprint.

Cite this