On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent

Scott Pesme; Aymeric Dieuleveut; Nicolas Flammarion

Conference Papers Year : 2020

On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent

(1) , (2) , (1)

1
2

Scott Pesme

Function : Author

Ecole Polytechnique Fédérale de Lausanne

Aymeric Dieuleveut

Function : Author
PersonId : 1109167
IdHAL : aymeric-dieuleveut
ORCID : 0009-0005-1848-1724

Centre de Mathématiques Appliquées - Ecole Polytechnique

Nicolas Flammarion

Function : Author
PersonId : 1257123

Ecole Polytechnique Fédérale de Lausanne

Abstract

Constant step-size Stochastic Gradient Descent exhibits two phases: a transient phase during which iterates make fast progress towards the optimum, followed by a stationary phase during which iterates oscillate around the optimal point. In this paper, we show that efficiently detecting this transition and appropriately decreasing the step size can lead to fast convergence rates. We analyse the classical statistical test proposed by Pflug (1983), based on the inner product between consecutive stochastic gradients. Even in the simple case where the objective function is quadratic we show that this test cannot lead to an adequate convergence diagnostic. We then propose a novel and simple statistical procedure that accurately detects stationarity and we provide experimental results showing state-of-the-art performance on synthetic and real-word datasets.

Domains

Statistics [math.ST] Machine Learning [cs.LG]

Fichier principal

pesme20a.pdf (800.98 Ko)

Origin : Publisher files allowed on an open archive

Aymeric Dieuleveut : Connect in order to contact the contributor

https://hal.science/hal-04554421

Submitted on : Monday, April 22, 2024-11:21:10 AM

Last modification on : Saturday, April 27, 2024-3:10:04 AM

Dates and versions

hal-04554421 , version 1 (22-04-2024)

Identifiers

HAL Id : hal-04554421 , version 1

Cite

Scott Pesme, Aymeric Dieuleveut, Nicolas Flammarion. On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent. 37th International Conference on Machine Learning (ICML 2020), Jul 2020, Vienne (en ligne), Austria. pp.119:7641-7651. ⟨hal-04554421⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS INSMI X-CMAP X-DEP-MATHA CMAP IP_PARIS

4 View

3 Download

On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share