Convergence rates of deep ReLU networks for multiclass classification

DOI10.1214/22-EJS2011MaRDI QIDQ2137813zbMATH OpenOpenAlexFDO

Authors Thijs Bos, Johannes Schmidt-Hieber

Publication date 11 May 2022

Published in Electronic Journal of Statistics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/2108.00969, https://projecteuclid.org/journals/electronic-journal-of-statistics/volume-16/issue-1/Convergence-rates-of-deep-ReLU-networks-for-multiclass-classification/10.1214/22-EJS2011.full

zbMATH Keywords

convergence rates multiclass classification margin condition conditional class probabilities ReLU networks

Mathematics Subject Classification ID

Nonparametric estimation (62G05) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Artificial neural networks and deep learning (68T07)

Abstract: For classification problems, trained deep neural networks return probabilities of class memberships. In this work we study convergence of the learned probabilities to the true conditional class probabilities. More specifically we consider sparse deep ReLU network reconstructions minimizing cross-entropy loss in the multiclass classification setup. Interesting phenomena occur when the class membership probabilities are close to zero. Convergence rates are derived that depend on the near-zero behaviour via a margin-type condition.

Recommendations

Cites work

Cited in

(12)

Describes a project that uses

Uses Software

This page was built for publication: Convergence rates of deep ReLU networks for multiclass classification

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2137813)