Generalization Error in Deep Learning

Abstract: Deep learning models have lately shown great performance in various fields such as computer vision, speech recognition, speech translation, and natural language processing. However, alongside their state-of-the-art performance, it is still generally unclear what is the source of their generalization ability. Thus, an important question is what makes deep neural networks able to generalize well from the training set to new data. In this article, we provide an overview of the existing theory and bounds for the characterization of the generalization error of deep neural networks, combining both classical and more recent theoretical and empirical results.

Recommendations

Cites work

Cited in

(12)

Describes a project that uses

Uses Software

PMTK

This page was built for publication: Generalization Error in Deep Learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3296180)