The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions

Recommendations

Cited in

(43)

Sparsity through evolutionary pruning prevents neuronal networks from overfitting
Learning long-term dependencies by the selective addition of time-delayed connections to recurrent neural networks
NSNO: Neumann series neural operator for solving Helmholtz equations in inhomogeneous medium
Bayesian framework for simulation of dynamical systems from multidimensional data using recurrent neural network
Generalised latent assimilation in heterogeneous reduced spaces with machine learning surrogate models
Unification of popular artificial neural network activation functions
A neural network ensemble approach for GDP forecasting
Robust High-Dimensional Regression with Coefficient Thresholding and Its Application to Imaging Data Analysis
scientific article; zbMATH DE number 7714085 (Why is no real title available?)
The Study of Architecture MLP with Linear Neurons in Order to Eliminate the “vanishing Gradient” Problem
Incorporating financial news for forecasting Bitcoin prices based on long short-term memory networks
Absence of Barren plateaus and scaling of gradients in the energy optimization of isometric tensor network states
Activation function design for deep networks: linearity and effective initialisation
State estimation with limited sensors -- a deep learning based approach
Multi-domain encoder-decoder neural networks for latent data assimilation in dynamical systems
A Taxonomy for Spatiotemporal Connectionist Networks Revisited: The Unsupervised Case
Application of neural network to model rainfall pattern of Ethiopia
Sensitivity -- local index to control chaoticity or gradient globally
Learning model predictive control with long short‐term memory networks
Deep learning applied to wind power forecasting: a spatio-temporal approach
Gradient explosion free algorithm for training recurrent neural networks
Learning deep implicit Fourier neural operators (IFNOs) with applications to heterogeneous material modeling
Successfully and efficiently training deep multi-layer perceptrons with logistic activation function simply requires initializing the weights with an appropriate negative mean
A homotopy gated recurrent unit for predicting high dimensional hyperchaos
Recurrent neural network-based internal model control design for stable nonlinear systems
Nonlocal kernel network (NKN): a stable and resolution-independent deep neural network
Application of Attention Technique for Digital Pre-distortion
Physics-informed online learning of gray-box models by moving horizon estimation
Data-driven subjective performance evaluation: An attentive deep neural networks model based on a call centre case
Learning on predictions: fusing training and autoregressive inference for long-term spatiotemporal forecasts
A turbulent eddy-viscosity surrogate modeling framework for Reynolds-averaged Navier-Stokes simulations
A Local Deep Learning Method for Solving High Order Partial Differential Equations
QSurfNet: a hybrid quantum convolutional neural network for surface defect recognition
LSTM-based approach for predicting periodic motions of an impacting system via transient dynamics
The remarkable robustness of surrogate gradient learning for instilling complex function in spiking neural networks
High-resolution urban air quality monitoring from citizen science data with echo-state transformer networks
Deep learning models for time-history prediction of vehicle-induced bridge responses: a comparative study
A mathematical perspective of machine learning
Stratified sampling algorithms for machine learning methods in solving two-scale partial differential equations
Efficient forecasting of chaotic systems with block-diagonal and binary reservoir computing
Anti-derivatives approximator for enhancing physics-informed neural networks
RHONN identifier-control scheme for nonlinear discrete-time systems with unknown time-delays
A two-stage deep learning architecture for model reduction of parametric time-dependent problems

Describes a project that uses

Uses Software

LSTM

This page was built for publication: The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5291707)