The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions
From MaRDI portal
Publication:5291707
DOI10.1142/S0218488598000094zbMath1087.68616WikidataQ56621169 ScholiaQ56621169MaRDI QIDQ5291707
Publication date: 19 May 2006
Published in: International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems (Search for Journal in Brave)
Related Items (max. 100)
Application of neural network to model rainfall pattern of Ethiopia ⋮ Activation function design for deep networks: linearity and effective initialisation ⋮ State estimation with limited sensors -- a deep learning based approach ⋮ A Taxonomy for Spatiotemporal Connectionist Networks Revisited: The Unsupervised Case ⋮ Learning deep implicit Fourier neural operators (IFNOs) with applications to heterogeneous material modeling ⋮ A homotopy gated recurrent unit for predicting high dimensional hyperchaos ⋮ Recurrent neural network-based internal model control design for stable nonlinear systems ⋮ Nonlocal kernel network (NKN): a stable and resolution-independent deep neural network ⋮ A two-stage deep learning architecture for model reduction of parametric time-dependent problems ⋮ LSTM-based approach for predicting periodic motions of an impacting system via transient dynamics ⋮ Sensitivity -- local index to control chaoticity or gradient globally ⋮ Learning model predictive control with long short‐term memory networks ⋮ Successfully and efficiently training deep multi-layer perceptrons with logistic activation function simply requires initializing the weights with an appropriate negative mean ⋮ Application of Attention Technique for Digital Pre-distortion ⋮ Physics-informed online learning of gray-box models by moving horizon estimation ⋮ QSurfNet: a hybrid quantum convolutional neural network for surface defect recognition ⋮ A mathematical perspective of machine learning ⋮ NSNO: Neumann series neural operator for solving Helmholtz equations in inhomogeneous medium ⋮ Robust High-Dimensional Regression with Coefficient Thresholding and Its Application to Imaging Data Analysis ⋮ Incorporating financial news for forecasting Bitcoin prices based on long short-term memory networks ⋮ Sparsity through evolutionary pruning prevents neuronal networks from overfitting ⋮ RHONN identifier-control scheme for nonlinear discrete-time systems with unknown time-delays ⋮ A turbulent eddy-viscosity surrogate modeling framework for Reynolds-averaged Navier-Stokes simulations ⋮ Bayesian framework for simulation of dynamical systems from multidimensional data using recurrent neural network ⋮ The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks ⋮ A Local Deep Learning Method for Solving High Order Partial Differential Equations ⋮ Generalised latent assimilation in heterogeneous reduced spaces with machine learning surrogate models ⋮ A neural network ensemble approach for GDP forecasting
Uses Software
This page was built for publication: The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions