scientific article; zbMATH DE number 1043533

From MaRDI portal
Publication:4346705

zbMath0914.60006MaRDI QIDQ4346705

Harold J. Kushner, G. George Yin

Publication date: 4 August 1997


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (only showing first 100 items - show all)

On Stochastic Approximation and CredibilityApproximation by quantization of the filter process and applications to optimal stopping problems under partial observationPairs Trading under Geometric Brownian Motion ModelsTuning positive feedback for signal detection in noisy dynamic environmentsA Stochastic Approximation Method for Simulation-Based Quantile OptimizationAveraging analysis of a point process adaptive algorithmComplexity Analysis of stochastic gradient methods for PDE-constrained optimal Control Problems with uncertain parametersA universal procedure for parametric frailty modelsAsymptotic Properties of Primal-Dual Algorithm for Distributed Stochastic Optimization over Random Networks with Imperfect CommunicationsMinibatch Forward-Backward-Forward Methods for Solving Stochastic Variational InequalitiesA distributed methodology for approximate uniform global minimum sharingContinuous Assortment Optimization with Logit Choice Probabilities and Incomplete InformationThe Stochastic Auxiliary Problem Principle in Banach Spaces: Measurability and ConvergenceStochastic approximation for uncapacitated assortment optimization under the multinomial logit modelLearning and equilibrium transitions: stochastic stability in discounted stochastic fictitious playA unified stochastic approximation framework for learning in gamesA model for data transmission and its optimizationModeling and control of data transmissionBackward Importance Sampling for Online Estimation of State Space ModelsCoordinating Pricing and Inventory Replenishment with Nonparametric Demand LearningOnline EM with Weight-Based ForgettingAn Asymptotically Optimal Set Approach for Simulation OptimizationAsymptotic analysis of temporal-difference learning algorithms with constant step-sizesAdaptive Learning Algorithm Convergence in Passive and Reactive EnvironmentsRisk-Constrained Reinforcement Learning with Percentile Risk CriteriaApproximation of an analog diffusion network with applications to image estimationPole assignment for stochastic systems with unknown coefficientsOnline surrogate problem methodology for stochastic discrete resource allocation problem.Efficient discovery of overlapping communities in massive networksRecursive online EM estimation of mixture autoregressionsAsymptotic analysis of temporal-difference learning algorithms with constant step-sizesRandom-direction optimization algorithms with applications to threshold controlsStrong Convergence for URN Models with Reducible Replacement PolicyCentral limit theorems for generalized Pólya urn modelsDesign and analysis of linear precoders under a mean square error criterion. II: MMSE designs and conclusionsAn ellipsoid algorithm for probabilistic robust controller designLinear stochastic approximation driven by slowly varying Markov chainsA dual purpose principal and minor component flowLearning automata algorithms for pattern classification.Stochastic approximation algorithms: overview and recent trends.Convergence of least squares learning in self-referential discontinuous stochastic models.Minimization algorithms based on supervisor and searcher cooperationAnnealing adaptive search, cross-entropy, and stochastic approximation in global optimizationA sensitivity formula for risk-sensitive cost and the actor-critic algorithmConsistent expectations equilibria and learning in a stock marketApproximating networks and extended Ritz method for the solution of functional optimization problemsOptimal quadratic quantization for numerics: the Gaussian caseSimulation Optimization Using Multi-Time-Scale Adaptive Random SearchUrn models and differential algebraic equationsParallel and bootstrapped stochastic approximationAsymptotically optimal quantization schemes for Gaussian processes on Hilbert spacesSampled Tikhonov regularization for large linear inverse problemsControl of singularly perturbed Markov chains: A numerical studyConvergence of conjugate gradient methods with constant stepsizesSolving inverse problems using data-driven modelsStochastic Algorithms for the Estimation of an Optimal Solution of a LP Problem. Convergence and Central Limit TheoremA law of the iterated logarithm for stochastic approximation procedures in \(d\)-dimensional Euclidean space.Learning aspiration in repeated gamesConvergence of the Wang-Landau algorithmStochastic adaptation of importance samplerWorkshop on statistical approaches for the evaluation of complex computer modelsImportance sampling and statistical Romberg method for Lévy processesGeneralized neural networks for spectral analysis: dynamics and Liapunov functionsA Bayesian stochastic approximation methodConvergence rate of linear two-time-scale stochastic approximation.Analysis of a high-resolution optical wave-front control systemA survey of randomized algorithms for control synthesis and performance verificationPerturbation analysis for production control and optimization of manufacturing systemsA new approach for analyzing the limiting behavior of the normalized LMS algorithm under weak assumptionsReference points and learningAn information-theoretic analysis of return maximization in reinforcement learningMultiscale Q-learning with linear function approximationEstimating the position of a moving object based on test disturbance of camera positionQuantile estimation with adaptive importance samplingA direct search method for unconstrained quantile-based simulation optimizationAn adaptive design for clinical trials with non-dichotomous response and prognostic factorsOnline calibrated forecasts: memory efficiency versus universality for learning in gamesAdaptive stepsizes for recursive estimation with applications in approximate dynamic programmingRapid decision threshold modulation by reward rate in a neural networkControl of multi-node mobile communications networks with time-varying channels via stability methodsGlobal optimization using diffusion perturbations with large noise intensityNumerical methods for the pricing of swing options: a stochastic control approachOn the ergodicity properties of some adaptive MCMC algorithmsConvergence rate and averaging of nonlinear two-time-scale stochastic approximation algo\-rithmsOn asymptotic properties of a constant-step-size sign-error algorithm for adaptive filteringStrong consistency of the regularized least-squares estimates of infinite autoregressive modelsStochastic approximation with series of delayed observationsA simulation optimization method that considers uncertainty and multiple performance measuresAn online sequential algorithm for the estimation of transition probabilities for jump Markov linear systemsASD+M: automatic parameter tuning in stochastic optimization and on-line learningScaling up Bayesian variational inference using distributed computing clustersNew stochastic approximation algorithms with adaptive step sizesStochastic Nelder-Mead simplex method -- a new globally convergent direct search method for simulation optimizationAn actor-critic algorithm with function approximation for discounted cost constrained Markov decision processesError bounds for constant step-size \(Q\)-learningApproximate stochastic annealing for online control of infinite horizon Markov decision processes\(\alpha\)-variational inference with statistical guaranteesLearning in monotone Bayesian gamesPerturbation analysis and optimization of stochastic hybrid systemsConstrained stochastic estimation algorithms for a class of hybrid stock market models




This page was built for publication: