Model selection in reinforcement learning (Q415618): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(7 intermediate revisions by 5 users not shown)
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 68T05 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 68Q32 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 90C40 / rank
 
Normal rank
Property / zbMATH DE Number
 
Property / zbMATH DE Number: 6031868 / rank
 
Normal rank
Property / zbMATH Keywords
 
reinforcement learning
Property / zbMATH Keywords: reinforcement learning / rank
 
Normal rank
Property / zbMATH Keywords
 
model selection
Property / zbMATH Keywords: model selection / rank
 
Normal rank
Property / zbMATH Keywords
 
complexity regularization
Property / zbMATH Keywords: complexity regularization / rank
 
Normal rank
Property / zbMATH Keywords
 
adaptivity
Property / zbMATH Keywords: adaptivity / rank
 
Normal rank
Property / zbMATH Keywords
 
offline learning
Property / zbMATH Keywords: offline learning / rank
 
Normal rank
Property / zbMATH Keywords
 
off-policy learning
Property / zbMATH Keywords: off-policy learning / rank
 
Normal rank
Property / zbMATH Keywords
 
finite-sample bounds
Property / zbMATH Keywords: finite-sample bounds / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: R-MAX / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: ElemStatLearn / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: PRMLT / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s10994-011-5254-7 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2006330826 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: A survey of cross-validation procedures for model selection / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3973919 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model selection and error estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Local Rademacher complexities / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic optimal control. The discrete time case / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5483032 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093261 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Memory-universal prediction of stationary random processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A distribution-free theory of nonparametric regression / rank
 
Normal rank
Property / cites work
 
Property / cites work: The elements of statistical learning. Data mining, inference, and prediction / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Complexity regularization via localized random penalties / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonparametric time series prediction through adaptive model selection / rank
 
Normal rank
Property / cites work
 
Property / cites work: Basis function adaptation in temporal difference reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Chains and Stochastic Stability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3394879 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Concentration of measure inequalities for Markov chains and \(\Phi\)-mixing processes. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms for Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3655724 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Oracle inequalities for multi-fold cross validation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model selection in nonparametric regression / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 03:43, 5 July 2024

scientific article
Language Label Description Also known as
English
Model selection in reinforcement learning
scientific article

    Statements

    Model selection in reinforcement learning (English)
    0 references
    0 references
    0 references
    8 May 2012
    0 references
    reinforcement learning
    0 references
    model selection
    0 references
    complexity regularization
    0 references
    adaptivity
    0 references
    offline learning
    0 references
    off-policy learning
    0 references
    finite-sample bounds
    0 references
    0 references
    0 references
    0 references

    Identifiers