Model selection in reinforcement learning (Q415618): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
(6 intermediate revisions by 4 users not shown)
Property / describes a project that uses
 
Property / describes a project that uses: R-MAX / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: ElemStatLearn / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: PRMLT / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s10994-011-5254-7 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2006330826 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: A survey of cross-validation procedures for model selection / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3973919 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model selection and error estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Local Rademacher complexities / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic optimal control. The discrete time case / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5483032 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093261 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Memory-universal prediction of stationary random processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A distribution-free theory of nonparametric regression / rank
 
Normal rank
Property / cites work
 
Property / cites work: The elements of statistical learning. Data mining, inference, and prediction / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Complexity regularization via localized random penalties / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonparametric time series prediction through adaptive model selection / rank
 
Normal rank
Property / cites work
 
Property / cites work: Basis function adaptation in temporal difference reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Chains and Stochastic Stability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3394879 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Concentration of measure inequalities for Markov chains and \(\Phi\)-mixing processes. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms for Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3655724 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Oracle inequalities for multi-fold cross validation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model selection in nonparametric regression / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 03:43, 5 July 2024

scientific article
Language Label Description Also known as
English
Model selection in reinforcement learning
scientific article

    Statements

    Model selection in reinforcement learning (English)
    0 references
    0 references
    0 references
    8 May 2012
    0 references
    reinforcement learning
    0 references
    model selection
    0 references
    complexity regularization
    0 references
    adaptivity
    0 references
    offline learning
    0 references
    off-policy learning
    0 references
    finite-sample bounds
    0 references
    0 references
    0 references
    0 references

    Identifiers