Reinforcement learning with algorithms from probabilistic structure estimation (Q2165986): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
ReferenceBot (talk | contribs) Changed an Item |
||
(One intermediate revision by one other user not shown) | |||
Property / Wikidata QID | |||
Property / Wikidata QID: Q114204749 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3093180 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Ergodicity Coefficients Defined by Vector Norms / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3717970 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q2794334 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Adaptive control using multiple models / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4315289 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Non-negative matrices and Markov chains. / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4626283 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: \({\mathcal Q}\)-learning / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses / rank | |||
Normal rank |
Latest revision as of 23:14, 29 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Reinforcement learning with algorithms from probabilistic structure estimation |
scientific article |
Statements
Reinforcement learning with algorithms from probabilistic structure estimation (English)
0 references
23 August 2022
0 references
reinforcement learning
0 references
statistical testing
0 references
Markov decision process
0 references
machine learning
0 references
decision support system
0 references