Knows what it knows: a framework for self-aware learning
From MaRDI portal
Publication:413843
DOI10.1007/s10994-010-5225-4zbMath1237.68154OpenAlexW2488247662MaRDI QIDQ413843
Alexander L. Strehl, Thomas J. Walsh, Michael L. Littman, Lihong Li
Publication date: 8 May 2012
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-010-5225-4
computational learning theoryreinforcement learningactive learningexplorationknows what it knows (KWIK)mistake boundprobably approximately correct (PAC)
Computational learning theory (68Q32) Learning and adaptive systems in artificial intelligence (68T05)
Related Items (10)
Unnamed Item ⋮ Relational reinforcement learning with guided demonstrations ⋮ Adaptive aggregation for reinforcement learning in average reward Markov decision processes ⋮ Reducing reinforcement learning to KWIK online regression ⋮ Knows what it knows: a framework for self-aware learning ⋮ Consistency of plug-in confidence sets for classification in semi-supervised learning ⋮ On Version Space Compression ⋮ Learning with Rejection ⋮ Learning with Deep Cascades ⋮ Unnamed Item
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Knows what it knows: a framework for self-aware learning
- Reducing reinforcement learning to KWIK online regression
- Stochastic optimal control. The discrete time case
- The complexity of dynamic programming
- Efficient distribution-free learning of probabilistic concepts
- Toward efficient agnostic learning
- An upper bound on the loss from approximate optimal-value functions
- Selective sampling using the query by committee algorithm
- An empirical study of two approaches to sequence learning for anomaly detection
- Queries revisited.
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- Near-optimal reinforcement learning in polynomial time
- Apple tasting.
- Generalization bounds for averaged classifiers
- Queries and concept learning
- 10.1162/153244303765208377
- Minimizing Regret With Label Efficient Prediction
- A theory of the learnable
- 10.1162/153244303321897663
- Probability Inequalities for Sums of Bounded Random Variables
This page was built for publication: Knows what it knows: a framework for self-aware learning