Haochen Yang
From MaRDI portal
List of research outcomes
This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!
| Publication | Date of Publication | Type |
|---|---|---|
| Sample-efficient reinforcement learning from human feedback via information-directed sampling IEEE Transactions on Information Theory | 2025-10-06 | Paper |
Research outcomes over time
This page was built for person: Haochen Yang