title | section | abstract | layout | series | publisher | issn | id | month | tex_title | firstpage | lastpage | page | order | cycles | bibtex_author | author | date | address | container-title | volume | genre | issued | extras | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Stochastic Constrained Contextual Bandits via Lyapunov Optimization Based Estimation to Decision Framework |
Original Papers |
This paper studies the problem of stochastic constrained contextual bandits (CCB) under general realizability condition where the expected rewards and costs are within general function classes. We propose LOE2D, a Lyapunov Optimization Based Estimation to Decision framework with online regression oracles for learning reward/constraint. LOE2D establishes |
inproceedings |
Proceedings of Machine Learning Research |
PMLR |
2640-3498 |
guo24a |
0 |
Stochastic Constrained Contextual Bandits via Lyapunov Optimization Based Estimation to Decision Framework |
2204 |
2231 |
2204-2231 |
2204 |
false |
Guo, Hengquan and Liu, Xin |
|
2024-06-30 |
Proceedings of Thirty Seventh Conference on Learning Theory |
247 |
inproceedings |
|