搜索结果: 1-1 共查到“Uncontrolled Restless Bandits”相关记录1条 . 查询时间(0.234 秒)
Adaptive Learning of Uncontrolled Restless Bandits with Logarithmic Regret
Uncontrolled Restless Bandits Logarithmic Regret Optimization and Control
2011/9/15
Abstract: In this paper we consider the problem of learning the optimal policy for the uncontrolled restless bandit problem. In this problem only the state of the selected arm can be observed, the sta...