方略学科导航

搜索结果: 1-1 共查到“Uncontrolled Restless Bandits”相关记录1条 . 查询时间(0.234 秒)

Adaptive Learning of Uncontrolled Restless Bandits with Logarithmic Regret Uncontrolled Restless Bandits Logarithmic Regret Optimization and Control 2011/9/15

Abstract: In this paper we consider the problem of learning the optimal policy for the uncontrolled restless bandit problem. In this problem only the state of the selected arm can be observed, the sta...

存档附件原文地址

中国研究生教育排行榜-条

正在加载...

中国学术期刊排行榜-条

正在加载...

世界大学科研机构排行榜-条

正在加载...

中国大学排行榜-条

正在加载...

人　物-篇

正在加载...

课　件-篇

正在加载...

视听资料-篇

正在加载...

研招资料 -篇

正在加载...

知识要闻-篇

正在加载...

国际动态-篇

正在加载...

会议中心-篇

正在加载...

学术指南-篇

正在加载...

学术站点-篇

正在加载...