搜索结果: 1-1 共查到“统计学 Linear Payoffs”相关记录1条 . 查询时间(0.031 秒)
Thompson Sampling for Contextual Bandits with Linear Payoffs
Thompson Sampling Contextual Bandits Linear Payoffs
2012/11/23
Thompson Sampling is one of the oldest heuristics for multi-armed bandit problems. It is a randomized algorithm based on Bayesian ideas, and has recently generated significant interest after several s...