Tag: search
-
From Complete to Partial Feedback: Supervised Learning vs. Contextual Bandits
Reading Time: 15 minutes
At a high level, bandit algorithms and supervised learning models can look surprisingly similar. In both cases, the goal is to look at a given context and predict the best class or “arm” to choose. For a recommendation system, this might mean predicting which product a user is most likely to click on. But beneath…