News Research

Paper Accepted at AISTATS2020

08.01.2020: New paper accepted to AISTATS 2020 on “Laplacian-regularized graph bandits: Algorithms and theoretical analysis”. Well done Kaige!

ArXiv: https://arxiv.org/abs/1907.05632.

Abstract: We consider a stochastic linear bandit problem with multiple users, where the relationship between users is captured by an underlying graph and user preferences are represented as smooth signals on the graph. We introduce a novel bandit algorithm where the smoothness prior is imposed via the random-walk graph Laplacian, which leads to a single-user cumulative regret scaling as O(ΨdT‾‾√) with time horizon T, feature dimensionality d, and the scalar parameter Ψ∈(0,1) that depends on the graph connectivity. This is an improvement over O(dT‾‾√) in LinUCB, where user relationship is not taken into account. In terms of network regret (sum of cumulative regret over n users), the proposed algorithm leads to a scaling as O(ΨdnT‾‾‾√), which is a significant improvement over O(ndT‾‾√) in the state-of-the-art algorithm Gob.Lin. To improve scalability, we further propose a simplified algorithm with a linear computational complexity with respect to the number of users, while maintaining the same regret. Finally, we present a finite-time analysis on the proposed algorithms, and demonstrate their advantage in comparison with state-of-the-art graph-based bandit algorithms on both synthetic and real-world data.

0 comments on “Paper Accepted at AISTATS2020

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: