ßÉßɱ¬ÁÏ

This website stores cookies on your computer. These cookies are used to collect information about how you interact with our website and allow us to remember your browser. We use this information to improve and customize your browsing experience, for analytics and metrics about our visitors both on this website and other media, and for marketing purposes. By using this website, you accept and agree to be bound by UVic’s Terms of Use and Protection of Privacy Policy. If you do not agree to the above, you must not use this website.

Skip to main content

Andrea T.T. Nguyen

  • BSc (ßÉßɱ¬ÁÏ, 2021)

Notice of the Final Oral Examination for the Degree of Master of Science

Topic

Thresholded Linear Bandits

Department of Computer Science

Date & location

  • Thursday, April 17, 2025

  • 2:00 P.M.

  • Virtual Defence

Reviewers

Supervisory Committee

  • Dr. Nishant Mehta, Department of Computer Science, ßÉßɱ¬ÁÏ (Supervisor)

  • Dr. Yun Lu, Department of Computer Science, UVic (Member) 

External Examiner

  • Dr. Nidhi Hegde, Department of Computer Science, University of Alberta 

Chair of Oral Examination

  • Dr. Elisabeth Gugle, Department of Economics, UVic 

Abstract

Thresholded linear bandits is a novel bandit problem that lies in the intersection of several important multiarmed bandit (MAB) variants, including active learning, structured bandits, and learning halfspaces. To achieve sublinear regret in the presence of exponentially many arms, one method is to exploit the structure of the reward function. However, the presence of an unknown threshold component makes previously known algorithms for structured bandits unsuitable. Moreover, the threshold introduces a discontinuity to the reward function, making the problem significantly more difficult. In this thesis, we study the union of axis parallel halfspace variant of the thresholded linear bandits problem. We suggest an algorithm that achieves sublinear regret and provide theoretical guarantees on the performance of the algorithm.