Skip to main content

ORIGINAL RESEARCH article

Front. Artif. Intell.
Sec. Machine Learning and Artificial Intelligence
Volume 7 - 2024 | doi: 10.3389/frai.2024.1346700

Causal Contextual Bandits with One-Shot Data Integration

Provisionally accepted
  • 1 Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India
  • 2 Robert Bosch Centre for Data Science and Artificial Intelligence, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India

The final, formatted version of the article will be published soon.

    We study a contextual bandit setting where the agent has access to causal side information, in addition to the ability to perform multiple targeted experiments -corresponding to potentially different context-action pairs -simultaneously in one-shot within a budget. This new formalism provides a natural model for several real-world scenarios where parallel targeted experiments can be conducted and where some domain knowledge of causal relationships is available. We propose a new algorithm that utilizes a novel entropy-like measure that we introduce. We perform several experiments, both using purely synthetic data and using a real-world dataset, and show that our algorithm performs better than baselines in all of them. In addition, we study sensitivity of our algorithm's performance to various aspects of the problem setting. We also show that the algorithm is sound; that is, as budget increases, the learned policy eventually converges to an optimal policy. Further, we theoretically bound our algorithm's regret under additional assumptions.Finally, we provide ways to achieve two popular notions of fairness, namely counterfactual fairness and demographic parity, with our algorithm.

    Keywords: causality, fairness, causal contextual bandits, causal bandits, contextual bandit algorithm

    Received: 29 Nov 2023; Accepted: 19 Nov 2024.

    Copyright: © 2024 Subramanian and Ravindran. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Chandrasekar Subramanian, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.