Research

My research interests include algorithm design and analysis, high dimensional statistics, inference over networks, sequential decision making under uncertainty, and online learning. In particular, many of my ongoing research projects are motivated by the following collection of research questions.

  • Reinforcement learning algorithms still lag behind carefully designed heuristics for real-world systems where we do not have access to infinite data. How can we design reinforcement learning algorithms that efficiently exploit known structure that arise in real-world systems? What are even the appropriate types of structure that are common to real-world systems yet lead to efficient and practical algorithms?

  • Matrix and tensor estimation algorithms have been widely used as part of the data preprocessing pipeline for handling high dimensional noisy and partially observed datasets. However all existing theoretical guarantees require strict conditions on the data generating process which are violated when the data is collected adaptively, which could introduce complex dependences and intentional non-uniformity to the sampling process. Can we develop optimal theory and algorithms for utilizing low rank models in sequential decision making scenarios?

  • The use of machine learning to design optimal policies for societal systems is in reality multi-objective, as we need to be conscientious of the computational resources consumed and ethical considerations of bias and fairness, in addition to the standard metrics of performance. Can we develop a fundamental theory for understanding optimal multi-objective tradeoffs in sequential decision making? Do there exist efficient algorithms that can aid a human decision maker to achieve any desired tradeoff along the Pareto frontier?

  • The majority of causal inference tools are built upon a naive assumption that applying a treatment to an individual does not affect others’ outcomes. However, this is clearly violated in scenarios where the treatment as well as the outcome are mediated by a network, e.g. public health campaigns, social media patforms, and epidemic modeling. Can we develop new theory and techniques for causal inference that strike a balance between efficient algorithms and flexible models?

If any of these peak your interests, I would love to connect!

Selected Recent Papers

  • Siddhartha Banerjee, Alankrita Bhatt, and Christina Lee Yu. ‘‘The SMART Approach to Instance-Optimal Online Learning.’’ Preprint.

Publications and Preprints by Topic

(If prefaced by * then authors are ordered alphabetically)

Causal Inference

Reinforcement Learning and Bandits

  • Siddhartha Banerjee, Alankrita Bhatt, and Christina Lee Yu. ‘‘The SMART Approach to Instance-Optimal Online Learning.’’ Preprint.

Fairness

High Dimensional Statistics

Efficient Local Computation for Large Scale Graphs and Matrices

Miscellaneous