Research

My research interests include algorithm design and analysis, high dimensional statistics, inference over networks, sequential decision making under uncertainty, and online learning. In particular, many of my ongoing research projects are motivated by the following collection of research questions.

Reinforcement learning algorithms still lag behind carefully designed heuristics for real-world systems where we do not have access to infinite data. How can we design reinforcement learning algorithms that efficiently exploit known structure that arise in real-world systems? What are even the appropriate types of structure that are common to real-world systems yet lead to efficient and practical algorithms?
Matrix and tensor estimation algorithms have been widely used as part of the data preprocessing pipeline for handling high dimensional noisy and partially observed datasets. However all existing theoretical guarantees require strict conditions on the data generating process which are violated when the data is collected adaptively, which could introduce complex dependences and intentional non-uniformity to the sampling process. Can we develop optimal theory and algorithms for utilizing low rank models in sequential decision making scenarios?
The use of machine learning to design optimal policies for societal systems is in reality multi-objective, as we need to be conscientious of the computational resources consumed and ethical considerations of bias and fairness, in addition to the standard metrics of performance. Can we develop a fundamental theory for understanding optimal multi-objective tradeoffs in sequential decision making? Do there exist efficient algorithms that can aid a human decision maker to achieve any desired tradeoff along the Pareto frontier?
The majority of causal inference tools are built upon a naive assumption that applying a treatment to an individual does not affect others’ outcomes. However, this is clearly violated in scenarios where the treatment as well as the outcome are mediated by a network, e.g. public health campaigns, social media patforms, and epidemic modeling. Can we develop new theory and techniques for causal inference that strike a balance between efficient algorithms and flexible models?

If any of these peak your interests, I would love to connect!

Selected Recent Papers

Siddhartha Banerjee, Alankrita Bhatt, and Christina Lee Yu. ‘‘The SMART Approach to Instance-Optimal Online Learning.’’ Preprint.

Xumei Xi, Christina Lee Yu, and Yudong Chen. ‘‘Entry-Specific Bounds for Low-Rank Matrix Completion under Highly Non-Uniform Sampling.’’ International Symposium on Information Theory, 2023.

Su Jia, Nathan Kallus, Christina Lee Yu. ‘‘Clustered Switchback Experiments: Near-Optimal Rates Under Spatiotemporal Interference.’’ Preprint.

Siddhartha Banerjee, Sean R. Sinclair, Milind Tambe, Lily Xu, Christina Lee Yu. ‘‘Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits.’’ Preprint.

Tyler Sam, Yudong Chen, and Christina Lee Yu. ‘‘Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement Learning with Latent Low-Rank Structure.’’ Proceedings of the ACM on Measurement and Analysis of Computing Systems, 2023. Also presented at ACM SIGMETRICS, 2023. Best Student Paper Award.

Sean R. Sinclair, Gauri Jain, Siddhartha Banerjee, and Christina Lee Yu. ‘‘Sequential Fair Allocation: Achieving the Optimal Envy-Efficiency Tradeoff Curve.’’ Operations Research, 2022.

Mayleen Cortez, Matthew Eichhorn, and Christina Lee Yu. ‘‘Exploiting Neighborhood Interference with Low Order Interactions under Unit Randomized Designs.’’ Journal of Causal Inference, 2023.

Mayleen Cortez, Matthew Eichhorn, and Christina Lee Yu. ‘‘Staggered Rollout Designs Enable Causal Inference Under Interference Without Network Knowledge.’’ NEURIPS, 2022.

Christina Lee Yu, Edo Airoldi, Christian Borgs, and Jennifer Chayes. ‘‘Estimating Total Treatment Effect in Randomized Experiments with Unknown Network Structure.’’ Proceedings of National Academy of Sciences, 2022.

Sean R. Sinclair, Siddhartha Banerjee, and Christina Lee Yu. ‘‘Adaptive Discretization for Online Reinforcement Learning.’’ Operations Research, 2022.

Publications and Preprints by Topic

(If prefaced by * then authors are ordered alphabetically)

Causal Inference

Su Jia, Nathan Kallus, Christina Lee Yu. ‘‘Clustered Switchback Experiments: Near-Optimal Rates Under Spatiotemporal Interference.’’ Preprint.

Anish Agarwal, Sarah Cen, Devavrat Shah, and Christina Lee Yu. ‘‘Network Synthetic Interventions: A Framework for Panel Data with Network Interference.’’ Preprint.

Mayleen Cortez, Matthew Eichhorn, and Christina Lee Yu. ‘‘Exploiting Neighborhood Interference with Low Order Interactions under Unit Randomized Designs.’’ Journal of Causal Inference, 2023.

Mayleen Cortez, Matthew Eichhorn, and Christina Lee Yu. ‘‘Staggered Rollout Designs Enable Causal Inference Under Interference Without Network Knowledge.’’ NEURIPS, 2022.

Christina Lee Yu, Edo Airoldi, Christian Borgs, and Jennifer Chayes. ‘‘Estimating Total Treatment Effect in Randomized Experiments with Unknown Network Structure.’’ Proceedings of National Academy of Sciences, 2022.

Reinforcement Learning and Bandits

Siddhartha Banerjee, Alankrita Bhatt, and Christina Lee Yu. ‘‘The SMART Approach to Instance-Optimal Online Learning.’’ Preprint.

Xumei Xi, Christina Lee Yu, Yudong Chen. ‘‘Matrix Estimation for Offline Evaluation in Reinforcement Learning with Low-Rank Structure.’’ Preprint.

Siddhartha Banerjee, Sean R. Sinclair, Milind Tambe, Lily Xu, Christina Lee Yu. ‘‘Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits.’’ Preprint.

Tyler Sam, Yudong Chen, and Christina Lee Yu. ‘‘Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement Learning with Latent Low-Rank Structure.’’ Proceedings of the ACM on Measurement and Analysis of Computing Systems, 2023. Also presented at ACM SIGMETRICS 2023. Received Best Student Paper Award. SNAPP seminar video.

Sean R. Sinclair, Siddhartha Banerjee, and Christina Lee Yu. ‘‘Adaptive Discretization for Online Reinforcement Learning.’’ Operations Research, 2022.

*Christopher Archer, Siddhartha Banerjee, Mayleen Cortez, Carrie Rucker, Sean R. Sinclair, Max Solberg, Qiaomin Xie, and Christina Lee Yu. ‘‘ORSuite: Benchmarking Suite for Sequential Operations Models.’’ Reinforcement Learning Networks and Queues SIGMETRICS workshop, 2021.

Sean R. Sinclair, Tianyu Wang, Gauri Jain, Siddhartha Banerjee, and Christina Lee Yu. ‘‘Adaptive Discretization for Model-Based Reinforcement Learning.’’ Advances in Neural Information Processing Systems, 2020.

Sean Sinclair, Siddhartha Banerjee, and Christina Lee Yu. “Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces.” Proceedings of the ACM on Measurement and Analysis of Computing Systems, 2019. Also presented at ACM SIGMETRICS 2020.

*Nirandika Wanigasekara and Christina Lee Yu. “Nonparametric Contextual Bandits in an Unknown Metric Space.” Advances in Neural Information Processing Systems, 2019.

Fairness

Sean R. Sinclair, Gauri Jain, Siddhartha Banerjee, and Christina Lee Yu. ‘‘Sequential Fair Allocation: Achieving the Optimal Envy-Efficiency Tradeoff Curve.’’ Operations Research, 2022.

Sean R. Sinclair, Siddhartha Banerjee, and Christina Lee Yu. ‘‘Sequential Fair Allocation: Achieving the Optimal Envy-Efficiency Tradeoff Curve.’’ ACM SIGMETRICS, 2022.

Sean R. Sinclair, Gauri Jain, Siddhartha Banerjee, and Christina Lee Yu. ‘‘Sequential Fair Allocation of Limited Resources under Stochastic Demands.’’ Harvard CRCS AI for Social Good and Mechanism Design for Social Good, 2020.

High Dimensional Statistics

Xumei Xi, Christina Lee Yu, and Yudong Chen. ‘‘Entry-Specific Bounds for Low-Rank Matrix Completion under Highly Non-Uniform Sampling.’’ International Symposium on Information Theory, 2023.

Devavrat Shah and Christina Lee Yu. ‘‘Robust Max Entrywise Error Bounds for Sparse Tensor Estimation via Similarity Based Collaborative Filtering’’. IEEE Transactions on Information Theory, 2023.

Christina Lee Yu and Xumei Xi. ‘‘Tensor Estimation with Nearly Linear Samples Given Weak Side Information.’’ Proceedings of the ACM on Measurement and Analysis of Computing Systems, 2022. Also presented at ACM SIGMETRICS, 2022.

Christina Lee Yu. ‘‘Nonparametric Matric Estimation with One-Sided Covariates.’’ International Symposium of Information Theory, 2022.

*Christian Borgs, Jennifer Chayes, Devavrat Shah, and Christina Lee Yu. ‘‘Iterative Collaborative Filtering for Sparse Matrix Estimation.’’ Operations Research, 2021.

*Yihua Li, Devavrat Shah, Dogyoon Song, Christina Lee Yu. “Nearest Neighbors for Matrix Estimation Interpreted as Blind Regression for Latent Variable Model.” IEEE Transactions on Information Theory, 2019.

*Devavrat Shah and Christina Lee Yu. “Iterative Collaborative Filtering for Sparse Noisy Tensor Estimation.” Proceedings of Allerton Conference on Communication, Control, and Computing, 2019. Also presented at International Symposium on Information Theory, 2019.

*Devavrat Shah and Christina Lee Yu. “Reducing Crowdsourcing to Graphon Estimation, Statistically.” International Conference on Artificial Intelligence and Statistics, 2018.

*Christian Borgs, Jennifer Chayes, Christina E. Lee and Devavrat Shah. “Thy Friend is My Friend: Iterative Collaborative Filtering for Sparse Matrix Estimation.” Advances in Neural Information Processing Systems, 2017. Short Video. Poster.

*Christina E. Lee, Yihua Li, Devavrat Shah, Dogyoon Song. “Blind Regression: Nonparametric Regression for Latent Variable Models via Collaborative Filtering.” Advances in Neural Information Processing Systems, 2016. Short Video.

Efficient Local Computation for Large Scale Graphs and Matrices

*Asuman Ozdaglar, Devavrat Shah, and Christina Lee Yu. “Asynchronous Approximation of a Single Component of the Solution to a Linear System.” IEEE Transactions on Network Science and Engineering, 2019.

*Christina E. Lee, Asuman Ozdaglar, and Devavrat Shah. “Computing the Stationary Distribution Locally.” Advances in Neural Information Processing Systems, 2013. Full version with supplement.

Miscellaneous

Chunyin (Alex) Siu, Gennady Samorodnitsky, Christina Lee Yu, and Rongyi He. ‘‘The Asymptotics of the Expected Betti Numbers of Preferential Attachment Clique Complexes.’’ Preprint.

Chunyin (Alex) Siu, Gennady Samorodnitsky, Christina Lee Yu, and Andrey Yao. ‘‘Detection of Small Holes by the Scale-Invariant Robust Density-Aware Distance (RDAD) Filtration.’’ Preprint.

Elizabeth Bodine-Baron, Christina Lee, Anthony Chong, Babak Hassibi, and Adam Wierman. “Peer effects and stability in matching markets.” Proceedings of Symposium on Algorithmic Game Theory, 2011.