Publications
Check out my Google Scholar.
2025
- ICLRDiffusing States and Matching Scores: A New Framework for Imitation LearningIn The Thirteenth International Conference on Learning Representations, 2025
- ICMLConvergence of Consistency Model with Multistep Sampling under General Data AssumptionsIn Forty-second International Conference on Machine Learning, 2025
- ICMLCollaborative Mean Estimation Among Heterogeneous Strategic Agents: Individual Rationality, Fairness, and Truthful ContributionIn Forty-second International Conference on Machine Learning, 2025
- arXivAvoiding exp(R) scaling in RLHF through Preference-based ExplorationarXiv preprint arXiv:2502.00666, 2025
- arXivEfficient Controllable Diffusion via Optimal Classifier GuidancearXiv preprint arXiv:2505.21666, 2025
- arXivScaling Offline RL via Efficient and Expressive Shortcut ModelsarXiv preprint arXiv:2505.22866, 2025
- arXivA Cramér-von Mises Approach to Incentivizing Truthful Data SharingarXiv preprint arXiv:2506.07272, 2025
2024
- ICMLMinimally Modifying a Markov Game to Achieve Any Nash Equilibrium and ValueIn Forty-first International Conference on Machine Learning, 2024
- AAAIExact policy recovery in offline rl with both heavy-tailed rewards and data corruptionIn Proceedings of the AAAI Conference on Artificial Intelligence, 2024
2023
- AISTATSByzantine-robust online and offline distributed reinforcement learningIn International Conference on Artificial Intelligence and Statistics, 2023
- NeurIPSMechanism design for collaborative normal mean estimationAdvances in Neural Information Processing Systems, 2023Spotlight
2022
- AISTATSCorruption-robust offline reinforcement learningIn International Conference on Artificial Intelligence and Statistics, 2022
- arXivThe game of hidden rules: A new kind of benchmark challenge for machine learningarXiv preprint arXiv:2207.10218, 2022
2021
- ICMLRobust policy gradient against strong data corruptionIn International Conference on Machine Learning, 2021
2020
- AAAIOptimal attack against autoregressive models by manipulating the environmentIn Proceedings of the AAAI conference on artificial intelligence, 2020
2019
- arXivError lower bounds of constant step-size stochastic gradient descentarXiv preprint arXiv:1910.08212, 2019