Preprints
Alignment of large language models with constrained learning
Botong Zhang, Shuo Li, Ignacio Hounie, Osbert Bastani, Dongsheng Ding*, Alejandro Ribeiro
Note: *corresponding author; submitted; also arXiv:2505.19387.
Composition and alignment of diffusion models using constrained learning
Shervin Khalafi, Ignacio Hounie, Dongsheng Ding, Alejandro Ribeiro
Note: submitted; to appear.
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding, Kaiqing Zhang, Jiali Duan, Tamer Basar, and Mihailo R. Jovanovic
Note: J. Mach. Learn. Res. (accepted); a journal extension of the paper; also arXiv:2206.02346.
Refereed Papers
2025
2024
2023
2022
2021
2020
2019
2018
2016
2015
2014
2012
2011
|