Publications
Preprints
Provably efficient generalized Lagrangian policy optimization for safe multi-agent reinforcement learning
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo R. Jovanovic
Note: 5th Annual Conference on Learning for Dynamics & Control (submitted); a full version with appendices.
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding, Kaiqing Zhang, Jiali Duan, Tamer Basar, and Mihailo R. Jovanovic
Note: J. Mach. Learn. Res. (submitted); a journal extension of the paper; also arXiv:2206.02346.
Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo R. Jovanovic
Note: IEEE Trans. Autom. Control (submitted); also arXiv:1908.02805.
Refereed Papers
2022
2021
2020
2019
2018
2016
2015
2014
2012
2011
|