Publications

Preprints

  • Provably efficient generalized Lagrangian policy optimization for safe multi-agent reinforcement learning
    Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo R. Jovanovic
    Note: 5th Annual Conference on Learning for Dynamics & Control (submitted); a full version with appendices.

  • Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
    Dongsheng Ding, Kaiqing Zhang, Jiali Duan, Tamer Basar, and Mihailo R. Jovanovic
    Note: J. Mach. Learn. Res. (submitted); a journal extension of the paper; also arXiv:2206.02346.

  • Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization
    Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo R. Jovanovic
    Note: IEEE Trans. Autom. Control (submitted); also arXiv:1908.02805.

Refereed Papers

2022

2021

2020

2019

2018

2016

2015

2014

2012

2011