Publications
Preprints
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding, Kaiqing Zhang, Jiali Duan, Tamer Basar, and Mihailo R. Jovanovic
Note: J. Mach. Learn. Res. (under review); a journal extension of the paper; also arXiv:2206.02346.
Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo R. Jovanovic
Note: IEEE Trans. Control. Netw. Syst. (under revision); also arXiv:1908.02805.
Refereed Papers
2024
Resilient constrained reinforcement learning
Dongsheng Ding, Zhengyan Huan, and Alejandro Ribeiro
27th International Conference on Artificial Intelligence and Statistics, 2024.
Note: to appear; also arXiv:2312.17194.
2023
2022
2021
2020
2019
2018
2016
2015
2014
2012
2011
|