Dongsheng Ding - Publications

School of Engineering and Applied Science, University of Pennsylvania

Preprints

Deterministic policy gradient primal-dual methods for continuous-space constrained MDPs
Sergio Rozada, Dongsheng Ding, Antonio G. Marques, Alejandro Ribeiro
Note: submitted; also arXiv:2408.10015.

Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding, Kaiqing Zhang, Jiali Duan, Tamer Basar, and Mihailo R. Jovanovic
Note: J. Mach. Learn. Res. (under review); a journal extension of the paper; also arXiv:2206.02346.

Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo R. Jovanovic
Note: IEEE Trans. Control. Netw. Syst. (under revision); also arXiv:1908.02805.

Refereed Papers

2024

Constrained diffusion models via dual training
Shervin Khalafi, Dongsheng Ding, and Alejandro Ribeiro
Advances in Neural Information Processing Systems, 2024.
Note: to appear; also arXiv:2408.15094.

One-shot safety alignment for large language models via optimal dualization
Xinmeng Huang, Shuo Li, Edgar Dobriban, Osbert Bastani, Hamed Hassani, Dongsheng Ding
Advances in Neural Information Processing Systems, 2024. (spotlight)
Note: to appear; poster; also arXiv:2405.19544.

Resilient constrained reinforcement learning
Dongsheng Ding, Zhengyan Huan, and Alejandro Ribeiro
27th International Conference on Artificial Intelligence and Statistics, 2024.

2023

Last-iterate convergent policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding*, Chen-Yu Wei*, Kaiqing Zhang*, and Alejandro Ribeiro
Advances in Neural Information Processing Systems, 2023.
Note: *alphabetical order; poster, slides, video.

Provably efficient generalized Lagrangian policy optimization for safe multi-agent reinforcement learning
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo R. Jovanovic
5th Annual Conference on Learning for Dynamics and Control, 2023.
Note: poster.

2022

Policy gradient primal-dual mirror descent for constrained MDPs with large state spaces
Dongsheng Ding and Mihailo R. Jovanovic
61st IEEE Conference on Decision and Control, 2022.
Note: slides.

Independent policy gradient for large-scale Markov potential games: sharper rates, function approximation, and game-agnostic convergence
Dongsheng Ding*, Chen-Yu Wei*, Kaiqing Zhang*, and Mihailo R. Jovanovic
39th International Conference on Machine Learning, 2022. (long talk)
Note: *alphabetical order; poster, slides, video.

Convergence and optimality of policy gradient primal-dual method for constrained Markov decision processes
Dongsheng Ding, Kaiqing Zhang, Tamer Basar, and Mihailo R. Jovanovic
2022 American Control Conference, 2022.
Note: slides.

2021

Provably efficient safe exploration via primal-dual policy optimization
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo R. Jovanovic
24th International Conference on Artificial Intelligence and Statistics, 2021. (oral presentation)
Note: poster, slides, video.

Byzantine-resilient distributed learning under constraints
Dongsheng Ding, Xiaohan Wei, Hao Yu, and Mihailo R. Jovanovic
2021 American Control Conference, 2021.
Note: slides.

Discounted online Newton method for time-varying time series prediction
Dongsheng Ding, Jianjun Yuan, and Mihailo R. Jovanovic
2021 American Control Conference, 2021.
Note: slides.

2020

Natural policy gradient primal-dual method for constrained Markov decision processes
Dongsheng Ding, Kaiqing Zhang, Tamer Basar, and Mihailo R. Jovanovic
Advances in Neural Information Processing Systems, 2020.
Note: poster, slides, video.

Global exponential stability of primal-dual gradient flow dynamics based on the proximal augmented Lagrangian: A Lyapunov-based approach
Dongsheng Ding and Mihailo R. Jovanovic
59th IEEE Conference on Decision and Control, 2020.
Note: slides.

2019

Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual method
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo R. Jovanovic
Optimization Foundations for Reinforcement Learning Workshop, 2019.
Note: poster.

Distributed robust statistical learning: Byzantine mirror descent
Dongsheng Ding, Xiaohan Wei, and Mihailo R. Jovanovic
58th IEEE Conference on Decision and Control, 2019
Note: poster, slides.

Global exponential stability of primal-dual gradient flow dynamics based on the proximal augmented Lagrangian
Dongsheng Ding and Mihailo R. Jovanovic
2019 American Control Conference, 2019.
Note: slides.

2018

An exponentially convergent primal-dual algorithm for nonsmooth composite minimization
Dongsheng Ding, Bin Hu, Neil K. Dhingra, and Mihailo R. Jovanovic
57th IEEE Conference on Decision and Control, 2018.
Note: slides.

A primal-dual Laplacian gradient flow dynamics for distributed resource allocation problems
Dongsheng Ding and Mihailo R. Jovanovic
2018 American Control Conference, 2018.
Note: slides.

2016

Adaptive Mittag-Leffler stabilization of a class of fractional order uncertain nonlinear systems
Qiao Wang, Jianliang Zhang, Dongsheng Ding, and Donglian Qi
Asian J. Control, 2016.

2015

Nonlinear Mittag-Leffler stabilization of commensurate fractional-order nonlinear systems
Dongsheng Ding, Donglian Qi, and Qiao Wang
IET Control Theory Appl., 2015

Asymptotic pseudo-state stabilization of uncertain fractional-order nonlinear systems with additive disturbance
Dongsheng Ding, Donglian Qi, and Qiao Wang
Nonlinear Dyn., 2015.

Mittag-Leffler synchronization of uncertain fractional order chaotic systems
Qiao Wang, Dongsheng Ding, and Donglian Qi
Chinese Physics B, 2015.

2014

Adaptive Mittag-Leffler stabilization of commensurate fractional-order nonlinear systems
Dongsheng Ding, Donglian Qi, Yao Meng, and Li Xu
53rd IEEE Conference on Decision and Control, 2014.

Strategy analysis of an evolutionary spectrum sensing game
Dongsheng Ding, Guoyue Zhang, Donglian Qi, and Huhu Zhang
Intelligent Computing and Applications (LSMS & ICSEE), 2014.

Alternative LMI characterizations for fractional-order linear systems
Dongsheng Ding, Donglian Qi, and Qiao Wang
33rd Chinese Control Conference, 2014.

Fractional-order integral state space modeling and quasi state analysis via block operational matrix scheme
Dongsheng Ding, Donglian Qi, and Qiao Wang
26th Chinese Control and Decision Conference, 2014.

2012

Convergence analysis and performance of an extended central force optimization algorithm
Dongsheng Ding, Donglian Qi, Xiaoping Luo, Jinfei Chen, Xuejie Wang, and Pengying Du
Appl. Math. Comput., 2012

2011

A convergence proof and parameter analysis of central force optimization algorithm
Dongsheng Ding, Xiaoping Luo, Jinfen Chen, Xuejie Wang, Pengying Du, and Yunfei Guo
J. Convergence Inf. Technol., 2011.