Preprints

  • Convergence of natural policy gradient primal-dual methods for constrained convex MDPs
    Dongsheng Ding
    Note: under review.

  • Where to spend rollouts: Hit-utility optimal rollout allocation for group-based RLVR
    Tao Wang, Shuo Li, Yan Sun, Dongsheng Ding, Edgar Dobriban
    Note: arXiv:2605.07114v1; under review.

Refereed Papers

2026

  • Unlearning in diffusion models: A unified framework with KL divergence and likelihood constraints
    Shervin Khalafi*, Alejandro Ribeiro, Dongsheng Ding*
    43rd International Conference on Machine Learning, 2026.
    Note: *corresponding authors; accepted and to appear.

2025

2024

2023

2022

2021

2020

2019

2018

2016 and earlier