Second-Order Mirror Descent: Convergence in Games Beyond Averaging and Discounting

Bolin Gao, Lacra Pavel

correcthigh confidence

Category: math.DS
Journal tier: Strong Field
Processed: Sep 28, 2025, 12:56 AM
arXiv Links: Abstract ↗PDF ↗

Audit review

The paper proves that MD2 converges to an interior mere variationally stable state under either (i) C2 Legendre + supercoercive regularizers or (ii) compact domain with strongly convex regularizers, for all α,β,γ,ε>0, via a Lyapunov function built from a Fenchel/Bregman coupling plus a quadratic term and LaSalle’s invariance principle; see the MD2 definition and Theorem 1 statement, and the proof details for cases (i) and (ii) . The candidate solution follows the same structure: identical Lyapunov functional (up to constant scaling), the same derivative calculation d/dt F(x*,z)=⟨x−x*,ż⟩, the same use of variational stability to make V̇≤0, and LaSalle to locate the ω-limit set. The model additionally gives a standard Bregman-divergence argument to rule out multiple equilibrium cluster points, which strengthens (but does not contradict) the paper’s sketch. Overall, the model’s solution matches the paper’s main argument and assumptions.

Referee report (LaTeX)

\textbf{Recommendation:} minor revisions

\textbf{Journal Tier:} strong field

\textbf{Justification:}

The paper presents a clear, broadly applicable continuous-time scheme (MD2) that overcomes known limitations of first-order MD in games, proving exact convergence to interior mere VSS under standard mirror-map regularity, and extending to rates via a vanishing-perturbation variant. The core Lyapunov analysis (Fenchel/Bregman coupling plus quadratic term) is sound. Some proof details (e.g., selection/uniqueness of the limit among several interior VSS) could be sharpened, and certain assumptions/regularity points might be stated more explicitly. Overall the contribution is technically correct and significant within the game dynamics literature.