Robert Hawkins recommended on on 1/23/2024introduces a policy-regularized hedge algorithm that searches for equilibria relatively close to a specified 'anchor' policy (via a KL term). I mostly liked the theoretical contribution, but the empirical results are also nice.