算法分类-人机对抗智能

对抗空间表示

特征表示

奖励函数

态势评估推理

策略生成优化

行动协同控制

非完美信息博弈

Neural Fictitious Self-Play

开源实现

CFR-BR

CFR against a best responder

开源实现

EVA

Hansen, Steven, et al. "Fast deep reinforcement learning using online adjustments from the past." Advances in Neural Information Processing Systems. 2018.

开源实现

TRPO

Trust Region Policy Optimization

开源实现

CFR

反事实后悔最小化算法（Counterfactual Regret Minimization）

开源实现

External sampling MCCFR

External sampling Monte Carlo CFR

开源实现

Best Response

开源实现

Outcome sampling Monte Carlo CFR

Outcome sampling MC CFR

Regret Policy Gradient

开源实现