技术共享
TECHNOLOGY SHARING
提供开源算法 高水平基准AI
训练及复盘数据以及AI开发包等
MADDPG
Mean Field Multi-Agent Reinforcement Learning
Neural Fictitious Self-Play
Friend-or-Foe Q-Learning
Nash Q-Learning
DRON
反事实后悔最小化算法(Counterfactual Regret Minimization)
Regret Policy Gradient