技术共享
TECHNOLOGY SHARING
提供开源算法 高水平基准AI
训练及复盘数据以及AI开发包等
Minimax Q-Learning
Deep Q-Network
Neural Fictitious Self-Play
Monte Carlo Tree Search
Deep Deterministic Policy Gradient
Actor-Critic
Trust Region Policy Optimization
Dueling DQN
State-Action-Reward-State-Action
Sarsa (Lambda)
Prioritized Experience Replay
Local Generative Actor-Critic(浙江大学)
博弈算法(清华大学)