技术共享
TECHNOLOGY SHARING
提供开源算法 高水平基准AI
训练及复盘数据以及AI开发包等
Proximal Policy Optimization
Deep Q-Network
Double Deep Q-Networks
Deep Deterministic Policy Gradient
Actor-Critic
Trust Region Policy Optimization
Dueling DQN
State-Action-Reward-State-Action
Sarsa (Lambda)
Prioritized Experience Replay
Local Generative Actor-Critic(浙江大学)