RL_sysu_homework
Homework with respect to Reinforcement Learning for SYSU graduate students
homework1:
- value iteration and policy iteration (policy evaluation and policy improvement)
- Tabular Q-Learning
- Sarsa and Q-learning
homework2
- DQN for PongNoFrameskip-v4
- REINFORCE and REINFORCE with baseline for CartPole-v0
- DDPG (TD3) for LunarLanderContinuous-v2
- DDPG (TD3) for BipedalWalker-v2
homework3