Skip to content
Snippets Groups Projects
Commit b28160bc authored by Gallouedec Quentin's avatar Gallouedec Quentin
Browse files

Add link to hf cours on reinforce

parent 4768fd97
Branches
No related tags found
No related merge requests found
......@@ -70,6 +70,8 @@ Repeat 500 times:
Update the policy using an Adam optimizer and a learning rate of 5e-3
```
To learn more about REINFORCE, you can refer to [this unit](https://huggingface.co/blog/deep-rl-pg).
> 🛠 **To be handed in**
> Use PyTorch to implement REINFORCE and solve the CartPole environement. Share the code in `reinforce_cartpole.py`, and share a plot showing the total reward accross episodes in the `README.md`.
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment