diff --git a/README.md b/README.md index bee8ea4f1ff6a0693bb93efa8ce5c6e9f928d791..6c39c1b74069d350aa74aee62e842044d401916c 100644 --- a/README.md +++ b/README.md @@ -119,7 +119,7 @@ Repeat 500 times: Save the model weights ``` -To learn more about REINFORCE, you can refer to [this unit](https://huggingface.co/learn/deep-rl-course/unit4/introduction). +To learn more about REINFORCE, you can refer to [this unit](https://huggingface.co/learn/deep-rl-course/unit4/policy-gradient). > 🛠**To be handed in** > Use PyTorch to implement REINFORCE and solve the CartPole environement. Share the code in `reinforce_cartpole.py`, and share a plot showing the total reward accross episodes in the `README.md`. Also, share a file `reinforce_cartpole.pth` containing the learned weights. For saving and loading PyTorch models, check [this tutorial](https://pytorch.org/tutorials/beginner/saving_loading_models.html#saving-loading-model-for-inference)