From b28160bcdd3a949bd1c582ab6b012fc0a337f168 Mon Sep 17 00:00:00 2001 From: Gallouedec Quentin <quentin.gallouedec@ec-lyon.fr> Date: Sun, 5 Feb 2023 19:54:51 +0000 Subject: [PATCH] Add link to hf cours on reinforce --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index 366f6a3..535143c 100644 --- a/README.md +++ b/README.md @@ -70,6 +70,8 @@ Repeat 500 times: Update the policy using an Adam optimizer and a learning rate of 5e-3 ``` +To learn more about REINFORCE, you can refer to [this unit](https://huggingface.co/blog/deep-rl-pg). + > 🛠**To be handed in** > Use PyTorch to implement REINFORCE and solve the CartPole environement. Share the code in `reinforce_cartpole.py`, and share a plot showing the total reward accross episodes in the `README.md`. -- GitLab