diff --git a/README.md b/README.md index 92e7adee6d90a9229893ff56262eab926a7ae0c4..bedab60c91db49a191ad723ec4413142f095d35b 100644 --- a/README.md +++ b/README.md @@ -12,7 +12,7 @@ The REINFORCE algorithm (also known as Vanilla Policy Gradient) is a policy grad > 🛠**To be handed in** > Use PyTorch to implement REINFORCE and solve the CartPole environement. Share the code in `reinforce_cartpole.py`, and share a plot showing the total reward accross episodes in the `README.md`. Also, share a file `reinforce_cartpole.pth` containing the learned weights. For saving and loading PyTorch models, check [this tutorial](https://pytorch.org/tutorials/beginner/saving_loading_models.html#saving-loading-model-for-inference) - + ## Model Evaluation