Skip to content
Snippets Groups Projects
Commit 284be624 authored by Benyahia Mohammed Oussama's avatar Benyahia Mohammed Oussama
Browse files

Edit README.md

parent c1bb033e
No related branches found
No related tags found
No related merge requests found
...@@ -13,8 +13,12 @@ This repository contains my individual work for the **Hands-On Reinforcement Lea ...@@ -13,8 +13,12 @@ This repository contains my individual work for the **Hands-On Reinforcement Lea
### Training Results ### Training Results
- The model was trained for **500 episodes**, showing a steady increase in total rewards. The goal (total reward = 500) was reached consistently after **400 episodes**, confirming successful learning. - The model was trained for **500 episodes**, showing a steady increase in total rewards. The goal (total reward = 500) was reached consistently after **400 episodes**, confirming successful learning.
- **Training Plot:** - **Training Plot:**
<p align="center"> ![Training Plot](/images/train_rewards.png) <p align="center">
*(Figure: Total rewards increase per episode, indicating successful learning.)* </p> ![Training Plot](/images/train_rewards.png)
</p>
<p align="center">
*(Figure: Total rewards increase per episode, indicating successful learning.)*
</p>
### Model Saving ### Model Saving
- The trained model is saved as: `reinforce_cartpole.pth`. - The trained model is saved as: `reinforce_cartpole.pth`.
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment