@@ -13,8 +13,12 @@ This repository contains my individual work for the **Hands-On Reinforcement Lea
...
@@ -13,8 +13,12 @@ This repository contains my individual work for the **Hands-On Reinforcement Lea
### Training Results
### Training Results
- The model was trained for **500 episodes**, showing a steady increase in total rewards. The goal (total reward = 500) was reached consistently after **400 episodes**, confirming successful learning.
- The model was trained for **500 episodes**, showing a steady increase in total rewards. The goal (total reward = 500) was reached consistently after **400 episodes**, confirming successful learning.