@@ -14,9 +14,7 @@ This repository contains my individual work for the **Hands-On Reinforcement Lea
- The model was trained for **500 episodes**, showing a steady increase in total rewards. The goal (total reward = 500) was reached consistently after **400 episodes**, confirming successful learning.
-**Training Plot:**
<palign="center">

</p>
<pstyle="text-align: center;">
<br>
<b>(Figure: Total rewards increase per episode, indicating successful learning.)</b>