Skip to content
Snippets Groups Projects
Commit 78ca6fcb authored by Benyahia Mohammed Oussama's avatar Benyahia Mohammed Oussama
Browse files

Edit README.md

parent 803d78ea
No related branches found
No related tags found
No related merge requests found
...@@ -107,11 +107,20 @@ This repository contains my individual work for the **Hands-On Reinforcement Lea ...@@ -107,11 +107,20 @@ This repository contains my individual work for the **Hands-On Reinforcement Lea
[A2C Panda-Reach Model](https://huggingface.co/oussamab2n/a2c-panda-reach) [A2C Panda-Reach Model](https://huggingface.co/oussamab2n/a2c-panda-reach)
### Evaluation ### Evaluation
- **Evaluation Process:** - **Evaluation Process:**
- The model was evaluated over **100 episodes**.
- An episode is considered **successful** if it reaches a total reward of **-0.25**. - The model was evaluated over 100 episodes.
- **Evaluation Results:** - An episode is considered successful if it reaches a total reward of -0.25.
- **97%** of episodes successfully met the reward threshold, indicating strong performance.
- **Updated Evaluation Results:**
- **Total episodes with truncation:** 99/100
- **Average reward at truncation:** -7.68
- **Percentage of episodes meeting the reward threshold:** 97%, indicating strong performance.
- **Evaluation Plot:** - **Evaluation Plot:**
![Evaluation Plot](/images/panda_sb3_eval.png) ![Evaluation Plot](/images/panda_sb3_eval.png)
*(Figure: The robotic arm’s performance on the PandaReachJointsDense-v3 environment.)* *(Figure: The robotic arm’s performance on the PandaReachJointsDense-v3 environment.)*
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment