@@ -24,6 +24,7 @@ Stable-Baselines3 (SB3) is a high-level RL library that provides various algorit
...
@@ -24,6 +24,7 @@ Stable-Baselines3 (SB3) is a high-level RL library that provides various algorit
The trained model in HuggingFace: [model](https://huggingface.co/Younes-hands-on-rl/a2c_sb3_cartpole/tree/main)
The trained model in HuggingFace: [model](https://huggingface.co/Younes-hands-on-rl/a2c_sb3_cartpole/tree/main)
Weights and Bias: [Wandb](https://wandb.ai/younes-rl/my-hands-on-rl?workspace=user-younes-rl)
Weights and Bias: [Wandb](https://wandb.ai/younes-rl/my-hands-on-rl?workspace=user-younes-rl)
...
@@ -32,5 +33,6 @@ Weights and Bias: [Wandb](https://wandb.ai/younes-rl/my-hands-on-rl?workspace=us
...
@@ -32,5 +33,6 @@ Weights and Bias: [Wandb](https://wandb.ai/younes-rl/my-hands-on-rl?workspace=us
The objective is to learn how to reach any point in 3D space by directly controlling the robot's articulations; using the environment `PandaReachJointsDense-v2`
The objective is to learn how to reach any point in 3D space by directly controlling the robot's articulations; using the environment `PandaReachJointsDense-v2`
The trained model in HuggingFace: [model](https://huggingface.co/Younes-hands-on-rl/a2c_sb3_panda_reach/tree/main/)
The trained model in HuggingFace: [model](https://huggingface.co/Younes-hands-on-rl/a2c_sb3_panda_reach/tree/main/)
Weights and Bias: [Wandb](https://wandb.ai/younes-rl/PandaReachJointsDense-v2?workspace=user-younes-rl)
Weights and Bias: [Wandb](https://wandb.ai/younes-rl/PandaReachJointsDense-v2?workspace=user-younes-rl)