#reinforcementlearning Development Tutorials & Tips | Roast Dev

100daysofcode 100daysofdevops 100pay 10mistakes 10yearworkanniversary 11 11labs 11tly 11ty 127001

Tic Tac Toe with AI!

What's up guys! Today I have created a AI using RL(Reinforcement Learning) that plays Tic Tac Toe with you. Using a Q-Network, we train the AI using the Adam optimizer and we train on 10,000 Episodes ...

14.04.2025 0 Read More

Proximal Policy Optimization (PPO) and Generalized Reinforcement Learning with Proximal Optimizer (GRPO)

Introduction Both Proximal Policy Optimization (PPO) and Generalized Reinforcement Learning with Proximal Optimizer (GRPO) are the algorithm of Reinforcement Learning (RL). In this blog, I am...

19.04.2025 0 Read More

Tic Tac Toe with AI!

Proximal Policy Optimization (PPO) and Generalized Reinforcement Learning with Proximal Optimizer (GRPO)

#reading

#popular