Beyond Alpha Zero, where is neural networks reinforcement learning now?

Sort:
Avatar of drmrboss

When Alpha Zero scientific experience end in Chess Go and Shogi, many neural network training are going on many games like Starcraft, Dota and many other games.

This is one of achievement where neural networks can learn and evolve exactly the same like human.

https://youtu.be/kopoLzvh5jY

Avatar of redghost101
No
Avatar of redghost101
this is reinforcement learning, the ai isn’t actually learning, it just tries different concepts and gets rewarded if it is (In the hiders case) not seen.
Avatar of redghost101
A neural network is much more complex
Avatar of KovenFan
redghost101 wrote:
A neural network is much more complex

If you go through the paper the video is on, you'll see that they used deep reinforcement learning which basically makes use of both deep learning and reinforcement learning principles. So they actually did use a neural net.

Avatar of KovenFan
redghost101 wrote:
this is reinforcement learning, the ai isn’t actually learning, it just tries different concepts and gets rewarded if it is (In the hiders case) not seen.

But that is learning though.

Avatar of Pawned_064

The more advanced opponents it plays,  the stronger it gets.

Therefore, AlphaZero can get uhhh.... unpredictable as the neural networks are dependent on the opponents it plays. If it plays a 500 rated player over and over again it learns from the match after playing. The computers logic - I need to just beat that particular human, not the world. AlphaZero

cant just play with weak opponents, else it becomes weak. The more it plays with itself..... The better it gets at drawing itself.

Avatar of redghost101
Me no professional, but as far as I know. Reinforcement learning is when they try every move possible until they get the highest reward, then continue onto the next stage. Neural networks find how to get the next move, making them more efficient than reinforcement learning
Avatar of Pawned_064

AI is a bit funny.

Avatar of redghost101
The thing is, neural networks don’t learn from playing 500s, they learn by playing themselves. One NN as black, one as white. Once the training phase finishes, they then challenge much higher rated people or a.i
Avatar of redghost101
It can take from 10 hours to 10 days to train, but once it does. It’s almost unbeatable
Avatar of Luxferre

Good point!

Avatar of redghost101
Ta very much
Avatar of redghost101
@MarcoDiazz it is not learning, this is an algorithm. A neural network is not
Avatar of KovenFan
redghost101 wrote:
@MarcoDiazz it is not learning, this is an algorithm. A neural network is not

?

Avatar of redghost101
Flip it
Avatar of Pawned_064
redghost101 wrote:
It can take from 10 hours to 10 days to train, but once it does. It’s almost unbeatable

According to common logic, playing by yourself does improve your ability. i.e if you play with others lower than your rank, YOUR rank depletes.

Avatar of redghost101
It plays itself, each time trying a new strategy. BOth sides try it, letting the. NN see it’s weaknesses and improve ovqer time
Avatar of Pawned_064
redghost101 wrote:
It plays itself, each time trying a new strategy. BOth sides try it, letting the. NN see it’s weaknesses and improve ovqer time

hmmmmmmmmmm....

Avatar of Pawned_064

that mods its previous strategy right?