Forums

Leela Zero (Stockfish without human knowledge), like Alpha 0

Sort:
drmrboss

Btw, if you would like to contribute in testing the Leela, I can help you set up your machine, we need a lot of volunteers. (Same as Stockfish project)

Elroch
drmrboss wrote:

Today Leela becomes generation 14, (a game vs me was played vs generation 12). Leela is fearless, aggressive without caring king safety.  

Nice. g4 was a surprise that turned out well!

But what was it up to at the end?  I believe this could be fixed by encouraging it to win quickly in the design of the reward calculation. This is very straightforward if it is only told about the value of a draw or a loss: score a win on move N as gamma^N, a loss on move N as -gamma^N and a draw as zero. where gamma is (of course) less than 1,

I know little about the details of the  design of the system, but it certainly looks as though it gives no value to winning more quickly. Any discounting would nudge it towards being efficient, which might also provide a better return on computational resources for learning and playing.

nighteyes1234
drmrboss wrote:

Neural network is the same as a learning child. If a child hits the spike, he get the feedback  that hurts him. And he will always avoid it in the future. This is the simple input-->output-->recognizing the result--> implementing in the future.  ( However we dont know how his inside brain work. ) This link will explain how neural network work. http://www.explainthatstuff.com/introduction-to-neural-networks.html

 

A true AI wont be told what a "spike" is, besides the outcome of the game. Humans telling the computer what a spike is, thats SF..or Komodo...or Houdini. Sure, with is 80,000 legit SF9 games already in, it will get to 3400 once it reaches those positions....they will boast how good it is, but it wont beat up on Komodo.So then they will add 100,000 Komodo games, and it becomes SF-Komodo and they boast again. Then they add 100,000 Houdini games and it will be SF-Komodo-Houdini...which will be impressive. But not more so than AlphaZero. With a new higher elo of any of those it will have to play again 100,000 games.

Elroch

 Presumably Leela was named after the Futurama character?

AntonioEsfandiari

Leela means divine play

drmrboss
 


I am starting to lose Leela, sad.png lose several games on time on 3'+2" min blitz.  

[Event "Computer chess game"]
[Site "DESKTOP-U9VVTOA"]
[Date "2018.03.23"]
[Round "?"]
[White "Nay Lin Tun"]
[Black "Play

[001]

"]
[Result "1-0"]
[BlackElo "2000"]
[ECO "A07"]
[Opening "Reti Opening"]
[Time "23:16:36"]
[Variation "KIA, 2.g3"]
[WhiteElo "2400"]
[TimeControl "180+2"]
[Termination "normal"]
[PlyCount "111"]
[WhiteType "human"]
[BlackType "program"

 

drmrboss

Leela running on my Arena 3.5.1 GUI.null

drmrboss
DeirdreSkye wrote:
drmrboss wrote:

@Attacker5000, Leela  already saw Mate in 5 since move 50 and since then M4 M3 M2 in all evaulations(check evaluations). But for some reason Leela didnt care M5 or M50 as along as the result end as a win. The same as human, we dont search the shortest line in mate. Probably due to 50 moves rules, where there is no bonus for winning in 5 moves vs 50 moves.

   Humans don't always care for shortest route to win , they often choose the simpler.

But that is not the case with Leela and seems more like a serious programming flaw.

Sorry, I checked wrong eval, it was not Leela who saw M5, M1  , it was stockfish. But  Leela will fix by its own

Elroch

I pointed out earlier (#27) that the odd characteristic of not trying to finish a game off could surely be fixed with a slight change to the reward function, and this might even be beneficial to performance.

drmrboss

Leela progress!

We got 100 active contributors, 76k games were played  3500+ elo increased from random play. (No one know the initial elo of random play, it might be arbitrarily -2000 elo).

Estimated elo of Leela=1800 now.
null

drmrboss
Elroch wrote:

I pointed out earlier (#27) that the odd characteristic of not trying to finish a game off could surely be fixed with a slight change to the reward function, and this might even be beneficial to performance.

@Elroch, thanks for your feedback. It would be a good idea to discuss with Leela  programmers. Here is a link if you would like to discuss directly https://groups.google.com/forum/#!forum/lczero

Elroch

Thanks for the suggestion. As you may have guessed, I have done spent quite a lot of time on related technology for other applications.

Elroch
drmrboss wrote:

Leela progress!

Looks very healthy, with more to come. I suspect you are slightly underestimating its strength, happy.png

drmrboss

Leela is growing very fast. A lot of people start crying that they could not win him anymore now. For comparison

Leela has been trained for 838k games.

Alpha 0 has been trained for 42 million games.

Stockfish has been tested for 705 million games. 

AntonioEsfandiari

how do we play her?

drmrboss
AntonioEsfandiari wrote:

how do we play her?

http://play.lczero.org

Elroch

My first game against Leela with Lichess annotation using Stockfish 9.

Move 41 by Leela was surprising (though not the critical error).

 

AntonioEsfandiari

Im noticing she plays exceptionally bad once the position is worse or lost for her.

drmrboss

Ok, I am pretty sure that Leela will be 2000+ in next month and  you wont win her anymore! (She is just one month old baby! )

happyyellowhandball
[COMMENT DELETED]