It trained itself to this level of play in only 4 hours.
When you wake up next morning, it will already be 4000, chess will be solved and this forum will shut down.
It appears not. Take a look at the graph of the standard of play versus the stage of learning. AlphaZero's standard of play seemed to have plateaued after it got about 100 points stronger than Stockfish.
That is what I am saying: this project is dead.
From 0 to 2000 elo it was extremely easy to tune their parameters: a 2000 player, no matter if human or machine, needs to know only about piece values, psqt, weak pawns, shelter attacks and passers.
From 2000 to around 2600 you need further positional terms, like all imaginable tactical checks and pins, connected passers, some imbalances, etc.
Beyond 2600, however, it gets really difficult, as one has to constantly refine and widen one's evaluation, and that evaluation is mostly unobvious. They reached 2800(3200-400 for hardware) on single core, and then they plateaued. Further on, the evaluation is extremely unobvious.
And I would say, almost impossible to tune automatically.
That is why we will not see a stronger version in the next very long period of time.
Again, they reached 2800 on single core, and that is full 400 points below SF, so a random middle tier engine evaluation.
That is why I am upset by their claims: it was all hardware.
The claim is they enhanced intelligence, be it artificial or not: if the evaluation(intelligence) of that engine is a middle-tier one, what kind of an intelligence breakthrough is this?
The only real connection between number of threads and number of cores is that Stockfish recommends one thread per core. Other settings are possible but sub-optimal.
If that setting was used, Stockfish was running on 64 cores, which is a very powerful computer. 32 cores is almost as impressive.
Not impressive at all next to what Alpha had.
I keep referring to the graphs from the DeepMind paper.
These show that if Stockfish was given thirty times longer per move it would have gained surprisingly few Elo points, and not done much better.
This is already bogus.
Doubling time is usually more important than doubling speed, and this is significantly more time.
I guess they have measured something wrong. We should not believe everything as they wrote it.