tygxc, in response to someone asking for mathematical rigor, responded with the claim that a mathematician working at a weather station wouldnt deal in absolutes and would just give the estimate of the weather.
he really thinks that math is just a bunch of guesses that we think are accurate. LMFAO
That's (corrupt)stats not maths even in stats you don't assume anything so it isn't really guessing but stats can be confused as guesswork if done poorly
I did the same thing stockfish drew itself every game
When the dev team wants to make new release, how do they test it?
- Does the new engine beat the old engine over 50% of the time?
- Does the new engine beat other popular engines more often than the old engine did?
- Does the new engine draw itself consistently?
These are the characteristics that produce a TCEC winner. TCEC winning engines and their handful of close competitors survive, all other engines go on the scrap heap.
The very process of "evolution" for computer engines ensures that they are tuned heavily towards their own current playing ability, with no regard to the future, only the past, and certainly no thought towards perfect play or solving chess. They are designed to win most often against other engines, but more importantly to not lose against other engines. An engine that does not draw itself the vast majority of the time is an engine that is vulnerable to losses.
Now, add the other very important factor here:
Stockfish is the king of the heap, but is also open source. Stockfish's competitors can always steal from it in their own development (indirectly if not openly lifting code line for line if the other engine is a commercial one), and if Stockfish does not draw itself at at even higher rate than most engines, then the opposing dev teams can figure out why, and improve their own engines to exploit blind spots in Stockfish's playing. The *only* advantage that Stockfish maintains is the privacy of their current beta version whose source has not been released yet. So, each release *must* change in some way, even if mostly a lateral change in playing ability, to stay ahead of the open source issue.
This creates a neverending cycle where engines play only the same handful of opponents and are tuned to (1) never lose, first priority, and (2) to have as big a shot at winning as they can manage without compromising (1).
So, is it a surprise that engines draw more and more frequently? Is it a surprise that they draw amongst each other? Not at all. Engines have evolved into an incestuous little band of siblings, and this affects their possibilities of approaching "perfect play" at lot more than people think.
In the computer chess community they use the term "incesttesting" for testing programs against close relatives of themselves. Johan de Koning invented the term.