Chess will never be solved, here's why

Sort:
Avatar of Intellectual_26

Waiting for 8 posts after this to post, and 21 after that.

Avatar of MARattigan
tygxc wrote (#8853):

@8838

"wheres your data on the number of errors"
++ Another case: Tata Steel Masters 2023
91 games: 60 draws + 31 decisive games

Assuming Chess a white or black win:
60/91 = Poisson(1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
No lambda satisfies this equation.
Thus Chess cannot be a white or black win.
Thus Chess is a draw.

Assuming Chess a draw:
31/91 = Poisson(1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
lambda = 0.57 errors / game average satisfies this.

Games with 0 errors: 51
Games with 1 error: 29
Games with 2 errors: 9
Games with 3 errors: 2
Games with 4 or more errors: 0

I posted earlier (#8819)


...

You don't need to guess what the distribution of blunders would be using SF15 by looking at games where the theoretical result of the starting position is unknown, the player is not SF15 and there's no chance of deciding which moves are blunders.

We have actually weakly solved competition rules chess (amended to make the game soluble) for positions with 7 or fewer men on the board, no castling rights and no positions since the previous ply count 0 position that are considered the same for the purposes of the triple repetition rule. 

So you only need to look at a series of games in some of those positions using SF15 itself. Then there's no guesswork.

To make it simple I suggest this series which I've already gone to the trouble of producing for you. Twelve games, twelve draws.

@cobra91 has already checked these games with the Syzygy tablebase and produced a table of blunders.


Now be a good lad and tell us what your Poisson distribution is for those games using only the results without any info from Syzygy and we can see how closely it matches - you've been avoiding it for months. While you're about it, you claim to be able to say what the theoretical result of the initial position is from that distribution, so tell us what that says as well.

For the record, I predict you will continue to do neither because you're interested only in posting crap.

My prediction has so far turned out to be spot on.

So we'll have to do it for you (as usual).

Applying your reasoning to the games shown.

12 games: 12 draws + 0 decisive games

Assuming the initial position a white or black win:
12/12 = Poisson(1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
No lambda satisfies this equation.
Thus the initial position cannot be a white or black win.
Thus the initial position is a draw.

Assuming the initial position a draw:
0/12 = Poisson(1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
lambda = 0 errors / game average satisfies this.

Games with 0 errors: 12

But Syzygy tells us (with the aid of @cobra91):

The position is a Black win.

Games with 0 errors: 0

Games with 1 error: 4
Games with 2 errors: 0
Games with 3 errors: 3
Games with 4 or more errors: 5

(error = @tygxc error = half point blunder or full point blunder counted twice.)

So what went wrong?

Answer: your reasoning.

You have used the hypothesis that blunders per game occur with probabilities that form a Poisson distributution. You gave no reasons why they should and the hypothesis is intuitively unlikely. Obviously they don't.

So you can now check those results and stop posting that argument.

For the record, I predict you will continue to do neither because you're interested only in posting crap.

 

Avatar of MARattigan
Intellectual_26 wrote:

Waiting for 8 posts after this to post, and 21 after that.

Is the "26" bit what you scored?

Avatar of MEGACHE3SE

ey yo tygxc you know whats funny is that i can PROVE that chess is a win with your logic.

assume a poisson distribution of errors.

use the dataset of my games.

there are more decisive games than draws, so chess must be a win.

Avatar of doctordooface

hi

 

Avatar of DiogenesDue
DesperateKingWalk wrote:

You can not use Stockfish to determine if a chess game had zero errors with perfect play. 

For the simple fact that we know for a fact that Stockfish does not play perfect chess. 

How do we know that Stockfish does not play perfect chess?

For the simple fact that Stockfish is still improving! And bring out better performing Stockfish versions. 

It just means the Stockfish version could not detect an error. 

This has been pointed out to him a dozen times happy.png.  It's ignored.

Avatar of tygxc

@8875

"If you instead assume chess to be a win, then what are the number of errors per game?"
++ Then no consistent numbers of errors per game can be found.

Take this tournament, which is larger and stronger that Tata Steel Masters 2023.
https://www.iccf.com/event?id=79897 
136 games = 121 draws + 15 decisive games

Assume Chess to be a win for either white or black.
Then there must be 121 / 136 probability for an odd number of errors.
121 / 136 = Poisson (1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
There exists no lambda that satisfies this equation.
Thus Chess can be no white or black win.
Thus Chess is a draw.

Now verify: assume Chess a draw.
Then there must be 15 / 136 probability for an odd number of errors
15 / 136 = Poisson (1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
lambda = 0.124 errors / game average satisfies this

Thus:
Games with 0 errors: 120
Games with 1 error: 15
Games with 2 errors: 1
Games with 3 or more errors: 0

Avatar of tygxc

@8883
"use the dataset of my games"
++ That is neither a sufficiently large nor a sufficiently strong tournament.
As a 1462 player you probably err every move and so do your opponents.
That is why you get more decisive games than draws.

Avatar of tygxc

@8882

"You can not use Stockfish to determine if a chess game had zero errors with perfect play."
++ I do not use Stockfish,
I use statistics on a sufficiently large and sufficiently strong tournament.

"Stockfish does not play perfect chess" ++ Indeed Stockfish does not play perfect chess.
At 17 s / move on a 10^9 nodes/s cloud engine it errs once every 100,000 positions.

"Stockfish is still improving"
++ Yes, with new versions, better hardware, more time per move, it makes fewer errors.

Avatar of MARattigan
tygxc wrote (#8886):

@8875

"If you instead assume chess to be a win, then what are the number of errors per game?"
++ Then no consistent numbers of errors per game can be found.

Take this tournament, which is larger and stronger that Tata Steel Masters 2023.
https://www.iccf.com/event?id=79897 
136 games = 121 draws + 15 decisive games

Assume Chess to be a win for either white or black.
Then there must be 121 / 136 probability for an odd number of errors.
121 / 136 = Poisson (1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
There exists no lambda that satisfies this equation.
Thus Chess can be no white or black win.
Thus Chess is a draw.

Now verify: assume Chess a draw.
Then there must be 15 / 136 probability for an odd number of errors
15 / 136 = Poisson (1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
lambda = 0.124 errors / game average satisfies this

Thus:
Games with 0 errors: 120
Games with 1 error: 15
Games with 2 errors: 1
Games with 3 or more errors: 0

I posted earlier (#8880)

I posted earlier (#8819)

...

You don't need to guess what the distribution of blunders would be using SF15 by looking at games where the theoretical result of the starting position is unknown, the player is not SF15 and there's no chance of deciding which moves are blunders.

We have actually weakly solved competition rules chess (amended to make the game soluble) for positions with 7 or fewer men on the board, no castling rights and no positions since the previous ply count 0 position that are considered the same for the purposes of the triple repetition rule. 

So you only need to look at a series of games in some of those positions using SF15 itself. Then there's no guesswork.

To make it simple I suggest this series which I've already gone to the trouble of producing for you. Twelve games, twelve draws.



@cobra91 has already checked these games with the Syzygy tablebase and produced a table of blunders.


Now be a good lad and tell us what your Poisson distribution is for those games using only the results without any info from Syzygy and we can see how closely it matches - you've been avoiding it for months. While you're about it, you claim to be able to say what the theoretical result of the initial position is from that distribution, so tell us what that says as well.

For the record, I predict you will continue to do neither because you're interested only in posting crap.

My prediction has so far turned out to be spot on.

So we'll have to do it for you (as usual).

Applying your reasoning to the games shown.

12 games: 12 draws + 0 decisive games

Assuming the initial position a white or black win:
12/12 = Poisson(1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
No lambda satisfies this equation.
Thus the initial position cannot be a white or black win.
Thus the initial position is a draw.

Assuming the initial position a draw:
0/12 = Poisson(1; lambda; 0) + Poisson (3; lambda; 0) + Poisson (5; lambda; 0) + ...
lambda = 0 errors / game average satisfies this.

Games with 0 errors: 12

But Syzygy tells us (with the aid of @cobra91):

The position is a Black win.

Games with 0 errors: 0

Games with 1 error: 4
Games with 2 errors: 0
Games with 3 errors: 3
Games with 4 or more errors: 5

(error = @tygxc error = half point blunder or full point blunder counted twice.)

So what went wrong?

Answer: your reasoning.

You have used the hypothesis that blunders per game occur with probabilities that form a Poisson distribution. You gave no reasons why they should and the hypothesis is intuitively unlikely. Obviously they don't.

So you can now check those results and stop posting that argument.

For the record, I predict you will continue to do neither because you're interested only in posting crap.

Again my prediction was spot on. Maybe it's talking to @Optimissed too much, but I'm beginning to think I've got precognition. Nostradamus was an amateur.

Let me reiterate. The above proves

YOUR ASSUMPTION THAT THE NUMBER OF HALF POINT BLUNDERS PER SF15 v SF15 GAME WILL FOLLOW A POISSON DISTRIBUTION IS WRONG. CAN YOU STOP POSTING IT PLEASE?

(I predict the answer will be "++No. I'm only interested in posting crap.")

Avatar of tygxc

@8889

You post crap.

Your position cannot result from the initial position by optimal play from both sides
and is thus not relevant to weakly solving Chess.

This is no sufficiently large and sufficiently strong tournament starting from the initial position, so that is not what my reasoning about distribution or errors applies to.

Avatar of Exploding_hamster

N

Avatar of MARattigan
tygxc wrote:

@8889

You post crap.

Only because I need to quote you to make my points.

Your position cannot result from the initial position by optimal play from both sides
and is thus not relevant to weakly solving Chess.

You're not planning to use optimal play. You're planning to use SF15. That's why I used SF15 in the examples. 

This is no sufficiently large and sufficiently strong tournament starting from the initial position, so that is not what my reasoning about distribution or errors applies to.

If that's the case, why why are you applying it to the Tata Steel Masters 2023 and ICCF WC31?

There's no reason to think it applies to any series of practical chess games. 

 

Avatar of tygxc

@8894

"And how would you know this to be a fact." ++ Because it is a won position.
Weakly solving Chess only hops from drawn position to drawn position. 

"It is a legal chess position that can be calculated like any other positions."
++ Strongly solving Chess needs all 10^44 legal positions.
Weakly solving Chess only needs the 10^18 relevant positions, all draws.

Avatar of MARattigan
tygxc wrote:

@8894

"And how would you know this to be a fact." ++ Because it is a won position.
Weakly solving Chess only hops from drawn position to drawn position. 

Not if your weakly solving involves hopping from position to position with SF15 it doesn't.

If you were to say a weak solution, once found, only hops from drawn position to drawn position, it's possible you're right. Nobody knows - at least not those of us without big red telephones.

"It is a legal chess position that can be calculated like any other positions."
++ Strongly solving Chess needs all 10^44 legal positions.
Weakly solving Chess only needs the 10^18 relevant positions, all draws.

That is just a tad adrift.

The fact remains, it is a legal chess position that can be calculated like any other position. Don't know what point you thought you were answering.

 

Avatar of tygxc

@8896

"Nor if your weakly solving involves hopping from position to position with SF15 it doesn't."
++ It does. At 17 s/move on a 10^9 nodes/s it hops from draw to draw as becomes clear once it reaches the 7-men endgame table base draw or a prior 3-fold repetition. If it would occasionally err, as it is expected to do once in 10^5 positions, then that will show by not reaching the 7-men endgame table base draw and can be easily repaired by retracting that move.
If you want to emulate 17 s/move on a 10^9 nodes/s cloud engine using a 10^6 nodes/s desktop, then you need 4.7 h/move.
If you want to reach a win in N moves, then you need time to calculate at 2N depth.
Hopping from draw to draw is easier as it does not require looking ahead to 2N.

"it is a legal chess position that can be calculated like any other position."
++ Yes, it is a legal chess position. Yes, it can be calculated. No, it is not relevant to weakly solving Chess. What you present is a set of move sequences from a won position. No, it is not a sufficiently large, sufficiently strong tournament, where the Poisson distribution applies.

Avatar of ardutgamersus

wat

Avatar of MARattigan
tygxc wrote:

@8896

"Nor if your weakly solving involves hopping from position to position with SF15 it doesn't."
++ It does. At 17 s/move on a 10^9 nodes/s it hops from draw to draw as becomes clear once it reaches the 7-men endgame table base draw or a prior 3-fold repetition. If it would occasionally err, as it is expected to do once in 10^5 positions,

I've pointed out numerous time that you calculation of blunder rates based on an extrapolation that assumes an exponential relationship between blunder rates and think time doesn't work, especially when you don't know what values you're extrapolating from.

I've asked you to stop posting the calculation, which you appear to have done, but I'll ask you now if you would also kindly stop posting the result of your calculation.

Try once again. Here is @cobra91's complete table.


At 2048 seconds a move on my desktop I'm around 7½ times short of 17 seconds on a cloud engine according to your calculation. The nearest values in the table that are 7½ times short of the 2048 second values would be the 256 second values.

If we were to use your exponential assumption we could estimate the blunder rates at 17 seconds on a cloud engine as

(blunder rate at 2048 seconds)²/(blunder rate at 256 seconds)

giving (11/290)²/(4/215) blunders per ply ≈ 1 blunder in 13 ply

(not close to 1 blunder in 10⁵ ply)

then that will show by not reaching the 7-men endgame table base draw and can be easily repaired by retracting that move.

Er, which move exactly? Are you going to solve chess before you begin not solving chess?
If you want to emulate 17 s/move on a 10^9 nodes/s cloud engine using a 10^6 nodes/s desktop, then you need 4.7 h/move.
If you want to reach a win in N moves, then you need time to calculate at 2N depth.

Time to calculate exhaustively at 2N depth, which you're obviously going to get nowhere near at 17 seconds on a cloud engine if N>1000 ply, say. 
Hopping from draw to draw is easier as it does not require looking ahead to 2N.

@Elroch can you send me a copy of your laughing peanut?

"it is a legal chess position that can be calculated like any other position."
++ Yes, it is a legal chess position. Yes, it can be calculated. No, it is not relevant to weakly solving Chess. What you present is a set of move sequences from a won position.

Only up to the point where SF15 blunders into a draw, which it does in every game. It's very relevant to your procedure for weakly not solving chess.

Here is another series from the set I posted for you in #5898  (October last year) which you've studiously ignored ever since. This one is a drawn position in KRPP v KRP specially for you since your big red telephone apparently tells you that all games finish up in such positions.

and @cobra91's table


If you're asserting that that SF15 will always hop from drawn position to drawn position if the initial position is a draw, how come 9 of the 11 games get into theoretical wins?

No, it is not a sufficiently large, sufficiently strong tournament, where the Poisson distribution applies.

You're right; it's SF15 v SF15 as you say you plan in your process to weakly not solve chess.

If you could find a set of games with perfect play then a Poisson process with λ=0 would naturally apply, but for the moment SF15 v SF15 is the best we have. (And as I pointed out earlier a Poisson process doesn't apply then.)

 

 

Avatar of tygxc

@8923

"extrapolation that assumes an exponential relationship between blunder rates and think time doesn't work" ++ I fail to see what is wrong. I have 3 data points: decisive games at 1 s/move, decisive games at 1 min/move and 0 decisive games at unlimited time.

"Here is @cobra91's complete table." ++ Of the 4 cases only the drawn KRPP vs. KRP is relevant.

There clearly are anomalies in your data. The error rate should descend monotoneously and it does not. How come it jumps up and down erraticly? There is something wrong with your data.
What are your sequences for this KRPP vs. KRP position? Please repeat or link.

"Er, which move exactly?" ++ The principle is easy. White tries to win, black tries to draw. If all white tries fail, then Chess is weakly solved. Whenever a draw or a white lossis reached, a white move needs retracting. Whenever a white win is reached, a black move needs retracting. According to my calculations the latter case is exceptional as I calculate being right 99,999 out of 100,000 positions. You doubt that result, OK. I doubt your data. Let us look into your data.

i"f N>1000 ply" ++ Chess is not 1000 ply deep. If there are w non-transposing choices per move, then all white moves and all black moves d moves deep lead to w^(2d) positions. There can be no more positions than there are legal positions, so w^(2d) < 10^44.

Avatar of MARattigan
tygxc wrote:

@8923

"extrapolation that assumes an exponential relationship between blunder rates and think time doesn't work" ++ I fail to see what is wrong. I have 3 data points: decisive games at 1 s/move, decisive games at 1 min/move and 0 decisive games at unlimited time.

Then you obviously also can't tell the difference between 1 in 13 and 1 in 10⁵. Perhaps you could work on that first.

"Here is @cobra91's complete table." ++ Of the 4 cases only the drawn KRPP vs. KRP is relevant.

Relevant has to be relevant to something. To what? Your big red telephone?

There clearly are anomalies in your data. The error rate should descend monotoneously and it does not. How come it jumps up and down erraticly? There is something wrong with your data.

No. There's something wrong with your understanding. Try it yourself; it may reduce the crap rate on the thread for a while.
What are your sequences for this KRPP vs. KRP position? Please repeat or link.

"Er, which move exactly?" ++ The principle is easy. White tries to win, black tries to draw.

I usually try to win as Black, SF15 just follows it's algorithm.

If all white tries fail, then Chess is weakly solved. Whenever a draw or a white lossis reached, a white move needs retracting. Whenever a white win is reached, a black move needs retracting. According to my calculations the latter case is exceptional as I calculate being right 99,999 out of 100,000 positions. You doubt that result, OK. I doubt your data. Let us look into your data.

Well, amend you calculations. I've pointed out they don't work. Never was any reason to think they would. By all means look at my data. You might have tried producing your own before filling the thread with crap.

i"f N>1000 ply" ++ Chess is not 1000 ply deep. If there are w non-transposing choices per move, then all white moves and all black moves d moves deep lead to w^(2d) positions. There can be no more positions than there are legal positions, so w^(2d) < 10^44.

Here is a 269 move game Ivan Nikolic vs. Goran Arsovic (538 ply)

 

According to you they've reached every single legal chess position many times over. As an exercise you might try locating this one somewhere in the game.