Chess will never be solved, here's why

Sort:
Avatar of tygxc

#2036
"If White wins in any of these lines, it would require no less time, because the engine should check White's strategy against all the reasonable alternatives for Black (you say that), not just one."
++ That does not happen: white does not win in any of these lines. If you just let Stockfish autoplay at reasonable time even on your desktop then you end with... a draw. That is empirical evidence, not proof. So the setup can assume it does not happen. In the unlikely case it would happen, the calculation must stop and the good assistants must step in and investigate what is happening.

Avatar of tygxc

#2060

"By including three more candidate moves - errors thus get divided by 1000 trillion ??"
For 1 candidate move:   1 error in 10^5 positions.
For 2 candidate moves: 1 error in 10^5 * 10^5 = 10^10 positions.
For 3 candidate moves: 1 error in 10^5 * 10^10 = 10^15 positions.
For 4 candidate moves: 1 error in 10^5 * 10^15 = 10^20 positions.

'"error" might be poorly defined too.'
++ error = move that turns a drawn position into a lost position

"because the computer is only following formulaic instructions from human programmers."
++ No, AlphaZero got no other instructions but the rules of the game

"In tactical situations - supercomputers would be great.  Fantastic."But what about 'positional' ?"
++ Positional = tactical at more depth

"There may be several 'best' moves in such positions. And several 'second rate' moves."
++ There are only two kinds of moves: good moves and errors.
Good move = move that turns a drawn position into another drawn position
Error = move that turns a drawn position into a lost position

"The computer is assigning numerical differences to evaluate positional moves according to obscure minor technical points which may have little to do with 'best move'."
++ The computer essentially counts the material in pawn units with some tweeks. This is a good approximation for the middle game. It is totally flawed in endgames, like KNN vs. KP.

Avatar of tygxc

#2068
weakly solved means that for the initial position a strategy has been determined to achieve the game-theoretic value against any opposition
https://web.archive.org/web/20170912011410/https://pdfs.semanticscholar.org/8296/bc0ab855841088b31190c9f2923951853d7b.pdf 

Avatar of Robotofdeath395

Your mum will never be solved

 

Avatar of MARattigan
tygxc wrote:

#2068
weakly solved means that for the initial position a strategy has been determined to achieve the game-theoretic value against any opposition
https://web.archive.org/web/20170912011410/https://pdfs.semanticscholar.org/8296/bc0ab855841088b31190c9f2923951853d7b.pdf 

No it doesn't.

 
White to move

The White strategy

         Play 1.Qa7+

achieves the game theoretic value against any opposition. Is that a weak solution of that initial position?

I already pointed that out. Do you read the posts addressed to you?

If you're proposing to find to find a weak solution of "chess" it would probably be a good idea to find out first what "weak solution" means.

Edit: Fixed position.

Avatar of haiaku
tygxc wrote:

#2036
"If White wins in any of these lines, it would require no less time, because the engine should check White's strategy against all the reasonable alternatives for Black (you say that), not just one."
++ That does not happen: white does not win in any of these lines. If you just let Stockfish autoplay at reasonable time even on your desktop then you end with... a draw. That is empirical evidence, not proof. So the setup can assume it does not happen. In the unlikely case it would happen, the calculation must stop and the good assistants must step in and investigate what is happening.

As I already said, I was just following your reasoning:

tygxc wrote:

#1999

"So, you check only one candidate move for Black... and that should be enough to find her optimal strategy?"
++ Yes. Black tries to draw, white tries to win.
If a game ends in a draw, then [ . . . ]
If on the contrary the game would end in a white win, then none of the white moves need changeing and it would be necessary to look at all reasonable alternatives for black and if all those also lead to a white win, then chess would be solved to be a win for white and that in retrospect all white moves were optimal: fit to win.

Did you change your mind? Fine, it's a draw, then. But that does not answer the question I asked the last time (and thrice before):

haiaku wrote:

So, in the hypothesis that the game ends in a draw, how many hours would occur (at 60 hours per position) to generate all these candidates between the starting point (which I understood is not the initial position) and the endgame?

 

Avatar of haiaku

@MARattigan, an objection can be made to your last post, but I will leave it to @tygxc 😁

Avatar of MARattigan

Yes, will fix.

Avatar of Elroch
tygxc wrote:

#2035

"You think that Stockfish makes no more than one losing error in every 100,000 moves"
++ I infer that by extrapolation from the peer reviewed Alpha Zero paper. I know the published error rate at 1 s/move and 1 min/move and extrapolate to 1 h/move and 60 h/move and thus arrive at 1 error of the top 1 move per 10^5 positions.

Extrapolation is not necessarily at all reliable. There is not even a unique way to do it, especially from (absurdly) two data points! (Transform the values using a wide range of monotone functions to get pretty much any answer you want).

But neither would it be reliable with 100 data points unless you know the form of the relationship. You don't.

Thanks for clarifying that those numbers of yours must be ignored.

Avatar of MARattigan
tygxc wrote:

#2035

"You think that Stockfish makes no more than one losing error in every 100,000 moves"
++ I infer that by extrapolation from the peer reviewed Alpha Zero paper. I know the published error rate at 1 s/move and 1 min/move ...

Fascinating. 

How did they work out the error rate? Have they beaten you to it?

(And were the games played under basic rules with the addition of a double repetition rule. If not that would have a significant effect on the error rate.)

Avatar of Elroch
MARattigan wrote:
tygxc wrote:

#2068
weakly solved means that for the initial position a strategy has been determined to achieve the game-theoretic value against any opposition
https://web.archive.org/web/20170912011410/https://pdfs.semanticscholar.org/8296/bc0ab855841088b31190c9f2923951853d7b.pdf 

No it doesn't.

 
White to move

The White strategy

         Play 1.Qa7+

achieves the game theoretic value against any opposition. Is that a weak solution of that initial position?

I already pointed that out. Do you read the posts addressed to you?

If you're proposing to find to find a weak solution of "chess" it would probably be a good idea to find out first what "weak solution" means.

Edit: Fixed position.

When one side is winning, a strategy is only required for that side to have a weak solution. That is the case here, so there is no need for a strategy for white.

A weak solution requires a black strategy that will reach a win against any white defense.

Avatar of MARattigan
Elroch wrote:
MARattigan wrote:
tygxc wrote:

#2068
weakly solved means that for the initial position a strategy has been determined to achieve the game-theoretic value against any opposition
https://web.archive.org/web/20170912011410/https://pdfs.semanticscholar.org/8296/bc0ab855841088b31190c9f2923951853d7b.pdf 

No it doesn't.

 
White to move

The White strategy

         Play 1.Qa7+

achieves the game theoretic value against any opposition. Is that a weak solution of that initial position?

I already pointed that out. Do you read the posts addressed to you?

If you're proposing to find to find a weak solution of "chess" it would probably be a good idea to find out first what "weak solution" means.

Edit: Fixed position.

When one side is winning, a strategy is only required for that side to have a weak solution. That is the case here, so there is no need for a strategy for white.

A weak solution requires a black strategy that will reach a win against any white defense.

I know that and you know that, but going by his definition @tygxc doesn't.

It should read:

weakly solved means that for the initial position a strategy has been determined for a player with a best game-theoretic value to achieve the game-theoretic value against any opposition

But since you don't know the game-theoretic value until you have a real solution, that's not very useful.

Better:

weakly solved means that for the initial position either a timely strategy has been determined for one player that achieves a win against any opposition, or a timely strategy has been determined for both players that avoids a loss against any opposition. 

Edit: Added the word "timely" to the definition in the last paragraph. Otherwise there is the obvious solution of just trying out all possible continuations up to a maximum length of the number of possible positions. 

Avatar of tygxc

#2079
I used the definition by Prof. van den Herik and I gave a reference to his paper.
Personally I prefer a simpler, more practical definition but then I get reproached of using my own definitions instead of the generally accepted ones, which is what you do here.
You even find fault with his generally accepted definition.
I am glad I am not the only one being critiqued.
You should take your criticism to prof. van den Herik this time.

Avatar of tygxc

#2077

"How did they work out the error rate?"
++ They did not: they gave the draw rate at 1 s/move and at 1 min/move. Games that do not draw are sure to contain 1 error. Further I assumed 40 moves hence 80 ply per game. Then I extrapolated from 1 s/move and 1 min/move to 1 h/move and to 60 h/move.

"were the games played under basic rules with the addition of a double repetition rule"
++ That does not matter at all. The result would be exactly the same.

Avatar of MARattigan
tygxc wrote:

#2079
I used the definition by Prof. van den Herik and I gave a reference to his paper.
Personally I prefer a simpler, more practical definition but then I get reproached of using my own definitions instead of the generally accepted ones, which is what you do here.
You even find fault with his generally accepted definition.
I am glad I am not the only one being critiqued.
You should take your criticism to prof. van den Herik this time.

With respect to Prof. van den Herik, the definition is flawed.

I think I showed that in #2072 - can you find any fault with it?

You'd do better to use the definition I gave in the last para. of #2079. I think everyone could agree to that one.

Avatar of Elroch
tygxc wrote:

#2077

"How did they work out the error rate?"
++ They did not: they gave the draw rate at 1 s/move and at 1 min/move. Games that do not draw are sure to contain 1 error. Further I assumed 40 moves hence 80 ply per game. Then I extrapolated from 1 s/move and 1 min/move to 1 h/move and to 60 h/move in an entirely unreliable and almost certainly wrong way.

[snip]

 

Avatar of MARattigan
tygxc wrote:

#2077

"How did they work out the error rate?"
++ They did not: they gave the draw rate at 1 s/move and at 1 min/move. Games that do not draw are sure to contain 1 error. Further I assumed 40 moves hence 80 ply per game. Then I extrapolated from 1 s/move and 1 min/move to 1 h/move and to 60 h/move.

Games that draw are sure to contain any number of errors that sum to -1/2 point for White if the starting position is a win for White, any number of errors that sum to 0 points if the starting position is a draw or any number of errors that sum to +1/2 point for White if the starting position is a win for Black.

The only thing it tells you about the number of errors is that it's at most the number of ply that were played in the game. 

"were the games played under basic rules with the addition of a double repetition rule"
++ That does not matter at all. The result would be exactly the same.

If you look at my SF14 v SF14 examples here , they contain a total of 30 errors in the basic rules game and 27 errors in the competition rules game. Only 4 of those are errors in both games.

That strongly suggests it does matter at all.

 

Avatar of tygxc

#2084
I work on the generally accepted hypothesis that chess is a draw. Hence each decisive game must contain 1 error. As the error rate is that low, the occurence of 2 or more errors is even rarer and can be neglected.
Your examples KNN vs. KP only highlight the failure of the evaluation function. KNN vs. KP is +5. KNN vs. K is +6, but is a draw. The evaluation function prefers a +6 draw over a +5 win.

Avatar of MARattigan
tygxc wrote:

#2084
I work on the generally accepted hypothesis that chess is a draw. Hence each decisive game must contain 1 error. As the error rate is that low, the occurence of 2 or more errors is even rarer and can be neglected.
Your examples KNN vs. KP only highlight the failure of the evaluation function. KNN vs. KP is +5. KNN vs. K is +6, but is a draw. The evaluation function prefers a +6 draw over a +5 win.

As the error rate is what low?

You think SF14 can't handle a couple of knight's and a pawn, but when it gets to 4 rooks, 4 knights, 4 bishops, 2 queens and 16 pawns that should be ok then?

(And when SF14 converted from  KNN vs. KP to KNN vs. K in most cases it wasn't a +5 win in the competition rules game for which SF14 is designed. It was alread a draw under competition rules, so taking the pawn to ensure no Black win and give the remote possibility of a KNNK vs K win for White was the most sensible option.)

Avatar of tygxc

#2086

"As the error rate is what low?"
++ At 1 s/move: 88.2% draw = 11.8% error / game = 1 error / 679 positions
At 1 min/move: 97.7% draw = 2.3% error / game = 1 error / 3478 positions
Extrapolating: at 60 h/move: 1 error / 10^5 positions

"You think SF14 can't handle a couple of knight's and a pawn, but when it gets to 4 rooks, 4 knights, 4 bishops, 2 queens and 16 pawns that should be ok then?"
++ It is not Stockfish, it is its evaluation function that is flawed.
If Stockfish just can calculate to the 7-men endgame table base then its result is exact.
KNN vs. KP highlights the failure of the evaluation function, not of Stockfish.
We know that the evaluation function is unsuitable for KNN vs. KP.
Stockfish itself is overwhelmed by positions with more than 26 men, when chess is most difficult. That is why the good assistants should prepare 26-men tabiya as starting points.
If you want to verify that the table base exact move is within the top 4 Stockfish candidate moves, then KRPP vs. KRP may be better.