Engine-Match Analysis of FVF Games

Sort:
Avatar of stephen_33

I intend to post a summary of the most recent engine-match analysis of our completed VC games here. Games that timed out, finished within book, or were of a different variant (Chess960) are excluded.

 

# Game Total Book Other Net 1st. 2nd. 3rd. T1% T2% T3% Last Book Move Moves Analysed
  Totals:       731 430 120 65 59% 75% 84%    
1 FVF Vs SPL 47 11 14 22 12 4 3 55% 73% 86% 11.Qd2 12-33
2 Serious Applicants Only 30 11 8 11 6 0 1 55% 55% 64% 11.Bd3 12-20
3 Italian Game: Classical Variation 1 29 12 16 1 1 0 0 100% 100% 100% 12.Bh4 13 (only)
4 Italian Game: Classical Variation 2 30 12 4 14 6 1 3 43% 50% 71% 12...c6 13-26
5 FVF vs Carpe Diem 68 19 29 20 13 5 1 65% 90% 95% 20.axb5 20-39
6 Nimzo-Indian Defense 18 5 6 7 2 1 1 29% 43% 57% 5...c5 6-12
7 Ruy Lopez - Berlin Defense 40 9 18 13 8 0 2 62% 62% 77% 9...Re8 10-22
8 FVF vs Intl 10 min - KID 32 9 4 19 16 1 1 84% 89% 95% 9...Ng4 10-28
9 benko 40 14 11 15 10 3 1 67% 87% 93% 14.e4 15-18 & 26-36
10 A few against the fanatics 33 14 11 8 4 2 0 50% 75% 75% 14.Rc1 15-22
11 1...Nc6 35 12 0 23 9 7 1 39% 70% 74% 12.Nb3 13-35
12 Raiders of the Long Dark II: Polyp'frs Wars 46 11 11 24 17 4 0 71% 88% 88% 11...Nd5 12-35
13 Top Ten Vote Chess Team Battle 42 12 8 22 14 2 3 64% 73% 86% 12.Ne2 13-34
14 FVF open challenge - King's Pawn Opening 40 8 11 21 15 3 1 71% 86% 90% 8.h3 9-29
15 DAVID AND GOLIATH, OR ARE YOU AFRAID :-) 12 11 0 1 1 0 0 100% 100% 100% 11.Bf4 12 (only)
16 VCL Open Swiss V R1: FVF vs Team Brazil 33 11 0 22 11 7 2 50% 82% 91% 11.e5 12-33
17 VCL Open Swiss V R2: LEGION vs FVF 31 10 12 9 6 0 2 67% 67% 89% 10...Bd7 11-19
18 UU vs FVF - The fair rematch! 22 11 2 9 6 0 1 67% 67% 78% 11...Nde5 12-20
19 Shaky start, but strong finish! ;) 54 14 8 32 15 4 3 47% 59% 69% 14...Nf5 15-33 & 41-53
20 FVF vs Bobby Fischer Group 29 12 5 12 9 1 0 75% 83% 83% 12...Bxf6 13-24
21 FVF vs THE SEVEN-UPS 30 14 0 16 10 2 2 62% 75% 88% 15.Ne2 15-30
22 CD vs FVF re-match 64 9 7 48 25 11 6 52% 75% 88% 10.exd6 10-57
23 French training game #2 Black 34 15 0 19 10 4 1 53% 74% 79% 15...Bd7 16-34
24 FVF vs Adults only group 44 12 10 22 16 1 1 73% 77% 82% 13.Be2 13-34
25 VCL Open Swiss V R4: Team Hungary - Magyar 34 13 5 16 9 4 1 56% 81% 88% 14.Qd2 14-29
26 VCL Open Swiss V R3: Fantastic Voting Fanat 57 15 23 19 11 2 3 58% 68% 84% 15...dxc5 16-34 (44 omitted)
27 The Reindeer Sleeps 30 11 8 11 6 3 2 55% 82% 100% 11.Be2 12-22
28 FVF vs BHARAT 35 9 16 10 6 2 1 60% 80% 90% 9.bxc3 10-19
29 FVF vs Bengal United 27 7 4 16 8 3 1 50% 69% 75% 7.Bd3 8-23
30 Reindeer Rematch 52 19   15 10 1 1 67% 73% 80% 19...Qh5 29-34 & 36-44
31 Smith-Morra Gambit: Accepted 73 14   53 28 12 4 53% 75% 83% 14...Bg4 15-28, 30-31, 33-38, 40-67, 69-71
32 KNOCKOUT VOTE CHESS R1: Fantastic Voting Fa 26 10 13 3 3 0 0 100% 100% 100% 10.Bd3 11-13
33 KNOCKOUT VOTE CHESS R1: BLACK KNIGHT CHESS 26 8 10 8 5 0 0 62% 62% 62% 9.a3 9-16
34 Last Rites vs FVF 38 12   23 13 5 2 57% 78% 87% 12...Bg4 13-27, 29, 31-33, 35-38
35 KNOCKOUT VOTE CHESS S1 QF: FVF vs LATINO 23 9   8 2 3 2 25% 62% 88% 9.Bh4 11-18
36 KNOCKOUT VOTE CHESS S1 QF: LATINO vs FVF 48 12   20 15 1 0 75% 80% 80% 12...Ne5 14-18, 20-24, 26-30, 32, 38, 40-42
37 Best Of Three? 21 14   2 2 0 0 100% 100% 100% 14.dxe6 15-16
38 KNOCKOUT VOTE CHESS S1 SF: OCD vs FVF 27 17   9 6 0 3 67% 67% 100% 18.fxe4 18-19, 21-27
39 KNOCKOUT VOTE CHESS S1 SF: FVF vs OCD 36 25   7 6 1 0 86% 100% 100% 25.Ra2 26-31, 33
40 FVF vs Last Rites 34 15   12 8 3 0 67% 92% 92% 15...Nd8  
41 Evans Gambit #2 33 11   14 10 2 2 71% 86% 100% 11.Qd1 12-20, 22, 24-27
42 Evan's Gambit #1 46 12   22 15 3 3 68% 82% 95% 13.Rae1 13-14, 16, 18-22, 24-31, 33, 35-39
43 KNOCKOUT VC S1 Final: FVF vs Lewis Chessmen 57 25   24 11 7 0 46% 75% 75% 25...Bb5 26-35, 37-44, 46-49, 54-55
44 Lewis v FVF Knockout S1 Final 37 4   29 14 5 4 48% 66% 79% 4...O-O 5-18, 20-26, 28, 31-37
-                          

*

1) 'Total' moves is the number made by FVF in the game

2) 'Book' moves are those available from most opening databases such as Explorer & are normally  excluded from analysis

3) 'Other' are those such as forced moves or moves in which there's a large difference between the engine's first & nth. choice. Any moves in which any of the engine choice moves exceed 2 pawns in strength are excluded, because a competent player would be expected to be able to see this for themselves

4) 'Net' moves is the number of qualifying moves used to calculate the T1, T2 & T3 figures

5) 1st, 2nd & 3rd give the counts of those moves we made that corresponded to the engine's assessed first, second & third best moves in the position. Sometimes we made moves that weren't in any of those categories, so the three counts won't always add up to the Net figure

6) T1, T2 & T3 represent the proportion of moves we made that correspond to the engine's best choices, as a percentage & are calculated like this:-

  • T1 = (1st / Net) * 100%
  • T2 = ((1st + 2nd) / Net) * 100%
  • T3 = ((1st + 2nd + 3rd) / Net) * 100%
Avatar of stephen_33

These are the T3 figures calculated for all our qualifying games but on a 20-game rolling basis:-

.

# Last Game Net 1st. 2nd. 3rd. T1% T2% T3%
20 FVF vs Bobby Fischer Group 305 181 45 26 59% 74% 83%
21 FVF vs THE SEVEN-UPS 299 179 43 25 60% 74% 83%
22 CD vs FVF re-match 336 198 54 30 59% 75% 84%
23 French training game #2 Black 354 207 58 31 58% 75% 84%
24 FVF vs Adults only group 362 217 58 29 60% 76% 84%
25 VCL Open Swiss V R4: Team Hungary - Magyar 358 213 57 29 59% 75% 84%
26 VCL Open Swiss V R3: Fantastic Voting Fanat 370 222 58 31 60% 76% 84%
27 The Reindeer Sleeps 368 220 61 31 60% 76% 85%
28 FVF vs BHARAT 359 210 62 31 58% 76% 84%
29 FVF vs Bengal United 360 208 62 31 58% 75% 84%
30 Reindeer Rematch 367 214 61 32 58% 75% 84%
31 Smith-Morra Gambit: Accepted 397 233 66 35 59% 75% 84%
32 KNOCKOUT VOTE CHESS R1: Fantastic Voting Fa 376 219 62 35 58% 75% 84%
33 KNOCKOUT VOTE CHESS R1: BLACK KNIGHT CHESS 362 210 60 32 58% 75% 83%
34 Last Rites vs FVF 364 208 62 33 57% 74% 83%
35 KNOCKOUT VOTE CHESS S1 QF: FVF vs LATINO 371 209 65 35 56% 74% 83%
36 KNOCKOUT VOTE CHESS S1 QF: LATINO vs FVF 369 213 59 33 58% 74% 83%
37 Best Of Three? 362 209 59 31 58% 74% 83%
38 KNOCKOUT VOTE CHESS S1 SF: OCD vs FVF 362 209 59 33 58% 74% 83%
39 KNOCKOUT VOTE CHESS S1 SF: FVF vs OCD 337 200 56 30 59% 76% 85%
40 FVF vs Last Rites 337 199 58 30 59% 76% 85%
41 Evans Gambit #2 335 199 58 30 59% 77% 86%
42 Evan's Gambit #1 309 189 50 27 61% 77% 86%
43 KNOCKOUT VC S1 Final: FVF vs Lewis Chessmen 314 190 53 26 61% 77% 86%
44 Lewis v FVF Knockout S1 Final 321 188 57 29 59% 76% 85%

*

Avatar of stephen_33

.

Avatar of stephen_33

.

Avatar of stephen_33

.

Avatar of stephen_33

I'm going to post some T3 results for one or more 'trusted VC teams' here as soon as I've obtained them. That will be useful I think as a comparison with our own figures.

I've been asking around & MGleason (always helpful in these matters) has suggested one of his own groups, The Ultimate Training Center. He wrote this about it...

"The Ultimate Training Center is mostly pretty clean; we have had the occasional cheater sneak through, but most of them aren't involved in vote chess. We've got NMs @Impractical and @CedrHask involved (and @canwedoit has joined our most recent game). We don't have any FMs or above involved in vote chess, though"

"I would assume that appropriate benchmarks for vote chess would be a bit above the best plausible correspondence stats for the strongest player. A well-organised team involving 1-2 FMs and additional players a bit below that level could likely be a match for an IM, but probably not for a GM"

Avatar of stephen_33

It's been quite a headache getting that data into some form that I could post & was readable, so I'm hoping it looks o/k for both of you?

If so, I'll just point out something that struck me as soon as I'd produced the T3 figures: While many of our games over the last two years exceed the benchmark by a worrying margin, the final two games are just within. This corresponds with the closing of Dalephilly's account, although that may simply be coincidental.

To remind you both, the usual benchmarks used for this kind of analysis is:-

  • T1 = 60%
  • T2 = 75%
  • T3 = 85%

In the short term we still have a difficult issue to deal with & that's the problem of d-d.

Joe, I received a message from the admin of Last Rites a while ago, drawing our attention to the high T1/T2 results for that game (T2 = 92%!). The benchmark is usually set at 75%. They also examined our discussion in that game & came to the conclusion that d-d was probably using engine assistance. I pointed out that we'd had a banned player in that game as well (Dalephilly) but all of his comments had been erased of course.

I didn't think they were convinced so I asked Stephen for his considered opinion & he's of the view that d-d most probably is using assistance. My next task is to write to him to ask if he can explain the strange gap between his Daily & shorter time control games. I don't intend to make accusations just yet.

There's some useful additional information here in posts #11 & #12....

https://www.chess.com/clubs/forum/view/engine-use-in-vote-chess