2014.06.06 Stockfish 5 with 4pc Syzygy bases added. The engine gained 6 Elo (in the one on one TOP 16 comparision) and just passed H4. With this the IPON-RRRL has a new No.1!
It is that close that the next new entry might change this! New anchor for the lists is
Stockfish 5 Syzygy with 3113 Elo.
For the "experts": I added Bayes, Ordo and Elostat results in the IPON Archive.
2014.06.01
Stockfish 5 entered the complete list with a very impressiv plus of 40 Elo to Stockfish DD and
ended just 5 Elo below H4. As a little remark I have to say that ELOSTAT would have Stockfish in
front in the Complete List and in the RRRL. It is very close.
2014.05.31
A new No. 12 entered the IPON-RRRL. Texel 1.04 with a rating of 2835 Elo. Congratulations!
As Texel pushed out my reference engine of the TOP 16 I will try a new method of "reference".
It is always the No. 1 which is giving the reference point. Today it is Houdini 4 with 3111 Elo. As
soon as a an engine passes the No.1 it initial rating became the new reference point. With this I
always have a reference and the ratings hopefully doesn't change to much.
My "theoretical" gut feeling tell me that over time (which means 2, 3, 4 new No1 engines) the
overall rating will drop a bit e.g. the old reference, Shredder 12, will drop below 2800 Elo. Most
important is the difference of the engines and that is not different than with any other reference
point.
2014.05.26
Komodo 7a now with all games in the Main List. A new No.2 with a very impressive 32 Elo jump!
I got requests about statistics for individual results. Please check the Archive section. It is
all available there.
2014.05.25
Komodo 7a included in the Complete List. 31 Elo increase over K-TCECr
Statistics and Main List will be updated after the last games.
2014.05.04
Protector 1.6.0 added to all lists. 33 Elo better than the last release. Rank 11 in the IPON-RRRL!
2014.04.20
Finished the missing games of Gull 3 vs DF14 and made a new main list.
No. 2, 3 and 4 are all within one SD basically they are that close that their playing strength is
hard to distinguish.
2014.04.19
Gull 3 added to the complete list. Nice increase of 40 Elo and a new No. 3.
The IPON-RRRL will follow as soon as the missing games are finished.
2014.03.20
Senpai 1.0 included. An entry of 2841 Elo for the main list. Impressive first release!
As Senpai is pushing a number 16 to 17 the last engine will be excluded out of the TOP 16 enignes. It was very
close but it is Depp Sjeng c't 2010, an engine which I really liked to have in my list.
This would be a TOP 17 List of the IPON:
1 Houdini 4 3119 9 9 3520 77% 2906 29%
2 Stockfish DD 3072 8 8 3520 72% 2909 41%
3 Komodo TCECr 3057 8 8 3520 70% 2909 38%
4 Gull 2.8 3023 8 8 3520 65% 2912 40%
5 Critter 1.4a 2982 8 8 3520 59% 2914 46%
6 Equinox 2.02 2978 8 8 3520 59% 2914 46%
7 Deep Rybka 4.1 2968 8 8 3520 57% 2915 47%
8 Deep Fritz 14 2901 8 8 3520 47% 2919 45%
9 Chiron 2 2893 8 8 3520 46% 2920 45%
10 Hannibal 1.4b 2875 8 8 3520 44% 2921 43%
11 Senpai 1.0 2843 8 8 3520 39% 2923 41%
12 Naum 4.2 2838 8 8 3520 39% 2923 41%
13 Protector 1.5.0 2836 8 8 3520 38% 2923 43%
14 HIARCS 14 WCSC 32b 2822 8 8 3520 36% 2924 40%
15 Jonny 6.00 2805 8 8 3520 34% 2925 37%
16 Deep Shredder 12 2800 8 8 3520 33% 2926 38%
17 Deep Sjeng c't 2010 32b 2798 8 8 3520 33% 2926 39%
2014.03.17
Gull 2.8 included. 3023 Elo in the main list. Ranked 4th!
2014.03.14
All counters removed - No vanity, no liabilities!
2014.01.05
1. Equinox 2.02 included. 2978 Elo in the MAIN lst.
2. For historical reasons the 4 year old Robbolito 0.085g is included. It is possible to compare it with the
alleged source but keep in mind that Robbo did not play the same opponents. It's average opponent elo is much
higher. As "the source" is playing with a contempt there is the possibility that it would lose more points
when playing the same strong engines.
3. The IPON is fixed to 2800 elo for Shredder 12. This is an offset of 2783 over ALL engines. I applied this
offset to the three ECO based lists. This is more accurate than having a fixed engine there. I will do that
principle with all ECO lists in the future.
2014.01.03
The three rating lists sorted by opening system have to be recalculated. It makes no seance to fix this with
a certain engine as this might cover an in- or decrease of another engine in case the fixed engines is weaker
or stronger in one of the lists. It is better to give the lists the same average elo value as all list have
the same engines. This way it is just the distribution which counts, not the relative distance to a specific
engine.
Unfortunately I deleted the individual PGN data for the lists. I will remodel the list with the next engine
I include.
2014.01.01
I restarted the IPON 2014 with some changes.
1. The old 75 opening positions set is increased by 35 new position to 110 opening positions. The ECO
distribution is 21% open games (ECO C20-C99), 30% half open games (ECO B + C00-C19) and 49% closed
games (ECO A,D, E).
This is less 'GM tourney' practice but more suited to the average chess player and his analytical needs.
2. Only 16 top engines are tested. The reasons are a smaller width of opponent elo and less games to play.
Additionally the error bar is below 10 Elo now for all top 16 engines in the IPON-RRRL. These 10 elo are
the "border of irrelevance" for me, as no one can feel or distinguish a + or - of 10 elo. (Having smaller
error margins in a testing environment is a different thing of course).
3. I will provide live games from time to time but not necessarily the day an engine is released ...
With 110 opening positions and a O20/H30/C50 distribution the individual comparison of 220 games becomes
interesting to a certain extend. It should not be stretched to much, but 220 games can be decisive in
some cases.
In the rating section I offer three new lists sorted by closed, half open and open openings. This is just an
experiment to show some interesting trends. It should not be taken too serious as the number of games as well
as the number of used openings is limited. Nonetheless, in some cases it might give a hint what can happen if
a certain opening distribution is used (here and somewhere else). It shows too, that there is no 'right' or
'wrong' set of openings. Everything is correct and the observer has to ask if a particular set-up suits his
needs!
This is a free service. If you don't like it please have a look at some of the other excellent lists. There
are plenty available to satisfy anyone!
Bye
Ingo