On the increase of the K-factor (UPDATE)
Since FIDE announced her own concerns about the planned increase of the K-factor, there is suddenly a hot debate around the subject. It seems as if supporters like GM Bartlomiej Macieja and opponents like GM John Nunn both have strong arguments. This might be confusing for anyone who never really put any thought into the subject, but even if you did, it is not surprising. In a guest post, statistician Daan Zult sheds some light on the discussion. Updated with another Macieja piece.By Daan ZultLast week FIDE expressed her concerns about the effects of the new K-Factor which will be in order as from July 1st, and decided to publish two parallel rating lists for a year and then review the results of the different K-factors in effect. GM Bartlomiej Macieja then made strong appeal to not delay the decision but increase the K-factor immediately. Subsequently FIDE sent a reaction which was followed by another reply by Macieja, all to be found in this article. Meanwhile, GM John Nunn expressed a different point of view, saying "there seems no real evidence that K=20 will result in a more accurate rating system, while there are a number of risks and disadvantages."I have been working with Elo’s model as a statistician for psychological purposes and I have been confronted with choosing a correct K-factor many times. The choice is always one between accuracy and adjustment speed: when you give the K-factor a low value, ratings will change slowly and fluctuate less, and when you give the K-factor a high value, ratings will adjust faster and fluctuate more. This can be a good thing if a chess player stops making progress in real chess strength. For instance, every chess player probably knows a player who was quite talented as a junior, but for some reason stopped making progress at a certain point. A stop in progress should result in a rating that stabilizes around a certain point, and not in a rating that makes crazy fluctuations. The choice of the K-factor is therefore both a choice between two evils and between two goods, like a glass can be half full or half empty. To shed some light on the current discussion, I will discuss some arguments that I encountered recently.
GM John Nunn

GM Bartlomiej Macieja
Meanwhile we also received another piece by Macieja:
The K-factor - here comes the proof!I couldn't believe my eyes when I read GM John Nunn's opinion: "The K-factor and the frequency of rating lists are unrelated to one another. Rating change depends on the number of games you have played. If you have played 40 games in 6 months, it doesn't make any difference whether FIDE publishes one rating list at the end of six months or one every day; you've still played the same number of games and the change in your rating should be the same.".It does make a significant difference how often rating lists are published. To understand this effect it is enough to imagine a player rated 2500 playing one tournament a month. With 2 rating lists published yearly, if he wins 10 points in every tournament, his rating after half a year will be 2500+6*10=2560. If rating lists are published 4 times a year, after 3 months his rating becomes 2500+3*10=2530 so it gets more difficult for him to gain rating points in further tournaments. After 3 more tournaments the player reaches the final rating of only about 2500+3*10+3*6=2548. With 6 rating lists published yearly, the final rating of the player (after half a year) is only about 2500+2*10+2*7+2*5=2544. Obviously it is only an approximation, the exact values may slightly differ, however the effect is clear. The rating change, contrary to GM John Nunn's opinion, is not the same. And that's what I meant by: "The higher frequency of publishing rating lists reduces the effective value of the K-factor, thus the value of the K-factor needs to be increased in order not to make significant changes in the whole rating system.".There are many possible ways to establish the correct value of the K-factor. For sure the following approach desires attention: Let's imagine 2 players with different initial ratings, let's say 2500 and 2600, achieving exactly the same results against exactly the same opponents for a year. The main idea of the ELO system is that if two players do participate in tournaments and show exactly the same results, their ratings should be the same. You can think about it also as "forgetting about very old results". Please note that it is far not the same approach as used in many other sports, for instance in tennis. In the ELO system, if a player doesn't participate in tournaments, his rating doesn't change (I don't want to discuss now if it is correct or not). But if he does, there is no reason why his rating should be different from the rating of another player achieving exactly the same results against exactly the same opponents.With one rating list published yearly, as was initially done by FIDE, the value of at least K=700/N was needed to reach the goal. As the majority of professional players play more than 70 rated games per year, the value of K=10 would play its role. However, with more rating lists published yearly, the initially higher rated player will always have higher rating than his initially lower rated colleague (if both achieve exactly the same results), unless the K-factor is extremely high. For this reason it is better to ask the question, which value of the K-factor will reduce the initial difference of ratings by 100 (for instance from 100 points to only 1 point) in a year?In a good approximation, the answer is K = (m*700/N)*[1-(0,01)(1/m)], where m is the number of lists published per year. For N=80 (suggestion of GM John Nunn), we get: if m=2 -> K should be 16, if m=4 -> K should be 24, if m=6 -> K should be 28. Otherwise, an initially higher rated player may still have a higher rating a year later even if he was achieving worse results than an initially lower rated player. It would not only be strange, but also unfair, as for many competitions, including the World Championship Cycle, the participants are qualified by rating. Please note, that if N is lower, the K-factor should be even bigger.Some people suggest that 12 months in a row of showing identical results may still not be enough to consider 2 players to be equally strong (or, to be more precise, to have their initial rating difference reduces by 100). Let's calculate which value of the K-factor will reduce the initial difference of ratings by 100 in 2 years. For N=80 (160 games in 2 years) we get K=17, for N=70 (140 games in 2 years) we get K=19.I believe that out of last 100 games (it is even more than professor ELO recommended) a sound judgement can be made. It means, that the value of the K-factor accepted in Dresden during the General Assembly (K=20) was a wise choice.Best regards Bartlomiej Macieja 3rd of May 2009