Upgrade to Chess.com Premium!

Ratings Comparison: Chess.com Blitz v USCF OTB

(Personal opinion below, and does not necessarily represent the views of chess.com management or anyone else with authority or insight.)

Hardly a day goes by in Live Chess Main Hall chat without at least one member asking some form of the following question: "How does my (insert rating number here) Chess.com Blitz rating compare to an over-the-board or official USCF rating?"

Good question. The unsatisfying but truthful answer is that since the player pools, ratings formulas, and playing conditions are not the same, no universal and direct comparison is fair, even if possible. Nevertheless, we all know that strong players in the over-the-board world usually translate to strong players here at Chess.com as well. So, is there anything at all that can be said quantitatively?

Some years back, a not-to-be-named competitor conducted a survey amongst its players to try to put a numerical spin on this off-asked-but-seldom-satisfyingly-answered question. Results of that survey indicated that - on average - online Blitz ratings were roughly 100 points higher than USCF ratings.

So, the $64K question is this: Is that also true here at Chess.com?

In search of an answer, I recently gathered some data from the very wonderful chess.com Live Chess players statistics database here - one of the more esoteric benefits of my new-acquired Diamond premium membership. The chart below illustrates the results of the analysis of these data and makes it possible to infer - albeit loosely - some interesting findings:

 

 

These data represent an instantaneous snapshot, taken over Dec/16-17/2010, of the Chess.com Blitz ratings for a sampling of Chess.com members in the 1500-1699 rating cohort who also has a claimed USCF OTB (over-the-board) rating - roughly 120+ members. The mean Chess.com Blitz rating of these players was 1592, whilst their mean USCF rating was 1665. This difference of 73 ratings points was statistically significant. The three players whose Chess.com Blitz rating and USCF rating differed by more than 800 points were not included in determining those mean values. On the face of it - though this sampling is quite small compared to membership counts here or the survey counts 'there' - we seem to be seeing a similar phenomenon: online Blitz ratings generally, if only slightly, understate OTB strength.

Another possible inference from these data: Although you can expect on average that your next Chess.com Blitz challenge against a 1500-to-1699 rated player will be with someone roughly comparable to that rating in OTB, the range of strengths of possible opponents is extremely wide - ranging from below 1000 USCF-strength to well above 2000 USCF-strength. See the figure below.

 

 

One can easily see that many of the claimed USCF ratings are indeed within the study's range for Chess.com ratings (roughly 1500 to 1700), but even more of them are in the next higher rated bin from 1700 to 1900!

Some disclaimers should be mentioned straightaway. The sample here is relatively small and only covers a 200-point rating range (though this range brackets the usual adult mean USCF rating average). It assumes that the claimed USCF rating is in fact both real and represents current OTB strength. No minimum number of Chess.com Blitz games was required for inclusion.

In closing, it's doubtful that this little study will answer the $64K question for all time or for all members, but it's interesting nonetheless, I believe. It leads me to conclude, on at least a semi-quantitative basis, that there is a reasonable correlation between and not a huge difference between average ratings and strength here in Blitz and those in USCF OTB chess - at least in the heartcut range of 1500-1700 rating - though members' Chess.com ratings may slightly understate USCF-asserted playing strength. The closer one gets to 1200, the more likely the provisional starting rating will bias findings, I suspect. The closer one gets to the top end of the ratings the less data available and the more suspect any conclusions will be, from a sheer insufficient sampling point of view.

Hey, it was fun to do the study, whether it means anything or not. 

Comments


  • 5 months ago

    magfirefox

    Well, my USCF Quick Chess rating is 2226, and the best I can do on Chess.com in 3 minute games is about 1750 to 1775.  I have the same issue with ICC where my 3 minute rating is about 1850.

  • 9 months ago

    Ziryab

    I post my peak USCF rating, which is reasonably current as it has changed (gone up) four times since January 2010. Once, after my rating dropped from my peak through several events, it equalled my peak once more. I am currently 12 below my peak reached in early summer. 1982-1970=12. My current blitz rating here is substantially lower than my current USCF, but it was slightly higher when this study was conducted.

  • 12 months ago

    SouL_109

    Thank you for this interesting study :)

    @FrankEngel (and the others) : this doesn't change anything unless you consider that for some reasons people who report uscf ratings tend to be less concentrate that players who don't (seems quite weird)

  • 12 months ago

    DrawMaster

    Yes, there are many whose Blitz is higher ... and my little study only covers a short range. At the extremes (where Eilysium resides), the data are sparce and correlations much poorer, I'm sure.

  • 12 months ago

    Elubas

    Of course, there are blitz players like FM Eilyisum, who actually have a blitz rating higher than their USCF rating.

  • 19 months ago

    FrankEngelGambit

    When I play chess online, I'm usually watching my two little kids...how does that factor in? or answering the phone.Smile

    I often wonder that when I win a game what was going on at the other players house where he was playing.

    Thanks for sharing your research!

  • 22 months ago

    Ghostly14

    Great research!

  • 23 months ago

    DrawMaster

    You make a valid point, GooDWiLL - although I actually under-report my USCF rating. However, if you take your point as fact here and presume there will be a bias upward in the USCF numbers reported, we probably could still make the point that the two ratings compare reasonably well - which is probably the key question most ask. And, indeed, I think trying to imply that we can quantitatively estimate with great precision any difference between the two might be a stretch.

    Thanks for commenting ...

  • 23 months ago

    GooD_WiLL

    DrawMaster my new friend!

    This was a topic I was very interested in, so I eventually found this thread after some discussion in the main hall.

    Nice statistical analysis, but have you considered the fact that maybe the reason you are observing these differences is because individuals tend to over-report their actual ratings?  My career is in experimental psychology, and over-reporting is actually quite commonly documented.  People tend to want to give off a good impression, and this holds even when they are told that they will remain anonymous because they have to convey that impression to themselves (strange right?).  Reply back and let me know what you think.  Maybe there were some updates.

  • 2 years ago

    DrawMaster

    Good point, deepak, but there are VERY few FIDE ratings listed by members, nearly none in the 1500 to 1700 area.

  • 2 years ago

    deepak19921993

    ya very graet job,,but it will be more better if u compare with fide rating,,,,,still very good post

  • 2 years ago

    Fire_PenguiN

    Great job.

  • 2 years ago

    Hydra

    Fantastic Post, very detailed and informative :)

  • 2 years ago

    RabbitCold

    My OTB is much higher than my standard on here...but that is b/c I am usually focused OTB. Here I can play late at night or unfocused. Great article!

  • 2 years ago

    DrawMaster

    The difference is significant when either conducting a non-equal variance unpaired t-test OR a paired data t-test. In fact, it is MORE significant (i.e., lower p-value) for the paired test - which make great sense because the two ratings for a single player are obviously correlated, at least to some extent.

    Thanks for your insightful comment, Jpatrick.

    See data below (using StatPlus in Excel 2011 for Mac):

  • 2 years ago

    Jpatrick

    Seems to me like the best way to test this is to pair the data and test for Ruscf - Rchess.com equal or not equal zero.    I'd be a little surprised if you had to conclude that the difference was zero in view of the different formulas in place.

  • 2 years ago

    david1995

    Great blog! Thanks for sharing

Back to Top

Post your reply: