Maths was never my strong suit but this looks like it should make sense.
However, I am not entirely sure how useful this method is for the chess side of things. In the words of two wise men:
"There are three kinds of lies: lies, damned lies, and statistics." - Mark Twain
"On the chessboard lies and hypocrisy do not survive long. The creative combination lays bare the presumption of lies; the merciless fact, culmination in checkmate, contradicts the hypocrites." - Emanuel Lasker
My point being, I don't think statistics are particularly relevant to chess, and only serve as a mildly-useful at best guide for ones search for the best move.
Better is to potz down the lines (with an engine, or with a keen eye) to see the type of different middlegame plans Black will undertake.
Let's compare the main lines:
It seems to me there is some potential to Black's ...Nc6/...Na5 forray - the bishop is quite important in many lines - but I would say I trust the ...Bb7/...d6/...Nd7 setup a little bit more on principle.
Ultimately it is up to you. Both are playable, and best way to find out is to experiment with both. Or just play 3... g6
I would like to get a better understanding of chess statistics and their meanings.
For example take this position:
Using the chess.com master games database, we want to figure out the best move for black (without resorting to any use of any engine).
In this position, at the master level, white wins 34.2%, 17.7% draw, 48.1% win for white.
There are 80 games in the chess.com master games database: https://www.chess.com/games/search?fen=rnbqkbnr/3p1ppp/p3p3/1p6/4P3/1BN2N2/PP3PPP/R1BQK2R%20b%20KQkq%20-%201%207
By far the most popular move is Bb7. There are 59 games in this position:
https://www.chess.com/games/search?fen=rn1qkbnr/1b1p1ppp/p3p3/1p6/4P3/1BN2N2/PP3PPP/R1BQK2R%20w%20KQkq%20-%202%208
White wins 32.2%, Draw 18.6%, Black wins 49.1%
The second most popular move is Nc6
There are 35 games in this position:
https://www.chess.com/games/search?fen=r1bqkbnr/3p1ppp/p1n1p3/1p6/4P3/1BN2N2/PP3PPP/R1BQK2R%20w%20KQkq%20-%202%208
White wins 28.6%, Draw 14.3%, Black wins 57.1%
There are also some other rare moves, but to avoid this exercise getting even more complicated I'll discount them.
So if you only look at the win rates the move Nc6 looks better, but that might be more to do with luck/chance than the strength of this particular move.
I did not do much statistics at school, but I read that you can find confidence levels based on different sample sizes, for example on this site there is an example for finding 95% confidence level of amount of time watching TV per week https://www.scribbr.com/statistics/confidence-interval/
Well chess is quite different from watching TV, but if we say a win for white is 1 point, a draw is 0.5 and a win for black is 0, then we can get mean values for each move.
So if the mean value is close to 0 that suggests the move is better for black and if it is closer to 1 that suggest the move is better for white.
This is an idea I came up with this morning and thought I should post to get opinions on whether it makes any sense or how it is flawed.