I just wrote a blog that covers this question of why puzzles often don't choose "challenging" responses. See Understanding soundness and motivations in chess puzzles, problems, and studies.
The underlying reason is that puzzle responses need to provoke unique winning moves, for the puzzle to work at all. The more "challenging" responses will likely allow you to win in more than one way, making the puzzle unsound. I used a clear-cut example in the post, but the OP puzzle also illustrates this. So according to Stockfish evaluation (depth 38), the more testing 24.Nxd7 enables two winning replies, 24...Nxd7 (-3.81) and 24...Nxe2+ (-2.82).
The same seems to be true of the chess lessons.
The demonstration of mating with bishop and knight here https://www.chess.com/terms/checkmate-chess?ncc=1#king-bishop-and-knight
starts from this (Black to play) position.
Black plays 1...Kf8 but 1...Kxh7 would have been much more challenging.
Edit: Don't bother looking - they fixed it at last.