...surely you can hold something in your short term memory for longer than a week? ....
It is just a point of terminology, in psychology or neurology, that the "short-term memory" is much shorter than a week. It's about 15 minutes I think (But I'm wrong, according to Wikipedia - it's even shorter. https://en.wikipedia.org/wiki/Short-term_memory). If anyone remembers anything longer than this, then it's becoming a long-term memory. That is just the way psychologists use the term "short term memory".
I tried to read all the responses but I stopped about page 7. But I wanted to see if anyone had said "compression" or "machine learning" ("deep learning").
A few people wrote about mathematical rules that could be found to determine if a position is won. Basically the best hope for solving chess is to drastically (more than the square root) reduce the amount of actual thinking that has to be done, by finding some clear patterns.
For humans, the questions in chess concern games where the two sides are nearly equal. What if White simply blunders away a Queen on move 4? I hope we can prove Black would win. Then if we compare random moves to moves that are considered pretty good, we see there are many more awful moves than moves that seem close to winning. So when I picture the space of all games, I think of a large continent of games where some idiotic moves are made, and a much smaller area of games, the boundary, where all the moves are really decent.
To me, some hope lies in the thought that there are some clear patterns, based on material and position, so a computer could mathematically eliminate a great number of move choices, and then the game could somehow, someday be small enough to solve.
The obvious drawback in this approach is that it's very difficult to teach the poor computers to avoid miscategorizing bad moves and exceptional-looking positions. If Black gives away all but 1 piece and the result is something like a composed puzzle where Black has a forced smothered mate then this requires extra work to compute or detect.
Unfortunately in most positions with even material there are usually 2-7 or more interesting moves that don't just throw away material or positional value, so the combinatorial explosion is huge. Basically the only hope, given the numbers, is to find some strong improvement of the evaluation function. The basic idea might be to compress the tablebases drastically.
I think it's hopeful for the solution to ever be completed that computers are learning to improve their own software. Stockfish has a program called Fishtest that is used to evaluate whether changes to its evaluation function are valuable, using crowdsourced computing power, and if the wins are increased in a large sample of games then it is included in the software. This is a little bit like a "deep learning" approach I think. Chess is too complicated for humans to figure out all the patterns, but if we leave the real programming to computers, it could be another area where they excel humans.