Guys, did Google just crack the Alberta Plan? Continual learning during inference?

Y'all seeing this too???

https://arxiv.org/abs/2501.00663

in 2025 Rich Sutton really is vindicated with all his major talking points (like search time learning and RL reward functions) being the pivotal building blocks of AGI, huh?