The week ahead: 20251229

December 29, 2025

Industry

[Blog] Karpathy’s EOY review. Verifiable rewards as a better deal for fixed compute, leading us to all sorts of philosophical questions this year.
[Company] MVP Studio. Builds your MVP for you.
[微信] 醫療數據的商業邏輯。

Paper

[Paper] ReaSeq. Recommendation systems based on in-log IDs have two limitations: (1) the noise and sparsity of co-occurrence statistical signals and (2) invisibility of user behavior outside of the logs. This work uses a LLM reasoning setup based on logs to uncover user demands and product attributes to combat the first limitation, and uses a diffusion-based approach to predict invisible user actions to compensate for the second.
[Paper] JustRL. Simple, single-stage RL training without any so-called fancy tricks. This shown as transferrable between models, uses less compute, and matches SOTA on math reasoning tasks.
[Paper] Estimate human body pose from Wifi signals. Relevant GitHub implementation.

Code

Have a great week!

You May Also Enjoy