The week ahead: 20250915
-
[Blog] First glimpse into the work of Thinking Machines Lab. Inference systems can be made deterministic by using “batch-invariant” GPU kernels (GitHub implementation).
-
[Paper] Alternative sampling strategies are shown to be conceptually a better fit if using a text space LM on numerical regression tasks.
-
[Blog] Post-training primer.
-
[Blog] Anthropic’s new blog shows substantial gains from simply using Claude to optimize MCP servers.
-
[HuggingFace] 4B model trained on Obsidian-style markdown operations pairs performance with much larger models on memory tasks - and can be used as an MCP server to a bigger model.
-
[GitHub] Crew AI - YAAF (Yet Another Agent Framework)
-
[GitHub] ROMA - YAAF (Yet Another Agent Framework)
-
[GitHub] term.everything - Run GUI apps in terminal.
Have a great week!
Comments