The week ahead: 20251006
[Blog] Thinking Machines Lab: LoRA is better to be applied to all weights, including MLP and MoE. On RL, LoRA performs on par with full training even with...
Writing is an important way to organize one’s thoughts. If you can’t express something, you probably haven’t thought it through.
I write with two goals:
With these, I consider brevity a writer’s highest virtue.
[Blog] Thinking Machines Lab: LoRA is better to be applied to all weights, including MLP and MoE. On RL, LoRA performs on par with full training even with...
[Announcement] ChatGPT Pulse. ChatGPT can now proactively push personalized content.
[Blog][Tweet] Gemini (10/12) and GPT (12/12) compete at ICPC.
[Blog] First glimpse into the work of Thinking Machines Lab. Inference systems can be made deterministic by using “batch-invariant” GPU kernels (GitHub im...
In NYC for Gemini summit.
[Blog] GPT-oss from the ground up.
Traveling in Brazil this week to join the indigenous Xingu people for Quarup.
[Reuters] Perplexity makes $34.5B bid on Google’s Chrome.
[HuggingFace] OpenAI’s open-weight model. Technical report from OpenAI. Onboarding guide from HuggingFace.
[Blog] Utkarsh’s insights on the mathematical impossibility of some expectations of AI agents in 2025. TLDR: error compounding, token economics, tool engi...
A maniacal sense of urgency
[Report] Alphabet Q2 earnings. CapEx increases for the 2nd time in this year.
[Blog] Perplexity to launch its own browser, with rumors of OpenAI’s upcoming release of a similar product, signaling challenges to Google Chrome’s contro...