The week ahead: 20251006
[Blog] Thinking Machines Lab: LoRA is better to be applied to all weights, including MLP and MoE. On RL, LoRA performs on par with full training even with...
Every Monday, I publish a brief note that captures my top of mind. These could include:
[Blog] Thinking Machines Lab: LoRA is better to be applied to all weights, including MLP and MoE. On RL, LoRA performs on par with full training even with...
[Announcement] ChatGPT Pulse. ChatGPT can now proactively push personalized content.
[Blog][Tweet] Gemini (10/12) and GPT (12/12) compete at ICPC.
[Blog] First glimpse into the work of Thinking Machines Lab. Inference systems can be made deterministic by using “batch-invariant” GPU kernels (GitHub im...
In NYC for Gemini summit.
[Blog] GPT-oss from the ground up.
Traveling in Brazil this week to join the indigenous Xingu people for Quarup.
[Reuters] Perplexity makes $34.5B bid on Google’s Chrome.
[HuggingFace] OpenAI’s open-weight model. Technical report from OpenAI. Onboarding guide from HuggingFace.
[Blog] Utkarsh’s insights on the mathematical impossibility of some expectations of AI agents in 2025. TLDR: error compounding, token economics, tool engi...
[Report] Alphabet Q2 earnings. CapEx increases for the 2nd time in this year.
[Blog] Perplexity to launch its own browser, with rumors of OpenAI’s upcoming release of a similar product, signaling challenges to Google Chrome’s contro...