Digest
This week in open AI: on-device inference meets agentic breakthroughs
Sparse experts land on smartphones, web agents learn from live sites, and uncensored fine-tunes proliferate—while Anthropic files to go public and Nvidia preps 120B-parameter laptops.
ByAlex Sokoloff·
Two trends collided this week: pushing frontier-scale models onto consumer hardware and teaching agents to navigate messier, real-world environments. Nvidia and Liquid AI both announced architectures that fit double-digit-billion-parameter models into phone and laptop memory, while researchers demonstrated that reinforcement learning on live websites and fairness-aware policy synthesis can unlock new agent capabilities. Meanwhile, the open fine-tune ecosystem continued its rapid expansion.
In this digest
- OpenWebRL-4B reaches 67% success on live-web tasks with minimal supervised data — Training web agents on real sites with only 400 seed examples just hit 67% success—online RL may finally beat the sim-to-real gap.