+10Sample: day-long coding tasks completed end to end in a public coding agent
A publicly available coding agent now completes multi-hour software tasks from a spec. Small teams can hand off whole features, not just snippets. What still breaks: Ambiguous specs still produce confidently wrong code.
Text milestone · primary · source 1 +5Sample: vision-grounded agents read screenshots and act on them in a shipped product
A public agent now reliably reads on-screen content and takes the corresponding action. Software help becomes 'do it for me', not 'tell me how'. What still breaks: Dense dashboards and tiny controls still confuse it.
Agents notable · primary · source 1 +5Sample: physics realism improves measurably on a tracked video benchmark
A shipped video model posted a clear gain on a public benchmark for motion and physics. Fewer uncanny artifacts in everyday generated clips. What still breaks: Object permanence across cuts is still weak.
Video notable · primary · source 1 +2Sample: studio-quality dubbing expands to ten more languages
An existing dubbing product added broad language coverage. Creators can localize content for new markets cheaply. What still breaks: Regional accents within a language remain hit-or-miss.
Audio minor · primary · source 1 0Sample: research demo shows ten-minute coherent video (not yet released)
A lab demoed long-form video generation with consistent characters. If it ships, long-form AI video stops being theoretical. What still breaks: It is a demo; nobody outside the lab can use it.
Video unverified - not yet publicly usable · primary · source 1 0Sample: reports of a real-time singing-voice translator (unconfirmed)
Newsletter chatter describes live translated singing in a consumer app. Would extend voice translation into music and performance. What still breaks: No primary source yet confirms it exists.
Audio unverified - pending primary verification · discovery-only · source 1