devtake.dev
Topic

Agentic coding

Every major lab now ships a coding-first agent. Anthropic has Claude Code and Routines; OpenAI is pushing Codex into the OS; Google and Apple are racing to catch up. We track the shipping features, the model-vs-model benchmark fights, and the practical question underneath it all — which tasks you can actually hand off today, and which ones still need a human in the loop.

45 articles in this topic

Gemini Intelligence interface on an Android phone
Android·

Gemini Intelligence turns Android 17 into an agent that drives your apps

Google's Android Show pitched Gemini Intelligence and AppFunctions, an MCP-style way for the assistant to call inside your apps. Here's how it works and what to watch.

A MacBook Pro beside a Surface Book, both open on a white surface, USB-C ports in view
AI·

Running a coding agent fully on Apple Silicon, no cloud, is now an off-the-shelf stack

A popular Hacker News how-to walked through a fully local coding agent on Apple Silicon. Here's the realistic 2026 stack: runner, model, and harness.

Abstract cybersecurity illustration of a glowing padlock over a circuit board, representing data protection
AI·

OpenAI added a Lockdown Mode to ChatGPT to blunt prompt-injection attacks

OpenAI shipped Lockdown Mode in ChatGPT to cut off the data-exfiltration step of prompt-injection attacks. Here's what it actually restricts and who should turn it on.

Rows of server racks inside a data center, the kind of infrastructure that runs Starlette-based AI agent endpoints
Security·

One bad Host header bypassed auth in Starlette, the routing core under millions of AI agents

A flaw in Starlette, downloaded 325M times a week, let a single Host-header character bypass path-based auth across FastAPI, vLLM, and MCP servers.

OpenAI's Codex branding over a code background, illustrating Codex expanding across the ChatGPT app.
AI·

OpenAI is putting Codex in every ChatGPT app, with six business plugins for non-coders

On June 2 OpenAI said Codex is coming to the ChatGPT app everywhere within weeks, and shipped six role-specific plugins for sales, analytics, design, and finance teams.

Anthropic's announcement artwork for Claude Opus 4.8, a soft gradient panel with the Claude wordmark.
AI·

Claude Opus 4.8 flags the bugs it writes four times more often than Opus 4.7

Anthropic's Opus 4.8 posts 69.2% on SWE-Bench Pro, lets code flaws slip 4x less often, and ships parallel subagents in Claude Code. Here's what matters.

A developer's Emacs session in a Linux terminal, editing C source alongside a shell
AI·

Hacker News is obsessed with durable Postgres workflows and a game about clicking yes

Six dev-tooling and AI posts that climbed Hacker News in late May 2026: durable execution on plain Postgres, LLM code smells, a permission-fatigue game, Rust 1.96, and more.

A software engineer at a laptop, the kind of AI-assisted coding workflow whose token costs blew through Uber's annual budget.
AI·

Uber blew its entire 2026 AI coding budget in four months. Its COO can't prove it paid off.

Uber exhausted its full-year Claude Code budget by April. Adoption hit 84%, heavy users burn $2,000 a month, and COO Andrew Macdonald can't connect the spend to shipped features.

Aisle of dense server racks inside the CERN Computer Center
Web·

Cloudflare rebuilt Browser Run on its own Containers. Concurrency went from 30 to 120.

Cloudflare moved Browser Run off shared isolation infrastructure on May 13. The agentic-coding crowd gets four times the headless-Chrome ceiling and half the latency.

Microsoft building exterior sign on a clear day.
AI·

Microsoft is canceling Claude Code for its engineers. They have until June 30 to switch to Copilot CLI.

Internal Claude Code licenses end June 30, 2026, for Microsoft's Experiences + Devices group. Engineers move to GitHub Copilot CLI instead.

Portrait of Andrej Karpathy, whose January 26 X thread on agentic coding was distilled into the viral CLAUDE.md file.
AI·

Karpathy posted four notes about Claude Code. The CLAUDE.md they spawned has 110K GitHub stars.

Forrest Chang turned Andrej Karpathy's January coding thread into a 70-line CLAUDE.md. It now has 110,000+ stars and has trended on GitHub for 28 weeks.

An Alibaba booth at a Chinese technology trade expo, with the company's logo above a display floor.
Hardware·

Alibaba's new Zhenwu M890 chip is 3x faster and aimed straight at agent workloads

Alibaba showed the Zhenwu M890 at its Cloud Summit on May 19. 144 GB of memory, 800 GB/s interchip bandwidth, and Qwen3.7-Max riding on top.

An illustration of the Claude Code deeplink vulnerability, showing a malicious URL handler triggering a shell prompt.
Security·

A bad command-line parser turned every claude-cli:// link into a remote shell

Joernchen of 0day.click found a deeplink RCE in Claude Code. Anthropic shipped the fix in 2.1.118 the same week.

OpenAI's Codex inside the ChatGPT mobile app, showing a Codex review on a phone screen.
AI·

OpenAI's Codex moved into the ChatGPT mobile app. You can approve a diff from the train now.

OpenAI shipped Codex remote control inside the ChatGPT app for iPhone, iPad, and Android on May 14. Pair via QR; the agent runs on your laptop, the review moves to your phone.

GitHub Open Graph card for oven-sh/bun pull request #30412, the Rust rewrite merge.
Open Source·

Bun's million-line Rust rewrite is now mainline. 99.8% of tests pass and 13,000 unsafe blocks remain.

Jarred Sumner merged the Bun-in-Rust PR on May 14, ending Zig as Bun's runtime language. Binary shrinks 3-8 MB; one analysis counted 13,000 unsafe blocks.

Anthropic Object Store opengraph illustration in clay tones
AI·

Anthropic shipped Claude for Small Business with 15 prebuilt agents. Daniela Amodei is pitching the corner-store owner.

Anthropic announced Claude for Small Business on May 13 with QuickBooks, HubSpot, Canva, and DocuSign hooks. The pitch: 15 ready-to-run agents and a 10-city tour.

Google Googlebook laptop promotional thumbnail showing the device and Gemini branding
AI·

Google's Magic Pointer turns the cursor into a Gemini prompt. The first Googlebooks ship this fall.

Google announced Googlebook on May 12: a premium laptop tier above Chromebook, with a Gemini-aware cursor called Magic Pointer. Acer, ASUS, Dell, HP, and Lenovo are in.

Airbnb office building exterior
AI·

Airbnb says AI writes 60% of its new code. Nobody has explained what that means.

Brian Chesky dropped the 60% figure on an earnings call without defining how Airbnb measures it. Google claims 75%. The independent average is 27%.

GitLab Act 2 blog post header graphic
Web·

GitLab is cutting staff and killing its CREDIT values. The CEO calls it 'Act 2.'

CEO Bill Staples announced a restructuring he frames around agentic AI, retiring GitLab's six core values for three new operating principles. Exact layoff numbers come June 2.

Apple AirPods Pro shown in their charging case, the current generation design that the camera-equipped model will build on
Apple·

Apple delayed its camera AirPods because Siri wasn't ready. Gurman says they're back in late testing.

Bloomberg reports Apple's infrared-camera AirPods have reached advanced testing. The earbuds feed visual context to Siri but can't take photos or video.

Illustration representing DOGE and government technology
Policy·

A judge killed DOGE's grant purge. The 'review process' was asking ChatGPT 'Is this DEI?'

A federal judge restored $100M+ in grants after two DOGE staffers used ChatGPT to flag 97% of NEH grants as DEI, including an HVAC repair and Holocaust research.

Abstract visualization of data exposure through code
Security·

380,000 vibe-coded apps are sitting on the open web. 5,000 of them are leaking real data.

RedAccess found that AI coding tools like Lovable, Base44, and Replit default to public hosting, leaving medical records, bank internals, and corporate secrets indexed by Google.

Illustration of Cloudflare layoffs with company logo and downward trend
Web·

Cloudflare cut 1,100 jobs on its best earnings day. Revenue grew 34% and the stock dropped 18%.

Cloudflare laid off 20% of its workforce on May 7 while reporting record Q1 revenue of $639.8M. The stock dropped 18% after hours.

Cartoon Claude Code terminal flexing two muscular arms against a terracotta background
AI·

Anthropic doubled Claude Code's limits by renting 220,000 GPUs from xAI

Anthropic doubled Claude Code's 5-hour limits, killed peak-hours throttling, and raised Opus API tiers. The capacity comes from xAI's Colossus 1, via a SpaceX deal.

Illustration of a Git commit message stamped with a Copilot co-author trailer.
Web·

VS Code shipped 'Co-Authored-by Copilot' on every commit by default. Microsoft is reverting it.

A two-line PR flipped the AI co-author flag from off to all in April. Hand-typed commits started getting Copilot attribution. The maintainer apologized and promised a fix in 1.119.

Stylized GitHub Copilot mascot melting into glowing puddles in front of a wall of flames — a visual metaphor for the steep multiplier hike on annual plans.
AI·

GitHub Copilot's Claude Opus multiplier jumps to 27x on June 1. Monthly plans dodge the hike.

GitHub's new model multiplier table for Copilot Pro and Pro+ annual plans lands June 1. Opus 4.6 goes 3 to 27. Sonnet 4.6 goes 1 to 9.

DHS senior official Kristie Canegallo presenting awards at the CISA Annual Award Ceremony in Arlington, Virginia.
Security·

Five Eyes intel agencies publish first joint agentic AI security guide. Their advice: slow down.

CISA, NSA, GCHQ, ASD, CSE and NCSC-NZ jointly tell organizations agentic AI isn't ready for fast rollout. The 23-page guide names five risk categories.

Architecture diagram from Cloudflare's Dynamic Workflows launch post, showing a host Worker dispatching durable execution to per-tenant Workers.
Web·

Cloudflare shipped Dynamic Workflows. Multi-tenant agent platforms finally get durable per-tenant code.

A 300-line MIT library lets one Worker route durable execution to every tenant's own workflow. The piece Cloudflare's Agents Week was missing.

Title card for Boris Cherny's 'Mastering Claude Code in 30 Minutes' Anthropic workshop talk.
AI·

Anthropic just dropped its Claude Code workshop tapes. The playbook is better than the marketing.

Boris Cherny on Claude Code, Applied AI on prompting, Erik Schluntz on vibe coding in prod. Three Code with Claude tapes hit YouTube ahead of the 2026 conference.

The Zed 1.0 launch graphic in dark mode with the Zed wordmark and a stylized cursor.
Open Source·

Zed 1.0 ships its agentic editor. The Atom team's Rust rewrite finally has a stable label.

Zed Industries shipped 1.0 on April 29 after five years of Rust and GPU work. Free forever for humans, with $10/month hosted AI and an open Agent Client Protocol.

Warp terminal product screenshot from the company's website.
Open Source·

Warp's terminal is now open source. The cloud agent platform Oz is the actual product.

Warp released its 36k-star Rust client on GitHub under AGPLv3 on April 28. OpenAI is the founding sponsor and Oz keeps the bills paid.

AWS marketing illustration of an interconnected machine-learning workflow.
AI·

OpenAI's models are on AWS Bedrock the day after Microsoft lost exclusivity

Amazon shipped Bedrock Managed Agents powered by OpenAI on April 28, plus Codex on Bedrock. Altman tells Stratechery the runtime matters as much as the model.

Anthropic Claude generic brand graphic shown in promotional material for enterprise customers.
AI·

Disney built an AI leaderboard. One employee called Claude 460,000 times in nine days.

Leaked internal Disney screenshots show 4,800 product and tech staff burning 3.1 billion Claude tokens and 13.3 billion Cursor tokens across nine April workdays.

GitHub Octocat mark on a dark gradient, the cover graphic on the GitHub Blog post announcing the Copilot billing change.
AI·

GitHub Copilot kills premium requests on June 1. Token billing arrives, fallback models do not.

On June 1 every Copilot plan switches to GitHub AI Credits priced per token. Code completions stay free. Fallback models and credit rollover do not.

Illustration of an AI-driven chip design process from IEEE Spectrum's coverage.
AI·

An AI agent built a working RISC-V CPU from a 219-word prompt in 12 hours. Here's what it actually did.

Verkor's Design Conductor agent went from a 219-word spec to a tape-out-ready RISC-V core called VerCore in 12 hours. The catch: it's still a Celeron.

Anthropic Engineering postmortem cover image.
AI·

Anthropic admits three Claude Code bugs quietly tanked quality for six weeks

Anthropic's April 23 postmortem names three bugs that degraded Claude Code between March 4 and April 20. Usage limits are being reset for every subscriber.

OpenAI's GPT-5.5 model launch with ChatGPT and Codex interfaces
AI·

OpenAI shipped GPT-5.5 seven weeks after 5.4. API tokens now cost twice as much.

OpenAI released GPT-5.5 (codename Spud) on April 23. The API runs at $5/$30 per million tokens, double GPT-5.4, with Pro at $30/$180.

OpenAI workspace agents launch graphic
AI·

OpenAI's Workspace Agents kill Custom GPTs and take the fight straight to Claude Code

Workspace Agents for ChatGPT Business, Enterprise, Edu, plus Teachers launched April 22. Team-shared, cloud-run, Codex-powered. Free until May 6, then credit-based.

GitHub Copilot announcement cover graphic
AI·

GitHub Copilot paused new signups and kicked Opus out of Pro. Here's what actually changed.

GitHub froze Copilot Pro/Pro+/Student signups on April 20 and moved Claude Opus 4.7 behind the $39 Pro+ tier. Agent workflows broke the old math.

Anysphere founder Michael Truell, the CEO behind the Cursor AI code editor
AI·

Cursor wants $50B for an AI editor that's burning cash on individuals

Cursor is in talks to raise $2B at a $50B valuation, nearly double its September mark. Revenue is up, but it's still losing money per indie seat.

Screenshot of the updated OpenAI Codex Mac app with background computer-use panel
AI·

OpenAI's Codex now drives your Mac, not just your code

OpenAI shipped a Codex update that can pilot desktop apps with a cursor, generate images in-line, and run parallel agents. It's the opening move in a real Claude Code fight.

Siri icon on an iPhone display showing Apple's AI assistant interface
Apple·

Apple is sending 200 Siri engineers to an AI coding bootcamp. That tells you everything.

A report from The Information reveals Apple is retraining Siri staff on AI tools like Claude Code. With WWDC two months away, Apple's AI gap has never been more visible.

Google Gemini app running on a Mac desktop showing the mini chat interface
AI·

Google Gemini finally has a Mac app, and it's gunning for ChatGPT's desktop lead

Google shipped a native Swift Gemini app for macOS with screen sharing, voice, and Deep Research. Here's what it does, what it doesn't, and how it stacks up.

Adobe Firefly expansion announcement social image from Adobe's blog
AI·

Adobe's Firefly AI Assistant can now drive Photoshop, Premiere, and Lightroom for you

Adobe renamed Project Moonlight to Firefly AI Assistant and opened a public beta. It runs multi-step workflows across Photoshop, Premiere, Lightroom, and more.

Claude wordmark on Anthropic's introducing-Routines announcement
AI·

Claude Code Routines: what they actually do, and when to use them over GitHub Actions

Anthropic just shipped Routines: Claude Code sessions as cron jobs, webhooks, and GitHub-event reactors. Here's what they replace, what they don't, and one rule to follow.

Related topics