Blog

Field notes from inside the builds.

What we're learning about voice agents, evals, multi-tenant platforms, and the infrastructure that holds them together.

Latest · FrontierJul 22, 2026 · 4 min

OpenAI and Hugging Face disclosed a model evaluation breach — here's what the supply chain risk actually looks like

OpenAI and Hugging Face published joint disclosure of a security incident during model evaluation. An attacker uploaded a malicious model that attempted to extract API keys when OpenAI's eval pipeline downloaded it.

Read full piece

StackJul 21, 2026

Cloudflare just shipped Internal DNS for private networks — here's what it means for agent infrastructure

Cloudflare Internal DNS brings authoritative and recursive DNS for private networks to the same global network that runs Zero Trust. For agentic systems that coordinate across VPCs, it's the missing piece.

4 min readRead

FrontierJul 15, 2026

Codex hit 7M users while Claude Code went silent — here's what the usage gap tells us

OpenAI's Codex added 1M users in roughly 24 hours and now sits at 7M total. Anthropic hasn't reported Claude Code numbers since launch. The silence is the signal.

OpenAI and Hugging Face disclosed a model evaluation breach — here's what the supply chain risk actually looks like

Cloudflare just shipped Internal DNS for private networks — here's what it means for agent infrastructure

Codex hit 7M users while Claude Code went silent — here's what the usage gap tells us

GPT-5.6 Sol Ultra reportedly solved a 50-year-old math problem in under an hour — here's what the proof actually shows

Cloudflare just shipped Meerkat: a global consensus service built on QuePaxa — here's what it means for agent coordination

Claude Opus 4.8 invents tool arguments that don't exist — Armin Ronacher reports the newer model is worse at schema adherence than older versions

Mistral's Leanstral 1.5 found 5 real bugs scanning open-source repos — here's what formal verification in production looks like

Cloudflare just shipped a monetization gateway for the agentic internet — here's what x402 means for AI traffic

Commerce lifted export controls on Fable 5 and Mythos 5 — here's what the block actually was

Ornith-1.0 just shipped: a self-scaffolding coding model that builds its own tooling — here's what makes it different

Cloudflare just shipped 60-minute ephemeral accounts — here's what it means for agent tooling

AWS just shipped Context and Continuum — two services that tackle the same agent problem from opposite ends

Cloudflare just opened the Agents SDK to any framework — here's what Flue brings to the runtime

Cloudflare just shipped DNS routing to private origins — no public IPs, no extra connectors

Anthropic's Fable 5 ships with a clause letting it degrade service to competitors — here's what that means

Latent.space just dropped FrontierCode, a benchmark for code quality over slop — here's what it measures

Uber capped Claude Code usage after blowing four months of AI budget — here's what that means for enterprise rollout

Cloudflare cut core server boot time from 4 hours to minutes by fixing UEFI timeouts — here's the diff

SQLite shipped an AGENTS.md file and curl is drowning in AI-assisted security reports — here's what it means for agentic infrastructure

The Pope just published an encyclical on AI ethics — and it reads like Anthropic's Constitutional AI doc

Google I/O 2026: Gemini 3.5 Flash, Omni, and Spark — here's what shipped

Abridge just hit 100M doctor visits and cut prior auth from days to minutes — here's what production healthcare AI actually looks like

GitLab just announced a 30% country reduction for "the agentic era" — here's what the math actually says

Mozilla used a Claude preview to harden Firefox. The numbers are worth looking at.

Mozilla used Claude Mythos Preview to find hundreds of Firefox vulnerabilities — here's what changed

Anthropic's Claude Code team just published a case for HTML over Markdown — here's why it matters for production tooling

Versioned filesystems for agent sandboxes: a quick note on Tilde.run

Anthropic launched a finance-agent suite. What does it actually mean?

OpenAI's voice latency write-up: a four-layer read for production deployments

Codex /goal, the OpenClaw drama, and what coding agents look like now

Cloudflare just made agents first-class customers

The six-layer agentic stack we deploy for SMBs

OpenAI on AWS Bedrock, one day after the Microsoft split

Mistral Medium 3.5 ships remote agents — a quick note on why we won't route production through them

Evals on day zero

Migrating Goldie from Retell to ElevenLabs in four days

A Sunday email from the workshop.

Tell us what you'd like built.