Fly.io - Code And Let Live: Why AI Agents Need Persistent Cloud Computers, Not Ephemeral Sandboxes

2026-01-09Rafe Colburn

Kurt Mackey argues that ephemeral sandboxes are the wrong approach for AI agents. Instead, agents work better with persistent environments where they can maintain context across sessions and avoid rebuilding their workspace repeatedly.

[]

Simon Willison’s 2025 recap: reasoning models, agents, and the rise of coding CLIs

2026-01-01Rafe Colburn

#llms #ai-agents #reasoning-models #coding-agents #developer-tools

Simon Willison’s 2025 recap maps the year’s biggest LLM shifts, from inference-scaled reasoning models and tool-using agents to the breakout of coding CLIs like Claude Code. It’s a dense, opinionated timeline linking major releases to what actually changed for developers and everyday workflows.

[]

Sam Rose: Prompt caching explained

2025-12-30Rafe Colburn

#llms #transformers #attention #optimization

A deep dive into how LLM prompt caching works under the hood, focusing on the transformer attention mechanism and the exact data providers reuse between requests. This is also one of the most accessible explanations of how LLMs work that I’ve encountered. The visuals are really clear, and the step by step walkthrough is incredibly clear. Via Simon Willison.

[]

Robin Sloan: An app can be a home-cooked meal

2025-12-29Rafe Colburn

#programming

Via kottke.org, Robin Sloan describes himself as the programming equivalent of a home cook. I’ve been working in professional kitchens for a really long time, but lately I’ve rediscovered the joy of home cooking myself.

[]

A better way to view Claude Code transcripts

2025-12-29Rafe Colburn

#coding-agents #claude #tools

Simon Willison released a Python CLI tool that converts Claude Code sessions into shareable HTML pages with more detail than Claude Code itself provides, including hidden thinking traces.

[]

Georgi Arnaudov: How I Think About Kubernetes

2025-12-28Rafe Colburn

#kubernetes #infrastructure #devops

A compelling reframing of Kubernetes as ‘a runtime for declarative infrastructure with a type system’ rather than just a container orchestrator.

[]

The State of AI Security

2025-12-22Rafe Colburn

#llms #security

A reflection on Sander Schulhoff’s “AI security crisis” discussion, framing LLM and agent threats as social engineering at scale. It explains why simple-sounding mitigations like restricting access and capabilities are hard to implement in practice—and why agentic tools (especially browsers) demand extra caution.

[]

Essentially Free Semantic Search

2025-11-30Rafe Colburn

#embeddings #search #semantic-search

A practical walkthrough of adding semantic search to a Hugo site hosted on Cloudflare Pages while spending as close to $0 as possible. It covers the core pieces—embeddings, indexing, a vector database, and a small backend wrapper—plus notes from building it with Claude Code.

[]

Hello World

2025-11-25Rafe Colburn

#meta

Welcome to my tech blog.

[]

Posts by Rafe Colburn