Jeremy Tregunna - Into the Stack

Orbit

The Airlock

Every system needs a front door, and the front door is where you decide what "allowed in" means before anything inside has to care. In Orbit that's the Airlock. Nothing enters a Constellation without passing it: the client proves who they are, the mission proves it&

Orbit

Many Schedulers, One Commit Loop

A scheduler is where an orchestrator earns its keep, and it's also where Borg admitted it had painted itself into a corner. The scheduling policy lived in one place, it grew over years to serve every workload Google had, and the paper is candid that the single scheduler

Orbit

The Firehose

The telemetry plane is the easy half of the state split to describe and the easy half to get quietly wrong. It carries everything the cluster observes about itself: usage, health, pressure, liveness, GPU temperature. It's high-volume, it's allowed to be stale, and the entire

Orbit

The Commitment Ledger

The Ephemeris is the part of Orbit I was most nervous about, because it's the part that has to be correct. Telemetry can drop a sample and shrug. The ledger cannot drop a placement, can't double-book a node, and can't disagree with itself

Orbit

Illegal States Won't Compile

Before Orbit can place a workload or reclaim a core, it has to know how to say what a workload is. That sounds like the boring part. It isn't. The shape of the object model is where Borg and Kubernetes both left scars, in opposite directions, and getting

Orbit

Two Planes, Not One Store

Last post I said the most important decision in Orbit is splitting cluster state into two planes, and then I made you wait for it. Here it is. If you only read one post in this series, read this one, because almost everything else is downstream of this single cut.

Orbit

Why I'm Building Another Orchestrator

Everyone tells you not to build your own cluster manager. They're mostly right. Kubernetes exists, it works, it used all over the place...better job prospects if you use K8s. Building another one is the kind of decision that gets you eyerolls in conversation. So let me undermine

The Log is the Database

foldb has one source of truth: an append-only, Raft-replicated log. Everything else, the storage layer, the indexes, the materialized query results, is just a fold(log). A pure function applied to a sequence of committed entries in order. A transaction commits when its entry is durable in the

Deep dive: TrueTime

Out of the gate, what is TrueTime? It's an algorithm used in Google Spanner, giving it the ability to do something no distributed database had done before...globally consistent transactions without a centralized timestamp authority. Not only is it a breakthrough algorithm for the time, it's

Thoughts: Building Tools that Serve, not Extract

Every software product pretends to be built for its users. But most are actually built on them... extracting value in the form of data, attention, or compliance. We see this everywhere: * CRMs that sales people hate but must use * Social platforms that drain users to sell to advertisers * Productivity tools

How to Lose Data on Purpose and Still Access It Later

Everyone tells you not to build your own distributed storage system. "Just use S3," they'll say. "Use Ceph. Use MinIO. Real engineers don't reinvent wheels." They're wrong, real engineers build whatever interests them, with expectations in reality. So let'

NUMA-Aware Allocation: Making Memory Local Again

I've been building a storage system on CXL-attached persistent memory, and one component that took more iteration than expected was the NUMA-aware allocator. The concept is simple--bind memory to specific NUMA nodes--but the implementation has some sharp edges that aren't obvious until