Learn

Engineering Articles

Real posts from engineering blogs — broken down for system design interviews, with a quiz to test what you learned.

CloudflareSeptember 26, 2025

How Cloudflare Cut Cold Starts 10x: From TLS Pre-Warming to Consistent Hash Sharding

Cloudflare Workers started with 5ms cold starts that were hidden behind TLS handshakes. As Workers grew to support full applications (10 MB scripts, 400ms startup budgets), cold starts outgrew TLS — and the original trick stopped working. This post covers both generations of their solution: the TLS SNI pre-warming trick and the consistent hash ring sharding system that ultimately cut eviction rates 10x and pushed warm request rates to 99.99%.

14 minAdvancedServerlessSystem DesignDistributed Systems
OpenAIJanuary 22, 2026

Scaling PostgreSQL to power 800 million ChatGPT users

OpenAI runs ChatGPT for 800 million users on a single-primary PostgreSQL instance with ~50 read replicas — no sharding. Over the past year, database load grew 10x. This post covers every optimization they made to keep it running: connection pooling, cache stampede prevention, workload isolation, rate limiting, and safe schema management.

12 minAdvancedDatabasesSystem DesignPostgreSQL
OpenAIJanuary 29, 2026

Inside OpenAI's In-House Data Agent: From Question to Insight in Minutes

OpenAI built a bespoke internal AI data agent that lets any employee — not just data engineers — go from natural language question to verified insight in minutes. The agent is powered by GPT-5.2, uses Codex to deeply understand table semantics from source code, retrieves context via RAG over 70k datasets (600 PB), and continuously self-improves through a layered memory system. The post breaks down its six-layer context architecture, conversational reasoning loop, eval-driven quality assurance, and key lessons in agent design.

14 minAdvancedAI AgentsSystem DesignData Engineering