Many machines, one system

Distributed Systems

A distributed system is one where a machine you didn’t know existed can break yours. This track covers the fallacies you inherit the moment you cross a network, the fundamentals of consistency and scale, and the reusable patterns — sidecar, sharding, scatter-gather — that tame them.

Start the journey →

Your distributed systems progress

Mark a topic “learned” on its page and watch the bars fill.

Skill map

Learned nodes light up — the glowing one is your next step. Click any node to jump in.

Foundations

The Fallacies of Distributed Computing Communication and Consistency: CAP and the Models Time and Ordering in Distributed Systems Consensus and Coordination

Scalability & Data

Scalability Fundamentals Load Balancing and Elasticity Distributed Caching Distributed Databases: Replication and Sharding Asynchronous Messaging at Scale

System Patterns

Single-Node Patterns: Sidecar, Ambassador, Adapter Serving Patterns: Replicated, Sharded, Scatter/Gather Batch Computational Patterns: Work Queues, Event-Driven, Coordinated

Foundations

The hard truths — the fallacies of distributed computing, communication models, consistency and CAP, time and ordering, and consensus.

1 · Start here The Fallacies of Distributed Computing

The dangerous assumptions every distributed system must unlearn — why networks fail partially, clocks lie, and 'it worked on one machine' stops being true.

✦ Complete · ⏱ 5 min 2 · Beginner Communication and Consistency: CAP and the Models

What CAP really forces you to choose, and the spectrum from eventual to strict serializability — so you pick a consistency model on purpose, not by accident.

✦ Complete · ⏱ 5 min 3 · Intermediate Time and Ordering in Distributed Systems

Why wall-clock timestamps can't order events across machines, and how logical clocks, version vectors, and anti-entropy reason about order instead.

✦ Complete · ⏱ 5 min 4 · Advanced Consensus and Coordination

How nodes agree on one value despite failures — two-phase commit, Raft, Paxos, and the ownership-election primitives that make leaders safe.

✦ Complete · ⏱ 5 min

Scalability & Data

Scaling out — load balancing and statelessness, caching, distributed databases, replication and partitioning, and asynchronous messaging at scale.

5 · Beginner Scalability Fundamentals

What scaling actually means — scale up vs out vs down, the twin principles of replication and optimization, statelessness, and why Amdahl's law caps your gains.

✦ Complete · ⏱ 5 min 6 · Intermediate Load Balancing and Elasticity

How a load balancer spreads requests across stateless replicas — Layer 4 vs Layer 7, distribution policies, health checks, elastic autoscaling, and the cascading-failure defenses that keep it all standing.

✦ Complete · ⏱ 5 min 7 · Intermediate Distributed Caching

How caching buys capacity by not asking the database — cache-aside vs read/write-through, TTLs and eviction, hit-rate economics, and HTTP/CDN caching at the edge.

✦ Complete · ⏱ 5 min 8 · Advanced Distributed Databases: Replication and Sharding

Scaling the data tier — read replicas, partitioning and sharding, leader-follower vs leaderless replication, NoSQL data models, and the consistency knobs real engines expose.

✦ Complete · ⏱ 5 min 9 · Intermediate Asynchronous Messaging at Scale

Decoupling producers from consumers with queues and logs — persistence and delivery guarantees, pub/sub, competing consumers, dead-letter queues, and the event-log shift Kafka makes.

✦ Complete · ⏱ 5 min

System Patterns

Reusable building blocks — single-node patterns (sidecar, ambassador, adapter), serving patterns (replication, sharding, scatter-gather), and batch patterns.

10 · Intermediate Single-Node Patterns: Sidecar, Ambassador, Adapter

The three co-located container patterns — sidecar augments, ambassador brokers, adapter normalizes — that turn one machine's containers into reusable distributed-system building blocks.

✦ Complete · ⏱ 5 min 11 · Advanced Serving Patterns: Replicated, Sharded, Scatter/Gather

The three multi-node serving topologies — replicate to scale requests, shard to scale data, scatter/gather to scale time — plus the readiness, hot-sharding, and straggler realities that govern them.

✦ Complete · ⏱ 5 min 12 · Advanced Batch Computational Patterns: Work Queues, Event-Driven, Coordinated

Patterns for short-lived, parallel data processing — the work queue, the event-driven coordination primitives (copier, filter, splitter, sharder, merger), and the coordinated join/reduce that produces aggregates.

✦ Complete · ⏱ 6 min

🌐 The network is not reliable — design for partial failure

The eight fallacies of distributed computing all reduce to one: the network will fail in ways a single process never does. Latency is nonzero, messages get lost and duplicated, and parts of the system go dark while others stay up. Idempotency, timeouts, retries and backpressure aren’t extras — they’re the baseline.