Cassandra shows up when the system design problem has a specific shape: high write volume, large data size, multi-region availability, and predictable access patterns.
It is a good candidate when high availability matters more than rich querying, and when the team is willing to model data around the exact reads the application will perform.
Cassandra makes a clear trade-off: it sacrifices query flexibility for write performance, horizontal scale, and availability. Writes go through an append-oriented path, there is no primary node for the whole cluster, and data is replicated across nodes and data centers.
Those choices work well for the right workload, but they also change how you model data and operate the system.
In interviews, the important part is explaining both sides of the trade-off. Cassandra is a poor fit for ad-hoc analytics, complex joins, and transactional workflows across partitions. It is a strong fit when the workload is write-heavy, access patterns are known, partition keys are well distributed, and eventual or quorum-based consistency is acceptable.
For interviews, focus on query-first data modeling, partition key design, consistency tuning, and the anti-patterns that make Cassandra fail in production.
The diagram traces how client requests flow through a masterless Cassandra ring, how token ranges map each partition to a specific node, and how data replicates asynchronously to a second data center.