Introduction to CAP Theorem

The CAP Theorem is a fundamental principle in distributed systems that helps in understanding the trade-offs that are necessary when designing a distributed database. It was proposed by Eric Brewer in the year 2000. The theorem states that in a distributed system, you can only achieve two out of the following three properties simultaneously:

Consistency (C)
Availability (A)
Partition Tolerance (P)

These three properties are vital in the context of distributed data storage systems, but according to the CAP theorem, achieving all three is impossible. Hence, when designing distributed systems, you must prioritize two of these properties based on your system requirements.

Key Concepts of the CAP Theorem

Consistency (C):

Availability (A):

Partition Tolerance (P):

Trade-offs in CAP Theorem

Based on the CAP Theorem, distributed systems can only guarantee two of the three properties. Let’s break down the different combinations:

1. Consistency + Availability (CA)

If you prioritize both Consistency and Availability, the system will not be able to tolerate network partitions. Systems that ensure consistency and availability work well as long as there are no network partitions. If a partition occurs, the system will likely stop responding, as the system can no longer guarantee both consistency and availability.

Use Case: This combination is suitable for systems within a single data center where the likelihood of network partitions is minimal.

Example:

Traditional relational databases (e.g., SQL databases) usually follow this model, but they do not perform well when partition tolerance is required.

2. Consistency + Partition Tolerance (CP)

With Consistency and Partition Tolerance, the system may sacrifice availability. When a partition occurs, the system may become unavailable to maintain consistency.

Use Case: Suitable for systems where consistency is critical, such as financial systems or banking applications where every transaction must be accurate and up-to-date.

Example:

Systems like HBase or MongoDB can sacrifice availability in certain configurations to ensure consistency even when partitions occur.

3. Availability + Partition Tolerance (AP)

If you prioritize Availability and Partition Tolerance, you may sacrifice consistency. These systems remain available and partition-tolerant, but may return outdated or inconsistent data when there are network issues. This is often referred to as eventual consistency, where the system will eventually become consistent but not necessarily immediately.

Use Case: Ideal for systems where availability is more important than consistency, such as social media feeds, caching systems, or shopping cart systems in e-commerce platforms.

Example:

NoSQL databases like Cassandra and DynamoDB prioritize availability and partition tolerance, allowing for eventual consistency in exchange for handling partitions and maintaining availability.

CAP Theorem in System Design Interviews

In system design interviews, CAP theorem is an important concept because it helps you understand trade-offs in designing distributed systems. When discussing a system, you should be able to explain:

What kind of data consistency is required for the system (e.g., strong or eventual consistency)?
How important availability is — does the system need to serve requests under all conditions, or is temporary unavailability acceptable?
How the system can handle network partitions, and whether the system should continue to function during partition events.

When asked about CAP in a system design interview, follow this approach:

1. Clarify Requirements

Ask clarifying questions to understand the business or technical requirements. Does the system need high availability (e.g., financial transactions)? Or is consistency more critical (e.g., in a booking system where double bookings must be avoided)?

2. Identify Trade-offs

Explain how the system you propose will handle trade-offs between consistency, availability, and partition tolerance. For example, if you’re designing a social media platform, you might prioritize availability and partition tolerance, with eventual consistency being acceptable.

3. Use Real-World Examples

Reference real-world systems and databases like Cassandra, DynamoDB, or traditional SQL systems to highlight how different distributed systems make trade-offs based on CAP theorem.

4. Discuss Failures and Recovery

Address what happens in the event of network partitions and how your system would recover. Mention concepts like eventual consistency, leader election, and replication strategies to strengthen your explanation.

Cloud Database following CAP Theorem

Amazon DynamoDB (AP)
Google Cloud Spanner (CP)
Amazon Aurora (CA)
Cassandra (AP)
Firebase Realtime Database (AP)
Amazon RDS (CA)

Introduction to CAP Theorem

Key Concepts of the CAP Theorem

Consistency (C):

Availability (A):

Partition Tolerance (P):

Trade-offs in CAP Theorem

1. Consistency + Availability (CA)

2. Consistency + Partition Tolerance (CP)

3. Availability + Partition Tolerance (AP)

CAP Theorem in System Design Interviews

1. Clarify Requirements

2. Identify Trade-offs

3. Use Real-World Examples

4. Discuss Failures and Recovery

Cloud Database following CAP Theorem