Chat Application : WhatsApp


Functional Requirements

Non-Functional Requirements:

Estimation:

As of early 2025, WhatsApp reportedly handles 100 billion messages per day.This figure includes text messages, media, and voice/video calls.

Storage Estimation: Number of server assumption:

High level design:

cache
  1. Connection with a WebSocket server:
  2. Send or receive messages:
  3. The system performs the following steps to send messages from user A to user B:

  4. Send or receive media files:
  5. Support for group messages:
    • User A Sends a Message to the WebSocket Server:
      • User A is connected to a WebSocket server, which maintains an active connection for the user.
      • When User A sends a message intended for Group/A, the WebSocket server forwards it to the message service.
      • The message service handles the initial processing and routing of the message.
    • Message Service Sends the Message to Kafka:
      • The message service packages the message with metadata such as group ID (Group/A), sender ID, and timestamp.
      • The service publishes the message to Kafka for further processing.
    • Kafka's Responsibility (Message Broker):
      • Topic Management: Kafka treats each group (e.g., Group/A) as a "topic." Topics are logical channels for organizing messages, where producers (senders) write messages and consumers (receivers) read them.
      • Message Storage: Kafka temporarily stores the message in a partitioned log associated with the Group/A topic, ensuring durability even during failures or delays.
      • Scalability: Kafka partitions topics for parallel processing, enabling the handling of millions of messages efficiently.
      • Producer and Consumer Decoupling: Kafka decouples senders (producers) and receivers (consumers), simplifying the architecture and allowing dynamic scaling of recipients.
      • Consumer Coordination: Kafka ensures that all intended recipients receive messages in the correct order, coordinating delivery between producers and consumers.
    • Group Service Retrieves Group Metadata:
      • The group service maintains metadata for groups, including:
        • User IDs in the group.
        • Group ID and status (active/inactive).
        • Group attributes like the group icon and number of members.
      • Metadata is stored in a MySQL database cluster with:
        • Secondary Replicas: Geographically distributed replicas ensure high availability and reduce latency for read operations.
        • Redis Cache: Frequently accessed group data is cached to improve response times and reduce database load.
    • Group Message Handler Fetches Group Data:
      • The group message handler communicates with the group service to retrieve Group/A user data and statuses (online/offline).
      • It retrieves the message for Group/A from Kafka and processes it.
    • Message Delivery to Group Members:
      • The group message handler acts as the consumer for Kafka messages.
      • For each user in Group/A:
        • If the user is online, the handler forwards the message to the WebSocket server they are connected to for real-time delivery.
        • If the user is offline, the message may be stored temporarily or forwarded to a push notification system.

Non-functional Requirements