LeetDezine — System Design and Internals
Data Processing
Initializing search
    • Home
    • Start Here
    • Concepts
    • Back of Envelope
    • Foundation
    • Ascent
    • Expedition
    • Summit
    • Battleground
    • Home
    • Start Here
    • Concepts
      • Fundamentals
      • Distributed Systems
      • Caching
      • Storage & Databases
      • Database Types
      • Messaging
      • Event Broker
      • Event-Driven Patterns
      • Data Processing
        • Stream Processing
        • Batch Processing
        • Lambda & Kappa
        • Schema Evolution
    • Back of Envelope
    • Foundation
    • Ascent
    • Expedition
    • Summit
    • Battleground
    1. Home
    2. Concepts
    3. Data Processing
    09

    Concepts

    Data Processing

    How systems move, transform, and aggregate large volumes of data — in real time and in bulk.

    Stream Processing
    Windows, watermarks, statefulness, and crash recovery. Processing infinite streams of events without losing data or blowing memory.
    WindowsWatermarksStatefulnessCrash Recovery
    Open topic →
    Batch Processing
    MapReduce internals, Spark's DAG execution model, and how batch systems handle hot keys and skewed data.
    MapReduceSparkDAGHot Keys
    Open topic →
    Lambda & Kappa
    Two architectures for mixing batch and stream. Lambda's complexity vs Kappa's simplicity — and when each one wins.
    Lambda ArchitectureKappa ArchitectureBatch LayerSpeed Layer
    Open topic →
    Schema Evolution
    Schema registries, Protobuf, and Avro — how to change data formats in a live pipeline without breaking producers or consumers.
    Schema RegistryProtobufAvroCompatibility
    Open topic →