This document discusses Airbnb's data infrastructure and use of AirStream. It describes how AirStream provides a unified platform for both streaming and batch data processing using Spark SQL and a shared state store in HBase. Case studies show how AirStream is used for real-time data ingestion from Kafka to HBase, streaming exports from databases to HBase, and point-in-time queries. The document also covers how AirStream scales jobs using YARN, provides fault tolerance through checkpointing and job restarts, and monitors jobs with AirStream listeners.