1. Flink's Notions of Time:
  2. Why Event Time is Crucial:
  3. Watermarks Explained:
  4. Lateness and Watermarks:
  5. Trade-offs in Stream Processing:
  6. Approaches to Watermarking:

What is Flink? Apache Flink® — Stateful Computations over Data Streams

Flink is a stream processor → Data has to come from somewhere (databases, logs, transactions, etc) passes thru Flink and goes to somewhere else

Screenshot 2023-07-25 at 1.48.55 PM.png

Apache Flink is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams

Screenshot 2023-07-25 at 2.04.06 PM.png

Apache Flink is a distributed processing engine that requires compute resources in order to execute applications. It integrates with common cluster resource managers (Hadoop, YARN, Kubernetes), but can also run as a standalone cluster. When deploying applications, Flink automatically identifies required resources, requests them from the resource manager, and replaces failed containers by requesting new resources.

Apache Flink Benefits