Guide: Streaming Processing Frameworks

less than 1 minute read

Stream processing frameworks

  • Apache Apex
  • Apache Storm
  • Apache Flink
  • Apache Spark
  • Heron from Twitter
  • Kafka Streams

Apache Apex

Claims that “Apache Apex is industry’s only open-source enterprise-grade engine capable of handling batch data as well as steaming data needs”

Just like MapReduce, Apex does not have: Resource Scheduler and Management, Distributed File System, Security setup, other common utilities available within a distributed operating system. Apex leverages all YARN features without an overlap with YARN, while using HDFS as default persistent state store.

For more information: Flink