Distributed event streaming platform capable of handling trillions of events per day.
Links:
• Official Docs: kafka.apache.org
• GitHub: github.com/apache/kafka
• Quickstart: kafka.apache.org/quickstart
Stateful computations over data streams with exactly-once semantics and event-time processing.
Links:
• Official Docs: flink.apache.org
• Training: flink.apache.org/training
• Playgrounds: flink.apache.org/try-flink
Cloud-native distributed messaging and streaming platform with multi-tenancy and geo-replication.
Links:
• Official Docs: pulsar.apache.org
• Tutorials: pulsar.apache.org/docs/next
• Community: pulsar.apache.org/community
Scalable stream processing with micro-batch architecture and unified batch/stream API.
Links:
• Official Docs: spark.apache.org/streaming
• Examples: spark.apache.org/examples
• Structured Streaming: spark.apache.org/docs/latest/structured-streaming
Fully managed streaming service on AWS for real-time data ingestion and processing.
Links:
• Official Docs: aws.amazon.com/kinesis
• Developer Guide: docs.aws.amazon.com/kinesis
• Tutorials: aws.amazon.com/kinesis/getting-started
Real-time distributed computation system for processing unbounded streams of data.
Links:
• Official Docs: storm.apache.org
• Tutorial: storm.apache.org/releases/current/Tutorial.html
• GitHub: github.com/apache/storm
| Library | Language | Platform | Description |
|---|---|---|---|
| KafkaJS | JavaScript/TypeScript | Node.js | Modern Kafka client for Node.js with async/await support |
| confluent-kafka-python | Python | Any | High-performance Python client based on librdkafka |
| kafka-go | Go | Any | Pure Go Kafka client with zero dependencies |
| kafka-clients | Java | JVM | Official Apache Kafka Java client |
| rdkafka | C/C++ | Any | High-performance C library used by many language bindings |
| Sarama | Go | Any | Pure Go client with consumer group support |
Authors: Tyler Akidau, Slava Chernyak, Reuven Lax
Comprehensive guide to stream processing concepts, windowing, watermarks, and state management.
Authors: Neha Narkhede, Gwen Shapira, Todd Palino
Complete reference for Apache Kafka covering architecture, operations, and use cases.
Author: Ben Stopford
Patterns and concepts for building event-driven architectures with Kafka.
Authors: Fabian Hueske, Vasiliki Kalavri
Practical guide to building streaming applications with Apache Flink.
| Tool | Purpose | Features |
|---|---|---|
| Confluent Control Center | Kafka Management | Monitoring, alerting, cluster management, stream processing |
| Kafka Manager (CMAK) | Cluster Management | Topic management, consumer groups, cluster monitoring |
| Kafdrop | Web UI | Browse topics, view messages, monitor consumers |
| Schema Registry | Schema Management | Avro/Protobuf/JSON schema versioning and validation |
| Kafka Connect | Data Integration | Source/sink connectors for databases, cloud services, etc. |
| ksqlDB | Stream Processing | SQL-like queries on Kafka streams |
| Burrow | Monitoring | Consumer lag monitoring and alerting |
| Cruise Control | Operations | Automated cluster rebalancing and anomaly detection |
Event serialization formats, schemas, and data modeling for streaming systems.
Producer/consumer APIs, admin operations, and client interaction patterns.
Wire protocols, communication patterns, and network optimization strategies.
Connectors, adapters, and integration patterns with databases and cloud services.