kafka
distributed event streaming platform
TLDR
SYNOPSIS
kafka-server-start.sh config
DESCRIPTION
Apache Kafka is a distributed event streaming platform. It provides high-throughput, low-latency message handling for real-time data pipelines and streaming applications.Kafka organizes messages into topics, with partitions for parallelism and replication for fault tolerance. Producers send messages; consumers read them.
CONFIGURATION
node.id=1
process.roles=broker,controller
controller.quorum.voters=1@localhost:9093
listeners=PLAINTEXT://:9092,CONTROLLER://:9093
log.dirs=/var/kafka-logs
broker.id=0
listeners=PLAINTEXT://:9092
log.dirs=/var/kafka-logs
zookeeper.connect=localhost:2181
KEY CONCEPTS
- Topic: Category for messages- Partition: Ordered, immutable sequence- Producer: Sends messages to topics- Consumer: Reads messages from topics- Broker: Kafka server node- Consumer Group: Coordinated consumers
CAVEATS
ZooKeeper support was removed in Kafka 4.0; KRaft mode is now required for new deployments. Memory and disk intensive. Topic configuration affects retention and storage. Consumer group rebalancing can cause temporary processing delays.
HISTORY
Kafka was developed at LinkedIn and open-sourced in 2011. Named after author Franz Kafka, it became an Apache project and is now fundamental infrastructure for event-driven architectures.
SEE ALSO
kafka-topics(1), kafkacat(1)
