Kafka is an open source stream processing platform developed by the Apache Software Foundation and written in Scala and Java. Kafka is a high-throughput distributed publish-subscribe messaging system that can process all action streaming data of consumers in the website. This action is a key factor in many social functions on the modern web.
Kafka is an open source stream processing platform developed by the Apache Software Foundation and written in Scala and Java. Kafka is a high-throughput distributed publish-subscribe messaging system that can process all action streaming data of consumers in the website. Such actions (web browsing, searches and other user actions) are a key factor in many social functions on the modern web. This data is typically addressed by processing logs and log aggregation due to throughput requirements. This is a feasible solution for log data and offline analysis systems like Hadoop, but requiring real-time processing constraints. The purpose of Kafka is to unify online and offline message processing through Hadoop's parallel loading mechanism, and to provide real-time messages through the cluster.
Kafka is a high-throughput distributed publish-subscribe messaging system with the following characteristics:
Provides message persistence through O(1) disk data structures ization, this structure can maintain stable performance for a long time even for message storage of several terabytes.
High throughput: Even very modest hardware Kafka can support millions of messages per second.
Supports partitioning messages through Kafka server and consumer machine clusters.
Support Hadoop parallel data loading.
The above is the detailed content of What does kafka mean?. For more information, please follow other related articles on the PHP Chinese website!