This is a summary of the Kafka white paper, which is available at this link.
1. Abstract
- Kafka, a distributed messaging system that is developed for collecting and delivering high volumes of log data with low latency.
 - Kafka has superior performance when compared to two popular messaging systems.
 
2. Introduction
A large amount of log data is generated at any sizable internet company which includes user activity, operational metrics.
- User activity
- It contains events corresponding to logins, page views, clicks, comments, and search queries.
 
 - Operational metrics
- It contains service call stack, call latency, errors, and system metrics
 
 - system metrics include CPU, memory, network, or disk utilization on each machine.
 
- User activity
 Activity data
- search relevance
 - recommendations
 - ad targeting and reporting
 - security
 
- Kafka provides an API similar to a messaging system and allows applications to consume log events in real-time.
 
Access all course materials today
The rest of this tutorial's contents are only available for premium members. Please explore your options at the link below.


