This article is awesome. Explaining every capabilities of the tool so that we can go deeper as needed. As a beginner in kafka, this is very helpful. Thanks!
I am currently working in a team that heavily uses and builds on top of Kafka infrastructure - we make use of kafka infra + Kafka streams + Kafka connect - and nowhere until this article have I come across such a crisp, succinct and comprehensive enough that I have already bookmarked it as my go to article moving forward! Thank you Stanislav!
This article is so comprehensive. One can find their way round kafka with this .
This article is awesome. Explaining every capabilities of the tool so that we can go deeper as needed. As a beginner in kafka, this is very helpful. Thanks!
This article is so helpful and easy to follow.
One suggestion is to allow click-zoom the images for mobile users
I am currently working in a team that heavily uses and builds on top of Kafka infrastructure - we make use of kafka infra + Kafka streams + Kafka connect - and nowhere until this article have I come across such a crisp, succinct and comprehensive enough that I have already bookmarked it as my go to article moving forward! Thank you Stanislav!
🙇♂️
One of the best high level overview of kafka
Awesome article...
Timeouts in consumers are `session.timeout.ms` and `poll.timeout.ms`; I wrote about it back in the day here: https://www.confluent.io/blog/apache-kafka-data-access-semantics-consumers-and-membership/
Basically, as long as the consumer heartbeats and actively polls for messages, it’ll remain in the group. It can accumulate consumer lag in that time.
If you’re surprised how much S3 can save you, you’ll be blown away by how much you can save by eliminating networking: https://topicpartition.io/blog/kip-1150-diskless-topics-in-apache-kafka
For small cases, you should just use postgres. https://topicpartition.io/blog/postgres-pubsub-queue-benchmarks