February 10, 2015

SE Radio 219: Apache Kafka with Jun Rao

Venue: Internet
Jeff Meyerson talks to Jun Rao, a software engineer and researcher (formerly of LinkedIn). Jun has spent much of his time researching MapReduce, scalable databases, query processing, and other facets of the data warehouse. For the past three years, he has been a committer to the Apache Kafka project. Jeff and Jun first compare streaming to messaging, and the frameworks that support each. Kafka is a big data messaging or pub/sub system. Traditionally, these are two different types of systems, but the lines have become blurred recently. Kafka can also be looked at as a distributed commit log. Next, they discuss the vocabulary of Kafka, including producers and consumers. They wrap up by exploring Kafka from the perspective of durability and reliability and discuss some failure cases.

Show Notes

Related Links

Apache Kafka: http://kafka.apache.org
Original Kafka paper: http://research.microsoft.com/en-us/um/people/srikanth/netdb11/netdb11papers/netdb11-final12.pdf
Kafka Basic Training: http://www.slideshare.net/miguno/apache-kafka-08-basic-training-verisign
Building LinkedIn’s Real-time Activity Data Pipeline: http://sites.computer.org/debull/A12june/pipeline.pdf
Kafka: A Little Introduction: https://speakerdeck.com/pingles/kafka-a-little-introduction
Apache Storm: https://storm.apache.org
Apache Samza: http://samza.incubator.apache.org
Apache Zookeeper: http://zookeeper.apache.org

Join the discussion

You must be logged in to post a comment.

6 comments

Ivan Muzzolini says:

February 11, 2015 at 11:28 am

Hi!
It feels like the Download link is broken… I cannot download the mp3. Could you please have a look?
Thanks!
Ivan
cdman says:

February 19, 2015 at 12:43 pm

It works now. Could you please try it again? Perhaps libsyn was down momentarily?
Interesting Java related Links. Week 7, 2015 | My Technical Life says:

February 20, 2015 at 5:48 pm

[…] Apache Kafka (podcast) – very good introduction into Apache Kafka project. AT least I know know where it’s applicable and some internals of it. […]
hartror comments on “Time-Series Database Requirements” | blog.offeryour.com says:

March 9, 2015 at 1:57 am

[…] [3] http://www.se-radio.net/2015/02/episode-219-apache-kafka-wit… […]
tekkiesuk says:

April 11, 2015 at 11:02 pm

I’ve recently been enlightened by a few of your podcasts. Zookeeper keeps coming up in the conversation. It would be good to hear a bit more about it.
Podcast for developers: A few recommendations says:

July 4, 2015 at 12:31 pm

[…] Apache Kafka with Jun Rao […]

SE Radio 219: Apache Kafka with Jun Rao

Show Notes

Related Links

Join the discussion

6 comments

More from this show

SE Radio 730: Birgitta Boeckeler on Harness Engineering for AI Agents

SE Radio 729: Garth Mollett on AI Supply Chain Security

SE Radio 728: Clare Liguori on the AWS Strands SDK for AI Agents

Menu

Recent posts

Search

Search

SE Radio 219: Apache Kafka with Jun Rao

Show Notes

Related Links

Join the discussion

6 comments

More from this show

SE Radio 730: Birgitta Boeckeler on Harness Engineering for AI Agents

SE Radio 729: Garth Mollett on AI Supply Chain Security

SE Radio 728: Clare Liguori on the AWS Strands SDK for AI Agents

Menu

Recent posts