Venue: Internet
Fangjin Yang, creator of the Druid real-time analytical database, talks with Robert Blumen. They discuss the OLAP (online analytical processing) domain, OLAP concepts (hypercube, dimension, metric, and pivot), types of OLAP queries (roll-up, drill-down, and slicing and dicing), use cases for OLAP by organizations, the OLAP store’s position in the enterprise workflow, what “real time” means in the analytics context, and the relationship between real-time analytics and the Lambda architecture. They then move on to Druid itself: a high-level view, the challenges of implementing real-time analytics, and major application domains for Druid. They also discuss ad-tech and real-time analytics and real-time analytics as an operational tool. Yang then addresses Druid internals: node types, the special handling of the time dimension, the Druid query language, and the relationship between SQL and OLAP. Closing topics are the Druid open source project, community contributions, and the size and scale of some of the larger Druid clusters.
Show Notes
Related Links
- Fangjin Yang on Twitter: http://twitter.com/fangjin
- Introduction to OLAP http://www.dwreview.com/OLAP/Introduction_OLAP.html
- Druid http://druid.io
- The Druid project on Twitter https://twitter.com/druidio
- “Druid: A Real-Time Analytical Data Store” http://static.druid.io/docs/druid.pdf
- MetaMarkets—Introduction to Druid by Fangjin Yang https://www.youtube.com/watch?v=hgmxVPx4vVw
- The Strata talk, “Real Time Analytics with Open Source Technologies” https://www.youtube.com/watch?v=kJMYVpnW_AQ
Thanks a lot for the presentation. The Introduction to OLAP and Druid links dont work.
/V
It seems to work fine now: http://www.dwreview.com/OLAP/Introduction_OLAP.html
Perhaps it was some kind of temporary internet glitch or one of my colleagues already fixed it.