Real-time applications have long been considered off-limits for Hadoop clusters. This talk will show how latencies of less than 5 seconds can be achieved for a sample metrics application while still retaining years of data with high availability and durability. This is done using a hybrid system consisting of Storm plus Hadoop that cooperate to do something neither can do alone.

This talk will provide a theory of operations, systems description and possibly a demo of a live system. All code will be made available on GitHub.


Speaker: Ted Dunning

Ted Dunning has been involved with a number of startups, with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.

Loading more stuff…

Hmm…it looks like things are taking a while to load. Try again?

Loading videos…