Real-time applications have long been considered off-limits for Hadoop clusters. This talk will show how latencies of less than 5 seconds can be achieved for a sample metrics application while still retaining years of data with high availability and durability. This is done using a hybrid system consisting of Storm plus Hadoop that cooperate to do something neither can do alone.
This talk will provide a theory of operations, systems description and possibly a demo of a live system. All code will be made available on GitHub.
Speaker: Ted Dunning
Ted Dunning has been involved with a number of startups, with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.