The Collector - A Tool to Have Multi-Writer Appends into HDFS
by Bryan Duxbury, Software Engineer at Rapleaf
Bryan covers The Collector, a tool built by Rapleaf that facilitates multi-writer appends into Hadoop Distributed Filesystem. This talk details why this is an important workflow component, along with the performance characteristics and some gotchas surrounding the implementation of such a system.
Link to the presentation: docs.google.com/Present?docid=dgz78tv5_10gpjhnvg9