1. Aaron Beppu - Profiling and performance-tuning your Hadoop pipelines


    from newthinking / Added

    191 Plays / / 0 Comments

    In the Hadoop ecosystem, there are now several tools which allow developers to quickly produce pipelines of MapReduce jobs without descending to the verbose level of the Java MapReduce apis. Unfortunately, these concise, higher-level tools often produce pipelines which are initially slow, and difficult to optimize. This talk will describe Etsy's pipeline of hundreds of Cascading flows (and thousands of daily Hadoop jobs), and our approach to profiling and performance-tuning them. More info: http://berlinbuzzwords.de/sessions/profiling-and-performance-tuning-your-hadoop-pipelines

    + More details
    • A Billion Records a Month Isn't Exactly a


      from Wes Hunt / Added

      17 Plays / / 0 Comments

      Colt and Leif will present a high-level overview of a production big data system that uses Hadoop MapReduce, HDFS, and HBase. As well as describe how to use those technologies to create a horizontally scalable system to continually process lots of data and make it available for nearly instant access.

      + More details
      • A billion records a month isn't exactly a lot of data these days


        from Wes Hunt / Added

        26 Plays / / 0 Comments

        For our main February meetup Colt and Leif presented a high-level overview of a production bigdata system that uses Hadoop MapReduce, HDFS, and HBase. As well as described how to use those technologies to create a horizontally scalable system to continually process lots of data and make it available for nearly instant access. www.montanaprogrammers.org

        + More details
        • About Scality RING


          from Scality / Added

          27 Plays / / 0 Comments

          The Scality story. About Scality Scality is an industry leader in petabyte-scale storage. Scality’s RING software offers an award winning, scale-out storage solution that operates seamlessly on any commodity server hardware. It provides outstanding scalability and data persistence, while the end-to-end parallel architecture provides unsurpassed performance. Delivering billions of files, to tens of millions of users each day with 100% availability. Scality’s impressive customer list includes telcos, media companies, web 2.0, government and more. scality.com or follow @Scality on Twitter.

          + More details
          • Accelerating Hadoop with In-Memory Computing


            from GridGain Systems / Added

            136 Plays / / 0 Comments

            Dmitriy Setrakyan, GridGain’s co-founder and CTO, will walk through the technology and use cases behind in-memory accelerator for Hadoop. GridGain’s In-Memory Accelerator for Hadoop is based upon the industry’s first dual-mode, high-performance in-memory file system that is 100% compatible with Hadoop HDFS – and an in-memory MapReduce implementation. In-memory HDFS and in-memory MapReduce provide easy to use extension to disk-based HDFS and traditional MapReduce delivering up to 100x faster performance.

            + More details
            • Actian Corporation - Accelerating Big Data 2.0


              from Vladimir Perlovich / Added

              2 Plays / / 0 Comments

              The world is generating more data than ever before and creating new opportunities to innovate but until now we haven't all had the same ability to exploit that big data. Learn how Actian is accelerating big data 2.0 and bringing big data to the rest of us.

              + More details
              • Actian Customer Success Story - Express Analytics


                from Actian Corporation / Added

                3 Plays / / 0 Comments

                + More details
                • Actian's CTO, Mike Hoskins - Welcome to the Age of Data


                  from Vladimir Perlovich / Added

                  9 Plays / / 0 Comments

                  our every digital interaction with the world generates data, more data than ever before. Finding meaning in these volumes of data will transform the way we live, the way we work, and the way we play, now and in the future.

                  + More details
                  • Actian SQL on Hadoop


                    from Actian Corporation / Added

                    2 Plays / / 0 Comments

                    Transform Hadoop from a data lake into a high performance, fully functional analytics platform with the Actian Analytics Platform -- Hadoop SQL Edition. The platform is the first end-to-end analytics platform to run 100% natively in Hadoop. Run sophisticated data science analytics natively in Hadoop up to 30 times faster. Give business users interactive SQL access to Hadoop data with the highest performing SQL in Hadoop capability.

                    + More details
                    • Adatao In 90 Seconds


                      from Adatao / Added

                      1,255 Plays / / 0 Comments

                      Visual, Real-Time, Predictive Analytics for Business and Data Science, on One Unified Big-Data Solution

                      + More details

                      What are Tags?


                      Tags are keywords that describe videos. For example, a video of your Hawaiian vacation might be tagged with "Hawaii," "beach," "surfing," and "sunburn."