1. In the previous episode (vimeo.com/73849021), we saw how to to transfer some file data into Hadoop. In order to interrogate easily the data, the next step is to create some Hive tables. This will enable quick interaction with high level languages like SQL and Pig.

    We experiment with the SQL queries, then parameterize them and insert them into a workflow in order to run them together in parallel. Including Hive queries in an Oozie workflow is a pretty common use case with recurrent pitfalls as seen on the user group. We can do it with Hue in a few clicks.

    More info here: gethue.tumblr.com/post/60937985689/hadoop-tutorials-ii-2-execute-hive-queries-and


    # vimeo.com/74215175 Uploaded 10.9K Plays / / 0 Comments Watch in Couch Mode
  2. Slides can be found here: slideshare.net/PyData/pydata-talk

    In this talk I'll show how a number of tools from the pandas library can be used to quickly wrangle raw data into shape for analysis. Techniques for structured and semi-structured data manipulation, cleaning and preparation, reshaping, and other common tasks will be the main focus.

    # vimeo.com/63295598 Uploaded 1,414 Plays / / 1 Comment Watch in Couch Mode
  3. Michael Becker

    # vimeo.com/73628112 Uploaded 594 Plays / / 0 Comments Watch in Couch Mode
  4. On this episode Pedro shows you how you can serve your Node.js app using Nginx.

    # vimeo.com/25731567 Uploaded 6,747 Plays / / 1 Comment Watch in Couch Mode
  5. Sandro Hawke, June 8, 2010 - MIT Cambridge, MA
    World Wide Web Consortium w3.org

    Although the first Semantic Web standards are more than ten years old, only recently have we begun to actually see machines sharing data on the Web. The key turning point was the acceptance of the core Linked Data principle, that object identifiers should also work with Web protocols to access useful information. This talk will cover the basic concepts and techniques of publishing and using Linked Data, assuming some familiarity with programming and the Web. No prior knowledge of Semantic Web technologies is required.

    Slides: files.meetup.com/1336198/LinkedDataPresentation-SandroHawke.pdf

    # vimeo.com/12444260 Uploaded 1,917 Plays / / 2 Comments Watch in Couch Mode

Dev Stuff

Colin Bielen

Development Bookmarks

Browse This Channel

Shout Box

Channels are a simple, beautiful way to showcase and watch videos. Browse more Channels. Channels