This is a demonstration showing the Cloud Stenography web application being used to control a Hadoop cluster via Apache Pig - to do analytics on Apache web server logs.
In this case, Pig 0.3.0 is being run in local mode. In the first test I simply load 10, then 100 records. In the next test I create a dataflow to give me the top 25 referring urls, with hit counts for each. This data is sent to Excel for additional analysis.
Loading more stuff…
Hmm…it looks like things are taking a while to load. Try again?