Data focused computing involves many stages: exploration, visualization, production mode computing, collaboration, debugging, development, presentation and publication. The IPython Notebook is a web based interactive computing environment that can carry the data scientist through all of these stages. The Notebook enables users to build documents that combine live, runnable code with text, LaTeX formulas, images and videos. These documents are version controllable/sharable and preserve a full record of a computation, its results and accompanying material. In this talk I will introduce the Notebook, show how to configure and run it, illustrate its main features and discuss its future.
Introduction to business intelligence, data warehousing and online analytical processing with Cubes. Cubes is a lightweight Python framework and OLAP server that provides business point of view modeling for multidimensional data analysis.
Python's use in analytical settings is well-established and impressive. Most of the discussion though is confined to a few settings: web; finance; the sciences. In this talk, I'll share some of the things I have learned from bringing Python into traditional business groups, pitfalls to avoid, and how to shine if you are a Pythonista looking for a career in the rapidly growing job role of Data Scientist. Along the way I'll share examples of how large scale statistical analyses are used in retail marketing.