Obviously, there’s a huge amount of interest and use around Hadoop for processing large amounts of data given its scalability and cost/performance. Its great for a lot of Big Data needs but it was never designed to replace RDBMS systems. Hadoop is batch oriented, it doesn’t supports queries and its environment can best be described as “ideal” for programmers but abysmal for business users. Additional technologies such as Hive and Pig are helping but there is still a long way to go.
This session will outline the advantages and challenges of using Hadoop, both from a technology and user perspective. The session will discuss what additional approaches and technologies are needed to leverage Hadoops scalability and cost/performance and will share experiences from more than 5 years building Hadoop based platforms for Startups.