As the complexity of data based products increase, along with the need to make smart, data driven decision – the amount of data we need to collect increase rapidly. This is true for business intelligence solutions, but also for much simpler data driven products.
We have started to examine big data solutions a while ago, for example IBM’s Netezza and apache’s hadoop, but using such solutions has its price, and you usually trade-off scalability with ease of use.
Here is a very friendly article explaining about hadoop -what it is, and its design principles. The article is relatively simple and clear, and I think it is worth reading, and getting to know this technology, which we will surely encounter if not sooner then later.
ReadWriteWeb: Hadoop: What It Is And How It Works: http://readwrite.com/2013/05/23/hadoop-what-it-is-and-how-it-works
After you read this, check out Intel’s distribution of Hadoop: http://hadoop.intel.com/