Aspiring Data Analyst, Meet Your New Best Friend: Excel
In general, people want to associate themselves with cool job titles and one that indirectly says both that you’re clever and you get paid...
Making the Most of Your Hadoop Data Lake, Part 2: Optimized File Formats
One major factor of making the conversion to Hadoop is the concept of the Data Lake. That idea suggests that users keep as much...
The Mysteries of Big Data and the Orient … DB
Mapping the world of big data must be a lot like demystifying the antiquated concept of the Orient, trying to decipher a mass of...
How to Build a Recommender by Running Mahout on Spark
Mahout on Spark: Recommenders
There are big changes happening in Apache Mahout. For several years it was the go-to machine learning library for Hadoop. It...
Python Data Stack
The Python programming language has grown significantly in popularity and importance, both as a general programming language and as one of the most advanced...
Top 5 NoSQL Databases
NoSQL has seen a sharp rise in both adoption and migration from the tried and tested relational database management systems. The open source world...
Reducing Cost in Big Data using Statistics and In-memory Technology – Part 1
The world is shifting from private, dedicated data centers to on-demand computing in the cloud. This shift moves the onus of cost from the...
Top 4 Business Intelligence Tools
With the boom of data analytics, Business Intelligence has taken something of a front stage in recent years, and as a result, a number...
Reducing Cost in Big Data using Statistics and In-memory Technology – Part 2
In the first part of this two-part blog series, we learned that using statistical algorithms gives us a 95 percent accuracy rate for big...
Big Data Is More Than Just a Buzz Word!
We all agree big data sounds cool (well I think it does!), but what is it?
Put simply, big data is the term used to...