Big Data Analysis
(For more resources related to this topic, see here.)
Counting distinct IPs in weblog data using MapReduce and Combiners
This recipe will walk you through creating...
Comparative Study of NoSQL Products
(For more resources related to this topic, see here.)
Comparison
Choosing a technology does not merely involve a technical comparison. Several other factors related to documentation,...
Advanced Hadoop MapReduce Administration
(For more resources related to this topic, see here.)
Tuning Hadoop configurations for cluster deployments
Getting ready
Shut down the Hadoop cluster if it is already running,...
Line, Area, and Scatter Charts
(For more resources related to this topic, see here.)
Introducing line charts
First let's start with a single series line chart. We will use one of...
Obtaining a binary backup
Getting ready
Next we need to modify the postgresql.conf file for our database to run in the proper mode for this type of backup. Change...
Ease the Chaos with Automated Patching
(For more resources related to this topic, see here.)
We have seen how the provisioning capabilities of the Oracle Enterprise Manager's Database Lifecycle Management (DBLM)...
Follow the Money
(For more resources related to this topic, see here.)
It starts with the Cost Worksheet
In PCM, the Cost Worksheet is the common element for all...
Generating Reports in Notebooks in RStudio
(For more resources related to this topic, see here.)
A very important feature of reproducible science is generating reports. The main idea of automatic report...
Creating the first Circos diagram
(For more resources related to this topic, see here.)
Getting ready
Let's start with the simple task of graphing a relationship between a student's eye and...
Extending Your Structure and Search
(For more resources related to this topic, see here.)
Indexing data that is not flat
Not all data is flat. Of course if we are building...