Big Data News

Get the latest Big Data news, insights and updates from the Packt Hub. We’ll help you get to grips with NoSQL databases like MongoDB. With the vast quantities of Data currently stored by businesses, it’s becoming increasingly important to manage it in logical and secure ways. Therefore, the Hub provides news and insights around technologies like Hadoop and AWS to help keep you up to date.

shining roads

Visualization of Big Data

(For more resources related to this topic, see here.) Data visualization Data visualization is nothing but a representation of your data in graphical form. It is...

Processing Tweets with Apache Hive

(For more resources related to this topic, see here.) Extracting hashtags In this part and the following one, we'll see how to extract data efficiently from...

The EMR Architecture

This article is written by Amarkant Singh and Vijay Rayapati, the authors of Learning Big Data with Amazon Elastic MapReduce. The goal of this...

Apache Solr and Big Data – integration with MongoDB

In this article by Hrishikesh Vijay Karambelkar, author of the book Scaling Big Data with Hadoop and Solr - Second Edition, we will go...

Identifying Big Data Evidence in Hadoop

In this article by Joe Sremack, author of the book Big Data Forensics, we will cover the following topics: An overview of how to identify...

Data Governance in a Data Lake

In this article by Pradeep Pasupuleti and Beulah Salome Purra, authors of the book Data Lake Development with Big Data, we will see the...

Spark – Architecture and First Program

In this article by Sumit Gupta and Shilpi Saxena, the authors of Real-Time Big Data Analytics, we will discuss the architecture of Spark and...

Getting Started with Apache Hadoop and Apache Spark

In this article by Venkat Ankam, author of the book, Big Data Analytics with Spark and Hadoop, we will understand the features of Hadoop...

Introduction to R Programming Language and Statistical Environment

In this article by Simon Walkowiak author of the book Big Data Analytics with R, we will have the opportunity to learn some most important...

Context – Understanding your Data using R

In this article by James D Miller, the author of the book Big Data Visualization we will explore the idea of adding context to the...

Must Read in Cloud & Networking

ServiceNow Partners with IBM on AIOps from

ServiceNow and IBM this week announced that the Watson artificial intelligence for IT operations (AIOps) platform from IBM will be integrated with the IT...

Must Read in Data

Distributed training in TensorFlow 2.x

TensorFlow 2 is a rich development ecosystem composed of two main parts: Training and Serving. Training consists of a set of libraries for dealing...

How to Create Tensors in PyTorch