Here we show how to retrieve data from ElasticSearch using Apache Pig. The reason for doing that is...
Author - Walker Rowe
Using ElasticSearch with Apache Spark
ElasticSearch is a JSON database popular with log processing systems. For example, organizations...
Using Spark with Hive
Here we explain how to use Apache Spark with Hive. That means instead of Hive storing data in...
Using Hive Advanced User Defined Functions with Generic and...
Previously we wrote how to write user defined functions that can be called from Hive. You can write...
How to write a Hive User Defined Function (UDF) in Java
Here we show how to write user defined functions (UDF) in Java and call that from Hive. You can...
What is Apache HCatalog? HCatalog Explained
Here we explain what HCatalog is and why it is useful to Hadoop programmers. Basically, HCatalog...
Apache Hive Beeline Client, Import CSV File into Hive
Beeline has replaced the Hive CLI in what Hive was formally called HiveServer1. Now Hive is called...
Graphing Spark Data with HighCharts
Here we look at how to use HighCharts with Spark. HighCharts is a charting framework written in...
Basics of Graphing Streaming Big Data
Imagine creating a live chart that updates as data flows in. With this you could watch currency...
K-means Clustering with Apache Spark
Here we show a simple example of how to use k-means clustering. We will look at crime statistics...