In the last two posts we wrote, we explained how to read data streaming from Twitter into Apache...
Author - Walker Rowe
Reading Streaming Twitter feeds into Apache Spark
In part 1 of this blog post we explained how to read Tweets streaming off Twitter into Apache Kafka...
Working with Streaming Twitter Data Using Kafka
Here we show how to read messages streaming from Twitter and store them in Kafka. In Part 2 we...
Using Zeppelin with Big Data
Zeppelin is an interactive notebook. It lets you write code into a web page, execute it, and...
Spark Decision Tree Classifier
Here we explain how to use the Decision Tree Classifier with Apache Spark ML (machine learning). We...
Using Logistic Regression, Scala, and Spark
Here we explain how to do logistic regression with Apache Spark. Logistic regression (LR) is...
SGD Linear Regression Example with Apache Spark
This article explains how to do linear regression with Apache Spark. It assumes you have some basic...
Hadoop Interview Questions
Hadoop Interview Questions Q: Is Hadoop a database? A: No. Hadoop is a write-only file system. But...
Hadoop Clusters: An Introduction
Hadoop clusters 101 In talking about Hadoop clusters, first we need to define two terms: cluster...
An Introduction to Hadoop Administration
Here we explain some of the most common Hadoop administrative tasks. There are many, so we only...