Quantcast
Channel: YMC » Data Mining
Browsing all 5 articles
Browse latest View live

Log Data Analysis – What is the Most Popular Apache Webserver Version?

We operate an Hadoop cluster for one of our customers MeMo News AG. They monitor online news and social media as a service for their customers. Monitoring here is defined by retrieving various content...

View Article



Case Study: Retail WiFi Log-file Analysis with Hadoop and Impala, Part 1

This week we were inspired to do some research, driven by an idea: It must be possible to bring the concepts of tracking users in the online world to retail stores. We are not the experts in retail but...

View Article

Case Study: Retail WiFi Log-file Analysis with Hadoop and Impala, Part 2

Following on from Jean-Pierre’s introduction to this experiment in part 1, I will now expand on the technical details of the data ingestion process using Flume. As you can see in figure 2 from the...

View Article

Case Study: Retail WiFi Log-file Analysis with Hadoop and Impala, Part 3

In the previous article we described how to collect WiFi router logs with Flume to store in HDFS. This article will describe how we did the transformation, parsing, filtering and finally loading into...

View Article

Case Study: Retail WiFi Log-file Analysis with Hadoop and Impala, Part 4

In the previous article we explained how to parse, transform and finally load data into Hive’s warehouse. Now it’s time to talk about querying the data. Before we start, here is how a sample of the...

View Article

Browsing all 5 articles
Browse latest View live




Latest Images