real-time analytics – DataScience Hacks

Real-time analytics using distributed computing system Storm: Part II

February 20, 2014October 10, 2014 Pavan3 Comments

How to setup a Storm cluster ? We will look at how to set up a single node cluster of storm project. The following are the prerequisites for setting up: Java 6 or above Python 2.6 Zookeeper ZeroMQ JZMQ any other dependencies (unzip, git, etc...) Zookeeper: Apache zookeeper project gives you a set of tools… Continue reading Real-time analytics using distributed computing system Storm: Part II

Real-time analytics using distributed computing system Storm: Part I

February 18, 2014October 10, 2014 Pavan1 Comment

We know that using data we can produce analytics and create reports that would help us in decision-making, the usual story. The constraint here is in order to create analytics reports right now, data should have been collected, cleansed and pre-processed earlier. Only after spending considerable amount of time doing these tasks, we can use… Continue reading Real-time analytics using distributed computing system Storm: Part I