Apache Storm and Samza

Project Information

Discipline
Computer Science (401) 
Orientation
Research 
Abstract

There are Distributed Stream Processing engines like Apache Storm, Apache S4 and Apache Samza created for processing large amounts of streaming data. Sensors are a source of large amounts of stream data generation. We would like to study and use the capabilities of the Dsitrbuted Stream processing systems in regards to Sensor Data Processing. 

Intellectual Merit

We will look at running Distributed Stream Processing engines on FutureGrid nodes and develop stream processing algorithms for sensor data processing.

Broader Impacts

Contribute to Open Source projects in the Apache Big Data Analytic stack.

Project Contact

Project Lead
Supun Kamburugamuva (skamburu) 
Project Manager
Supun Kamburugamuva (skamburu) 
Project Members
Oliver Lewis, Leif Christiansen  

Resource Requirements

Hardware System
  • I don't care (what I really need is a software environment and I don't care where it runs)
 
Use of FutureGrid

Running the Distributed Stream Engines.

Scale of Use

About 3 nodes

Project Timeline

Submitted
01/24/2014 - 15:20