Course: Trial run - BUS S523 Large Scale Data Analysis

Project Information

Discipline
Social Sciences, n.e.c. (910) 
Orientation
Education 
Abstract

This project is for the graduate course BUS S-523. The course will cover how to use MapReduce to analyze data to gain insights for data not well suited to traditional RDBMS or datawarehouses.

Intellectual Merit

FutureGrid will provide students a hands-on environment to learn about MapReduce, including Hive and Pig. Activities students will complete will include in-class demonstrations, take home labs, homework, and potentially end of the semester group projects. These hands-on activities will allow students to engage in active learning while providing them with a better understanding of the class material.

Broader Impacts

At the completion of the project, the students will have a greater understanding of grid computing and its role in analyzing data.

Project Contact

Project Lead
Binny Samuel (bmsamuel) 
Project Manager
Binny Samuel (bmsamuel) 
Project Members
Meenakshi Kaleeswaran  

Resource Requirements

Hardware Systems
  • Not sure
  • I don't care (what I really need is a software environment and I don't care where it runs)
 
Use of FutureGrid

HPC (MyHadoop, SalsaHadoop)

Scale of Use

This is a trial run, I intend to use FutureGrid resources to teach a class on Hadoop (including Pig and Hive) to approximately 90 graduate (Masters) students at Indiana University in the Spring 2013 semester. I anticipate that each student will only need modest resources.

Project Timeline

Submitted
10/03/2012 - 13:51