Big Data for Science Summer School July 26-30 2010

Abstract

Support Big Data workshop with over 200 attendees from 10 sites across the USA. This summer school has selected hands on session on MapReduce, Eucalyptus and Appliances plus many presentations from leading researchers in data intensive computing

Intellectual Merit

Data Intensive computing is rapidly growing in importance and this summer school will advance broad knowledge of this field. We will develop tutorials for Hadoop and Twister on FutureGrid that can be used by other activities. We will also use FutureGrid for introductory tutorial using Eucalyptus and demonstrations of Virtual Appliance technology.

In the “Big Data for Science” workshop, 200 students across 10 institutions (Arkansas High Performance Computing Center, University of Arkansas, Fayetteville; Electronic Visualization Laboratory, University of Illinois at Chicago; Indiana University, Bloomington; Institute for Digital Research and Education, University of California, Los Angeles; Michigan State University, East Lansing; Pennsylvania State University, University Park; University of Iowa, Iowa City; University of Minnesota Supercomputing Institute, Minneapolis; University of Notre Dame, Notre Dame, Indiana; and University of Texas at El Paso). Students in the workshop used FutureGrid in hands-on activities that covered, among others, Hadoop/MapReduce, Twister, Grid Appliance, and GroupVPN.

Broader Impact

This project involves many students who will be able to get an understanding of modern data intensive computing

Use of FutureGrid

Support hands on laboratory sessions during Big Data summmer school

Scale Of Use

Essentially all nodes on Sierra and India needed

Publications


Results

The workshop was successfully delivered with good reviews. In this “Big Data for Science” workshop, over 200 students across 10 institutions (Arkansas High Performance Computing Center, University of Arkansas, Fayetteville; Electronic Visualization Laboratory, University of Illinois at Chicago; Indiana University, Bloomington; Institute for Digital Research and Education, University of California, Los Angeles;  Michigan State University, East Lansing; Pennsylvania State University, University Park; University of Iowa, Iowa City; University of Minnesota Supercomputing Institute, Minneapolis; University of Notre Dame, Notre Dame, Indiana; and University of Texas at El Paso). Additionally 100 additional students attended via streaming video. Students in the workshop used FutureGrid in hands-on activities that covered, among others, Hadoop/MapReduce, Twister, Grid Appliance, and GroupVPN.
See http://salsahpc.indiana.edu/tutorial/index.html

FG-70
Judy Qiu
Indiana University
Closed

Timeline

4 years 10 weeks ago
1 year 13 weeks ago