Workshop: A Cloud View on Computing

Abstract

June 6-10, 2011

Cloud computing provides elastic compute and storage resources to solve data intensive science and engineering problems, but the number of students from under-represented universities who are involved and exposed to this area is minimal. In order to attract underserved students, we intend to train faculty members and graduate students from the Association of Computer/Information Sciences and Engineering Departments at Minority Institutions (ADMI) in the area of cloud computing through a one-week workshop conducted on the campus of Elizabeth City State University. This workshop will enable faculty members and graduate students from underserved institutions, who are involved with minority undergraduate students to gain information about various aspects of cloud computing while serving as a catalyst in propagating their knowledge to their students.

Intellectual Merit

The desired competencies for faulty and graduate students to acquire and/or refine in cloud computing
are:
• Understand and articulate the challenges associated with distributed solutions to large-scale problems,
e.g., scheduling, load balancing, fault tolerance, memory and bandwidth limitations, etc.
• Understand and explain the concepts behind MapReduce
• Understand and express well-known algorithms in the MapReduce framework.
• Understand and reason about engineering tradeoffs in alternative approaches to processing large
datasets.
• Understand how current solutions to the particular research problem can be cast into the MapReduce
framework.
• Explain the advantages in using a MapReduce framework over existing approaches.
• Articulate how adopting the MapReduce framework can potentially lead to advances in the state of
the art by enabling processing not possible before.

Broader Impact

The curricula and tutorials can be reused in other cloud computing educational activities

Use of FutureGrid

FutureGrid will serve as a platform for communicating concepts of cloud computing

Scale Of Use

15 generic users will need modest resources

Results

The hands-on workshop was June 6-10, 2011. Participants were immersed in a “MapReduce boot camp”, where ADMI faulty members sought introduction to the MapReduce programming framework. The following were themes for five boot camp sessions:

  • Introduction to parallel and distributed processing
  • From functional programming to MapReduce and the Google File System (GFS)
  • “Hello World” MapReduce Lab
  • Graph Algorithms with MapReduce
  • Information Retrieval with MapReduce

An overview of parallel and distributed processing provided a transition into the abstractions of functional programming, which introduces the context of MapReduce along with its distributed file system. Lectures focused on specific case studies of MapReduce, such as graph analysis and information retrieval. The workshop concluded with a programming exercise (PageRank or All-Pairs problem) to ensure faculty members have a substantial knowledge of MapReduce concepts and the Twister/Hadoop API.

For more information, please visit http://salsahpc.indiana.edu/admicloudyviewworkshop/

FG-120
Jerome Mitchell
PTI
Closed

Project Members

Candice Adams
Constance Bland
deshea simon
F Doswell
Mohammad Hasan
Natarajan Meghanathan
Timothy Holston
Willie Fuller
yenhung hu

Timeline

1 year 29 weeks ago
1 year 13 weeks ago