Course: CIIC 8995: Mining Massive Datasets
Project Information
- Discipline
- Computer Science (401)
- Subdiscipline
- 11.01 Computer and Information Sciences, General
- Orientation
- Education
This course provides a introduction to the analysis of massive datasets using Hadoop and MapReduce. The emphasis in the use of distributed computation paradigms to carry out algorithms for massive datasets.
Intellectual MeritIn this course, students will be exposed to the state-of-the-art computation models for analyzing massive datasets. Massive datasets are appearing now very often in the scientific community.
Broader ImpactsThis project will provide research and educational opportunities to graduate students. Software, curriculum materials and publications attained under this project will be distributed to the scientific community as free, open source materials.
Project Contact
- Project Lead
- Edgar Acuna (eacunaf)
- Project Manager
- Edgar Acuna (eacunaf)
Resource Requirements
- Hardware Systems
-
- alamo (Dell optiplex at TACC)
- hotel (IBM iDataPlex at U Chicago)
- india (IBM iDataPlex at IU)
- sierra (IBM iDataPlex at SDSC)
We will use the Future Grid resources as testbed for projects given in the course
Scale of UseThe class size will be about 10 students
Project Timeline
- Submitted
- 09/03/2013 - 14:01