Cloud Technologies for Bioinformatics Applications

Project Information

Discipline
Computer Science (401) 
Orientation
Research 
Abstract

Test the performance variation of couple of Bioinformatics applications running on Apache Hadoop on Linux Virtual Machines and on Microsoft DryadLINQ on Windows HPCS cluster over multiple runs.

Intellectual Merit

Analyzing the performance and viability of cloud technologies to conduct bioinformatics data analyses.

Broader Impacts

We analyze the feasibility of cloud environments to conduct bioinformatics data analyses, providing recommendations and guidance for the bio-informatics scientists.

Project Contact

Project Lead
Thilina Gunarathne (thilina) 
Project Manager
Thilina Gunarathne (thilina) 
Project Members
Tak-Lon Wu  

Resource Requirements

Hardware Systems
  • india (IBM iDataPlex at IU)
  • sierra (IBM iDataPlex at SDSC)
 
Use of FutureGrid

We are going to test the performance variation of couple of Bioinformatics applications running on Apache Hadoop on Linux Virtual Machines and on Microsoft DryadLINQ on Windows HPCS cluster over multiple runs.

Scale of Use

"33 Nodes (8 cores per node) on the Windows HPCS 2008 cluster.33 Nodes (8 cores per node) Xen VM cluster with the instances running Linux with access to local disks and a shared file system."

Project Timeline

Submitted
12/02/2010 - 16:42