Collaborative Research: North East Cyberinfrastructure Consortium

Project Information

Discipline
Biology (603) 
Orientation
Research 
Abstract

Under the North East Cyberinfrastructure Consortium, the bioinformatics cores of the five partner states have formed a virtual organization, the North East Bioinformatics Collaborative (NEBC), to develop collaborative activities such as shared workflows and promote the development of protocols for a new Shared Data Center for the movement, life cycle management, storage and recovery of data that are simultaneously viewed/analyzed/worked on by multiple users across the region. We have implemented the Shared Data Center in a cloud infrastructure (Amazon) and have begun developing on-demand, cloud enabled workflows. We would like to extned this work to encompass directly NSF resources such as FutureGrid.

Intellectual Merit

The NEBC is currently carrying out two large scale collaborative research projects under the NECC to develop expertise and experience in the implementation of large scale collaborative research projects. These research projects, the sequencing of the Little Skate Genome and the Metagenomics of Alagal Blooms, provide a foundation for a variety of other research projects across the NECC region.

Broader Impacts

The current implementation and support of the NECC Shared Data Center has been leveraged to secure NSF funding for a large scale metagenetics projects. We anticipate continued development of data management and workflow processes within the NECC will provide the necessary cyberinfrastructure for other large scale collaborative projects.

Project Contact

Project Lead
James Vincent (jjv5) 
Project Manager
James Vincent (jjv5) 

Resource Requirements

Hardware System
  • Not sure
 
Use of FutureGrid

"I plan to use Futuregrid to develop on-demand workflows for processing massively parallel sequencing data. A five state consortium (NECC) maintains a shared data center for MPS data and other projects at a utility computing provider (Amazon Web Services). We have begun implementing on demand workflows there but would like to utilize NSF resources directly for this purpose. "

Scale of Use

A small number of VMs (8-32) to develop and test with.

Project Timeline

Submitted
10/22/2010 - 03:30