Hierachical Multidimensional Scaling process Massive Metageonomics data

Abstract

Using various algorithms to process massive metagenomics data through the Twister pipeline.

Intellectual Merit

Advanced Multidimensional Scaling interpolation algorithm which makes clustering dozens of millions sequence possible

Broader Impact

Introduced a new way to do the clustering for metagenomics data

Use of FutureGrid

User FutureGrid resources to do the large data processing

Scale Of Use

dozens of nodes which can be reserved for a few days now and then

FG-114
Yang Ruan
Indiana University
Active

Timeline

3 years 21 weeks ago