Hierachical Multidimensional Scaling process Massive Metageonomics data

Abstract

Using various algorithms to process massive metagenomics data through the Twister pipeline.

Advanced Multidimensional Scaling interpolation algorithm which makes clustering dozens of millions sequence possible

Introduced a new way to do the clustering for metagenomics data

User FutureGrid resources to do the large data processing

dozens of nodes which can be reserved for a few days now and then

Project Number: FG-114

Project Lead: Yang Ruan

Institution: Indiana University

Project Status: Active

Updated: 3 years 21 weeks ago