FG-114
Hierachical Multidimensional Scaling process Massive Metageonomics data
DACIDR: deterministic annealed clustering with interpolative dimension reduction using a large collection of 16S rRNA sequences
[ruan2012dacidr] Ruan, Y., S. Ekanayake, M. Rho, H. Tang, S. - H. Bae, J. Qiu, and G. Fox,
"DACIDR: deterministic annealed clustering with interpolative dimension reduction using a large collection of 16S rRNA sequences",
Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine: ACM, pp. 329–336, 2012.
Hierachical Multidimensional Scaling process Massive Metageonomics data
Project Details
- Project Lead
- Yang Ruan
- Project Manager
- Yang Ruan
- Institution
- Indiana University, Pervasive Technology Institute
- Discipline
- Computer Science (401)
Abstract
Using various algorithms to process massive metagenomics data through the Twister pipeline.
Intellectual Merit
Advanced Multidimensional Scaling interpolation algorithm which makes clustering dozens of millions sequence possible
Broader Impacts
Introduced a new way to do the clustering for metagenomics data
Scale of Use
dozens of nodes which can be reserved for a few days now and then