CLUTO is a software package for clustering low- and high-dimensional datasets and for analyzing the characteristics of the various clusters. CLUTO is well-suited for clustering data sets arising in many diverse application areas including information retrieval, customer purchasing transactions, web, GIS, science, and biology.
CLUTO's distribution consists of both stand-alone programs (
vcluster) and a library (
libcluto.a) via which an application program can access directly the various clustering and analysis algorithms implemented in CLUTO.
A simple perl script called
doc2mat that convertes a set of documents into the vector-space format used by CLUTO is also provided.
cluto-2.1.2a: latest stable release, includes the Perl script
$ module add cluto-2.1.2a $ doc2mat [options] doc-file mat-file $ vcluster [optional parameters] MatrixFile NClusters $ scluster [optional parameters] GraphFile NClusters
- CLUTO docs: provider's webpage or locally within the program directory (
doc2matdocs: provider's webpage or locally within the program directory (
Freely available for educational and research purposes by non-profit institution.
Tom Rebok email@example.com