CLUTO is a software package for clustering low- and high-dimensional datasets and for analyzing the characteristics of the various clusters. CLUTO is well-suited for clustering data sets arising in many diverse application areas including information retrieval, customer purchasing transactions, web, GIS, science, and biology.

CLUTO's distribution consists of both stand-alone programs (scluster and vcluster) and a library (libcluto.a) via which an application program can access directly the various clustering and analysis algorithms implemented in CLUTO.

A simple perl script called doc2mat that convertes a set of documents into the vector-space format used by CLUTO is also provided.


Module cluto-2.1.2a: latest stable release, includes the Perl script doc2mat


$ module add cluto-2.1.2a
$ doc2mat [options] doc-file mat-file
$ vcluster [optional parameters] MatrixFile NClusters
$ scluster [optional parameters] GraphFile NClusters


  • CLUTO docs: provider's webpage or locally within the program directory (/software/cluto-2.1.2a/doc/manual.pdf)
  • doc2mat docs: provider's webpage or locally within the program directory (/software/cluto-2.1.2a/doc/doc2mat.html)


Freely available for educational and research purposes by non-profit institution.

