It is possible to do cluster analysis on text files. (in fact, anytime you can caclulate some metric between elements in a set, you can cluster that set)

modules that do both metric+cluster

metric modules, when doing metric+cluster

clustering modules, when doing metric+cluster

similar fields

How do we do text-file clustering, but FASTER? Random ideas: