What is the broad outlines of how data mining / quant finance / etc done?

wikipedia

dataset gathering

Data cleanup / [[Data integration|integration]]

techniques

quant-specific stuff

lower level

software