paperlined.org
apps > wikipedia > technical > database_dump
document updated 17 years ago, on Apr 18, 2007

Good tools

Advice

Compressed-file indexing

Compressed files have two benefits: 1) they save disk space, 2) when there's plenty of CPU to spare, it allows the file to be read faster, because disk I/O is usually an important constraint.

However, I think that perhaps random-access might mitigate the second benefit? Epescially when you end up needing to read in a fair bit of extra data before and after the desired data, because it's compressed in blocks? So, if regularly accessing a file randomly, and when you have enough spare disk space, maybe it's easier to just index it uncompressed?