document updated 11 years ago, on Mar 28, 2013
What are some utilities available if you want to grep through .doc, .xlsx, .zip, etc files?
all-in-one solutions
- Recoll
(Xapian.org)
- Docfetcher
- Google Desktop
- Strigi
- Terrier Search Engine
- PowerGrep
- $160, 30 day trial period
- more
tools that extract the raw text from .docx/.xlsx/etc
- antiword
- antiword-xp-rb will handle .docx
- Apache POI
tools that search inside .tar/.zip/etc