Hachoir version 0.1 was the first prototype and has been replaced by wiki:hachoir-core and wiki:hachoir-parser. It has a nice GTK user interface and can parse approxamitivly 30 file formats.

Features

  • GTK user interface ;
  • Support stream decompression ;
  • Read and process on demand ;
  • Filters are recursive ;
  • Automatic choice of the filter using MIME type detection ;
  • It's easy to write new filter (it's one of the project goals) ;
  • It's possible to dump field content into a file ;
  • Scriptable with Python (command line option --script).

See the parser list (supported file format).

Download using SubVersion

svn co http://hachoir.org/svn/tags/old-hachoir/0.1

You can also browse the source code

Depends on

  • Python 2.4
  • pyGTK 2.0 and pyglade
  • magic module for Python (optionnal, internal fallback does exist):
    • If you don't have it, download magic.so.bz2 (decompress it in the same directory than Hachoir)
    • Comes from file project (see python sub-directory in source code)
    • Debian package: python-magic

Instructions for Linux distributions

  • Gentoo: emerge pytgtk pyxml
    • Moreover, for libmagic : use « USE="python" », and install sys-apps/file
  • Ubuntu: apt-get install python-gtk2 python-glade2 python-xml
  • Debian: apt-get install python-gtk2 python-magic python-xml

Under Windows

Stream types

  • File (FileStream)
  • Binary string (StringStream)
  • Part of another stream (LimitedStream)
  • Decompression deflate (DeflateStream), use module zlib of Python
  • Decompression gzip (GunzipStream), use module gzip of Python
  • Decompression bzip2, use module bz2 of Python
  • Decoder base64, use module base64 of Python