Categories
libextractor
'libextractor' extracts meta-data from files of arbitrary type. It uses helper-libraries to perform the actual extraction, and is trivially extendable by linking against external extractors for additional file types. Its goal is to provide developers of file-sharing networks, file managers, and WWW-indexing bots with a universal library to obtain meta-data about files.
'libextractor' includes the command "extract" that can extract meta-data from a file and print the results to stdout. Currently, it supports the formats HTML, PDF, PS, OLE2 (doc, xls, ppt), StarOffice, OpenOffice, MAN, DVI, MP3 (ID3v1, ID3v2), OGG, WAV, JPEG, GIF, PNG, TIFF, DEB, RPM, TAR(.GZ), ZIP, Real, QT, MPEG, RIFF (AVI), ASF, and ELF. It also detects various MIME types, and can compute hash functions (SHA-1, MD5, ripemd160). A Java binding (JNI) is available.
Last updated 14 Aug, 2005
About
Leadership
- Christian Grothoff - Maintainer
Requirements
- zlib (Use Requirement)
- libvorbis (Weak Prerequisite)
Related Projects
Versions
0.5.3
0.5.3 stable released 2005-08-14
- Released: 14 Aug, 2005
- Code Maturity: Stable
- Source Archive: http://gnunet.org/libextractor/download/libextr...
- Licenses: GPLv2orlater
- Interfaces: Library




