'libextractor' extracts meta-data from files of arbitrary type. It uses helper-libraries to perform the actual extraction, and is trivially extendable by linking against external extractors for additional file types. Its goal is to provide developers of file-sharing networks, file managers, and WWW-indexing bots with a universal library to obtain meta-data about files. 'libextractor' includes the command "extract" that can extract meta-data from a file and print the results to stdout. Currently, it supports the formats HTML, PDF, PS, OLE2 (doc, xls, ppt), StarOffice, OpenOffice, MAN, DVI, MP3 (ID3v1, ID3v2), OGG, WAV, JPEG, GIF, PNG, TIFF, DEB, RPM, TAR(.GZ), ZIP, Real, QT, MPEG, RIFF (AVI), ASF, and ELF. It also detects various MIME types, and can compute hash functions (SHA-1, MD5, ripemd160). A Java binding (JNI) is available.
This is a GNU package:libextractor
released on 23 December 2013
|License||Verified by||Verified on||Notes|
|GFDLv1.3orlater||mtjm||26 September 2012|
For the manual.
|GPLv3orlater||mtjm||20 October 2004|
Leaders and contributors
Resources and communication
|Bug Tracking,Developer,Support||Mailing List Info/Archive||https://lists.gnu.org/mailman/listinfo/bug-libextractor|
This entry (in part or in whole) was last reviewed on 5 January 2014.