Docx2txt

From Free Software Directory
 
Jump to: navigation, search


[edit]

Docx2txt

http://docx2txt.sourceforge.net
Convert Microsoft OOXML files to plain text

docx2txt is a tool that attempts to generate equivalent (ASCII) text files from Microsoft .docx documents, preserving some formatting and document information (which MS text conversion drops) along with appropriate character conversions for a good (ASCII) text experience. It is a platform independent solution consisting of (core) Perl and (wrapper) Unix/Windows shell scripts and a configuration file to control the output text appearance to fair extent. It can very conveniently be used to build a Web based docx document conversion service. Some Makefiles and Windows batch files are provided for easy installation of the scripts. With unzippers like CakeCmd that can deal with corrupt Zip archives, this tool can extract text from corrupt docx documents in many cases, where MS word processor fails to even open them.



Download




Categories





Licensing

License

Verified by

Verified on

Notes

License

GPLv3

Verified by

Debian

Verified on

20 March 2013

Notes

License: GPL-3

GPL-3 This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see <http://www.gnu.org/licenses/>.

On Debian systems, the complete text of the GNU General Public

License can be found in /usr/share/common-licenses/GPL-3 file.

Verified by

Debian

Verified on

20 March 2013

Notes

License: GPL-2+

GPL-2+ This package is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This package is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see <http://www.gnu.org/licenses/>

On Debian systems, the complete text of the GNU General Public License version 2 can be found in "/usr/share/common-

licenses/GPL-2".




Leaders and contributors

Resources and communication

AudienceResource typeURI
Ruby (Ref)https://rubygems.org/gems/docx2txt
Python (Ref)https://pypi.org/project/docx2txt
Downloadhttp://docx2txt.sourceforge.net
Debian (Ref)https://tracker.debian.org/pkg/docx2txt


Software prerequisites




Entry










"Python (Ref)" is not in the list (General, Help, Bug Tracking, Support, Developer) of allowed values for the "Resource audience" property.


"Ruby (Ref)" is not in the list (General, Help, Bug Tracking, Support, Developer) of allowed values for the "Resource audience" property.


"Debian (Ref)" is not in the list (General, Help, Bug Tracking, Support, Developer) of allowed values for the "Resource audience" property.








Date 2013-03-20
Source Debian import
Source link http://packages.debian.org/sid/docx2txt

[[Category:]]



Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.

The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.