Broaden your selection: Category/Localization
- 'Enca' (Extremely Naive Charset Analyser) detects the encoding of text files, based on knowledge of their language. It can also convert them to other encodings, letting you to recode files without knowing their current encoding. It supports most of Central and East European languages, and a few Unicode variants, independently of language.
- FriBidi is a free implementation of the Unicode Bidirectional (BiDi) Algorithm. It also provides utility functions to aid in the development of interactive editors and widgets that implement BiDi functionality. The BiDi algorithm is a prerequisite for supporting right-to-left scripts such as Hebrew, Arabic, Syriac, and Thaana.
- htmlatex does on-the-fly rendering of LaTeX source to HTML documents. htmlatex is a mod_python application that uses memcached to reduce the massive overhead of repeatedly rendering the same equation. It has an option to sanitize the LaTeX source, removing any potentially dangerous code. It is fairly generous about the HTML and LaTeX it accepts. LaTeX source code is typed directly into an HTML file. The file is left unchanged, and the output is filtered, replacing raw source with images.
- Html2ps html2pdf
- Convert nearly any URL or HTML document to PostScript or PDF using this PHP system. PDF converter may use Ghostscript, FPdf, or PDFLib; supports all common PDF versions. Over 200 CSS and HTML properties are supported, including floating elements (DHTML). Advanced API and complete documentation included. Freely distributed with 100% of source code on http://www.sourceforget.net and http://www.tufat.com
- 'htmlrecode' applies modifications to a HTML file. For example, you can completely change the character set you are using without making any of the characters unreadable.
- JGloss is an application for adding reading and translation annotations to words in a Japanese text document. This can be done automatically and manually. When a text document is first opened, kanji words will be looked up in a dictionary and the first reading and translation (if any) used to annotate the word. The user can then edit the annotations: choose among the readings and translations found in the dictionaries, enter your own readings and translations, remove annotations, and add new annotations. The document can be exported as plain text with annotations, HTML, or LaTeX.
- Leet-Generator converts plaintext to leettext.
- 'libtranslate' is a library for translating text and Web pages between natural languages. Its modular infrastructure lets users implement new translation services separately from the core library. It is shipped with a generic module that supports Web-based translation services (ie, BabelFish) and lets new services be added simply by adding a few lines to an XML file. The distribution also includes a command line interface.
- libuninum is a library for converting Unicode strings to integers and integers to Unicode strings. Internal computation is done using arbitrary precision arithmetic, so there is no limit on the size of the integer that can be converted. Values are passed and returned as ASCII decimal strings, GNU MP mpz_t objects, or unsigned long integers. Auto-detection of the number system is provided. Group delimitation for output strings is fully controllable. Virtually all known number systems are supported.
- otl is intended to convert a text file to a HTML or XHTML file. It is different than many other text-to-HTML programs in that the input format (by default a simple highly readable plain text format) can be customized by the user, and the output format (by default XHTML) can be user-defined. It can process complex structures such as ordered and unordered lists (nested or not), and add custom "headers" and "footers" to documents. The conversion utilizes Perl regex, adding quite a bit of flexibility and power to the conversion process. Since both the syntax of the source file and of the output can be readily customized, otl in theory can be used for many types of conversions. The package also includes tag-remove, a script for stripping HTML/XHTML-ish tags from documents.