Unidecode
Unidecode
https://pypi.org/project/Unidecode
ASCII transliterations of Unicode text (PyPy module)
It often happens that you have text data in Unicode, but you need to represent it in ASCII for display. One could represent non- roman Unicode characters as "???" or "\\15BA\\15A0\\1610", but neither is useful to the user reading the text.
Unidecode tries to represent it in ASCII characters (i.e., the universally displayable characters between 0x00 and 0x7F), where the compromises taken when mapping between two character sets are chosen to be near what a human with a US keyboard would choose.
This module generally produces better results than simply stripping accents from characters (which can be done in Python with built-in functions). It is based on hand-tuned character mappings that for example also contain ASCII approximations for symbols and non-Latin alphabets.
unidecode is a Python port of the Text::Unidecode Perl module for PyPy.
Download
http://ftp.debian.org/debian/pool/main/u/unidecode/unidecode_0.04.16.orig.tar.gz
Categories
Licensing
License
Verified by
Verified on
Notes
License
Verified by
Debian: Stefano Rivera <stefanor@debian.org>
Verified on
17 October 2014
Notes
License: gpl-2+
Leaders and contributors
Contact(s) | Role |
---|---|
Tomaz Solc | contact |
Resources and communication
Audience | Resource type | URI |
---|---|---|
Python (Ref) | https://pypi.org/project/Unidecode | |
Debian (Ref) | https://tracker.debian.org/pkg/unidecode | |
Ruby (Ref) | https://rubygems.org/gems/unidecode |
Software prerequisites
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.
The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.