Unidecode

From Free Software Directory
 
Jump to: navigation, search


[edit]

Unidecode

https://pypi.org/project/Unidecode
ASCII transliterations of Unicode text (PyPy module)

It often happens that you have text data in Unicode, but you need to represent it in ASCII for display. One could represent non- roman Unicode characters as "???" or "\\15BA\\15A0\\1610", but neither is useful to the user reading the text.

Unidecode tries to represent it in ASCII characters (i.e., the universally displayable characters between 0x00 and 0x7F), where the compromises taken when mapping between two character sets are chosen to be near what a human with a US keyboard would choose.

This module generally produces better results than simply stripping accents from characters (which can be done in Python with built-in functions). It is based on hand-tuned character mappings that for example also contain ASCII approximations for symbols and non-Latin alphabets.

unidecode is a Python port of the Text::Unidecode Perl module for PyPy.





Licensing

License

Verified by

Verified on

Notes

Verified by

Debian: Stefano Rivera <stefanor@debian.org>

Verified on

17 October 2014

Notes

License: gpl-2+




Leaders and contributors

Contact(s)Role
Tomaz Solc contact


Resources and communication

AudienceResource typeURI
Python (Ref)https://pypi.org/project/Unidecode
Debian (Ref)https://tracker.debian.org/pkg/unidecode
Ruby (Ref)https://rubygems.org/gems/unidecode


Software prerequisites




Entry








"contact" is not in the list (Maintainer, Contributor, Developer, Sponsor, Unknown) of allowed values for the "Role" property.


"Python (Ref)" is not in the list (General, Help, Bug Tracking, Support, Developer) of allowed values for the "Resource audience" property.


"Ruby (Ref)" is not in the list (General, Help, Bug Tracking, Support, Developer) of allowed values for the "Resource audience" property.


"Debian (Ref)" is not in the list (General, Help, Bug Tracking, Support, Developer) of allowed values for the "Resource audience" property.






Date 2015-07-17
Source Debian
Source link http://packages.debian.org/sid/unidecode

[[Category:]]



Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.

The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.