43👍
✅
I recommend using Unidecode module:
>>> from unidecode import unidecode
>>> unidecode(u'ıöüç')
'iouc'
Note how you feed it a unicode string and it outputs a byte string. The output is guaranteed to be ASCII.
7👍
It all depends on how far you want to go in transliterating the result. If you want to convert everything all the way to ASCII (αβγ
to abg
) then unidecode
is the way to go.
If you just want to remove accents from accented letters, then you could try decomposing your string using normalization form NFKD (this converts the accented letter á
to a plain letter a
followed by U+0301 COMBINING ACUTE ACCENT
) and then discarding the accents (which belong to the Unicode character class Mn
— “Mark, nonspacing”).
import unicodedata
def remove_nonspacing_marks(s):
"Decompose the unicode string s and remove non-spacing marks."
return ''.join(c for c in unicodedata.normalize('NFKD', s)
if unicodedata.category(c) != 'Mn')
- [Django]-Django delete superuser
- [Django]-Django, Models & Forms: replace "This field is required" message
- [Django]-How to deploy django under a suburl behind nginx
- [Django]-Hadoop and Django, is it possible?
- [Django]-Per-transaction isolation level in Django ORM
- [Django]-Django: Not Found static/admin/css
0👍
import unicodedata
unicodedata.normalize()
- [Django]-Malformed Packet: Django admin nested form can't submit, connection was reset
- [Django]-How to reset migrations in Django 1.7
- [Django]-Delete method on Django Rest Framework ModelViewSet
Source:stackexchange.com