Encrypt/Decrypt Mailbox urls #198

lologf · 2019-01-18T11:04:27Z

No description provided.

…-soc-email FIX - Django settings fix to unicode

…soc-email FIX - Fix default charset

Update models.py

lologf · 2019-01-18T11:11:38Z

This fix works in python 2.7, django 1.11 and database with charset utf8 and collation ut8_general_ci

…subject Update models.py

coddingtonbear · 2019-01-18T13:45:00Z

django_mailbox/models.py

        if 'subject' in message:
            msg.subject = (
-                utils.convert_header_to_unicode(message['subject'])[0:255]
+                utils.convert_header_to_unicode(unicode(message['subject']).decode('utf-8'))[0:255]


This is a rather surprising change; could you elaborate on how this helps, exactly?

I am working in a app with python 2.7, django 1.11 and production database in 'utf8' charset. And i need to use django-mailbox to receive emails. If an email have a 'emoji' in subject, Django return a OperationalError. I should not change the character set to 'utf8mb4' in production. This fix (I don't know another way to do it, in utils.convert_header_to_unicode perhaps?) allow receive emails with emojis in django 1.11, python 2.7 and utf8 charset and collation

Before this fix: Django return a OperationalError
After this fix: Email subject with unicode emojis: "Resume of your a\xc3\xb1o with \xf0\x9f\x9a\x80"

Oh, I understand that you believe that this fixes the issue you're encountering, but what I meant was, specifically, how does the above change help that, really; consider this:

There are two possibilities here; one is that message['subject'] is a unicode object and the other is that it's bytes; given your example emoji of 🚀, that means we have two possibilities:

If it's bytes:

value = unicode('\xf0\x9f\x9a\x80') # Will raise the following exception: # Traceback (most recent call last): # File "<stdin>", line 1, in <module> # UnicodeDecodeError: 'ascii' codec can't decode byte 0xf0 in position 0: ordinal not in range(128)

If it's unicode:

value = unicode(u'\U0001f680') # Now let's try running 'decode' value.decode('utf-8') # Will raise the following exception: # Traceback (most recent call last): # File "<stdin>", line 1, in <module> # File "/var/www/envs/latestrevision/lib/python2.7/encodings/utf_8.py", line 16, in decode # return codecs.utf_8_decode(input, errors, True) # UnicodeEncodeError: 'ascii' codec can't encode character u'\U0001f680' in position 0: ordinal # not in range(128)

There are a couple things to be learned from the above:

Using unicode without supplying an encoding to use will attempt to interpret the provided string using your default encoding (sys.getdefaultencoding()). In most peoples' cases, that encoding is going to be ascii, and that is certainly not going to work for codepoints above 127.

decode is intended to be used for converting bytes into unicode objects -- not for converting unicode objects into anything at all -- so when you run decode on a unicode object, you're actually asking python to re-interpret your object into your default encoding, then to decode those bytes using the encoding you've selected. This is also not going to help you get the result you want, but is one of the more common misunderstandings of how unicode and bytes objects work in Python.

Okay. If I explain how I got to this point we can better understand the solution to the problem.

I use "(message ['subject]).decode(' utf-8 ')" to force the utf-8 encoding, which is the encoding that I have configured by default in my django app and in my production database.

I thought that the variable 'DJANGO_MAILBOX_default_charset' contained in utils.get_settings () could help me, but I saw that being lowercase django does not detect it as settings. I made a fix to capitalize it and force the 'default_charset' to be utf-8, but it still gave the same OperationalError.

I read several articles where they indicated that I had to change all the tables and columns of the production databases to 'utf8mb4', since the 'emojis' use 4 bytes to represent it in unicode.

But I can not change that encoding in my production database and I do not care that the emoji is represented as bytes in the subject.

My intention is to use django-mailbox to automate actions when receiving emails, and I do not care that the emoji is not represented correctly. What I want is that django does not return an OperationalError if I do not have the encoding to 'utf8mb4'.

I understand that this conversion from header to unicode should be done by the function utils.covert_header_to_unicode(), but I made the fix in _models.Mailbox.process_message() as workaround.

When making the decode, it returns a string "'=?Utf-8?Bxxxxxxxxxxxx ...'" which is a MIME header. This string is converted to a readable string with "email.header.decode_header (msg.subject)".

And at this point my question is, is there any way to use django-mailbox without the encoding 'utf8mb4' in the production database if i received an email with a "emoji"?. Thanks for everything

Encrypt uri

Update models.py

Remove fix UTF-8

lologf added 7 commits January 16, 2019 12:07

FIX - Django settings fix to unicode

bb5fba8

Merge pull request #1 from invisiblebits/STC-668__create-tickets-from…

c706e48

…-soc-email FIX - Django settings fix to unicode

FIX - Fix default charset

724e804

Merge pull request #2 from invisiblebits/STC-668__create-ticket-from-…

fe1367c

…soc-email FIX - Fix default charset

Update models.py

b797216

Merge pull request #3 from invisiblebits/FIX_unicode_subject_message

6ba66ed

Update models.py

Update models.py

3f22f48

lologf mentioned this pull request Jan 18, 2019

Fix default_charset and _process_message() to accept emojis in utf-8 encode #199

Closed

lologf added 2 commits January 18, 2019 12:38

Update models.py

b390b28

Merge pull request #4 from invisiblebits/FIX_remove-single-quotes-in-…

fd586e8

…subject Update models.py

coddingtonbear reviewed Jan 18, 2019

View reviewed changes

lologf added 16 commits April 9, 2019 12:07

Update admin.py

72582ab

Encrypt, decrypt and padding in model methods

11a05a8

Pycryto added in requirements

850f86a

Remove URI from list_display

e073a8c

Help text in uri form

cd1e4de

Decrypt URI in _protocol_info() method

6f23c59

Merge pull request #5 from invisiblebits/encrypt-uri

66f1254

Encrypt uri

Update setup.py

934c799

Fix decrypt_uri() and _protocol_info() methods

0d5f395

Update models.py

6c6b23c

Merge pull request #6 from invisiblebits/lologf-patch-raw-unicode

edc27af

Update models.py

Update models.py

e713b5c

Update models.py

6136dea

Update models.py

204d67a

Update __init__.py

625e166

Remove fix UTF-8

be3163e

Remove fix UTF-8

pfouque changed the title ~~Fix default_charset and _process_message() to accept emojis in utf-8 encode~~ Encrypt/Decrypt Mailbox urls Dec 17, 2023

pfouque added the enhancement label Dec 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encrypt/Decrypt Mailbox urls #198

Encrypt/Decrypt Mailbox urls #198

lologf commented Jan 18, 2019

lologf commented Jan 18, 2019

coddingtonbear Jan 18, 2019

lologf Jan 19, 2019 •

edited

Loading

coddingtonbear Jan 21, 2019

lologf Jan 22, 2019 •

edited

Loading

Encrypt/Decrypt Mailbox urls #198

Are you sure you want to change the base?

Encrypt/Decrypt Mailbox urls #198

Conversation

lologf commented Jan 18, 2019

lologf commented Jan 18, 2019

coddingtonbear Jan 18, 2019

Choose a reason for hiding this comment

lologf Jan 19, 2019 • edited Loading

Choose a reason for hiding this comment

coddingtonbear Jan 21, 2019

Choose a reason for hiding this comment

lologf Jan 22, 2019 • edited Loading

Choose a reason for hiding this comment

lologf Jan 19, 2019 •

edited

Loading

lologf Jan 22, 2019 •

edited

Loading