fix String encoding #235

tisdall · 2015-06-16T20:14:37Z

fixes #233

tseaver · 2015-06-20T14:34:48Z

colander/__init__.py

+                if self.encoding:
+                    result = text_type(appstruct).encode(self.encoding)
+                else:
+                    result = text_type(appstruct)


I prefer that the result = text_type(appstruct) to be outside the if, with no else. E.g.:

result = text_type(appstruct) if self.encoding: result = result.encode(self.encoding) return result

tisdall · 2015-06-22T14:46:21Z

@tseaver - updated as requested...

tseaver · 2015-06-22T17:31:33Z

colander/tests/test_colander.py

@@ -1557,6 +1557,13 @@ def test_serialize_string_with_high_unresolveable_high_order_chars(self):
        e = invalid_exc(typ.serialize, node, not_utf8)
        self.assertTrue('cannot be serialized' in e.msg)

+    def test_serialize_encoding_with_non_string_type(self):
+        utf8 = '123'.encode('utf-8')


Hmm, this one should start as text and then encode to bytes (Python2 lets us get away with being sloppy, but that doesn't mean we shouldn't be explicit).

Sorry, I develop everything in Python 3 and then deal with Python 2 only when I need backwards compatibility. I know this is already doing what your describing in Py3, but I don't understand how it's not doing this in Py2. In Py2, should this be u'123'.encode('utf-8') to be explicit?

Are you saying this should be utf8 = text_type('123').encode('utf-8')?

That would be the better spelling, yes.

Or just b'123', in fact (ASCII and UTF-8 are identical encodings for 7-bit values).

tisdall · 2015-06-23T13:32:13Z

@tseaver - I went with using text_type('123').encode('utf-8') so it's totally explicit that the result is a utf-8 encoded string.

tisdall · 2015-06-23T13:35:30Z

@tseaver - should I add a note in the CHANGES.rst? Is there anything else I need to change to complete this?

tseaver · 2015-06-23T15:34:51Z

A note in CHANGES.rst would be great, thanks!

tisdall · 2015-06-23T20:09:33Z

done

fix String encoding

tseaver · 2015-06-23T20:30:09Z

Thanks!

fix String encoding

e1e8762

fixes Pylons#233

tseaver reviewed Jun 20, 2015
View reviewed changes

pulled text_type() out of if statement

be63b55

tseaver reviewed Jun 22, 2015
View reviewed changes

be more explicit about string type

a941784

noted string encoding fix in CHANGES

7b8e44b

tseaver added a commit that referenced this pull request Jun 23, 2015

Merge pull request #235 from tisdall/string_encoding

ccc31c1

fix String encoding

tseaver merged commit ccc31c1 into Pylons:master Jun 23, 2015

tisdall deleted the string_encoding branch July 3, 2015 13:26

pyup-bot mentioned this pull request Jun 30, 2020

Pin colander to latest version 1.7.0 camptocamp/c2cgeoportal#6618

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix String encoding #235

fix String encoding #235

tisdall commented Jun 16, 2015

tseaver Jun 20, 2015

tisdall commented Jun 22, 2015

tseaver Jun 22, 2015

tisdall Jun 22, 2015

tisdall Jun 22, 2015

tseaver Jun 22, 2015

tseaver Jun 22, 2015

tisdall commented Jun 23, 2015

tisdall commented Jun 23, 2015

tseaver commented Jun 23, 2015

tisdall commented Jun 23, 2015

tseaver commented Jun 23, 2015

fix String encoding #235

fix String encoding #235

Conversation

tisdall commented Jun 16, 2015

tseaver Jun 20, 2015

Choose a reason for hiding this comment

tisdall commented Jun 22, 2015

tseaver Jun 22, 2015

Choose a reason for hiding this comment

tisdall Jun 22, 2015

Choose a reason for hiding this comment

tisdall Jun 22, 2015

Choose a reason for hiding this comment

tseaver Jun 22, 2015

Choose a reason for hiding this comment

tseaver Jun 22, 2015

Choose a reason for hiding this comment

tisdall commented Jun 23, 2015

tisdall commented Jun 23, 2015

tseaver commented Jun 23, 2015

tisdall commented Jun 23, 2015

tseaver commented Jun 23, 2015