Skip to content
This repository was archived by the owner on Jul 7, 2023. It is now read-only.

Add to_unicode_utf8() to text_encoder.py #1321

Merged
merged 1 commit into from
Jan 4, 2019

Conversation

cclauss
Copy link
Contributor

@cclauss cclauss commented Dec 21, 2018

tensor2tensor.data_generators currently has two different versions of to_unicode(), one in text_encoder.py and the other in wiki_revision_utils.py. This PR proposes renaming the second version --> to_unicode_utf8() and moving it into text_encoder.py with the other Unicode utility functions.

@googlebot googlebot added the cla: yes PR author has signed CLA label Dec 21, 2018
@cclauss cclauss force-pushed the add-to_unicode_utf8 branch from 014c055 to ea74871 Compare December 21, 2018 12:50
@afrozenator
Copy link
Contributor

thanks a lot @cclauss for caring to clean up the code -- this is much much appreciated!

@afrozenator afrozenator merged commit 89052b0 into tensorflow:master Jan 4, 2019
tensorflow-copybara pushed a commit that referenced this pull request Jan 4, 2019
PiperOrigin-RevId: 227918987
@cclauss cclauss deleted the add-to_unicode_utf8 branch January 4, 2019 22:59
@cclauss cclauss mentioned this pull request Jan 4, 2019
kpe pushed a commit to kpe/tensor2tensor that referenced this pull request Mar 2, 2019
kpe pushed a commit to kpe/tensor2tensor that referenced this pull request Mar 2, 2019
PiperOrigin-RevId: 227918987
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
cla: yes PR author has signed CLA
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants