Skip to content

Conversation

@ambv
Copy link
Contributor

@ambv ambv commented Aug 4, 2022

+suffixes are removed for all emails except for @users.noreply.github.com special addresses, in which case prefixes+ are removed as the usernames are used by the CLA bot.

Dot characters are removed from gmail.com addresses where they are irrelevant.

Note: changing ContributorLicenseAgreement.normalized_email to actually use the normalize_email() function is left in a separate commit because an action that deduplicates the database might be necessary to apply the migration successfully. A script to do that is added to scripts/.

Fixes #51

ambv added 2 commits August 4, 2022 17:24
+suffixes are removed for all emails except for @users.noreply.github.com
special addresses, in which case prefixes+ are removed as the usernames are
used by the CLA bot.

Dot characters are removed from gmail.com addresses where they are irrelevant.

Fixes geldata#51
@ambv
Copy link
Contributor Author

ambv commented Aug 4, 2022

Tested on python/cpython#22694.

Copy link
Member

@elprans elprans left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@ambv ambv merged commit c098536 into geldata:master Aug 6, 2022
@ambv ambv deleted the better-normalize-email branch August 6, 2022 17:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Should normalized_email be stripped of dots and +parts, too?

2 participants