Skip to content

Conversation

@ZenithClown
Copy link
Member

@ZenithClown ZenithClown commented Aug 17, 2024

📜 Description

This PR is an initialization PR to merge the existing code into the NLPurify module. This is a part of the total required changes (#1), and the following changes and fixes are listed below.

This PR brings the following change(s):

  • ♻️ Legacy Code(s) are maintained under nlpurify/legacy unless all existing code are migrated.
    • 🌟 New Feature - text cleaning using nltk.stopwords, lemmatizations.
    • 🌟 New Feature - a collection function to easily switch between different types of fuzzy text matching using fuzzywuzzy.
  • 🛠️ Breaking Change - upon successful transition the existing GitHub gist will be deleted and will become unusable.

On successful merge, the PR Fixes #2 Fixes #4 Fixes #5

@ZenithClown ZenithClown self-assigned this Aug 17, 2024
- this adheres to the module initialization #4
- remove authors from nlp_utils.py
- fixes #4; create a temporary namespace under nlpurify
@ZenithClown
Copy link
Member Author

Some of my codes are dependent on the existing structure. This should be made available unless all codes are migrated to the latest changes. Check issue details #5 for more information.

@ZenithClown ZenithClown merged commit 29b636d into master Aug 17, 2024
@ZenithClown ZenithClown deleted the merge/existing-gist branch August 17, 2024 14:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

2 participants