Skip to content

lc-rezende/demo-thefuzz

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

CI

TheFuzz Lab - Fuzzy String Matching in Python

This repository contains a Jupyter Lab notebook designed as a lab environment to explore and experiment with the capabilities of the TheFuzz Python library (formerly known as fuzzywuzzy).

TheFuzz provides intuitive and flexible tools for fuzzy string matching, making it useful for approximate string comparisons, data deduplication, and search applications.


🧠 Why Use Fuzzy Matching?

Fuzzy string matching is useful when:

  • Strings may have typos or inconsistent formatting
  • You need to find "close enough" matches instead of exact ones
  • You're deduplicating records (e.g., names, addresses) from noisy data

πŸ“˜ What You'll Find Here

  • Basic usage examples of fuzz and process modules
  • Experiments with partial ratio, token sort ratio, and token set ratio
  • Examples of matching strings against a list of choices

πŸš€ Getting Started

  1. Clone this repo or open the notebook in Jupyter Lab.
  2. Explore and modify the examples to understand how fuzzy matching works.
  3. Try your own datasets or string inputs to see how TheFuzz behaves.

πŸ”— References

About

Jupyter Lab notebook to test and explore fuzzy string matching with TheFuzz.

Topics

Resources

License

Stars

Watchers

Forks