Rapid fuzzy string matching in Python using various string metrics
-
Updated
Apr 12, 2025 - Python
Rapid fuzzy string matching in Python using various string metrics
Record Linkage ToolKit (Find and link entities)
Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity
Learning String Alignments for Entity Aliases
Learned string similarity for entity names using optimal transport.
A python implementation of a variety of text/string distance and similarity metrics. No GPL!
String distance metrics based on Levenshtein and Qwerty Matrix Distance
PPJoin and P4Join Python 3 implementation
Library in python for string treatment
Fast(ish) string similarity for one vs many comparisons.
Common string similarity algorithm implementations.
CJKfuzz is a Python library for supporting fuzzy matching chinese string.
Python wrapper for Rust's strsim library
tect comparision
Detects duplicate publications
Text Classification: Fast, custom string similarity functions in Python mapping lambda functions to NumPy arrays, assigning issuers to one of 10,000 classes.
Using a Python GUI (Tkinter) and the fuzzywuzzy library to get rid of duplicate bullet points from a list of bullet points. Includes EXE file to run without Python.
Project in Python language for Bioinformatics course starting october '20, aimed at implementing Needleman-Wunsh algorithm.
Add a description, image, and links to the string-similarity topic page so that developers can more easily learn about it.
To associate your repository with the string-similarity topic, visit your repo's landing page and select "manage topics."