Skip to content
@dedupeio

Dedupe.io

De-duplicate and find matches in your Excel spreadsheet or database

Pinned Loading

  1. dedupe dedupe Public

    🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 4.2k 551

  2. csvdedupe csvdedupe Public

    🆔 Command line tool for deduplicating CSV files

    Python 413 81

  3. dedupe-examples dedupe-examples Public

    🆔 Examples for using the dedupe library

    Python 406 214

  4. affinegap affinegap Public

    📐 A Cython implementation of the affine gap string distance

    Cython 58 9

  5. pyhacrf pyhacrf Public

    Forked from dirko/pyhacrf

    📐 Hidden alignment conditional random field for classifying string pairs.

    Python 25 12

  6. doublemetaphone doublemetaphone Public

    🔉 Python wrapper for a C++ Double Metaphone

    C++ 15 7

Repositories

Showing 10 of 32 repositories
  • dedupe Public

    🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    dedupeio/dedupe’s past year of commit activity
    Python 4,152 MIT 551 68 7 Updated Nov 18, 2024
  • pylbfgs Public Forked from larsmans/pylbfgs

    🚠 Python/Cython wrapper for liblbfgs

    dedupeio/pylbfgs’s past year of commit activity
    C 26 MIT 38 1 2 Updated Sep 23, 2024
  • pyhacrf Public Forked from dirko/pyhacrf

    📐 Hidden alignment conditional random field for classifying string pairs.

    dedupeio/pyhacrf’s past year of commit activity
    Python 25 BSD-3-Clause 21 5 3 Updated Sep 23, 2024
  • dedupe-variable-embedding Public

    Use embeddings for semantic comparisons

    dedupeio/dedupe-variable-embedding’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Sep 18, 2024
  • dedupe-examples Public

    🆔 Examples for using the dedupe library

    dedupeio/dedupe-examples’s past year of commit activity
    Python 406 MIT 214 10 6 Updated Aug 10, 2024
  • dedupe-variable-datetime Public

    DateTime variable for dedupe

    dedupeio/dedupe-variable-datetime’s past year of commit activity
    Python 4 MIT 3 2 0 Updated Jul 10, 2024
  • dedupe-variable-address Public

    Address Variable Type for dedupe

    dedupeio/dedupe-variable-address’s past year of commit activity
    Python 9 3 6 0 Updated Jun 27, 2024
  • parseratorvariable Public

    Base class for dedupe variables for parsed fields

    dedupeio/parseratorvariable’s past year of commit activity
    Python 3 3 2 0 Updated Jun 27, 2024
  • dedupe-variable-name Public

    name variable type for dedupe

    dedupeio/dedupe-variable-name’s past year of commit activity
    Python 8 8 1 2 Updated Jun 26, 2024
  • affinegap Public

    📐 A Cython implementation of the affine gap string distance

    dedupeio/affinegap’s past year of commit activity
    Cython 58 MIT 9 4 2 Updated Jan 23, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…