Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
398 commits
Select commit Hold shift + click to select a range
022026a
adjust tests
fgregg May 24, 2020
25a3134
fix type signature
fgregg May 24, 2020
965b7ab
fix type signature
fgregg May 24, 2020
9b2cecc
move labeled mmap to connected components
fgregg May 25, 2020
e2f5161
restore core
fgregg May 25, 2020
2a07e4e
Getting windows to close tempfile for connected components
fgregg May 25, 2020
57934e1
comments on what all this slicing is about
fgregg May 25, 2020
c401124
Merge pull request #822 from dedupeio/preserve_memmap
fgregg May 26, 2020
dd14467
bump version, add more proj urls
fgregg May 26, 2020
662b6db
Merge branch 'preserve_memmap'
fgregg May 26, 2020
70c3082
attempt to restore coveralls
fgregg May 26, 2020
fbbd1a3
improve job name
fgregg May 26, 2020
1bd5039
Merge pull request #824 from dedupeio/restore_coveralls
fgregg May 26, 2020
54d08ec
update changelog
fgregg May 26, 2020
52b3ffc
Merge branch 'restore_coveralls'
fgregg May 26, 2020
47892f3
fixed links for some outdated and updated examples
derekeder Jun 8, 2020
275ac37
adds consulting line to readme
derekeder Jun 8, 2020
8e07ece
Merge pull request #828 from dedupeio:example-links-fix
fgregg Jul 1, 2020
d798868
adding a mailing list and gitignore
fgregg Jul 1, 2020
ed172e7
ignore type checker failing to deal with zips
fgregg Jul 1, 2020
54106c9
typing casts
fgregg Jul 1, 2020
824807f
flake8
fgregg Jul 1, 2020
5092f25
Merge pull request #829 from dedupeio/readme-consulting
fgregg Jul 1, 2020
0625abd
Closes #836: escape colon at CompoundPredicate
fjsj Jul 13, 2020
2506efb
Remove trailing whitespace
fjsj Jul 13, 2020
9862fef
Add test_escapes_escaped_colon
fjsj Jul 20, 2020
6c85d53
Merge pull request #837 from vintasoftware/fix-836-escape-colon
fgregg Jul 20, 2020
054d30e
improve typing coverage
fgregg Aug 13, 2020
20ee47c
Sample should be sample w/replacement not w/o replacement
fgregg Aug 13, 2020
79ac951
add pyproject.toml to manifest
fgregg Aug 13, 2020
5f1ca0a
v2.0.4, comparison count now using a sample w/ replacement
fgregg Aug 13, 2020
dd0c2e0
ensure that gazetteMatching does not return empty blocks
fgregg Sep 1, 2020
28c2205
v2.0.5
fgregg Sep 2, 2020
dbc4f72
Raise error if all the field definitions are of type 'Custom', update…
fgregg Sep 2, 2020
cb1f496
flake8
fgregg Sep 2, 2020
290ab07
Prepare for predicate candidate generation
fgregg Sep 4, 2020
4e6cbc8
remove Cover as class, basically a dict now
fgregg Sep 4, 2020
2a48f5f
implement Bilenko's DNF candidate generation scheme
fgregg Sep 4, 2020
be4de65
rough in random-forest inspired blocking predicate candidate generator
fgregg Sep 5, 2020
467bed7
ignore type complaint
fgregg Sep 5, 2020
f156c3d
restore index predicates to dedupe models
fgregg Sep 5, 2020
6187552
consider index predicate
fgregg Sep 5, 2020
1c3fa34
indexing shoudl be incremental
fgregg Sep 5, 2020
c38ab35
Merge branch 'master' into predicate_candidate_gen
fgregg Sep 5, 2020
0b0427d
bust the cache:
fgregg Sep 5, 2020
93f2c7d
Merge branch 'master' into predicate_candidate_gen
fgregg Sep 5, 2020
327b2ca
remove compounds_with methods and attributes
fgregg Sep 5, 2020
c65e2e3
clean up implementation of random forest candidate gen
fgregg Sep 6, 2020
3f36272
staged learning is not worth it
fgregg Sep 6, 2020
c586cdc
allow singleton preds
fgregg Sep 6, 2020
f28cea7
remove bilenko mode candidate_gen
fgregg Sep 6, 2020
8c5e9aa
remove unused import
fgregg Sep 6, 2020
c250911
v2.0.6
fgregg Sep 6, 2020
fa2696d
add timeout for github actions
fgregg Sep 6, 2020
bc9bcfa
increase timeout to 10 minute
fgregg Sep 6, 2020
b08df76
raise not implemented error based on problem in levenshein search
fgregg Sep 6, 2020
f45f41d
alternate unindexing strategegy for levenshtein index
fgregg Sep 7, 2020
97773f1
Create dependabot.yml
fgregg Sep 29, 2020
6d10c07
Update README.md
fgregg Sep 29, 2020
e6f00a4
don't add all predicates
fgregg Oct 27, 2020
891f43c
Don't raise AssertionError when out of candidates
alextsits Nov 5, 2020
57f5d84
only use random forest for last learning
fgregg Nov 9, 2020
66d47d9
Merge pull request #864 from alextsits/feature-fix-assertions
fgregg Nov 9, 2020
b38b471
remove dead code
fgregg Nov 9, 2020
8afb661
Merge branch 'master' into predicate_candidate_gen
fgregg Nov 9, 2020
02e3352
by default try to cover all the dupes
fgregg Nov 9, 2020
add3e9c
Merge pull request #854 from dedupeio/predicate_candidate_gen
fgregg Nov 9, 2020
cd26caa
docs: fix simple typo, probabilites -> probabilities
timgates42 Dec 24, 2020
b2f2578
Revert "Update README.md"
fgregg Jan 15, 2021
5926046
spike for parallel blocking
fgregg Feb 28, 2021
2bc48bd
remove parallel blocking spike
fgregg Feb 28, 2021
79d6ba4
flake8, remove debug statement
fgregg Feb 28, 2021
69f3bcd
Merge pull request #878 from dedupeio/parallel_blocking
fgregg Feb 28, 2021
8b6c1e0
static checking
fgregg Feb 28, 2021
3fc263a
Merge branch 'master' of github.com:dedupeio/dedupe
fgregg Feb 28, 2021
0483205
3.9 builds
fgregg Feb 28, 2021
2167855
bump version
fgregg Feb 28, 2021
1a2711e
adds blogposts and youtube videos to README.md, fixes #865
Mar 14, 2021
52d1808
implements firstTwoTokensPredicate
Mar 18, 2021
6570135
Merge pull request #884 from fritshermans/add_blogposts_readme
fgregg Mar 19, 2021
1940757
run github tests on pull request
fgregg Mar 19, 2021
5009e94
changes `firstTwoTokensPredicate` such that nothing is returned in ca…
Mar 24, 2021
fd7e875
Merge pull request #886 from fritshermans/first_two_words_predicate
fgregg Mar 24, 2021
50bb7c6
full commit for manylinux builds
fgregg Apr 16, 2021
88012b9
full manylinux commit
fgregg Apr 16, 2021
476b3cc
Merge branch 'master' of github.com:dedupeio/dedupe
fgregg Apr 16, 2021
af6a206
don't raise value error in scoreGazette if no blocks
fgregg Apr 17, 2021
96a58fc
futz with random seed in tests
fgregg Apr 17, 2021
091be4c
bump version
fgregg Apr 17, 2021
11e965c
remove code used to calculate the total comparisons of predicate
fgregg May 13, 2021
1186766
remove unusused import
fgregg May 13, 2021
cc4aacb
remove poor test
fgregg May 13, 2021
b98ce1d
remove print statement
fgregg May 13, 2021
cb6bb2f
pair enumerator
fgregg Jun 8, 2021
ff24160
flake8
fgregg Jun 8, 2021
7c71d81
integer division
fgregg Jun 9, 2021
b8f48c7
Merge pull request #896 from dedupeio/remove_coverage_estimator
fgregg Jun 9, 2021
b2e6b5b
slight tweaks to pair enumerators
fgregg Jun 9, 2021
e22c269
Merge branch 'remove_coverage_estimator'
fgregg Jun 9, 2021
b15f029
The link is not working
sachinaraballi Jul 4, 2021
545433b
pre-compute squared_distances
Aug 20, 2021
6c01ef8
add in-memory option
Aug 20, 2021
1d0cd43
Merge pull request #902 from sachinaraballi/patch-1
fgregg Aug 21, 2021
6fb5298
newly required annotations
fgregg Aug 21, 2021
c2af2b1
Move in_memory to Matching class attribute
Aug 23, 2021
6c21e74
Fix formatting for flake8
Aug 23, 2021
476536b
Move squared_distances closer to where it's used
Aug 23, 2021
602aedf
Merge branch 'master' into simplify-confidences
Aug 23, 2021
fb079e5
Merge branch 'master' into sqlite-in-memory
Aug 23, 2021
d70b0aa
Merge pull request #912 from jhendr/simplify-confidences
fgregg Aug 23, 2021
153ea90
Add docstrings for in_memory
Aug 24, 2021
a639918
Fix missing in_memory in Gazetteer
Aug 24, 2021
f69529b
Merge branch 'dedupeio:master' into sqlite-in-memory
jhendr Aug 27, 2021
e789b9a
Merge pull request #913 from jhendr/sqlite-in-memory
fgregg Sep 3, 2021
7d09164
Fix "between" typo
MetcalfeTom Sep 30, 2021
b1358e7
Merge pull request #917 from MetcalfeTom/patch-2
fgregg Oct 14, 2021
e010ba1
Merge pull request #869 from timgates42/bugfix_typo_probabilities
fgregg Oct 14, 2021
b2c0b64
Update pythonpackage.yml
fgregg Dec 17, 2021
de7d092
Update pythonpackage.yml
fgregg Dec 17, 2021
dc954f2
Update pythonpackage.yml
fgregg Dec 17, 2021
a4cf468
Update pythonpackage.yml
fgregg Dec 17, 2021
5f20758
Update pythonpackage.yml
fgregg Dec 17, 2021
2197c4f
Update pythonpackage.yml
fgregg Dec 17, 2021
fcf99fd
Update pythonpackage.yml
fgregg Dec 17, 2021
7696019
Update pythonpackage.yml
fgregg Dec 18, 2021
e3c741b
Update pythonpackage.yml
fgregg Dec 18, 2021
e7e4d02
Update setup.py
fgregg Dec 19, 2021
9b90efa
Update dependabot.yml
fgregg Dec 20, 2021
72a51fa
Bump pypa/cibuildwheel from 2.2.2 to 2.3.1
dependabot[bot] Dec 20, 2021
5156e7c
Merge pull request #929 from dedupeio/dependabot/github_actions/pypa/…
fgregg Dec 20, 2021
702c6ba
call analyze after creating index
fgregg Jan 8, 2022
59a89f1
mypy
fgregg Jan 9, 2022
5ad932c
flake8
fgregg Jan 9, 2022
932d33e
remove type ignores and casts
fgregg Jan 19, 2022
c054308
remove type ignores and casts
fgregg Jan 19, 2022
b39d6f5
Merge branch 'master' of github.com:dedupeio/dedupe
fgregg Jan 19, 2022
6062838
add back a few necessary type ignores
fgregg Jan 19, 2022
6c59172
random pairs, closes #830
fgregg Jan 19, 2022
7c3ca80
flake8
fgregg Jan 19, 2022
01d6826
mypy
fgregg Jan 19, 2022
83562a6
handle recursion errors, closes #876
fgregg Jan 19, 2022
ea6c462
increase size of array in union_find, adaptively. closes #731
fgregg Jan 19, 2022
d3f75ec
roll up switch statement
fgregg Jan 19, 2022
be02c5d
handle border case
fgregg Jan 19, 2022
823b3ea
write data directly to mmap file (#936)
fgregg Jan 20, 2022
eb37ee8
remove unused loop conditions
fgregg Jan 20, 2022
30df5aa
initialize value on instantiation
fgregg Jan 20, 2022
3cec4ff
don't limit size of exception queue
fgregg Jan 20, 2022
8cc41b0
check that there are some matches, closes #565
fgregg Jan 20, 2022
ffd4373
better error message for no blocking error, closes #666
fgregg Jan 20, 2022
86cc727
bump version to 2.0.9
fgregg Jan 21, 2022
48bf079
Update pythonpackage.yml
fgregg Jan 21, 2022
387324d
Update pythonpackage.yml
fgregg Jan 21, 2022
3b287cd
Use "Index" and "Canopy" consistently in docs
NickCrews Jan 25, 2022
eb69b6d
Use camelcase in example in docs
NickCrews Jan 25, 2022
92b776a
Add links to optional variables in docs
NickCrews Jan 25, 2022
0c46395
Add and improve examples in variable definitions
NickCrews Jan 25, 2022
7bd8ab6
Merge pull request #939 from NickCrews/docs-improvements
fgregg Jan 26, 2022
9770ef4
open memmap file in correct way
fgregg Jan 26, 2022
c099630
recursively handle tuples w/i tuples, closes #943
fgregg Jan 26, 2022
c71ffdb
Merge branch 'master' of github.com:dedupeio/dedupe
fgregg Jan 26, 2022
9c26960
bump version
fgregg Jan 26, 2022
53ff2c1
don't set an extraneous flag
fgregg Jan 26, 2022
a26ac68
update copyright date
fgregg Jan 27, 2022
550a9ee
Update README.md
fgregg Jan 27, 2022
f349899
Merge pull request #946 from dedupeio/fgregg-patch-1
fgregg Jan 27, 2022
1241736
Make docs build fail on warnings
NickCrews Jan 27, 2022
c902001
Fix docs build errors
NickCrews Jan 26, 2022
ad1705c
Fix bad ref in docs
NickCrews Jan 26, 2022
4d4e9fb
Merge pull request #944 from NickCrews/docs-CI
fgregg Jan 27, 2022
7317798
back to sample w/o replacement
fgregg Jan 27, 2022
0d62b2e
consistently return iters in random pairs
fgregg Jan 27, 2022
9850fb2
bump version
fgregg Jan 27, 2022
35e79e8
bump version in docs
fgregg Jan 27, 2022
4962f3b
Create CITATION.cff
fgregg Jan 27, 2022
0d5b956
Fix docs on DateTime
NickCrews Feb 1, 2022
36d24a3
Add note on using None for empty strings
NickCrews Feb 1, 2022
29a0e2b
Expand on `index_predicates` arg in `train()`
NickCrews Feb 1, 2022
92869d1
bump up timout-time for github actions
fgregg Feb 2, 2022
a800831
Merge pull request #949 from NickCrews/docs-datetime
fgregg Feb 2, 2022
8bfcfb0
temporary fastcluster dependency (#951)
fgregg Feb 2, 2022
213d141
better error message when mark_pairs sees unseen records (#953)
fgregg Feb 3, 2022
ff2d280
bump version
fgregg Feb 3, 2022
3c6544e
Restore casting of int in randomPairs (#954)
fgregg Feb 3, 2022
5c9c550
bump
fgregg Feb 3, 2022
9ba44fa
clarify docs, align prepare_training, closes #955
fgregg Feb 4, 2022
0a2d785
when pickling variables and predicates remove unnecessary bits
fgregg Feb 4, 2022
96a222c
remove reference to default args when it's clear from method signature
fgregg Feb 4, 2022
a3aea02
consolidate sorts in filtering of connected components by size
fgregg Feb 6, 2022
2ca5919
typo
fgregg Feb 6, 2022
0e0ed0a
Create lock.yml
fgregg Feb 8, 2022
b07df0a
Update lock.yml
fgregg Feb 8, 2022
2689c63
Update lock.yml
fgregg Feb 8, 2022
4f44562
Update lock.yml
fgregg Feb 8, 2022
b4c24cd
Update lock.yml
fgregg Feb 8, 2022
69edbd3
Pin docs requirements
NickCrews Feb 10, 2022
5e1d75c
Merge pull request #961 from NickCrews/docs-bullets
fgregg Feb 10, 2022
6483258
Fix docs list indentation
NickCrews Feb 10, 2022
56d6266
run lock once a day
fgregg Feb 11, 2022
2405f30
Merge pull request #962 from NickCrews/tweaks
fgregg Feb 11, 2022
358ccc5
Remove junk links from docs index
NickCrews Feb 8, 2022
1ef2cca
Clarify when in_memory option applies.
NickCrews Feb 8, 2022
e46e1e1
Move "How it works" docs into subdirectory.
NickCrews Feb 8, 2022
2b712ef
Split examples into their own doc page
NickCrews Feb 8, 2022
92c0052
Add User Guide
NickCrews Feb 8, 2022
c9287d0
v2 of user guide
NickCrews Feb 21, 2022
fa576f9
Merge pull request #959 from NickCrews/user-guide
fgregg Feb 21, 2022
9f0e16b
Fix docs CSS (weird formatting on docs #968)
NickCrews Feb 21, 2022
4c2dc7f
Merge pull request #969 from NickCrews/docs-css
fgregg Feb 22, 2022
48a8b0b
unpin fastcluster, set python-requires > 3.6
fgregg Feb 27, 2022
ddab6c1
turn off wal
fgregg Mar 15, 2022
54b4f32
wal off
fgregg Mar 15, 2022
1f551d7
Merge branch 'master' of github.com:dedupeio/dedupe
fgregg Mar 15, 2022
8a901c8
start on better labeler
fgregg Mar 15, 2022
a007799
skip empty cover
fgregg Mar 15, 2022
e34aa49
simplify, and doc string
fgregg Mar 15, 2022
f6d9b23
sampling w/o replacement
fgregg Mar 15, 2022
3770697
record link sampling
fgregg Mar 25, 2022
060bfed
Create codeql-analysis.yml
fgregg Mar 29, 2022
b38b9bf
Bump pypa/cibuildwheel from 2.3.1 to 2.4.0
dependabot[bot] Apr 4, 2022
6988884
Merge pull request #985 from dedupeio/dependabot/github_actions/pypa/…
fgregg Apr 4, 2022
0b6032c
move random pairs to convenience
fgregg Apr 20, 2022
257db8b
remove more dead code
fgregg Apr 20, 2022
fc81d0a
remove more dead code
fgregg Apr 20, 2022
ccd983a
Merge pull request #982 from dedupeio/better_sampling
fgregg Apr 20, 2022
ee2dfd9
bump version, 2.0.14
fgregg Apr 20, 2022
22a99da
Update lock.yml
fgregg Apr 26, 2022
ba11285
Update pythonpackage.yml
fgregg Apr 26, 2022
d1ec3e1
Update pythonpackage.yml
fgregg Apr 26, 2022
6a80e15
Update pythonpackage.yml
fgregg Apr 26, 2022
c2ac559
Update pythonpackage.yml
fgregg Apr 26, 2022
249cde6
roll back attempting to build on 3.11
fgregg Apr 27, 2022
392144a
Refactor console_label
NickCrews Apr 27, 2022
3be99d1
Add a positive and negative sample for "unsure"
NickCrews Apr 27, 2022
5ce9644
Fixup signature of _print()
NickCrews Apr 27, 2022
94f7fc9
Fixup: add comment for treatment of "unsure"
NickCrews Apr 27, 2022
c9a3eba
Fixup: Keep queue always having one element per label
NickCrews Apr 28, 2022
5d24e42
Merge pull request #994 from NickCrews/unsure
fgregg Apr 28, 2022
6dd5a00
Remove use of "unsure" from TrainingData type
NickCrews Apr 29, 2022
c4a837b
Merge pull request #995 from NickCrews/unsure-followup
fgregg Apr 29, 2022
da1e5be
REF: remove unneeded ActiveMatching._sample()
NickCrews May 1, 2022
187b276
Fixup flake8
NickCrews May 1, 2022
c0141d9
Merge pull request #998 from NickCrews/_sample_refactor
fgregg May 1, 2022
1e0a8a7
Bump pypa/cibuildwheel from 2.4.0 to 2.5.0
dependabot[bot] May 2, 2022
e6a86db
Bump github/codeql-action from 1 to 2
dependabot[bot] May 2, 2022
c26b042
Merge pull request #999 from dedupeio/dependabot/github_actions/pypa/…
fgregg May 3, 2022
1275f5e
Merge pull request #1000 from dedupeio/dependabot/github_actions/gith…
fgregg May 3, 2022
c28cd13
Apply Black formatting
NickCrews May 3, 2022
748a861
Add black to CI
NickCrews May 3, 2022
9e01ccf
Merge pull request #1003 from NickCrews/black
fgregg May 4, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
19 changes: 19 additions & 0 deletions .github/dependabot.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
# To get started with Dependabot version updates, you'll need to specify which
# package ecosystems to update and where the package manifests are located.
# Please see the documentation for all configuration options:
# https://help.github.com/github/administering-a-repository/configuration-options-for-dependency-updates

version: 2
updates:
- package-ecosystem: "pip" # See documentation for possible values
directory: "/" # Location of package manifests
schedule:
interval: "daily"
- package-ecosystem: "github-actions"
directory: "/"
schedule:
interval: "weekly"
ignore:
# Optional: Official actions have moving tags like v1;
# if you use those, you don't need updates.
- dependency-name: "actions/*"
70 changes: 70 additions & 0 deletions .github/workflows/codeql-analysis.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,70 @@
# For most projects, this workflow file will not need changing; you simply need
# to commit it to your repository.
#
# You may wish to alter this file to override the set of languages analyzed,
# or to provide custom queries or build logic.
#
# ******** NOTE ********
# We have attempted to detect the languages in your repository. Please check
# the `language` matrix defined below to confirm you have the correct set of
# supported CodeQL languages.
#
name: "CodeQL"

on:
push:
branches: [ master ]
pull_request:
# The branches below must be a subset of the branches above
branches: [ master ]
schedule:
- cron: '17 21 * * 4'

jobs:
analyze:
name: Analyze
runs-on: ubuntu-latest
permissions:
actions: read
contents: read
security-events: write

strategy:
fail-fast: false
matrix:
language: [ 'python' ]
# CodeQL supports [ 'cpp', 'csharp', 'go', 'java', 'javascript', 'python', 'ruby' ]
# Learn more about CodeQL language support at https://git.io/codeql-language-support

steps:
- name: Checkout repository
uses: actions/checkout@v3

# Initializes the CodeQL tools for scanning.
- name: Initialize CodeQL
uses: github/codeql-action/init@v2
with:
languages: ${{ matrix.language }}
# If you wish to specify custom queries, you can do so here or in a config file.
# By default, queries listed here will override any specified in a config file.
# Prefix the list here with "+" to use these queries and those in the config file.
# queries: ./path/to/local/query, your-org/your-repo/queries@main

# Autobuild attempts to build any compiled languages (C/C++, C#, or Java).
# If this step fails, then you should remove it and run the build manually (see below)
- name: Autobuild
uses: github/codeql-action/autobuild@v2

# ℹ️ Command-line programs to run using the OS shell.
# 📚 https://git.io/JvXDl

# ✏️ If the Autobuild fails above, remove it and uncomment the following three lines
# and modify them (or add more) to build your code if your project
# uses a compiled language

#- run: |
# make bootstrap
# make release

- name: Perform CodeQL Analysis
uses: github/codeql-action/analyze@v2
22 changes: 22 additions & 0 deletions .github/workflows/lock.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
name: 'Lock Threads'

on:
schedule:
- cron: '0 0 1/7 * *'
workflow_dispatch:

permissions:
issues: write

concurrency:
group: lock

jobs:
action:
runs-on: ubuntu-latest
steps:
- uses: dessant/lock-threads@v3
with:
process-only: 'issues'
issue-inactive-days: '14'
log-output: true
112 changes: 112 additions & 0 deletions .github/workflows/pythonpackage.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,112 @@
name: tests

on: [push, pull_request]

jobs:
format:
# We want all subsequent steps of CI to refer to the fresh version of code,
# so run this before anything else.
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v1
- name: Set up Python 3.10
uses: actions/setup-python@v2
with:
python-version: "3.10"
- name: Install Black
run: pip install black
- name: Run black --check .
run: black --check .
- name: If needed, commit black changes to the pull request
if: failure()
run: |
black .
git config --global user.name 'autoblack'
git config --global user.email 'cclauss@users.noreply.github.com'
git remote set-url origin https://x-access-token:${{ secrets.GITHUB_TOKEN }}@github.com/$GITHUB_REPOSITORY
git checkout $GITHUB_HEAD_REF
git commit -am "fixup: Format Python code with Black"
git push
test:
needs: format
timeout-minutes: 30
runs-on: ${{ matrix.os }}
strategy:
matrix:
os: [windows-latest, macos-latest, ubuntu-latest]
python-version: [3.7, 3.8, 3.9, "3.10"]

steps:
- uses: actions/checkout@v1
- name: Set up Python ${{ matrix.python-version }}
uses: actions/setup-python@v2
with:
python-version: ${{ matrix.python-version }}
- name: Install dependencies
run: |
pip install --upgrade pip
pip install -e .
- name: Install test dependencies
run: |
pip install -r requirements.txt
- name: pytest
run: |
pytest tests dedupe --cov dedupe
- env:
COVERALLS_PARALLEL: true
COVERALLS_REPO_TOKEN: ${{ secrets.GITHUB_TOKEN }}
COVERALLS_FLAG_NAME: job-${{ matrix.os}}-${{ matrix.python-version }}
name: Submit to coveralls
run: |
pip install https://github.com/bboe/coveralls-python/archive/github_actions.zip
coveralls
- name: flake8
run: |
flake8 dedupe tests
- name: Static Testing
run: |
mypy --ignore-missing-imports dedupe
- name: Integration test, record link
run: |
python tests/canonical_gazetteer.py -vv
python tests/canonical_matching.py -vv
- name: Integration test, dedupe
run: |
python tests/canonical.py -vv
finish:
needs: test
runs-on: ubuntu-latest
steps:
- name: Coveralls Finished
uses: coverallsapp/github-action@57daa114ba54fd8e1c8563e8027325c0bf2f5e80
with:
github-token: ${{ secrets.GITHUB_TOKEN }}
parallel-finished: true
wheels:
if: github.event_name == 'push' && startsWith(github.event.ref, 'refs/tags')
needs: test
name: Build wheels on ${{ matrix.os }}
runs-on: ${{ matrix.os }}
strategy:
matrix:
os: [windows-latest, macos-latest, ubuntu-latest]
steps:
- uses: actions/checkout@v1
- uses: actions/setup-python@v2
- name: Build wheels
uses: pypa/cibuildwheel@2.5.0
env:
CIBW_ARCHS_MACOS: x86_64 arm64 universal2
- name: Build sdist
run: |
pip install build
python -m build --sdist
- name: Publish wheels to PyPI
env:
TWINE_USERNAME: ${{ secrets.PYPI_USERNAME }}
TWINE_PASSWORD: ${{ secrets.PYPI_PASSWORD }}
run: |
pip install twine
twine upload --skip-existing wheelhouse/*.whl
twine upload dist/*
continue-on-error: true
6 changes: 5 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
.env
.coverage*
cpredicates.c
*.code-workspace
libdistance-0.2.1
build
_build
Expand Down Expand Up @@ -27,4 +31,4 @@ dedupe.egg-info/
ENV
dist
.coverage
src/*.c
src/*.c
22 changes: 0 additions & 22 deletions .manylinux-install.sh

This file was deleted.

23 changes: 23 additions & 0 deletions .readthedocs.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
# See https://docs.readthedocs.io/en/stable/config-file/v2.html for details

# Required
version: 2

# Build documentation in the docs/ directory with Sphinx
sphinx:
configuration: docs/conf.py

# Build documentation with MkDocs
#mkdocs:
# configuration: mkdocs.yml

# Optionally build your docs in additional formats such as PDF and ePub
formats: all

# Optionally set the version of Python and requirements required to build your docs
python:
version: 3.7
install:
- requirements: docs/requirements.txt
- method: pip
path: .
96 changes: 0 additions & 96 deletions .travis.yml

This file was deleted.

Loading