Skip to content

Refresh#2

Open
daherman wants to merge 398 commits intodaherman:masterfrom
dedupeio:master
Open

Refresh#2
daherman wants to merge 398 commits intodaherman:masterfrom
dedupeio:master

Conversation

@daherman
Copy link
Owner

No description provided.

fgregg and others added 30 commits May 24, 2020 08:25
attempt to restore coveralls, closes #812
fixed links for some outdated and updated examples
Closes #836: escape colon at CompoundPredicate
for bigger data (over 5K records) we draw a random sample for purpose
of estimating the comparison counts of predicates. If we sample w/ replacement
then we have a chance of pulling in exact duplicates, which can cause
us to overestimate the number of comparisons. this is especially
noticeable for precidates that have low comparison count
fgregg and others added 28 commits April 19, 2022 23:09
Better sampling, closes #845 and closes #986
- No content changes
- replace "examples_buffer" -> "labeled" and
  "uncertain_pairs" -> "unlabled" to be more
  consistent
- pull out a _print() function and a _mark_pair() function
  to be more dry. Also fixes a type annotation typo of
  "exmples: TrainingData"
- Use black style formatting in a place or two
- change "if" to "while" when popping from a too-full
  `labeled` (no behavior change at this point, but
  it makes more sense and heads off a future bug where
  `len(labeled)` could be much more than `buffer_len`)
sometimes it takes 0 params.
it is otherwise very not obvious why we do this.
Otherwise we if we ever go "previous" we have to worry
about maybe popping twice if
a past label was "unsure"
Treat "unsure" as both "yes" and "no"
The "unsure" key isn't used anywhere else
Remove use of "unsure" from TrainingData type
No behavior changes (I don't think)

As far as I could tell, this private method is only used in
prepare_training

- Had default arguments that didn't match prepare_training
- Had stale docstring
- Now verifies data before resetting active learner, as a nicety
- Fixes message from assert if you
don't call prepare_training()
REF: remove unneeded ActiveMatching._sample()
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel) from 2.4.0 to 2.5.0.
- [Release notes](https://github.com/pypa/cibuildwheel/releases)
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md)
- [Commits](pypa/cibuildwheel@v2.4.0...2.5.0)

---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Bumps [github/codeql-action](https://github.com/github/codeql-action) from 1 to 2.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](github/codeql-action@v1...v2)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
…cibuildwheel-2.5.0

Bump pypa/cibuildwheel from 2.4.0 to 2.5.0
…ub/codeql-action-2

Bump github/codeql-action from 1 to 2
We want all subsequent steps to use the reformatted version,
so do this before anything else
@fgregg fgregg deleted the branch daherman:master May 4, 2022 13:56
@fgregg fgregg deleted the master branch May 4, 2022 13:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.