We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
2 parents b200f76 + 416a059 commit c7e9111Copy full SHA for c7e9111
1 file changed
Duplicates/README.md
@@ -42,3 +42,16 @@ print(len(ds.assignments))
42
print(len(ds.pairs))
43
```
44
45
+### Origin
46
+
47
+The choice of the files was designed in the included [notebooks](notebooks).
48
49
+### Limitations
50
51
+There were ~4 active human reviewers who did the labeling, they were from
52
+the same company, and talked to each other. Hence there can be bias in the labels.
53
+Code duplication is subjective, anyway.
54
55
+### License
56
57
+Code: MIT. Labels: Open Data Commons Open Database License (ODbL). Actual file contents © their authors.
0 commit comments