update cleanlab-tlm package to support binary evals by aditya1503 · Pull Request #130 · cleanlab/cleanlab-tlm

aditya1503 · 2025-10-16T18:08:57Z

This PR introduces mode for TLM RAG evals Binary/continuous
merge after backend PR

aditya1503 · 2025-10-28T17:13:08Z

        query_identifier: Optional[str] = None,
        context_identifier: Optional[str] = None,
        response_identifier: Optional[str] = None,
+        mode: Optional[str] = "numeric",


jwmueller · 2025-11-05T19:15:37Z

missing unit tests

Co-authored-by: Jonas Mueller <1390638+jwmueller@users.noreply.github.com>

…d_binary

aditya1503 · 2025-11-07T18:56:42Z

missing unit tests

Added here

Co-authored-by: Aditya Thyagarajan <aditya1593@icloud.com> Co-authored-by: Jonas Mueller <1390638+jwmueller@users.noreply.github.com>

elisno · 2025-12-04T17:27:52Z

What's going on with the formatting check in the CI?

elisno

Here's some initial feedback.

elisno · 2025-12-04T17:54:38Z

-    trustworthy_rag,  # noqa: F401
-    trustworthy_rag_api_key,  # noqa: F401


Why did you remove these? These fixtures are not defined in conftest.py, so they need to be imported.

Why is this file being updated at all?

This was with hatch automatic format fix, I'll look into these

elisno · 2025-12-04T17:58:50Z

+        # Compile and validate the eval
+        self.mode = self._compile_mode(mode, criteria, name)
+
+    def _compile_mode(self, mode: Optional[str], criteria: str, name: str) -> str:


This _compile_mode method needs to be written more carefully to avoid unintentionally breaking all our tests. A lot of the userwarnings will be thrown as errors during automated testing.

aditya1503 · 2025-12-04T18:22:05Z

+        # Compile and validate the eval
+        self.mode = self._compile_mode(mode, criteria, name)
+
+    def _compile_mode(self, mode: Optional[str], criteria: str, name: str) -> str:


Add separate test cases for these

update cleanlab-tlm package to support binary evals

1aae0be

aditya1503 commented Oct 28, 2025

View reviewed changes

aditya1503 added 2 commits November 4, 2025 18:01

update default evals

6e33a7a

format fix

3b6ede4

aditya1503 requested a review from elisno November 4, 2025 14:33

jwmueller reviewed Nov 6, 2025

View reviewed changes

Comment thread src/cleanlab_tlm/utils/rag.py Outdated

Update src/cleanlab_tlm/utils/rag.py

7f079ff

Co-authored-by: Jonas Mueller <1390638+jwmueller@users.noreply.github.com>

aditya1503 added the Do not merge label Nov 7, 2025

aditya1503 added 6 commits November 8, 2025 00:07

add test cases

edf67a6

Merge branch 'add_binary' of github.com:cleanlab/cleanlab-tlm into ad…

4401157

…d_binary

mypy fix

99f813b

hatch format

5063889

hatch fix

22dc55a

hatch dict

961726a

aditya1503 marked this pull request as ready for review November 7, 2025 18:56

jwmueller mentioned this pull request Nov 18, 2025

Added mode as an argument to Evals class to enable binary evals #119

Closed

mturk24 and others added 3 commits November 25, 2025 02:28

Added support for evals compilation checks and auto mode (#131)

c45e7cb

Co-authored-by: Aditya Thyagarajan <aditya1593@icloud.com> Co-authored-by: Jonas Mueller <1390638+jwmueller@users.noreply.github.com>

update tests

14599dc

binary switch

9463810

aditya1503 removed the Do not merge label Dec 3, 2025

aditya1503 added 4 commits December 3, 2025 23:46

Merge branch 'main' into add_binary

40c078f

fix tests to filter specific warnigns

ccafd2a

update tests

b028250

hatch format

e32eb69

warning fix

6adba18

elisno reviewed Dec 4, 2025

View reviewed changes

aditya1503 commented Dec 4, 2025

View reviewed changes

aditya1503 added 9 commits December 6, 2025 00:59

update binary evals

d1fe02c

hatch format

6399d18

noqa

55a36e4

update tests

645c1bc

default for context sufficienchy

3a75591

Merge branch 'main' into add_binary

4236c67

test fix

63a2386

sample eval

4a2253e

update custom evals

2225355

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update cleanlab-tlm package to support binary evals#130

update cleanlab-tlm package to support binary evals#130
aditya1503 wants to merge 27 commits into
mainfrom
add_binary

aditya1503 commented Oct 16, 2025 •

edited

Loading

Uh oh!

aditya1503 Oct 28, 2025

Uh oh!

jwmueller commented Nov 5, 2025

Uh oh!

Uh oh!

aditya1503 commented Nov 7, 2025

Uh oh!

elisno commented Dec 4, 2025

Uh oh!

elisno left a comment

Uh oh!

elisno Dec 4, 2025

Uh oh!

elisno Dec 4, 2025

Uh oh!

aditya1503 Dec 4, 2025

Uh oh!

elisno Dec 4, 2025

Uh oh!

aditya1503 Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		trustworthy_rag, # noqa: F401
		trustworthy_rag_api_key, # noqa: F401

Conversation

aditya1503 commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aditya1503 Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

jwmueller commented Nov 5, 2025

Uh oh!

Uh oh!

aditya1503 commented Nov 7, 2025

Uh oh!

elisno commented Dec 4, 2025

Uh oh!

elisno left a comment

Choose a reason for hiding this comment

Uh oh!

elisno Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

elisno Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

aditya1503 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

elisno Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

aditya1503 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aditya1503 commented Oct 16, 2025 •

edited

Loading