Add semi-supervised losses and test by surajyadav-research · Pull Request #1551 · google-deepmind/optax

surajyadav-research · 2026-01-06T10:38:16Z

`fixmatch_loss` — tests

Random batched matches reference: compares output to a float32 reference implementation (hard/soft supervised labels), across different B/U/C, confidence_threshold, lambda_u, and dtype (incl. bfloat16); also checks output is finite.
vmap correctness: jax.vmap(fixmatch_loss) matches lax.map (per-item loop) output; also checks finiteness.
Permutation invariance: shuffling labeled batch order and unlabeled batch order doesn’t change the loss (with threshold set so ordering shouldn’t matter); checks finiteness.
Numerical stability (extreme logits): loss stays finite for very large-magnitude logits; gradients w.r.t. labeled logits and strong unlabeled logits are finite.
lambda_u = 0 supervised-only: returns exactly supervised cross-entropy when unsupervised weight is zero.
Confidence threshold edges:
- too high (e.g., >1) → no pseudo-labels used → supervised-only
- 0.0 → unsupervised term included (loss ≥ supervised loss)
Empty unlabeled batch: if U=0, behaves as supervised-only; finite.
Gradient flows through strong logits: gradient w.r.t. strong logits (us) is non-zero and finite.
bfloat16 run: smoke test that it runs in bfloat16 and returns finite output.

`mixmatch_loss` — tests

Random batched matches reference: compares output to a float32 reference implementation (hard/soft supervised labels), different B/U/C, lambda_u, and dtype (incl. bfloat16); checks finiteness.
vmap correctness: jax.vmap(mixmatch_loss) matches lax.map; checks finiteness.
Permutation invariance: shuffling labeled and unlabeled batches doesn’t change the loss; checks finiteness.
Numerical stability (extreme logits): loss finite for huge logits; gradients w.r.t. labeled logits and unlabeled logits are finite.
lambda_u = 0 supervised-only: returns supervised cross-entropy when unsupervised weight is zero.
Stop-gradient on unlabeled targets: gradient w.r.t. unlabeled_targets is zero (targets are treated as constants).
Unsupervised term zero when targets match probs: if unlabeled_targets == softmax(unlabeled_logits), unsupervised loss becomes ~0, so total ≈ supervised-only.
bfloat16 run: smoke test that it runs in bfloat16 and returns finite output.

surajyadav-research · 2026-01-06T17:36:02Z

Hi @rdyro,
Could you please review this implementation when you have time? I included few tests to check robustness of losses; let me know if any of them seem unnecessary and I’ll remove them.

vroulet

Looks pretty good, thanks. Just add references and use correct headers

vroulet · 2026-01-22T21:37:45Z

@@ -0,0 +1,173 @@
+# Copyright 2024 DeepMind Technologies Limited. All Rights Reserved.


vroulet · 2026-01-22T21:39:09Z

+    lambda_u: Weight for unlabeled term.
+
+  Returns:
+    Scalar FixMatch loss.


Add reference to paper (be careful about formatting, see e.g. how it is done in the docstring of adam)

vroulet · 2026-01-22T21:39:46Z

+
+  Returns:
+    Scalar MixMatch loss.
+  """


Same here, add reference

vroulet · 2026-01-22T21:57:17Z

+        lambda_u=lambda_u,
+    )
+
+    self._assert_allclose(got, expected, dtype)


Use the wrapper with self.subTest(...) (see tests in other files).
This way if one test fails the other one can still be tested (so we get all info at once).

Update all tests with that pattern

vroulet · 2026-01-22T21:58:18Z

+
+class FixMatchLossTest(parameterized.TestCase):
+  @staticmethod
+  def _assert_allclose(got, expected, dtype):


Make these functions not class functions but private functions at the fiel level since they are used in both tests I believe

surajyadav-research · 2026-01-23T16:52:51Z

@vroulet Thank you for reviewing the code. I’ll push the updated changes ASAP.

surajyadav-research · 2026-01-23T21:36:57Z

Hi @vroulet,
I’ve updated all the changes. Whenever you have time, could you please review them?

surajyadav-research added 2 commits January 6, 2026 10:36

Add semi-supervised losses and test

fe215d2

modified losses and test for test errors

c5f3629

vroulet reviewed Jan 22, 2026

View reviewed changes

surajyadav-research added 2 commits January 23, 2026 20:41

Merge branch 'main' into semi-sl

19a8c37

Fix style issues and tests for semi-supervised losses

33679d5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add semi-supervised losses and test #1551

Add semi-supervised losses and test #1551
surajyadav-research wants to merge 4 commits intogoogle-deepmind:mainfrom
surajyadav-research:semi-sl

surajyadav-research commented Jan 6, 2026

Uh oh!

surajyadav-research commented Jan 6, 2026

Uh oh!

vroulet left a comment

Uh oh!

vroulet Jan 22, 2026

Uh oh!

vroulet Jan 22, 2026

Uh oh!

vroulet Jan 22, 2026

Uh oh!

vroulet Jan 22, 2026

Uh oh!

vroulet Jan 22, 2026

Uh oh!

surajyadav-research commented Jan 23, 2026

Uh oh!

surajyadav-research commented Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,173 @@
		# Copyright 2024 DeepMind Technologies Limited. All Rights Reserved.

Conversation

surajyadav-research commented Jan 6, 2026

fixmatch_loss — tests

mixmatch_loss — tests

Uh oh!

surajyadav-research commented Jan 6, 2026

Uh oh!

vroulet left a comment

Choose a reason for hiding this comment

Uh oh!

vroulet Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

vroulet Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

vroulet Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

vroulet Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

vroulet Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

surajyadav-research commented Jan 23, 2026

Uh oh!

surajyadav-research commented Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

`fixmatch_loss` — tests

`mixmatch_loss` — tests