Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 7 additions & 0 deletions docs/source/candidate_aggregators.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
# Candidate Aggregators for Multi-Task Learning

## GradNorm
GradNorm (Gradient Normalization) is a dynamic gradient-based approach that automatically balances task training by adjusting the weight coefficients of each task's loss. It aims to equalize the gradient norms of different tasks by minimizing an auxiliary loss. This auxiliary loss penalizes the difference between the actual task gradient norm and a target norm derived from the task's relative training rate. The main goal is to ensure that no single task dominates the model updates, thus preventing overfitting and enabling more effective learning across multiple tasks. It focuses primarily on dynamic gradient magnitude tuning.

## DB-MTL (Dual-Balancing Multi-Task Learning)
DB-MTL is a method that handles imbalances at both the loss level and the gradient level simultaneously through a "dual-balancing" strategy. For loss-scale balancing, it applies a parameter-free logarithm transformation on each task's loss to bring them to a similar scale. For gradient-magnitude balancing, it employs a training-free maximum-norm normalization strategy, which rescales all task gradients to have the same magnitude as the maximum gradient norm among the tasks. Unlike GradNorm, which uses an auxiliary loss and dynamic tuning, DB-MTL is computationally efficient (training-free) and effectively equalizes both loss and gradient scales.
1 change: 1 addition & 0 deletions docs/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -66,6 +66,7 @@ TorchJD is open-source, under MIT License. The source code is available on

installation.md
examples/index.rst
candidate_aggregators.md

.. toctree::
:caption: API Reference
Expand Down
Loading