Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 7 additions & 1 deletion CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,13 @@ changelog does not include internal changes that do not affect the user.

### Added

- Added `SDMGradWeighting` from
[Direction-oriented Multi-objective Learning: Simple and Provable Stochastic
Algorithms](https://arxiv.org/pdf/2305.18409)
(NeurIPS 2023). It is a stateful `Weighting` that solves for task weights via a simplex-projected
inner loop on a cross-batch matrix `A = J_1 @ J_2.T` (computed from two independent mini-batches
using `autojac.jac`), with a direction-oriented regularizer pulling the descent direction toward
a preference direction.
- Added `DWA` (Dynamic Weight Average) from [End-to-End Multi-Task Learning with
Attention](https://openaccess.thecvf.com/content_CVPR_2019/papers/Liu_End-To-End_Multi-Task_Learning_With_Attention_CVPR_2019_paper.pdf)
(CVPR 2019), a stateful `Scalarizer` that weights each value by the relative rate at which its
Expand All @@ -25,7 +32,6 @@ changelog does not include internal changes that do not affect the user.

### Added

- Added `SDMGradWeighting` from [Direction-oriented Multi-objective Learning: Simple and Provable Stochastic Algorithms](https://arxiv.org/pdf/2305.18409) (NeurIPS 2023). It is a stateful `Weighting` that solves for task weights via a simplex-projected inner loop on a cross-batch matrix `A = J_1 @ J_2.T` (computed from two independent mini-batches using `autojac.jac`), with a direction-oriented regularizer pulling the descent direction toward a preference direction.
- Added `IMTL-L` (the loss-balancing variant of Impartial Multi-Task Learning) from [Towards
Impartial Multi-Task Learning](https://openreview.net/pdf?id=IMPnRXEWpvr) (ICLR 2021), a stateful
`Scalarizer` that learns a per-task scale `s_i` and combines the values as
Expand Down
Loading