Questions about soft-min-snr loss

Really nice work! When reading through the paper, I have some questions about the proposed soft-min-snr loss. Would appreciate your feedback on this.

1. In eq (5) of the hourglass diffusion transformers, it's mentioned that `c_out^{-2}(\sigma)` is incorporated, however, based on the definition of `c_out`, eq (5) should be

```
min(SNR, \gamma) * (\sigma_data^2 + \sigma^2) / (\sigma_data^2 * \sigma^2).
```

2. In the implementation:
https://github.com/crowsonkb/k-diffusion/blob/6ab5146d4a5ef63901326489f31f1d8e7dd36b48/k_diffusion/layers.py#L64-L65

The  `\gamma=4 or 5` proposed in the paper doesn't seem to be used. Am I missing anything here?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about soft-min-snr loss #98

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

	def _weighting_soft_min_snr(self, sigma):
	return (sigma * self.sigma_data) 2 / (sigma 2 + self.sigma_data 2) 2

Questions about soft-min-snr loss #98

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions