Question about the model conditioning design

Hi @suniique
Thanks for your great work and for open-sourcing this code!

I have a question regarding the model design of the RPF. I noticed that the input to the flow model consists of the point features of unposed parts concatenated with noise along the feature dimension.

I am curious why you chose to concatenate these features rather than injecting them as common conditional signals using AdaLN in the DiT block, which seems to be the standard design for conditional generation. Is there a specific intuition or advantage behind this design choice?

Any response is appreciated!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about the model conditioning design #29

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about the model conditioning design #29

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions