Skip to content

Questions about paper #3

@stalkerrush

Description

@stalkerrush

Hi @B1ueber2y @pengsongyou , thanks for the great work! I have two questions regarding your paper:

  1. For video sequence supervised reconstruction, you mentioned that you did not use masks. Then why does the final predicted shape only contain foreground objects? I assume that the shape should be random at the beginning and the photometric loss is applied to each pixel of a pair of groundtruth images.
  2. It seems that you always optimize the latent code instead of the model. To my understanding, this means for every new object you would need a specific optimization. Is there any reason why you didn't optimize the model and use an image encoder to make the sdf network conditioned on an input?

Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions