Hi, thank you for releasing this great project!
I am currently trying to use the Egocentric Motion input, but I am a bit confused about the format of the 57D action representation.
Could you please provide a simple example of the Egocentric Motion input (e.g., a sequence of 57D actions) or explain how each dimension is defined?
In particular, I would like to understand:
What coordinate system is used (camera coordinate, world coordinate, or another convention)?
What is the scale/unit of the translation values (meters, centimeters, normalized values, etc.)?
Are the rotation values represented as Euler angles, axis-angle, quaternions, or another format?
Is any normalization applied to the 57D action before feeding it into the model?
Having a concrete example or a description of the preprocessing would be extremely helpful for reproducing the results.
Thank you very much for your help!
Hi, thank you for releasing this great project!
I am currently trying to use the Egocentric Motion input, but I am a bit confused about the format of the 57D action representation.
Could you please provide a simple example of the Egocentric Motion input (e.g., a sequence of 57D actions) or explain how each dimension is defined?
In particular, I would like to understand:
What coordinate system is used (camera coordinate, world coordinate, or another convention)?
What is the scale/unit of the translation values (meters, centimeters, normalized values, etc.)?
Are the rotation values represented as Euler angles, axis-angle, quaternions, or another format?
Is any normalization applied to the 57D action before feeding it into the model?
Having a concrete example or a description of the preprocessing would be extremely helpful for reproducing the results.
Thank you very much for your help!