Hi, thanks for the work and the released the code. These are very helpful. However, I feel like there might be something off and really appreciate some clarification. Please correct me if I am misunderstanding the code. Thank you!
- In here, this is calculating the similarity score between instruction and observation instead of actions, since
emb is calculating from here. And 'Actions' in memory is the trajectory always starting with Action: search[] since it is passed by here with actions[2:-1]. So basically emb is for obs (page after first search), act, obs, act, ... and np.argmax(cos_scores_act)*2] is always getting the even position which is an obs. This issue is also related to all the other places dealing with even and odd positon.
- In lines, what is the purpose of adding 2.
Hi, thanks for the work and the released the code. These are very helpful. However, I feel like there might be something off and really appreciate some clarification. Please correct me if I am misunderstanding the code. Thank you!
embis calculating from here. And'Actions'inmemoryis the trajectory always starting withAction: search[]since it is passed by here withactions[2:-1]. So basicallyembis forobs (page after first search), act, obs, act, ...andnp.argmax(cos_scores_act)*2]is always getting the even position which is anobs. This issue is also related to all the other places dealing with even and odd positon.