Thank you for your great work!
I have a question regarding the top-k energy selection process. Since the energy is normalized per frame, the values across different frames are on different scales. Could you please explain the rationale behind performing a global top-k selection across all frames after this per-frame normalization?
Thank you for your great work!
I have a question regarding the top-k energy selection process. Since the energy is normalized per frame, the values across different frames are on different scales. Could you please explain the rationale behind performing a global top-k selection across all frames after this per-frame normalization?