bugfix: fix the issue of ineffective input embedding transmission. #490

magicheng0816 · 2025-12-05T11:29:01Z

No description provided.

yq33victor · 2025-12-06T15:15:06Z

xllm/core/framework/batch/batch_input_builder.cpp

+  if (input_embeddings_vec_.size() > 0) {
+    torch::Tensor input_embeddings = torch::cat(input_embeddings_vec_);
+    raw_forward_input.embeddings = tensor_to_2d_float_vector(input_embeddings);
+  }


nit: emmm, it seems that the input_embedding is useless now, maybe @wly-115 @yiming-l21 @RobbieLeung can help to confirm this. If so, we can delete all input_embedding related codes.

input_embedding will be used in generative recommendation, don't delete it for now

linkerzhang · 2025-12-07T00:42:12Z

xllm/core/framework/batch/batch.cpp


  const auto& input_embedding = sequence->get_input_embedding();
-  if (input_embedding.defined())
+  if (sequence->stage() == SequenceStage::PREFILL && input_embedding.defined())


will sequence be nullptr? if not, the signature should be "Sequence&",

if yes, the nullptr case should be handled.

At the beginning of the function, ‘CHECK(sequence != nullptr)’, we will ensure that the sequence here will not be nullptr, and the use of pointers instead of references here is necessary to keep the sequence in both the batch and request.

linkerzhang · 2025-12-07T01:06:39Z

xllm/core/util/tensor_helper.h

  return tensor;
 };

+inline std::vector<std::vector<float>> tensor_to_2d_float_vector(


this will lead to a copy.... please fix it.

When passing 'const torch::Tensor&tensor' as a function parameter, it does not trigger the copying of tensor data itself, only pass the 'reference' of tensor objects, and pytorch's torch:: Tensor is just an intelligent pointer container with reference counting. Passing torch:: Tensor tensor directly will also not result in data copying;

When 'std::vector<std::vector>' is used as the return value of a function, there will be no data copying. Firstly, std::vector is a 'dynamic array' container, and calling the vector's move constructor only transfers ownership of the metadata. Secondly, the compiler will enable Named Return Value Optimization (NRVO), which directly constructs the vector at the memory address on the calling end, completely skipping copy/move;

Going back inside the function, if input tensor satisfies tensor.device().type()==torch:: kCPU&&tensor. scalar_type()== torch::kFloat32 && tensor.is_contiguous(), there will also be no data copying, but if one condition is not met, it will result in data copying, which is logical.

bugfix: fix the issue of ineffective input embedding transmission.

3dea903

magicheng0816 requested review from DragonFive, liujinguang0125, liutongxuan, walsonyang and yq33victor December 5, 2025 11:29

yq33victor reviewed Dec 6, 2025

View reviewed changes

linkerzhang requested changes Dec 7, 2025

View reviewed changes

linkerzhang self-requested a review December 10, 2025 04:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bugfix: fix the issue of ineffective input embedding transmission. #490

bugfix: fix the issue of ineffective input embedding transmission. #490

magicheng0816 commented Dec 5, 2025

Uh oh!

yq33victor Dec 6, 2025

Uh oh!

magicheng0816 Dec 7, 2025

Uh oh!

linkerzhang Dec 7, 2025

Uh oh!

linkerzhang Dec 7, 2025

Uh oh!

magicheng0816 Dec 7, 2025

Uh oh!

linkerzhang Dec 7, 2025

Uh oh!

magicheng0816 Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bugfix: fix the issue of ineffective input embedding transmission. #490

Are you sure you want to change the base?

bugfix: fix the issue of ineffective input embedding transmission. #490

Conversation

magicheng0816 commented Dec 5, 2025

Uh oh!

yq33victor Dec 6, 2025

Choose a reason for hiding this comment

Uh oh!

magicheng0816 Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

linkerzhang Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

linkerzhang Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

magicheng0816 Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

linkerzhang Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

magicheng0816 Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants