feat: support LongCat-Image-Edit on cuda device. by Dragonliu2018 · Pull Request #957 · jd-opensource/xllm

Dragonliu2018 · 2026-02-27T17:00:25Z

This PR supports LongCat-Image-Edit on cuda device.

The test program and the generated image are as follows.

OS: Ubuntu22.04
device: NVIDIA A800-SXM4-40GB
model: LongCat-Image-Edit

import requests
import json
import base64
from PIL import Image
from io import BytesIO

# Test prompt for image generation
url = "http://localhost:9977/v1/image/generation"

img = Image.open("cat.png").convert("RGB")
buf = BytesIO()
img.save(buf, format="PNG")        # 和服务端 OpenCV 解码兼容
img_bytes = buf.getvalue()
image_base64 = base64.b64encode(img_bytes).decode("utf-8")

prompt = "将猫变成狗"
request_data = {
    "model": "LongCat-Image-Edit",
    "input": {
        "prompt": prompt,
        "negative_prompt": "",
        "image": image_base64
    },
    "parameters": {
        "guidance_scale": 1,
        "num_inference_steps": 8,
        "num_images_per_prompt": 1,
        "seed":43
    }
}

print("Testing LongCat-Image-Edit model...")
print(f"Request URL: {url}")
print(f"Request data: {json.dumps(request_data, indent=2, ensure_ascii=False)}")

response = requests.post(url, json=request_data)
if response.status_code != 200:
    print(f"Error: {response.status_code}")
    print(f"Response: {response.text}")
else:
    try:
        result = json.loads(response.text)
        print("Success! Response:")
        print(json.dumps(result, indent=2, ensure_ascii=False))
        
        # Handle image response
        if "output" in result and "results" in result["output"]:
            for i, image_data in enumerate(result["output"]["results"]):
                if "image" in image_data:
                    # Decode base64 image
                    image_bytes = base64.b64decode(image_data["image"])
                    image = Image.open(BytesIO(image_bytes))
                    
                    # Save image
                    filename = f"edited_image_{i+1}.png"
                    image.save(filename)
                    print(f"\nGenerated image saved as: {filename}")
                    print(f"Image size: {image_data.get('width', 'unknown')}x{image_data.get('height', 'unknown')}")
                    print(f"Seed: {image_data.get('seed', 'unknown')}")
    except json.JSONDecodeError as e:
        print(f"Failed to parse JSON response: {e}")
        print(f"Raw response: {response.text}")

Input image:

Output image:

gemini-code-assist

Code Review

This pull request adds support for the LongCat-Image-Edit model on CUDA devices. The changes include a new pipeline implementation, modifications to the model loader to handle preprocessor configurations, and several improvements and bug fixes in the attention and rotary embedding kernels. The implementation of the new pipeline is comprehensive. I've identified one issue in the model loader's error handling for the preprocessor configuration that could lead to silent failures.

…rom pad to pack in PR jd-opensource#957.

Dragonliu2018 requested review from DongheJin, JimHsiung, RobbieLeung, XuZhang99, liutongxuan, walsonyang and yq33victor as code owners February 27, 2026 17:00

gemini-code-assist Bot reviewed Feb 27, 2026

View reviewed changes

Comment thread xllm/core/framework/dit_model_loader.cpp

XuZhang99 reviewed Feb 28, 2026

View reviewed changes

Comment thread xllm/core/kernels/ops_api.cpp Outdated

Comment thread xllm/core/platform/vmm_torch_allocator.h

Dragonliu2018 force-pushed the lzl/feat/support_longcat_image_edit_on_cuda branch 4 times, most recently from c0b7d0a to 94b01e8 Compare March 2, 2026 16:15

Dragonliu2018 requested a review from XuZhang99 March 3, 2026 01:52

XuZhang99 reviewed Mar 3, 2026

View reviewed changes

Comment thread xllm/core/layers/cuda/flashinfer_planinfo.cpp Outdated

feat: support LongCat-Image-Edit on cuda device.

d7dfba9

Dragonliu2018 force-pushed the lzl/feat/support_longcat_image_edit_on_cuda branch from 94b01e8 to d7dfba9 Compare March 3, 2026 05:20

XuZhang99 previously approved these changes Mar 3, 2026

View reviewed changes

Merge branch 'main' into lzl/feat/support_longcat_image_edit_on_cuda

934afc5

Dragonliu2018 dismissed XuZhang99’s stale review via 934afc5 March 5, 2026 02:21

Dragonliu2018 requested a review from XuZhang99 March 5, 2026 02:23

XuZhang99 approved these changes Mar 5, 2026

View reviewed changes

yiming-l21 approved these changes Mar 6, 2026

View reviewed changes

XuZhang99 merged commit dca05bb into jd-opensource:main Mar 7, 2026
20 of 29 checks passed

phantomlei3 pushed a commit to phantomlei3/xllm that referenced this pull request Mar 9, 2026

bugfix: fix qwen vl errors on mlu device caused by RoPE mode switch f…

38a7f3e

…rom pad to pack in PR jd-opensource#957.

phantomlei3 mentioned this pull request Mar 9, 2026

bugfix: fix qwen vl errors on mlu device caused by RoPE mode switch from pad to pack. #1023

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support LongCat-Image-Edit on cuda device.#957

feat: support LongCat-Image-Edit on cuda device.#957
XuZhang99 merged 2 commits intojd-opensource:mainfrom
Dragonliu2018:lzl/feat/support_longcat_image_edit_on_cuda

Dragonliu2018 commented Feb 27, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Dragonliu2018 commented Feb 27, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants