RepoStepFunStepFunpublished Apr 23, 2025seen 5d

stepfun-ai/Step1X-Edit

Python

Open original ↗

Captured source

source ↗
published Apr 23, 2025seen 5dcaptured 11hhttp 200method plain

stepfun-ai/Step1X-Edit

Description: A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Language: Python

License: Apache-2.0

Stars: 2224

Forks: 102

Open issues: 32

Created: 2025-04-23T09:53:08Z

Pushed: 2026-04-29T14:31:44Z

Default branch: main

Fork: no

Archived: no

README:

🔥🔥🔥 News!!

  • Apr 29, 2026: 🎉 Step Image Edit 2 is now live — a lightweight model designed for ultra-fast response and high-quality output, delivering a real-time interactive creation experience. It can complete image generation and editing tasks within 2 seconds. Feel free to try it out and share your feedback ✨✨✨

Try it here (StepFun Open Platform): https://platform.stepfun.com/docs/zh/guides/models/step-image-edit-2

API documentation: https://platform.stepfun.com/docs/zh/step-plan/integrations/image-api

  • Dec 29, 2025: 🎉 RegionE delivers a 2.5× speedup for Step1X-Edit inference with no accuracy degradation, achieved with just five lines of code.
  • Nov 26, 2025: 👋 We release Step1X-Edit-v1p2 (referred to as ReasonEdit-S in the paper), a native reasoning edit model with better performance on KRIS-Bench and GEdit-Bench. Technical report can be found here.

Models

G_SC⬆️ G_PQ⬆️ G_O⬆️ FK⬆️ CK⬆️ PK⬆️ Overall⬆️

Flux-Kontext-dev 7.16 7.37 6.51 53.28 50.36 42.53 49.54

Qwen-Image-Edit-2509 8.00 7.86 7.56 61.47 56.79 47.07 56.15

Step1X-Edit v1.1 7.66 7.35 6.97 53.05 54.34 44.66 51.59

Step1x-edit-v1p2-preview 8.14 7.55 7.42 60.49 58.81 41.77 52.51

Step1x-edit-v1p2 (base) 7.77 7.65 7.24 58.23 60.55 46.21 56.33

Step1x-edit-v1p2 (thinking) 8.02 7.64 7.36 59.79 62.76 49.78 58.64

Step1x-edit-v1p2 (thinking + reflection) 8.18 7.85 7.58 62.44 65.72 50.42 60.93

  • Sep 08, 2025: 👋 We release step1x-edit-v1p2-preview, a new version of Step1X-Edit with reasoning edit ability and better performance (report to be released soon), featuring:
  • Native Reasoning Edit Model: Combines instruction reasoning with reflective correction to handle complex edits more accurately. Performance on KRIS-Bench:

| Models | Factual Knowledge ⬆️ | Conceptual Knowledge ⬆️ | Procedural Knowledge ⬆️ | Overall ⬆️ | |:------------:|:------------:|:------------:| :------------:|:------------:| | Step1X-Edit v1.1 | 53.05 | 54.34 | 44.66 | 51.59 | | Step1x-edit-v1p2-preview | 60.49 | 58.81 | 41.77 | 52.51 | | Step1x-edit-v1p2-preview (thinking) | 62.24 | 62.25 | 44.43 | 55.21| | Step1x-edit-v1p2-preview (thinking + reflection) | 62.94 | 61.82 | 44.08 | 55.64 |

  • Improved image editing quality and better instruction-following performance. Performance on GEdit-Bench:

| Models | G_SC ⬆️ | G_PQ ⬆️ | G_O ⬆️ | Q_SC ⬆️ | Q_PQ ⬆️ | Q_O ⬆️ | |:------------:|:------------:|:------------:| :------------:|:------------:| :------------:|:------------:| | Step1X-Edit (v1.0) | 7.13 | 7.00 | 6.44 | 7.39 | 7.28 | 7.07 | | Step1X-Edit (v1.1) | 7.66 | 7.35 | 6.97 | 7.65 | 7.41 | 7.35 | | Step1x-edit-v1p2-preview | 8.14 | 7.55 | 7.42 | 7.90 | 7.34 | 7.40 |

  • Jul 09, 2025: 👋 We’ve updated the step1x-edit model and released it as step1x-edit-v1p1 (diffusers version see here), featuring:
  • Added support for text-to-image (T2I) generation tasks
  • Improved image editing quality and better instruction-following performance.

Quantitative evaluation on GEdit-Bench-EN (Full set). G_SC, G_PQ, and G_O refer to the metrics evaluated by GPT-4.1, while Q_SC, Q_PQ, and Q_O refer to the metrics evaluated by Qwen2.5-VL-72B. To facilitate reproducibility, we have released the intermediate results of our model evaluations. | Models | G_SC ⬆️ | G_PQ ⬆️ | G_O ⬆️ | Q_SC ⬆️ | Q_PQ ⬆️ | Q_O ⬆️ | |:------------:|:------------:|:------------:| :------------:|:------------:| :------------:|:------------:| | Step1X-Edit (v1.0) | 7.13 | 7.00 | 6.44 | 7.39 | 7.28 | 7.07 | | Step1X-Edit (v1.1) | 7.66 | 7.35 | 6.97 | 7.65 | 7.41 | 7.35 |

  • Jun 17, 2025: 👋 Support for Teacache and parallel inference has been added.
  • May 22, 2025: 👋 Step1X-Edit now supports Lora finetuning on a single 24GB GPU now! A hand-fixing Lora for anime characters has also been released. Download Lora
  • Apr 30, 2025: 🎉 Step1X-Edit ComfyUI Plugin is available now, thanks for the community contribution! quank123wip/ComfyUI-Step1X-Edit & raykindle/ComfyUI_Step1X-Edit.
  • Apr 27, 2025: 🎉 With community support, we update the inference code and model weights of Step1X-Edit-FP8. meimeilook/Step1X-Edit-FP8 & rkfg/Step1X-Edit-FP8.
  • Apr 26, 2025: 🎉 Step1X-Edit is now live — you can try editing images directly in the online demo! Online Demo
  • Apr 25, 2025: 👋 We release the evaluation code and benchmark data of Step1X-Edit. Download GEdit-Bench
  • Apr 25, 2025: 👋 We release the inference code and model weights of Step1X-Edit. ModelScope & HuggingFace models.
  • Apr 25, 2025: 👋 We have made our technical report available as open source. Read

📖 Introduction

We introduce a state-of-the-art image editing model, Step1X-Edit, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini2 Flash. More specifically, we adopt the Multimodal LLM to process the reference image and user's editing instruction. A latent embedding has been extracted and integrated with a diffusion image decoder to obtain the target image. To train the model, we build a data generation pipeline to produce a high-quality dataset. For evaluation, we develop the GEdit-Bench, a…

Excerpt shown — open the source for the full document.

Notability

notability 5.0/10

New repo with solid traction.