Moebius: 0.2B image inpainting model with 10B-level performance Hackernews Viewer

Moebius: 0.2B image inpainting model with 10B-level performance

(hustvl.github.io)

312 points by DSemba 22 June 2026 | 81 comments

Comments

simonw 22 hours ago

I got this working with ONNX (thanks, Claude Opus 4.8) and now I have an interactive demo of the model running entirely in the browser here (~1.3GB download): https://simonw.github.io/moebius-web/ - code here: https://github.com/simonw/moebius-web

(Claude Code transcript: https://gisthost.github.io/?58039ba5c1ca3ed177e8659168996ee4)

Wrote this up in more detail on my blog: https://simonwillison.net/2026/Jun/22/porting-moebius/

lifthrasiir 22 June 2026

Tried a bit, and while it is very impressive for 0.2B model it would be very hard to convince me that this matches with 10B models. It did work reasonably well with natural images but inpainted regions were visibly smoother than surroundings, and performed very badly on novel objects. It is also limited to 512x512 output, which limits its practical usefulness.

xrd 22 June 2026

I did an inpainting project for a client a few years ago. They were trying to inpaint banner ads for concert promoters, and find a way to make it easy to produce a bunch of different sized ads for a variety of placements. I was tasked with inpainting Xmas themed ad for a few major singers.

The weirdest thing was when the inpainting tool added strange people to an image. This singer was all decked out in tinsel and red, and the inpainting model added a grumpy old man in a top hat. I don't recall clicking the "Add creepy old man" button.

At the time this was Stable Diffusion on the backend, run by a variety of model hosting services, Amazon being one. They all had different requirements for the input image and that made things really complex. For some the aspect ratio was impossible to meet, and it would fail if the banner was 200x60. For others, you had to resize it before input, which meant you were adding an image with poor resolution to start. Garbage in, garbage out.

All of this to say, there is a lot of preproduction that went into it, and the client never ended up using my attempts.

james2doyle 22 June 2026

There are some demo spaces using this. This one seems the best (paint your own mask) but it failed on all the images I tried: https://huggingface.co/spaces/multimodalart/Moebius

nickandbro 15 hours ago

Here is a little app I made that allows you to experiment with all of the fine tuned models that runs entirely in your browser:

https://inpaintlab.com/

chatmasta 20 hours ago

What is inpainting? Everyone in the comments seems to be familiar with the term, and I don’t see it described in the linked page.

Zopieux 6 hours ago

Not great. The inpainted areas are, as usual, very smooth compared to the detailed, "high frequency" look of natural photos.

Barely useful enough to erase things in thumbnails.

pattilupone 22 June 2026

I want a version of this for manga (for translation). Right now I think the go-to lightweight inpainting model for anime and manga is LaMa which is several years old now and it feels like there is room for improvement.

delis-thumbs-7e 22 June 2026

This is the useful AI stuf. There’s so many usecases this makes possible.

NooneAtAll3 22 June 2026

I don't understand. Is it available somewhere to try or is it just an ad?

teroshan 22 June 2026

Unrelated but when I read inpainting and Moebius I was scared it was related and using the art of the great Jean Giraud [0] a.k.a. Moebius

https://characterdesignreferences.com/artist-of-the-week-3/m...

[0] https://en.wikipedia.org/wiki/Jean_Giraud

epolanski 22 June 2026

What is the current SOTA for impainting?

I have a potential project for my e-commerce where I want to allow users to upload images of their house exteriors and impaint awnings.

michaelfm1211 22 June 2026

> The core insight of Moebius can be summarized in a single equation: Synergy × (Architecture + Distillation) = Shattering the "Impossible Triangle" of Low Parameters, Fast Inference, and High Quality

Is it just me or is it weird seeing these clickbaity AI-generated taglines in an otherwise scientific work?

gspr 22 June 2026

Nitpick: in the showcase on that page, under Comparison of Natural Scenes, Moebius should definitely get a "structural confusion" tag for the back of the surfboard. If other models get deducted for truncating the surfboard, then surely the elongation that Moebius does should count too.

Also, what's going on behind the in-painted corner of the house? We'd need to see higher resolution pictures, but I'm not convinced that it too shouldn't get a flag. Likewise with the beach just behind the surfboard. Not terrible, but what gets flagged in the competitors is similar.

N_Lens 22 June 2026

The gallery of their samples is pretty impressive!

GL26 22 June 2026

Could this run locally on a smartphone ?

rasz 22 June 2026

It sure has a thing for chins, jaws and removing weight, looksmaxing build in.

hari1123 22 June 2026

lot of the photo editors on mobiles have this, maybe even some apps?

zb3 22 June 2026

1) What are RAM requirements?

2) If these are reasonable, a WebGPU demo would be great..