What is the best AI model for turning a flat-lay into an on-model clothing photo?

In my test, Seedream 4.5 was the clear winner and also the cheapest of the photoreal options. It put the garment on a realistic model and kept the chest print crisp and correctly laid out, where the others either garbled, restyled, or re-laid the text. GPT Image 2 kept the print legible but restyled it, FLUX.2 Pro re-laid it to one thin line, and Nano Banana 2 garbled it into the stripes. All four handled the stripe pattern and the on-model realism well, so the model choice comes down to whether your garment carries a print or logo that has to survive.

Why does my AI on-model photo look great but the logo or text on the garment is wrong?

Because the photo and the garment are two different problems, and most models solve the first while quietly failing the second. The research on virtual try-on documents why: diffusion models compress the image into a latent space that sacrifices high-frequency detail like printed text, logos, and fine patterns for overall realism. So you get a believable model, drape, and pose, with a chest print that has drifted into a plausible-looking but wrong version of your design. Judge the garment, not the model.

Can AI preserve a print, logo, or pattern when placing clothing on a model?

Sometimes, and it depends heavily on the model. In my test the stripe pattern survived on all four models, but the chest text only stayed faithful on one (Seedream 4.5). Simple geometric patterns like stripes transfer more reliably than fine printed text and intricate logos, which are the first thing to degrade. The safe workflow is to test your specific garment, lead your judgment with the print or logo, and proof it at full size before you publish.

How do I turn a flat-lay into an on-model shot with the Masonry CLI?

Pass your flat-lay as a reference image and prompt for an on-model scene. Use the --ref flag to send the garment's real pixels and ask the model to put it on a standing model, preserving the print and pattern. Run the same command across two or three models and compare, because the one that keeps your print faithful is not always the most expensive one. In this test the cheapest photoreal model preserved it best.

Is AI cheaper than a model photoshoot for clothing?

By a wide margin on cost. A model shoot means booking talent, a studio, a photographer, and styling, then editing, which runs into hundreds or thousands of dollars per shoot, versus cents per image with AI. The 2026 reality is that shoppers expect to see clothing on a model and largely do not mind whether the model was photographed or generated, as long as the garment is shown accurately. That accuracy, your real print and pattern, is the thing to protect.

Best AI for Clothing Product Photography in 2026 (One Flat-Lay, Four On-Model Shots)

The way clothing sells online changed. Shoppers now expect to see a garment on a real, varied model rather than a flat-lay or a ghost mannequin, and they largely do not care whether that model was photographed in a studio or generated by an algorithm, as long as the garment looks like the garment. That shift created a whole category of tools that promise the same trick: upload your flat-lay, get back an on-model shot, and they all lead with some version of "flawless fabric preservation."

That phrase is the whole game, so I tested it directly. I made one controlled flat-lay, a navy-and-white striped tee with "GOOD WAVES" printed across the chest, and asked four of the strongest image models to put that exact shirt on a model, with the same prompt and the same source image. This is the first image-to-image test in our product-photography series, and it is a different question from the others: not "can the model draw a nice shirt" but "is it the same shirt." This is the clothing entry alongside the skincare, jewelry, supplements, makeup, food and beverage, footwear, candles, furniture, electronics, handbags, sunglasses, glassware, flowers, watches, perfume, packaging, pet products, toys, textiles, cookware, stationery, drinkware, soap, ceramics, art prints, earbuds, houseplants, knives, and automotive wheels tests and the broader best AI image model for product photography roundup.

Quick answer

Best overall, and cheapest of the photoreal picks: Seedream 4.5. It put the tee on a realistic model and kept the chest print crisp and correctly laid out. The only model that preserved the garment's identity.
The on-model photo is solved: all four produced a believable model wearing the striped tee, with the stripe pattern intact. Realism was not the separator.
The print is where identity breaks: GPT Image 2 kept the text legible but restyled it, FLUX.2 Pro re-laid it to a single thin line, and Nano Banana 2 garbled it into the stripes.

If you only remember one thing: the model shot will look great no matter which model you use. Whether it is still your garment depends entirely on the model you pick, so judge the print, not the pose.

The test: one flat-lay, four on-model shots

I started from a single controlled source image so every model got the identical garment to preserve.

Source flat-lay: a folded navy-and-white striped t-shirt with GOOD WAVES printed across the chest — The source flat-lay (made with GPT Image 2): a navy-and-white Breton-striped tee with a crisp two-line GOOD WAVES serif print. This exact image was the reference fed to all four models. The stripes and the print are the two things that had to survive the transfer onto a model.

Then I passed that flat-lay as a reference to each model and asked for an on-model catalog shot, preserving the stripes and the chest print. Here is what came back.

On-model shot generated by Seedream 4.5 with the GOOD WAVES print kept crisp and faithful — Seedream 4.5 (~4.8 credits): the winner, and the cheapest photoreal option. The print is crisp, correctly laid out on two lines, and the right weight, the stripes are clean, and the on-model shot is premium and high resolution. This is the garment, on a model.

Seedream 4.5 was the surprise. It is the model that made the best hero in the skincare, jewelry, and supplements tests, and here it did something the virtual-try-on literature says is the hard part: it kept the printed text faithful. The two-line "GOOD WAVES" came through crisp and correctly placed, the stripes held, and the model, drape, and lighting read as a real catalog shot. Best result, lowest cost of the photoreal models. If your garment carries a print, this is where I would start.

On-model shot generated by GPT Image 2, print legible but restyled and merged into the stripes — GPT Image 2 (~20.4 credits): legible but restyled. Both words read, but the print is distressed and visually merged into the stripes rather than sitting cleanly on top, and it is the most expensive of the four. Close, but not your exact print.

GPT Image 2 is usually the strongest text model, and it kept "GOOD WAVES" readable, which most try-on pipelines cannot do. But readable is not the same as faithful: it restyled the print into a distressed, vintage-looking treatment that blends into the stripes, rather than the clean serif of the source. For a brand whose graphic has to be exact, that is a drift, just a subtler one than a garble. It was also the priciest model in the test.

On-model shot generated by FLUX.2 Pro, print legible but re-laid to a single thin line — FLUX.2 Pro (~3.6 credits): legible but re-laid. The cheapest model kept clean stripes and a readable print, but collapsed the two-line GOOD WAVES into a single thin outlined line, the wrong layout and weight. Fine if you will overlay the graphic yourself.

FLUX.2 Pro was the cheapest and, like elsewhere in this series, produced a clean main image. The stripes are good and the text is legible, but it re-laid the print: the two-line serif design became a single thin outlined line. The information survived, the design did not. This matches FLUX's pattern across the series, a strong overall image with the fine, exact details quietly altered.

On-model shot generated by Nano Banana 2, print garbled into the stripes — Nano Banana 2 (~9.3 credits): the photo is great, the print is gone. The stripes, model, and drape are convincing, but the GOOD WAVES print has smeared into illegible shapes against the stripes. The clearest case of a believable photo with a broken garment identity.

Nano Banana 2 made a genuinely convincing on-model photo, the stripes, the fit, the pose all read right, which is exactly what makes its failure instructive. The chest print smeared into illegible shapes against the stripes. If you only looked at the thumbnail you would ship it, and you would be shipping a garment that no longer says what yours says. This is the headline risk of the whole category in one image.

The comparison

Model	On-model realism	Stripe pattern	Chest print (the test)	Rough cost/image
Seedream 4.5	Premium, hi-res	Preserved	Faithful, crisp, correct layout	~4.8 credits
GPT Image 2	Good	Preserved	Legible but restyled	~20.4 credits
FLUX.2 Pro	Good	Preserved	Legible but re-laid (1 line)	~3.6 credits
Nano Banana 2	Good	Preserved	Garbled, illegible	~9.3 credits

Credit costs are first-hand from this test on Masonry; per-image rates move, so check current pricing before you budget.

Why the photo is easy and the print is hard

The split in this test is not random. It is exactly what the research on virtual try-on predicts, and understanding the why tells you what to watch for.

The photo and the garment are two different problems. Putting clothing on a believable model is about global structure, body, pose, drape, lighting, and modern image models are good at it. Preserving a printed graphic is about high-frequency detail, the precise edges of letters and logos. Diffusion models compress an image into a latent space to generate it, and that compression is documented to sacrifice exactly this high-frequency information first. The try-on literature is blunt about it: logos and printed text are often lost in the encoding, complex patterns can come back wavy, and embroidered logos can turn to blurs. So the default outcome is a great photo wrapped around a degraded graphic.

Which is why the model choice is the garment-fidelity decision. What this test shows is that the degradation is not uniform. The simple stripe pattern survived on all four models, because broad geometric patterns hold up better than fine type. But the printed text only stayed faithful on one model. The lesson is not "AI cannot preserve prints," it is "most models will not, and one will, so the model you choose is the decision that determines whether the garment in your catalog is actually yours."

The cheap-tool promise and the expensive-model assumption are both wrong. The specialized tools that promise "fabric preservation" are wrappers around these same models, so they inherit this exact behavior. And the most expensive model in the test was not the most faithful. The only way to know is to run your specific garment, with its specific print, and look.

How to shoot your clothing line without a model shoot

The workflow is the product photography roundup approach, applied to image-to-image. Start from a clean flat-lay of the real garment. Pass it as a reference and generate the on-model shot. Then judge the print first and the pose second, because the pose will almost always be fine and the print almost always is where the money is lost. Run two or three models, since the faithful one is not the obvious or the priciest one, and for an intricate logo or exact brand type, plan to overlay the real graphic rather than trusting any from-scratch render.

With the Masonry CLI you pass your flat-lay as a reference and fire the same prompt at every model from one command, which is exactly how the shots above were made:

Prompt

masonry image "put this exact striped tee on a standing model, ecommerce on-model shot, preserve the print and pattern" --ref ./flat-lay.png --model seedream-4-5 masonry image "put this exact striped tee on a standing model, ecommerce on-model shot, preserve the print and pattern" --ref ./flat-lay.png --model gpt-image-2

The bottom line

On-model is the format clothing now sells in, and AI makes it cheap. But cheap and accurate are different: every model in this test produced a believable model in a striped tee, and only one kept the print on that tee faithful, Seedream 4.5, at the lowest cost of the photoreal options. The pattern survived everywhere, the print survived in one place. So the rule for clothing is simple: never judge the on-model render by how real the model looks, judge it by whether the garment is still yours. Run your own flat-lay across two models from one place with the Masonry CLI, or see how the same fidelity-first logic plays out across every product type in our best AI image model for product photography roundup.