How to Generate Images and Video in Claude Code (Without Leaving the Terminal)

Q: How do I generate images in Claude Code from the terminal?

Install the Masonry CLI with npx @masonryai/cli, run masonry login, then either ask Claude Code in plain language to generate an asset and it runs masonry image for you, or run masonry skill install once so the agent reaches for generation on its own.

Q: Do I need a plugin or MCP server?

No. Because masonry is a normal shell command, any agent with shell access can call it. Running masonry skill install makes Claude Code reach for it automatically, but it is optional.

Yes, you can generate images and video directly in Claude Code. It cannot do it natively, it is a text coding agent, so when you ask it for a hero image, an OG card, or a placeholder asset, it can write the markup that references the file but it cannot create the file itself. The fix is to give it a command it can run, and from then on it generates the asset mid-session like any other shell step. Without that, the usual workaround is to stop, open a separate image tool, generate something, download it, drag it into your repo, and pick up where you left off. That context switch is small but it happens constantly, and it pulls you out of the flow that made the agent useful in the first place.

The fix is to give the agent a command it can run itself. The Masonry CLI generates images and video from the terminal across 50+ models, which means Claude Code (or any coding agent that can run shell commands) can produce a real asset mid-session and keep going. Here is the whole setup.

Quick answer

Prompt

# install npx @masonryai/cli # connect your account (opens a link) masonry login # generate an image masonry image "a minimalist mountain logo, flat vector" --output logo.png # generate a video masonry video "slow dolly over a misty forest at dawn" --aspect 16:9

That is the core of it. Two commands, masonry image and masonry video, plus flags for the model, aspect ratio, and output path. Everything below is detail and the agent workflow.

Why run image generation from the terminal

A web UI is great for browsing and exploring. A CLI is better the moment generation becomes part of a repeatable workflow, because a command can be copied, edited, scripted, and version controlled. For developers that shows up in obvious places: generating a cover image while you write the blog post, filling a UI with realistic placeholder assets instead of gray boxes, batching OG images for a set of pages, or producing a quick product video for a landing section.

The bigger shift is the agent angle. When the tool is a shell command, an agent that already has shell access can call it without any special integration. You ask Claude Code to build a feature, it scaffolds the page, and when it needs a hero image it runs masonry image and references the result, all in one pass. No tab switch, no copy paste, no breaking the agent's momentum.

Install and connect

Install and run it with npx, or install it globally:

npx @masonryai/cli
# or
npm install -g @masonryai/cli

Then connect your Masonry account. masonry login opens a link in your browser to authorize the CLI, and after that the credentials are stored locally so you do not pass keys around on the command line.

The two commands

Image generation takes a prompt and saves a file:

Prompt

masonry image "neon cyberpunk street at night" # → Using model: Nano Banana 2 (the default when you don't pass --model) # → Generating image... # ✓ Saved to cyberpunk-street.png

Video works the same way:

Prompt

masonry video "ocean waves at golden hour" --model veo-3.1-generate-preview # → Using model: veo-3.1-generate-preview # → Generating video... # ✓ Saved to ocean-waves.mp4

You can pin a specific model, set the aspect ratio and dimensions, and choose the output path:

Prompt

masonry image "studio product shot of a frosted glass bottle" --model flux-2-pro --aspect 1:1 --output bottle.png

And you can animate an existing image instead of starting from text, which is how you turn a static render into a short motion clip:

Prompt

masonry video --image ./bottle.png --model kling-v2-6-pro-i2v # → Input: ./bottle.png # → Using model: kling-v2-6-pro-i2v # ✓ Saved to bottle-animated.mp4

Run masonry --help for the full flag list.

Using it from Claude Code

This is the part that matters for an agent workflow. Because masonry is a normal command, you do not need a plugin. In a Claude Code session you can just ask for the asset in plain language, and the agent runs the command for you:

"Generate a 16:9 hero image of a dark control room with glowing dashboards and save it to public/hero.png, then reference it in the landing section."

Claude Code runs masonry image "..." --aspect 16:9 --output public/hero.png, the file lands in your repo, and it wires it into the component in the same turn. The asset and the code stay in sync because they were produced together.

For a more permanent setup, run masonry skill install. It installs a Masonry skill into Claude Code so the agent already knows the commands and reaches for image or video generation on its own whenever a task needs a visual, instead of you spelling out the syntax each time. (masonry skill list shows what's installed, masonry skill uninstall removes it.)

Why 50+ models instead of one

Most image CLIs are wired to a single model. Masonry exposes a catalog (Veo 3, FLUX, Imagen 4, GPT Image, Nano Banana, Kling, and more) behind the same two commands, and that matters because no single model is best at everything. One is stronger at legible text in a marketing mockup, another at photoreal product lighting, another at fast cheap iterations while you are still exploring. Swapping is a flag, not a new tool, so you can match the model to the task without rebuilding your workflow.

Honest notes

A few things worth knowing before you wire this into a pipeline:

It needs an account and credits. This is not an unlimited free local model, it is a hosted generation service, so generations draw down balance. For most developer use (a handful of assets per project) that is a non-issue, but if you plan to batch thousands of images, check the cost first.
Generation is a network call, so it is not instant and it needs connectivity. Video especially takes longer than image.
Treat generated marketing or product imagery the way you would any AI image: check it before you ship it, especially anything with text or a real product in it.

FAQ

Can Claude Code generate images? Not on its own. Claude Code is a text coding agent, so it can reference an image file but cannot create one. You add image generation by giving it a command it can run, like the Masonry CLI, which generates images and video from the terminal across 50+ models. Once installed, Claude Code runs it mid-session.

How do I generate images in Claude Code from the terminal? Install the Masonry CLI with npx @masonryai/cli, run masonry login, then either ask Claude Code in plain language to generate an asset and it runs masonry image for you, or run masonry skill install once so the agent reaches for generation on its own.

What image and video models does it support? More than 50 models behind two commands. Images include Nano Banana 2, GPT Image 2, FLUX.2, Seedream, Imagen, and Ideogram; video includes Veo 3.1, Kling, Seedance, Hailuo, and WAN. You switch models with the --model flag.

Is it free? No. Masonry is a hosted generation service that needs an account and credits, not an unlimited free local model. For typical developer use, a handful of assets per project, that is minor; for batching thousands, check the cost first.

Do I need a plugin or MCP server? No. Because masonry is a normal shell command, any agent with shell access can call it. Running masonry skill install makes Claude Code reach for it automatically, but it is optional.

The bottom line

Claude Code is a strong coding agent that simply lacks an image model. The Masonry CLI fills that gap with two commands and no integration work, so the agent can generate images and video the same way it runs tests or installs a package, inside the terminal, in the same session, across whichever model fits the job. If you have ever broken your flow to go make an image by hand, this is the part you can stop doing.

Install it with npx @masonryai/cli and try one generation. See the full command reference for everything else, how it stacks up against other CLI image tools, and our guide to prompting AI image models once you are generating.

How to Generate Images and Video in Claude Code (Without Leaving the Terminal)

Quick answer

Why run image generation from the terminal

Install and connect

The two commands

Using it from Claude Code

Why 50+ models instead of one

Honest notes

FAQ

The bottom line

Best Tools to Generate Images in Claude Code (2026): CLIs, Skills, and MCP Servers Compared

The Best getimg.ai Alternative in 2026 (Honest Comparison)