- Up to 2K
Resolution
- 5 ratios
Aspect ratios
- Up to 25 chars
Text per phrase
- Built-in
SynthID watermark
About Imagen 4
Imagen 4 is Google DeepMind's fourth-generation text-to-image model, announced at Google I/O 2025 and positioned as the most capable general-purpose image model in the Imagen line. The headline improvements over Imagen 3 are native 2K resolution output (up to 2048×2048 pixels), significantly better photorealism in fine-detail areas (fabric textures, water reflections, animal fur, skin), and cleaner in-image text rendering up to 25 characters per phrase. For commercial creative teams, those three improvements map directly to the most common failure modes of AI image generation: low resolution that doesn't hold up in production, artificial-looking surfaces that undermine photorealistic intent, and garbled text that makes on-brand marketing imagery unusable. Imagen 4 addresses all three in one model at $0.04 per image (standard tier), with a fast variant at $0.02 per image for higher-volume exploration work.
The practical workflow position for Imagen 4 is studio-quality photography replacement for commercial briefs like product heroes, lifestyle shots, campaign imagery, and editorial visuals, where photorealism and compositional control matter more than multi-reference mixing or web-grounded accuracy. The model supports five aspect ratios: 1:1, 3:4, 4:3, 9:16, and 16:9, covering the full range of digital advertising placements from square social to widescreen display. Every output carries an invisible SynthID watermark, which gives brand teams a traceable provenance record for AI-generated assets without any visible mark on the image. All Imagen 4 outputs include commercial licensing rights under Vertex AI's paid terms. Inside Masonry, Imagen 4 sits alongside Nano Banana 2, Imagen 4 Ultra, and 50+ other models, so teams can use it as the photorealism workhorse and switch to other models when the brief requires reference mixing, video, or maximum-fidelity print output.
Why teams choose Imagen 4
Imagen 4 is the right model when photorealism is the primary brief requirement, covering product heroes, lifestyle photography, fashion imagery, and campaign visuals where the output needs to look like it came from a studio rather than a generator. It outperforms Nano Banana 2 on raw photorealistic fidelity for standalone imagery, though Nano Banana 2 is stronger when you need multi-reference mixing, web grounding, or high-volume text overlay content. Choose Imagen 4 Ultra instead of standard Imagen 4 when the brief demands the absolute quality ceiling, such as final campaign heroes, print, and imagery with highly specific or complex compositional requirements where prompt fidelity matters as much as photorealism. The standard tier at $0.04 per image is the practical workhorse; use Imagen 4 Fast at $0.02 for concept exploration and ideation.
What Imagen 4 can do
The capabilities that set Imagen 4 apart and earn its place in a brief
Native 2K Resolution Output
Generates at up to 2048×2048 pixels natively, with no upscaling. Detail and sharpness hold up at print and large-format sizes without a separate super-resolution step.
Photorealism in Fine Detail
Substantially improved rendering of fabric textures, water reflections, animal fur, and skin compared to Imagen 3. Outputs pass for studio photography in the surface areas where earlier models fell flat.
In-Image Text Rendering
Renders legible in-image text up to 25 characters per phrase. Headlines, product labels, and short copy elements come through cleanly, reducing the manual cleanup that commercial creative previously required.
Five Aspect Ratios for Every Placement
Supports 1:1, 3:4, 4:3, 9:16, and 16:9, covering square social, portrait mobile, standard print, vertical Reels, and widescreen display from a single model without reformatting.
Commercial Rights on Vertex AI
Paid Vertex AI usage includes commercial licensing, making Imagen 4 outputs usable for advertising, marketing materials, and client deliverables under Google's API terms.
SynthID Invisible Watermarking
Every output carries an invisible SynthID watermark using dual neural networks (one embeds it during generation, one detects it). The mark survives cropping, resizing, and most edits, providing a traceable provenance record without any visible artifact.
Where teams reach for Imagen 4
- Photoreal product photography for e-commerce, DTC, and retail, with shots that pass for studio work
- Brand and lifestyle imagery for campaign and social where photorealism is the primary requirement
- Print and digital advertising visuals at 2K resolution without an additional upscaling step
- Editorial and campaign hero shots where strong composition and lighting control are essential
- Packaging and label concepts with legible on-product text (up to 25 characters per phrase)
- Fashion, beauty, and luxury brand imagery requiring fine material texture fidelity
- High-volume marketing asset generation using Imagen 4 Fast ($0.02/image) for concept exploration
- Multi-format campaign production across all five aspect ratios from a single model
What sets Imagen 4 apart
The strengths teams reach for, shown on real renders.

Photorealism That Holds Up to Client Scrutiny
Imagen 4 renders lifelike textures, lighting, and depth that pass for studio photography, so product heroes, lifestyle shots, and campaign visuals look polished enough to publish without retouching.

Legible In-Image Text for Commercial Creative
Improved text rendering keeps headlines, labels, and packaging copy clean and readable (up to 25 characters per phrase), eliminating the manual fix-up that plagued earlier generation models on branded content.

Five Aspect Ratios Covering Every Placement
Square, portrait, widescreen, fullscreen, and vertical, all five ratios in one model. Generate the same creative adapted for Instagram, display, OOH, and YouTube without re-prompting from scratch.
Prompts behind these Imagen 4 images
Actual prompts from Imagen 4 renders by the Masonry community. Copy any prompt, then remix it into your own creation.

A cinematic portrait of a woman astronaut standing on the rocky surface of a distant planet, beneath a pale sky tinted with faint atmospheric haze. The landscape stretches endlessly behind her — rugged terrain of sand, dust, and eroded stone, echoing the quiet desolation of alien worlds. She wears a futuristic evolution of a NASA-style spacesuit, sleek and advanced with matte white composite armor panels, metallic seams, and soft gold luminescent trims integrated into the fabric. The helmet’s clear visor reflects the horizon’s subtle glow, while translucent HUD graphics flicker faintly across its surface. Her expression is calm, determined, and introspective, capturing the weight of isolation and discovery. Fine dust drifts in the air around her boots, illuminated by soft, golden sunlight breaking through the thin atmosphere. The lighting is cinematic and naturalistic, blending warm highlights with cool blue shadow tones that create depth and emotional realism. In the distance, the outline of a futuristic landing craft gleams faintly, its lights shimmering through the dust — a silent reminder of humanity’s reach across the stars. The scene conveys a Gravity-meets-Interstellar tone: grounded realism infused with advanced design and quiet grandeur. Style: Futuristic cinematic realism, high-detail sci-fi portrait Lighting: Natural sunlight with soft atmospheric haze and rim glow Mood: Solitary, awe-inspiring, elegant, emotionally resonant Camera angle: Mid-frame eye-level portrait, shallow depth of field for dramatic focus

Dynamic low-angle fashion portrait of a confident young man standing outdoors under a bright blue sky with scattered clouds. He wears pastel peach cargo pants paired with an open short-sleeve shirt in shades of pink and orange, revealing his toned torso and tattoos. The upward perspective emphasizes power and presence, while the vibrant colors contrast dramatically with the sky, creating a bold, modern editorial look. Perfect for streetwear campaigns, youth fashion branding, and contemporary lifestyle imagery.

A close-up portrait of a young woman on an outdoor basketball court, bathed in warm natural sunlight. She has dewy, glowing skin with soft freckles, light makeup, and slightly parted lips. Her hair is tied back neatly, with a few loose strands framing her face. A basketball is positioned close to her cheek, dominating part of the foreground and adding texture contrast to her smooth skin. The lighting creates gentle shadows and highlights across her face, emphasizing her natural features and sporty confidence. Shot with a shallow depth of field and cinematic color grading for a modern, lifestyle aesthetic.

a stylish person posing confidently against a muted green studio backdrop. They are dressed in a sharp, tailored gray pinstripe suit with a double-breasted blazer and matching wide-leg trousers. Underneath, they wear a light blue dress shirt with the collar slightly open and a colorful striped tie worn loosely for a relaxed, modern look. The person accessorizes with dark sunglasses, black leather loafers, and a large textured black leather duffel bag held in one hand. Their wavy, shoulder-length dark hair and neutral expression add to the composed, fashion-forward aesthetic. The lighting is soft and even, emphasizing the outfit’s structure and texture, giving the overall scene a refined, editorial feel that blends classic menswear-inspired tailoring with contemporary minimalism.

a man outdoors in a cold, mountainous environment, dressed in technical winter gear. He is wearing a dark gray waterproof jacket with taped seams, multiple zippers, and orange accent details, giving it a sleek, high-performance look. A dark knit beanie covers his head, and he has a neatly groomed beard. The jacket includes reinforced sleeves and pockets, and he is carrying a backpack with visible shoulder straps, suggesting he’s prepared for outdoor activity such as hiking or mountaineering. The background features blurred rocky terrain with patches of snow, emphasizing the cold and rugged setting. The lighting is soft and natural, highlighting the texture of his clothing and the moisture on its surface. The overall atmosphere conveys resilience, focus, and adventure in a harsh winter landscape.

a woman posing against a plain light background, styled in an elegant and minimal fashion. She is wearing a black, deep V-neck top with slightly puffed shoulders, giving her outfit a sophisticated silhouette. Her dark, straight hair is parted in the middle and tucked neatly behind her ears, complementing her polished appearance. The main focus is on her accessories with layered gold necklaces featuring small pendants and hoops earrings that match in tone. Her makeup is natural and refined, emphasising a smooth complexion, neutral lips, and well-groomed brows. The lighting is soft and even, highlighting her features with a gentle glow. The overall mood is poised, confident, and modern — fitting for a beauty, fashion, or jewellery editorial portrait.

a visually striking arrangement of luxury makeup products with brand name Kylie Cosmetics displayed against a warm, monochromatic background in shades of peach, coral, and rose gold. The setup includes lipsticks, foundation or serum bottles, a compact blush, and cream containers — all housed in sleek black and metallic coral packaging. The products are artfully placed among geometric blocks and curved shapes in matching tones, creating a modern, sculptural composition. Soft, diffused lighting enhances the smooth textures and rich colors, producing a polished, high-end aesthetic. The overall mood is elegant and contemporary, evoking themes of beauty, sophistication, and design harmony

A soft, intimate close-up portrait of a young woman wearing a plush white faux-fur winter hood and a thick cream-colored knit sweater. Her face fills the frame, captured at a slight angle as she gazes gently toward the camera with calm, expressive eyes. Her skin appears natural and luminous, with minimal makeup, subtle freckles, and softly defined lips in a muted rose tone. The lighting is diffused and warm, creating smooth highlights and gentle shadows that enhance skin texture and facial contours. The background is neutral and softly blurred, keeping full focus on the subject. The overall mood is cozy, elegant, and serene, evoking winter warmth, comfort, and understated beauty—ideal for fashion, skincare, or seasonal lifestyle branding.

A surreal cinematic scene set inside a vast, abandoned gothic cathedral with towering stone arches, cracked walls, and tall stained-glass windows diffusing cold, misty daylight. Suspended in mid-air at the center of the nave is a bright orange sports car, lifted by heavy chains and a metal hoist frame attached to the vaulted ceiling. The car hangs perfectly horizontal, emphasizing tension and impossibility. Moss, vines, and creeping vegetation reclaim the ancient stone interior, while puddles on the cracked stone floor reflect the cathedral’s architecture and the floating vehicle above. The contrast between the modern, glossy supercar and the decaying medieval environment creates a striking visual juxtaposition. Lighting is soft but dramatic, with volumetric light rays cutting through dust and fog, evoking a mood of mystery, surrealism, and post-apocalyptic elegance. Highly detailed, cinematic realism with a dark, contemplative atmosphere.

A striking close-up portrait of a vividly colored lizard perched on a textured rock, captured at eye level with shallow depth of field. The reptile features an extraordinary gradient of saturated hues—electric blue scales along the body, fiery orange and red tones on the head and limbs, and subtle purple transitions across the neck and torso. Fine scale textures are sharply detailed, with glossy highlights catching the light. One large, reflective eye is in crisp focus, conveying alertness and curiosity. The background fades into a dark, softly blurred gradient, isolating the subject and enhancing contrast. Lighting is dramatic yet controlled, emphasizing color vibrancy and surface detail. The overall mood is bold, exotic, and visually arresting, ideal for wildlife art, fantasy realism, or high-end digital illustration.

a realistic Vogue magazine cover–style fashion portrait using the uploaded face as the original face reference (100% face identity preservation). A young elegant woman posing confidently, maintaining her original facial features and natural beauty. She is winking with her left eye and making a playful duck-face expression. Both hands are raised, forming a love/heart gesture near her face. She is surrounded by multiple DSLR cameras and smartphones held around her, as if paparazzi and photographers are capturing her from all directions. Some phones show her live image on their screens. Appearance & styling: flawless glowing skin, natural makeup with glossy pink lips, soft blush, subtle highlights. Light brown hair styled in a low, neat updo with a few loose strands. Outfit & accessories: elegant minimalist beige-white strapless evening dress, Louis Vuitton necklace, diamond ring, luxury fashion jewelry. Photography style: close-up to half-body fashion portrait, Vogue editorial aesthetic, cinematic professional studio lighting, soft HDR background, shallow depth of field, realistic skin texture, ultra-detailed, 8K quality. Camera & lens look: professional DSLR look, 85mm lens feel, f/1.8 aperture, crisp focus with smooth background bokeh. Composition: Vogue magazine layout with large bold logo at the top, editorial fashion cover framing, clean and elegant design. Mood & vibe: playful yet luxurious, high-fashion beauty editorial, realistic, not AI-looking, photographed by a professional fashion photographer.

A close-up wildlife portrait of a majestic stag standing still in a snowy forest during active snowfall. The deer faces the camera directly, with large symmetrical antlers coated in fresh snow, creating a striking and powerful composition. Its fur is a rich warm brown, lightly dusted with snowflakes, contrasting against the soft white foreground and cool gray-blue background. The background features blurred winter trees with shallow depth of field, producing a dreamy, atmospheric bokeh effect. Lighting is soft and diffused, typical of overcast winter conditions, emphasizing fine details in the fur, antlers, and snow texture. The mood is calm, serene, and slightly dramatic, evoking wilderness, nature, and seasonal beauty. Shot at eye level with a centered composition, high realism, no text, ideal for commercial wildlife, winter, or holiday-themed stock imagery.

Create a surreal, cinematic advertisement for Diet Coke, captured as a high-end DSLR photograph. Use symbolic visual metaphors that reflect the brand’s identity — through imaginative scenes, natural landscapes, or abstract environments. Center the product or logo in the scene with emotionally resonant lighting and realistic textures. Include a short slogan (max 6 words) that aligns with the brand’s voice. Do not repeat the brand name. Style: surreal realism, DSLR depth of field, ad campaign aesthetic.

a close-up portrait of a young woman with fair skin, light blue eyes, and dark brown hair styled in a short, wet-look bob. Her face is natural and glowing, with visible freckles across her cheeks and nose. She is holding a round, pale yellow container—sunscreen cream —near her face, suggesting the image may be part of a skincare or beauty campaign. The background is bright and neutral, highlighting her fresh, radiant complexion. The overall mood is clean, natural, and minimalistic, emphasizing healthy skin.

Design a hyperrealistic cinematic poster showcasing the Bose headphones the centerpiece. The device on a sleek obsidian pedestal at the edge of a tranquil reflective lake. Surrounding it are glowing bioluminescent plants and tall reeds swaying gently in the breeze, giving the scene an ethereal, futuristic feel. In the background, jagged cliffs rise dramatically, their edges kissed by a glowing aurora in the night sky. Use moody, atmospheric lighting with cool blues and purples contrasted by warm golden highlights reflecting off the device. Capture the scene from a slightly low angle to emphasize grandeur, with a shallow depth of field to sharpen the Bose headphones while softening the surrounding mist. Add subtle lens flares, crisp polished textures, and a cinematic vignette for a premium editorial aesthetic. Write the brand name and a tagline as well.
Explore related categories
Browse adjacent categories and creative directions teams are exploring
Frequently asked questions
What teams need to know about creating with Imagen 4 in Masonry
What resolution does Imagen 4 generate at?
Imagen 4 standard generates at up to native 2K resolution (2048×2048 pixels) across five aspect ratios (1:1, 3:4, 4:3, 9:16, 16:9). This is sufficient for most digital advertising placements and many print applications without a separate upscaling step. Imagen 4 Fast tops out at approximately 1408×768 pixels, suitable for digital-only use cases.
How does Imagen 4 compare to Imagen 4 Ultra?
Both models generate at up to 2K resolution with the same five aspect ratios and SynthID watermarking. The difference is in prompt fidelity and nuance. Imagen 4 is literal and fast. It renders what you describe accurately and reliably. Imagen 4 Ultra reads prompts with more sensitivity, handling abstract language, layered storytelling, and complex multi-element compositions with better accuracy. Ultra costs $0.06 per image vs $0.04 for standard, so use Ultra for finals where compositional precision is paramount.
How does Imagen 4 compare to Nano Banana 2 for marketing creative?
They serve different strengths. Imagen 4 is optimized for standalone photorealistic imagery. It excels at product photography, lifestyle shots, and campaign visuals where fidelity is the priority. Nano Banana 2 is stronger for multi-reference brand mixing, web-grounded generation, and high-volume text-overlay content. Many teams use Imagen 4 for hero imagery and Nano Banana 2 for social and campaign variation work, choosing based on the brief.
Can Imagen 4 outputs be used commercially?
Yes. Commercial use is included under paid Vertex AI terms, covering advertising, marketing materials, product imagery, and client deliverables. All outputs carry an invisible SynthID watermark but no visible watermark or restrictive licensing on commercial application. Review Google's current API terms of service for the latest usage rights.
Does Imagen 4 apply a watermark to images?
No visible watermark. All Imagen 4 outputs carry Google's invisible SynthID watermark, implemented via two neural networks (one that embeds the watermark during generation and one that detects it). The mark persists through common editing operations including cropping, resizing, and color adjustment. It's a provenance record, not a visual brand mark.
What is the text rendering limit in Imagen 4?
Imagen 4 supports in-image text rendering up to 25 characters per phrase. For short headlines, product label copy, and one-line callouts, this works well. For longer text strings or dense typographic layouts, you'll still need post-production. The rendering quality is significantly improved over Imagen 3, which was notoriously unreliable with any on-image text.
What are the rate limits for Imagen 4?
Via Vertex AI, Imagen 4 standard supports 75 requests per minute. Imagen 4 Fast allows 150 requests per minute. Imagen 4 Ultra is capped at 30 requests per minute. For high-volume campaign generation, the standard and fast tiers provide adequate throughput for most commercial workflows.
What is the pricing for Imagen 4 and Imagen 4 Fast?
Imagen 4 standard is priced at $0.04 per image via Vertex AI. Imagen 4 Fast is $0.02 per image, half the cost at lower resolution and faster generation speed. Imagen 4 Ultra is $0.06 per image. The typical workflow for budget-efficient production is to use Fast for concept exploration, standard for approved directions, and Ultra for final client deliverables that require maximum fidelity.
Does Imagen 4 support aspect ratios for vertical social content?
Yes. Imagen 4 supports 9:16 portrait aspect ratio natively, matching the format for TikTok, Instagram Reels, and YouTube Shorts. Combined with 16:9 landscape and 1:1 square, a single creative direction can be adapted across all major social placements without reformatting or re-prompting from scratch.
Does Imagen 4 accept reference images or image inputs?
Imagen 4 is primarily a text-to-image model. Unlike Nano Banana 2 which accepts up to 14 reference images for mixing and style transfer, Imagen 4 standard does not offer the same multi-reference input workflow. For reference-guided generation and brand-asset mixing, Nano Banana 2 is the better choice.
How does Imagen 4 handle fine material textures?
Fine material rendering is one of Imagen 4's most notable improvements over Imagen 3. Fabric weave, water surface detail, animal fur, and skin all render with substantially more believable texture. For fashion, beauty, and luxury brand imagery where material authenticity is part of the brand promise, this improvement is commercially meaningful.
Can I use Imagen 4 for fashion and beauty advertising?
Yes, and it is well suited for it. Imagen 4's improvements in fine surface texture, skin fidelity, and fabric detail make it a strong fit for fashion, beauty, and luxury product imagery. The combination of photorealism and improved composition handles editorial-style shots, product close-ups, and lifestyle contexts that brands commonly need for advertising and e-commerce.
What is Imagen 4?
Imagen 4 is an AI image generation model from Google, available inside Masonry, the AI creative agent teams use to produce marketing, product, and brand images.
How does my team use Imagen 4 in Masonry?
Open a Masonry canvas, pick Imagen 4 from the model selector, and describe the image you need: a product shot, an ad creative, a social post. Masonry generates it, then you refine, edit, and combine Imagen 4 with other models in one workspace.
Is Imagen 4 free to try?
Yes, you can start generating images with Imagen 4 on Masonry's free tier, then scale up with higher limits and priority processing as your team grows.
How do I write good prompts for Imagen 4?
Direct Imagen 4 like a photographer. Name the lens, the lighting, and the time of day. The more you specify the shoot, the more polished and intentional the result. See the prompt gallery on this page for real Imagen 4 prompts you can copy and adapt.
Who makes Imagen 4?
Imagen 4 is built by Google. Inside Masonry it runs alongside 50+ image and video models, so your team can pick the right one for each brief without switching tools.
Can I see examples made with Imagen 4?
Yes, the prompt gallery on this page shows real images teams have generated with Imagen 4 in Masonry, each paired with the exact prompt you can copy and adapt for your own brand.
Start creating with Imagen 4
Generate, edit, and compare across 50+ models in one workspace.
Guides for Imagen 4
Prompt walkthroughs and examples from the Masonry blog
Explore more AI models
Compare Imagen 4 with other models teams run in Masonry


