Model Guide

Modified on Mon, 14 Jul at 12:10 AM

Model Types

Invoke uses two broad types of models—Interpretive and Rendering—each with their own strengths and weaknesses.



Interpretive ModelsRendering Models
How to think about themLow learning curve, low to medium creative controlHigh learning curve, high creative control
What they doParse natural-language prompts, follow reference images.Focus on pixel-level synthesis with fine-grained control.
When to useYou’d rather “say” what you want than draw or composite what you want.You need surgical control over style, composition, detail, etc.
Typical outputGood directional drafts that match your prompts or reference images.Highly controllable work tuned via prompt tags, control layers, and custom-trained models.
Model families

ChatGPT-4o

FLUX Kontext

Imagen 3 

Imagen 4

Any Stable Diffusion XL (SDXL) model (eg. JuggernautXL)

Any Stable Diffusion 1.5 (SD 1.5) model

Any FLUX.1 (dev) model


ModelTypeControl LevelLearning CurveBest ForPrompt StyleKey StrengthsLimitations
ChatGPT-4o
(API)
InterpretiveLow-MediumLowDirectional drafts, natural language instructions, textInstruction + Long-formIntuitive prompting, follows complex instructionsLimited pixel-level control
FLUX Kontext
(API) 
InterpretiveLow-MediumLowQuick iterations, prompt-based transformationInstruction Fast, good prompt adherenceLess creative control
Imagen 3
(API) 
InterpretiveMediumLow-MediumHigh-quality photorealistic outputsLong-formGood image quality, natural language understandingLimited editing capabilities
Imagen 4
(API) 
InterpretiveMediumLow-MediumLatest generation photorealismLong-formCutting-edge quality, improved prompt handlingLimited editing capabilities
JuggernautXL (SDXL)RenderingHighHighDetailed creative work, style controlPrompt tagsFine-grained control, extensive customization, established ecosystemFull control involves a learning curve
SD 1.5 ModelsRenderingHighHighExtremely efficient styling and rendering work, customization, specialized tasksPrompt tagsExtensive customization, established ecosystemRequires technical knowledge
FLUX.1 (dev)RenderingHighHighProfessional quality, prompt adherence, high quality customizationLong-form Developer-focused, high precisionTechnical complexity

How to write prompts for different models


Prompt styleWhat it looks likeBest for
Instruction

Generate a neon-lit logo

Replace the cobblestone street with flat stones

Add a misty fog to the scene

ChatGPT-4o, FLUX Kontext
Long-formA cinematic wide-angle shot of a misty rainforest at dawn with soft volumetric light. The rainforest is filled with vibrant diverse flora and faunaImagen, FLUX Dev, ChatGPT-4o, FLUX Kontext
Prompt tagsultra-wide, 32 mm, concept art, vaporwave palette, award-winningSD 1.5, SDXL (eg. JuggernautXL)


Pro tip: Negative prompts (“deformed, blurry, watermark”) work best on Rendering models.



Recommendations for getting started

  • Brand-new to AI? Start with Interpretive Models like Imagen or ChatGPT-4o before learning Rendering Models like FLUX or SDXL. 
  • Looking to change something small about an image with text guidance? Choose FLUX Kontext, upload an image as a Global Reference Image, and add a short instruction to your prompt field.
  • Looking to master what the pros use? Watch our YouTube series and explore control layers and inpainting techniques with FLUX Dev and SDXL models.

Troubleshooting and tips

SymptomLikely CauseQuick Fix
I put in a text prompt but the image comes out kind of janky (weird faces, weird hands, etc).Rendering models are weaker at generating “error-free” images with just text prompts.If you are just using text prompts, try using an interpretive model.
I’m giving directions to change one part, but it’s changing the whole image.Certain models do not have the capability of using instructive prompts + targeted guidance.

Either:

  1. Use FLUX Kontext
  2. Advanced: Use an “Inpaint Mask” layer with FLUX (dev) or SDXL. 
I can’t get it to generate in the exact style that I want.The models don’t perfectly understand your style.Try using a reference image with an interpretive model like FLUX Kontext. If that isn’t sufficient, you can explore training your own LoRA model in our Model Training app.


More resources

Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select at least one of the reasons
CAPTCHA verification is required.

Feedback sent

We appreciate your effort and will try to fix the article