Using FLUX Models on Invoke

Modified on Wed, 13 Aug at 10:37 AM

Supported FLUX.1 base models

Pro - Not supported
1. Accessed via API only
Dev (including Krea [dev])- Open-weight, guidance-distilled model requiring license for commercial use
1. Professional Edition: Supported for users with a commercial license and users using the model non-commercially
2. Community Edition: Supported for all users
Schnell - Fastest Flux model designed for local development and personal use
1. Supported in both Professional and Community Editions
Kontext - API-based multimodal model that supports high-resolution image editing with deep in-context understanding of prompts, including fine-grained text + image conditioning.
1. Supported only in the Professional Edition

Accessing FLUX models

Professional Edition:

FLUX Dev- Users with a commercial license for FLUX Dev models can upload FLUX Dev model files directly or from Hugging Face. Enterprise and Premier users without a commercial license can purchase access directly through Invokeor any user tier can use FLUX Dev models after certifying that they are using the model for non-commercial purposes.
- FLUX Tools (e.g., Canny, Depth, Redux) can be uploaded from Starter Models
- You can read more about uploading models to Invoke in our 'Adding Models and LoRAs to Invoke' article
- You can read the terms for using FLUX Dev commercial access purchased through Invoke here
FLUX Schnell- Users can use the ‘FLUX Schnell’ model by using the ‘Add account models to project’ dropdown on the Model Management tab within Project Settings or upload their own version of the Schnell model for use
- Note: Enterprise users may need to have an Account Admin add FLUX models to the Enterprise account

FLUX Kontext- Users can access the 'FLUX Kontext' model by finding the model in the 'External API' section of the model selection dropdown in the Studio
- Note: You can learn more about the price per image here

Community Edition:

FLUX Dev - Users can use the ‘FLUX Dev (Quantized)’ model found on the starter models tab or upload their own version of the Dev model for use
FLUX Schnell - Users can use the ‘FLUX Schnell’ or ‘FLUX Schnell (Quantized)’ models found on the ‘Starter Models’ tab within Model Manger or upload their own version of the model for use

Troubleshooting errors when uploading Flux models

Right now, there’s a wide range of different formats being used by model trainers and fine-tuners across the ecosystem, and unfortunately, there isn’t any clear standardization. Model trainers are often not specifying the formats they use, which can cause issues when uploading models.

We’ve chosen to support the most commonly used format variances, but those are not well labeled on some sites that host FLUX LoRAs, so it’s hard to give guidance on which ones work and which don’t yet. In general, you can understand current model support through the following rules:

Models with full (non-quantized) model weights (float8, float16, bfloat16, float32) should work
bitsandbytes NF4 quantized models should work
GGUF quantized models should work
Most LoRA models trained with diffusers or kohya should work. Please report variants to [email protected] and we'll work on adding support

We’re working on driving that standardization through the Open Model Initiative, but for now, we’re focused on optimizing for the most widely adopted formats. If your model isn’t working, it could be due to the format it’s been trained in. Feel free to reach out if you need more help!

Troubleshooting slow generation speeds for Flux

A common cause of slowness is unnecessary offloads of large models from VRAM / RAM. To avoid unnecessary model offloads, make sure that your ram and vram config settings are properly configured in ${INVOKEAI_ROOT}/invokeai.yaml.

FLUX model support

Transformer Models

Model Format	Support Level	Notes
Official BFL-format Weights (fp16, bf16, or fp32)	Supported
BFL-format weights (cast to fp8)	Supported	Supported, but we cast to fp16 so fp8 offers no memory savings fp8 formats not recommended because typically worse performance than quantized models of the same size
Quantized bitsandbytes NF4	Supported	Install the base model via ‘Starter Models’ list in Invoke
Diffusers-format weights (fp16 or fp32)	Not Supported
BFL-format weights with GGUF quantization	Supported

T5 Text Encoder Models

Model Format	Support Level	Notes
Standard weights (fp16, bf16, or fp32)	Supported
bitsandbytes LLM.int8()	Supported	Install the base model via ‘Starter Models’ list in Invoke
Standard weights (cast to fp8)	Not Supported
Standard weights with GGUF quantization	Supported

CLIP Text Encoder Models

Model Format	Support Level	Notes
Standard huggingface/tranformers-format (fp16, bf16, or fp32)	Supported

LoRA Models

Model Format	Support Level	Notes
Diffusers LoRA	Supported
Kohya LoRA	Supported	Kohya LoRA transformer models, including text encoder layers, are now fully supported LyCORIS variants (LoHA, LoKr, etc.) are supported when using standard models but remain limited when applied on top of quantized models Support for text encoder layers in T5 models is not yet available
OneTrainer LoRA	Limited Support
XLabs LoRA	Not Supported

Flux Tools

Model Format	Support Level	Notes
FLUX.1 Canny	Supported	Upload from "Import Starter Models" in Model Manager - requires commercial license attestation
FLUX.1 Depth	Supported	Upload from "Import Starter Models" in Model Manager - requires commercial license attestation
FLUX.1 Redux	Supported	Upload from "Import Starter Models" in Model Manager - requires commercial license attestation