InvokeAI Configuration#
Intro#
Runtime settings, including the location of files and
directories, memory usage, and performance, are managed via the
invokeai.yaml
config file or environment variables. A subset
of settings may be set via commandline arguments.
Settings sources are used in this order:
- CLI args
- Environment variables
invokeai.yaml
settings- Fallback: defaults
InvokeAI Root Directory#
On startup, InvokeAI searches for its "root" directory. This is the directory
that contains models, images, the database, and so on. It also contains
a configuration file called invokeai.yaml
.
InvokeAI searches for the root directory in this order:
- The
--root <path>
CLI arg. - The environment variable INVOKEAI_ROOT.
- The directory containing the currently active virtual environment.
- Fallback: a directory in the current user's home directory named
invokeai
.
InvokeAI Configuration File#
Inside the root directory, we read settings from the invokeai.yaml
file.
It has two sections - one for internal use and one for user settings:
# Internal metadata - do not edit:
schema_version: 4.0.2
# Put user settings here - see https://invoke-ai.github.io/InvokeAI/features/CONFIGURATION/:
host: 0.0.0.0 # serve the app on your local network
models_dir: D:\invokeai\models # store models on an external drive
precision: float16 # always use fp16 precision
The settings in this file will override the defaults. You only need to change this file if the default for a particular setting doesn't work for you.
You'll find an example file next to invokeai.yaml
that shows the default values.
Some settings, like Model Marketplace API Keys, require the YAML to be formatted correctly. Here is a basic guide to YAML files.
Custom Config File Location#
You can use any config file with the --config
CLI arg. Pass in the path to the invokeai.yaml
file you want to use.
Note that environment variables will trump any settings in the config file.
Environment Variables#
All settings may be set via environment variables by prefixing INVOKEAI_
to the variable name. For example, INVOKEAI_HOST
would set the host
setting.
For non-primitive values, pass a JSON-encoded string:
We suggest using invokeai.yaml
, as it is more user-friendly.
CLI Args#
A subset of settings may be specified using CLI args:
--root
: specify the root directory--config
: override the defaultinvokeai.yaml
file location
Low-VRAM Mode#
See the Low-VRAM mode docs for details on enabling this feature.
All Settings#
Following the table are additional explanations for certain settings.
Attributes:
Name | Type | Description |
---|---|---|
host |
str
|
IP address to bind to. Use |
port |
int
|
Port to bind to. |
allow_origins |
list[str]
|
Allowed CORS origins. |
allow_credentials |
bool
|
Allow CORS credentials. |
allow_methods |
list[str]
|
Methods allowed for CORS. |
allow_headers |
list[str]
|
Headers allowed for CORS. |
ssl_certfile |
Optional[Path]
|
SSL certificate file for HTTPS. See https://www.uvicorn.org/settings/#https. |
ssl_keyfile |
Optional[Path]
|
SSL key file for HTTPS. See https://www.uvicorn.org/settings/#https. |
log_tokenization |
bool
|
Enable logging of parsed prompt tokens. |
patchmatch |
bool
|
Enable patchmatch inpaint code. |
models_dir |
Path
|
Path to the models directory. |
convert_cache_dir |
Path
|
Path to the converted models cache directory (DEPRECATED, but do not delete because it is needed for migration from previous versions). |
download_cache_dir |
Path
|
Path to the directory that contains dynamically downloaded models. |
legacy_conf_dir |
Path
|
Path to directory of legacy checkpoint config files. |
db_dir |
Path
|
Path to InvokeAI databases directory. |
outputs_dir |
Path
|
Path to directory for outputs. |
custom_nodes_dir |
Path
|
Path to directory for custom nodes. |
style_presets_dir |
Path
|
Path to directory for style presets. |
log_handlers |
list[str]
|
Log handler. Valid options are "console", "file= |
log_format |
LOG_FORMAT
|
Log format. Use "plain" for text-only, "color" for colorized output, "legacy" for 2.3-style logging and "syslog" for syslog-style. |
log_level |
LOG_LEVEL
|
Emit logging messages at this level or higher. |
log_sql |
bool
|
Log SQL queries. |
log_level_network |
LOG_LEVEL
|
Log level for network-related messages. 'info' and 'debug' are very verbose. |
use_memory_db |
bool
|
Use in-memory database. Useful for development. |
dev_reload |
bool
|
Automatically reload when Python sources are changed. Does not reload node definitions. |
profile_graphs |
bool
|
Enable graph profiling using |
profile_prefix |
Optional[str]
|
An optional prefix for profile output files. |
profiles_dir |
Path
|
Path to profiles output directory. |
max_cache_ram_gb |
Optional[float]
|
The maximum amount of CPU RAM to use for model caching in GB. If unset, the limit will be configured based on the available RAM. In most cases, it is recommended to leave this unset. |
max_cache_vram_gb |
Optional[float]
|
The amount of VRAM to use for model caching in GB. If unset, the limit will be configured based on the available VRAM and the device_working_mem_gb. In most cases, it is recommended to leave this unset. |
log_memory_usage |
bool
|
If True, a memory snapshot will be captured before and after every model cache operation, and the result will be logged (at debug level). There is a time cost to capturing the memory snapshots, so it is recommended to only enable this feature if you are actively inspecting the model cache's behaviour. |
device_working_mem_gb |
float
|
The amount of working memory to keep available on the compute device (in GB). Has no effect if running on CPU. If you are experiencing OOM errors, try increasing this value. |
enable_partial_loading |
bool
|
Enable partial loading of models. This enables models to run with reduced VRAM requirements (at the cost of slower speed) by streaming the model from RAM to VRAM as its used. In some edge cases, partial loading can cause models to run more slowly if they were previously being fully loaded into VRAM. |
keep_ram_copy_of_weights |
bool
|
Whether to keep a full RAM copy of a model's weights when the model is loaded in VRAM. Keeping a RAM copy increases average RAM usage, but speeds up model switching and LoRA patching (assuming there is sufficient RAM). Set this to False if RAM pressure is consistently high. |
ram |
Optional[float]
|
DEPRECATED: This setting is no longer used. It has been replaced by |
vram |
Optional[float]
|
DEPRECATED: This setting is no longer used. It has been replaced by |
lazy_offload |
bool
|
DEPRECATED: This setting is no longer used. Lazy-offloading is enabled by default. This config setting will be removed once the new model cache behavior is stable. |
device |
DEVICE
|
Preferred execution device. |
precision |
PRECISION
|
Floating point precision. |
sequential_guidance |
bool
|
Whether to calculate guidance in serial instead of in parallel, lowering memory requirements. |
attention_type |
ATTENTION_TYPE
|
Attention type. |
attention_slice_size |
ATTENTION_SLICE_SIZE
|
Slice size, valid when attention_type=="sliced". |
force_tiled_decode |
bool
|
Whether to enable tiled VAE decode (reduces memory consumption with some performance penalty). |
pil_compress_level |
int
|
The compress_level setting of PIL.Image.save(), used for PNG encoding. All settings are lossless. 0 = no compression, 1 = fastest with slightly larger filesize, 9 = slowest with smallest filesize. 1 is typically the best setting. |
max_queue_size |
int
|
Maximum number of items in the session queue. |
clear_queue_on_startup |
bool
|
Empties session queue on startup. |
allow_nodes |
Optional[list[str]]
|
List of nodes to allow. Omit to allow all. |
deny_nodes |
Optional[list[str]]
|
List of nodes to deny. Omit to deny none. |
node_cache_size |
int
|
How many cached nodes to keep in memory. |
hashing_algorithm |
HASHING_ALGORITHMS
|
Model hashing algorthim for model installs. 'blake3_multi' is best for SSDs. 'blake3_single' is best for spinning disk HDDs. 'random' disables hashing, instead assigning a UUID to models. Useful when using a memory db to reduce model installation time, or if you don't care about storing stable hashes for models. Alternatively, any other hashlib algorithm is accepted, though these are not nearly as performant as blake3. |
remote_api_tokens |
Optional[list[URLRegexTokenPair]]
|
List of regular expression and token pairs used when downloading models from URLs. The download URL is tested against the regex, and if it matches, the token is provided in as a Bearer token. |
scan_models_on_startup |
bool
|
Scan the models directory on startup, registering orphaned models. This is typically only used in conjunction with |
Model Marketplace API Keys#
Some model marketplaces require an API key to download models. You can provide a URL pattern and appropriate token in your invokeai.yaml
file to provide that API key.
The pattern can be any valid regex (you may need to surround the pattern with quotes):
remote_api_tokens:
# Any URL containing `models.com` will automatically use `your_models_com_token`
- url_regex: models.com
token: your_models_com_token
# Any URL matching this contrived regex will use `some_other_token`
- url_regex: '^[a-z]{3}whatever.*\.com$'
token: some_other_token
The provided token will be added as a Bearer
token to the network requests to download the model files. As far as we know, this works for all model marketplaces that require authorization.
HuggingFace Models
If you get an error when installing a HF model using a URL instead of repo id, you may need to set up a HF API token and add an entry for it under remote_api_tokens
. Use huggingface.co
for url_regex
.
Model Hashing#
Models are hashed during installation, providing a stable identifier for models across all platforms. Hashing is a one-time operation.
You might want to change this setting, depending on your system:
blake3_single
(default): Single-threaded - best for spinning HDDs, still OK for SSDsblake3_multi
: Parallelized, memory-mapped implementation - best for SSDs, terrible for spinning disksrandom
: Skip hashing entirely - fastest but of course no hash
During the first startup after upgrading to v4, all of your models will be hashed. This can take a few minutes.
Most common algorithms are supported, like md5
, sha256
, and sha512
. These are typically much, much slower than either of the BLAKE3 variants.
Path Settings#
These options set the paths of various directories and files used by InvokeAI. Any user-defined paths should be absolute paths.
Logging#
Several different log handler destinations are available, and multiple destinations are supported by providing a list:
-
console
is the default. It prints log messages to the command-line window from which InvokeAI was launched. -
syslog
is only available on Linux and Macintosh systems. It uses the operating system's "syslog" facility to write log file entries locally or to a remote logging machine.syslog
offers a variety of configuration options:
syslog=/dev/log` - log to the /dev/log device
syslog=localhost` - log to the network logger running on the local machine
syslog=localhost:512` - same as above, but using a non-standard port
syslog=fredserver,facility=LOG_USER,socktype=SOCK_DRAM`
- Log to LAN-connected server "fredserver" using the facility LOG_USER and datagram packets.
http
can be used to log to a remote web server. The server must be properly configured to receive and act on log messages. The option accepts the URL to the web server, and amethod
argument indicating whether the message should be submitted using the GET or POST method.
The log_format
option provides several alternative formats:
color
- default format providing time, date and a message, using text colors to distinguish different log severitiesplain
- same as above, but monochrome text onlysyslog
- the log level and error message only, allowing the syslog system to attach the time and datelegacy
- a format similar to the one used by the legacy 2.3 InvokeAI releases.