► KerasHub: Pretrained Models / API documentation / Model Architectures / Flux / FluxBackbone model

FluxBackbone model

`FluxBackbone` class

keras_hub.models.FluxBackbone(
    input_channels,
    hidden_size,
    mlp_ratio,
    num_heads,
    depth,
    depth_single_blocks,
    axes_dim,
    theta,
    use_bias,
    guidance_embed=False,
    image_shape=(None, 768, 3072),
    text_shape=(None, 768, 3072),
    image_ids_shape=(None, 768, 3072),
    text_ids_shape=(None, 768, 3072),
    y_shape=(None, 128),
    **kwargs
)

Transformer model for flow matching on sequences.

The model processes image and text data with associated positional and timestep embeddings, and optionally applies guidance embedding. Double-stream blocks handle separate image and text streams, while single-stream blocks combine these streams. Ported from: https://github.com/black-forest-labs/flux

Arguments

input_channels: int. The number of input channels.
hidden_size: int. The hidden size of the transformer, must be divisible by num_heads.
mlp_ratio: float. The ratio of the MLP dimension to the hidden size.
num_heads: int. The number of attention heads.
depth: int. The number of double-stream blocks.
depth_single_blocks: int. The number of single-stream blocks.
axes_dim: list[int]. A list of dimensions for the positional embedding axes.
theta: int. The base frequency for positional embeddings.
use_bias: bool. Whether to apply bias to the query, key, and value projections.
guidance_embed: bool. If True, applies guidance embedding in the model.

Call arguments

image: KerasTensor. Image input tensor of shape (N, L, D) where N is the batch size, L is the sequence length, and D is the feature dimension.
image_ids: KerasTensor. Image ID input tensor of shape (N, L, D) corresponding to the image sequences.
text: KerasTensor. Text input tensor of shape (N, L, D).
text_ids: KerasTensor. Text ID input tensor of shape (N, L, D) corresponding to the text sequences.
timesteps: KerasTensor. Timestep tensor used to compute positional embeddings.
y: KerasTensor. Additional vector input, such as target values.
guidance: KerasTensor, optional. Guidance input tensor used in guidance-embedded models.

Raises

ValueError: If hidden_size is not divisible by num_heads, or if sum(axes_dim) is not equal to the positional embedding dimension.

[source]

`from_preset` method

FluxBackbone.from_preset(preset, load_weights=True, **kwargs)

Instantiate a keras_hub.models.Backbone from a model preset.

A preset is a directory of configs, weights and other file assets used to save and load a pre-trained model. The preset can be passed as a one of:

a built-in preset identifier like 'bert_base_en'
a Kaggle Models handle like 'kaggle://user/bert/keras/bert_base_en'
a Hugging Face handle like 'hf://user/bert_base_en'
a path to a local preset directory like './bert_base_en'

This constructor can be called in one of two ways. Either from the base class like keras_hub.models.Backbone.from_preset(), or from a model class like keras_hub.models.GemmaBackbone.from_preset(). If calling from the base class, the subclass of the returning object will be inferred from the config in the preset directory.

For any Backbone subclass, you can run cls.presets.keys() to list all built-in presets available on the class.

Arguments

preset: string. A built-in preset identifier, a Kaggle Models handle, a Hugging Face handle, or a path to a local directory.
load_weights: bool. If True, the weights will be loaded into the model architecture. If False, the weights will be randomly initialized.

Examples

# Load a Gemma backbone with pre-trained weights.
model = keras_hub.models.Backbone.from_preset(
    "gemma_2b_en",
)

# Load a Bert backbone with a pre-trained config and random weights.
model = keras_hub.models.Backbone.from_preset(
    "bert_base_en",
    load_weights=False,
)

FluxBackbone model

FluxBackbone class

from_preset method

FluxBackbone model

FluxBackbone class

from_preset method

`FluxBackbone` class

`from_preset` method