Middleware

Genkit allows you to use middleware to modify the behavior of generate() calls. Middleware can be used for various purposes, such as retrying failed requests, falling back to different models, or injecting tools and context.

You can use pre-packaged middleware or build your own custom middleware.

The official Genkit middleware for JavaScript is available in the @genkit-ai/middleware package.

Installation

npm install @genkit-ai/middleware
# or
yarn add @genkit-ai/middleware
# or
pnpm add @genkit-ai/middleware

Available middleware

The @genkit-ai/middleware package provides several useful middleware options out of the box. This list represents the middleware built and maintained by the Genkit team, but there may also be community-built middleware available.

1. FileSystem middleware (`filesystem`)

Grants the model access to the local filesystem by injecting standard file manipulation tools (list_files, read_file, write_file, search_and_replace). All operations are safely restricted to a specified root directory.

import { genkit } from 'genkit';
import { filesystem } from '@genkit-ai/middleware';

const ai = genkit({ ... });

const response = await ai.generate({
  model: googleAI.model('gemini-flash-latest'),
  prompt: 'Create a hello world node app in the workspace',
  use: [
    filesystem({ rootDirectory: './workspace' })
  ]
});

Configuration options:

rootDirectory (required): The root directory to which all filesystem operations are restricted.
allowWriteAccess (optional): If true, allows write access to the filesystem (defaults to false).
toolNamePrefix (optional): Prefix to add to the name of the injected tools.

2. Skills middleware (`skills`)

Automatically scans a directory for SKILL.md files (and their YAML frontmatter) and injects them into the system prompt. It also provides a use_skill tool the model can use to retrieve more specific skills on demand.

import { genkit } from 'genkit';
import { skills } from '@genkit-ai/middleware';

const ai = genkit({ ... });

const response = await ai.generate({
  prompt: 'How do I run tests in this repo?',
  use: [
    skills({ skillPaths: ['./skills'] })
  ]
});

3. Tool approval middleware (`toolApproval`)

Restricts execution of tools to an approved list. If the model attempts to call an unapproved tool, it throws a ToolInterruptError allowing you to prompt the user for manual confirmation before resuming.

import { genkit, restartTool } from 'genkit';
import { toolApproval } from '@genkit-ai/middleware';

const ai = genkit({ ... });

// 1. Initial attempt
const response = await ai.generate({
  prompt: 'write a file',
  tools: [writeFileTool],
  use: [
    toolApproval({ approved: [] }) // Empty list means call triggers interrupt
  ]
});

if (response.finishReason === 'interrupted') {
  const interrupt = response.interrupts[0];

  // 2. Ask user for approval, then recreate the tool request with approval
  const approvedPart = restartTool(interrupt, { toolApproved: true });

  // 3. Resume execution
  const resumedResponse = await ai.generate({
    messages: response.messages,
    resume: { restart: [approvedPart] },
    use: [
      toolApproval({ approved: [] })
    ]
  });
}

4. Retry middleware (`retry`)

Automatically retries failed model generations on transient error codes (like RESOURCE_EXHAUSTED, UNAVAILABLE) using exponential backoff with jitter.

import { genkit } from 'genkit';
import { retry } from '@genkit-ai/middleware';

const ai = genkit({ ... });

const response = await ai.generate({
  model: googleAI.model('gemini-pro-latest'),
  prompt: 'Heavy reasoning task...',
  use: [
    retry({
      maxRetries: 3,
      initialDelayMs: 1000,
      backoffFactor: 2
    })
  ]
});

Configuration options:

maxRetries (optional): The maximum number of times to retry a failed request (default: 3).
statuses (optional): An array of StatusName values that should trigger a retry (default: ['UNAVAILABLE', 'DEADLINE_EXCEEDED', 'RESOURCE_EXHAUSTED', 'ABORTED', 'INTERNAL']).
initialDelayMs (optional): The initial delay between retries in milliseconds (default: 1000).
maxDelayMs (optional): The maximum delay between retries in milliseconds (default: 60000).
backoffFactor (optional): The factor by which the delay increases after each retry (exponential backoff, default: 2).
noJitter (optional): Whether to disable jitter on the delay (default: false).

5. Fallback middleware (`fallback`)

Automatically switches to a different model if the primary model fails on a specific set of error codes. Useful for falling back to a smaller/faster model when a large model exceeds quota limits.

import { genkit } from 'genkit';
import { fallback } from '@genkit-ai/middleware';

const ai = genkit({ ... });

const response = await ai.generate({
  model: googleAI.model('gemini-pro-latest'),
  prompt: 'Try the pro model first...',
  use: [
    fallback({
      models: [googleAI.model('gemini-flash-latest')], // try flash if pro fails
      statuses: ['RESOURCE_EXHAUSTED']
    })
  ]
});

Configuration options:

models (required): An array of model references to try in order.
statuses (optional): An array of StatusName values that should trigger a fallback (default: ['UNAVAILABLE', 'DEADLINE_EXCEEDED', 'RESOURCE_EXHAUSTED', 'ABORTED', 'INTERNAL', 'NOT_FOUND', 'UNIMPLEMENTED']).
isolateConfig (optional): If true, the fallback model will not inherit the original request’s configuration (default: false).

Building your own custom middleware

You can implement your own custom middleware to extend Genkit’s functionality. Genkit provides a generateMiddleware helper to create structured middleware with configuration schemas.

Middleware can intercept different phases of execution by providing hooks:

model: Intercepts the call to the model.
tool: Intercepts tool execution.
generate: Intercepts the high-level generation loop.

Here is an example of a custom middleware that logs requests and responses:

import { generateMiddleware, z } from 'genkit';

export const loggerMiddleware = generateMiddleware(
  {
    name: 'loggerMiddleware',
    description: 'Logs requests and responses',
    configSchema: z.object({
      verbose: z.boolean().optional(),
    }),
  },
  ({ config, ai }) => {
    return {
      model: async (req, ctx, next) => {
        if (config?.verbose) {
          console.log('Request:', JSON.stringify(req));
        }
        const resp = await next(req, ctx);
        if (config?.verbose) {
          console.log('Response:', JSON.stringify(resp));
        }
        return resp;
      },
    };
  }
);

To use it:

const response = await ai.generate({
  model: googleAI.model('gemini-flash-latest'),
  prompt: 'Hello',
  use: [loggerMiddleware({ verbose: true })],
});

For more complex examples of building custom middleware, you can refer to the source code of the built-in middleware in the Genkit GitHub repository.

Installation

The middleware framework is part of the core ai package, and the pre-packaged middleware ships in plugins/middleware. Both come with the core Genkit module:

go get github.com/firebase/genkit/go@latest

Register the Middleware plugin during genkit.Init to expose the built-ins to the Dev UI and to other-runtime callers:

import (
    "context"

    "github.com/genkit-ai/genkit/go/ai"
    "github.com/genkit-ai/genkit/go/core"
    "github.com/genkit-ai/genkit/go/genkit"
    "github.com/genkit-ai/genkit/go/plugins/googlegenai"
    "github.com/genkit-ai/genkit/go/plugins/middleware"
)

ctx := context.Background()
g := genkit.Init(ctx, genkit.WithPlugins(
    &googlegenai.GoogleAI{},
    &middleware.Middleware{},
))

For pure Go programs that just attach middleware to a genkit.Generate() call, plugin registration is optional. Passing a middleware value directly to ai.WithUse invokes its New method on the local fast path without consulting the registry.

Available middleware

The plugins/middleware package provides several useful middleware options out of the box. This list represents the middleware built and maintained by the Genkit team, but there may also be community-built middleware available.

1. Retry middleware (`Retry`)

Automatically retries failed model API calls on transient error codes (such as RESOURCE_EXHAUSTED and UNAVAILABLE) using exponential backoff with jitter. Only the model API call is retried; the surrounding tool loop is not replayed.

resp, err := genkit.Generate(ctx, g,
    ai.WithModelName("googleai/gemini-flash-latest"),
    ai.WithPrompt("Heavy reasoning task..."),
    ai.WithUse(&middleware.Retry{
        MaxRetries:     3,
        InitialDelayMs: 1000,
        BackoffFactor:  2,
    }),
)

Configuration options:

MaxRetries (optional): The maximum number of times to retry a failed request (default: 3).
Statuses (optional): A list of core.StatusName values that should trigger a retry (default: UNAVAILABLE, DEADLINE_EXCEEDED, RESOURCE_EXHAUSTED, ABORTED, INTERNAL). Non-GenkitError errors such as network failures are always retried regardless of this list.
InitialDelayMs (optional): The initial delay between retries in milliseconds (default: 1000).
MaxDelayMs (optional): The upper bound on retry delay in milliseconds (default: 60000).
BackoffFactor (optional): The factor by which the delay increases after each retry (default: 2).
NoJitter (optional): If true, disables random jitter on the delay (default: false).

2. Fallback middleware (`Fallback`)

Automatically switches to a different model if the primary model fails on a fallback-eligible status. Useful for falling back to a smaller or faster model when a large model exceeds quota limits.

resp, err := genkit.Generate(ctx, g,
    ai.WithModelName("googleai/gemini-pro-latest"),
    ai.WithPrompt("Try the pro model first..."),
    ai.WithUse(&middleware.Fallback{
        Models: []ai.ModelRef{
            googlegenai.ModelRef("googleai/gemini-flash-latest", nil),
        },
        Statuses: []core.StatusName{core.RESOURCE_EXHAUSTED},
    }),
)

Configuration options:

Models (required): An ordered list of ai.ModelRef values to try after the primary fails. Each ref’s Config is used verbatim for that model; the original request’s config is not inherited. Use googlegenai.ModelRef (or the equivalent helper for your provider) to attach configuration.
Statuses (optional): A list of core.StatusName values that should trigger a fallback (default: UNAVAILABLE, DEADLINE_EXCEEDED, RESOURCE_EXHAUSTED, ABORTED, INTERNAL, NOT_FOUND, UNIMPLEMENTED).

3. Tool approval middleware (`ToolApproval`)

Restricts tool execution to an allow list. Tools not in the list trigger a tool interrupt that you can resolve by prompting the user and then resuming with an explicit approval flag.

// 1. Initial attempt: any tool not in AllowedTools interrupts the call.
resp, err := genkit.Generate(ctx, g,
    ai.WithPrompt("write a file"),
    ai.WithTools(writeFileTool),
    ai.WithUse(&middleware.ToolApproval{
        AllowedTools: []string{}, // Empty list interrupts every tool call.
    }),
)
if err != nil {
    log.Fatal(err)
}

if resp.FinishReason == ai.FinishReasonInterrupted {
    interrupt := resp.Interrupts()[0]

    // 2. Ask the user for approval, then re-create the tool request with the approval flag.
    approved, err := writeFileTool.RestartWith(interrupt,
        ai.WithResumedMetadata[WriteFileInput](map[string]any{"toolApproved": true}),
    )
    if err != nil {
        log.Fatal(err)
    }

    // 3. Resume execution.
    resumed, err := genkit.Generate(ctx, g,
        ai.WithMessages(resp.History()...),
        ai.WithTools(writeFileTool),
        ai.WithToolRestarts(approved),
        ai.WithUse(&middleware.ToolApproval{}),
    )
    _ = resumed
}

A bare resume without the toolApproved flag is not treated as approval, so unrelated resume flows can’t bypass approval gating.

Configuration options:

AllowedTools (optional): The list of tool names pre-approved to run without interruption. Tools not in this list trigger an interrupt. An empty list interrupts every tool.

4. Skills middleware (`Skills`)

Scans a directory for SKILL.md files (and their YAML frontmatter) and injects them into the system prompt. It also provides a use_skill tool the model can call to load a specific skill’s full body on demand.

resp, err := genkit.Generate(ctx, g,
    ai.WithPrompt("How do I run tests in this repo?"),
    ai.WithUse(&middleware.Skills{SkillPaths: []string{"./skills"}}),
)

Configuration options:

SkillPaths (optional): A list of directories to scan for skills. Each direct subdirectory containing a SKILL.md file is exposed as a skill (default: ["skills"]).

5. Filesystem middleware (`Filesystem`)

Grants the model access to a single root directory by injecting standard file manipulation tools (list_files, read_file, plus write_file and edit_file when writes are enabled). Path safety is enforced by os.Root (Go 1.24+), which rejects any path that resolves outside the root, including via .., absolute paths, or symbolic links.

resp, err := genkit.Generate(ctx, g,
    ai.WithPrompt("Create a hello world program in the workspace"),
    ai.WithUse(&middleware.Filesystem{
        RootDir:          "./workspace",
        AllowWriteAccess: true,
    }),
)

Configuration options:

RootDir (required): The root directory all filesystem operations are confined to.
AllowWriteAccess (optional): If true, additionally registers write_file and edit_file (default: false).
ToolNamePrefix (optional): A prefix prepended to each tool name. Use distinct prefixes when attaching multiple Filesystem middlewares to one call so their tool names don’t collide.

Building your own custom middleware

A middleware in Go is any value that satisfies the ai.Middleware interface:

type Middleware interface {
    Name() string                               // stable, registered identifier
    New(ctx context.Context) (*ai.Hooks, error) // builds a per-call hook bundle
}

New is invoked once per genkit.Generate() call. The returned *ai.Hooks bundle is reused across every iteration of the tool loop within that call:

type Hooks struct {
    // Tools are extra tools to register for this Generate call alongside any user-supplied tools.
    Tools []ai.Tool

    // WrapGenerate wraps each iteration of the tool loop.
    WrapGenerate func(ctx context.Context, params *ai.GenerateParams, next ai.GenerateNext) (*ai.ModelResponse, error)

    // WrapModel wraps each model API call.
    WrapModel func(ctx context.Context, params *ai.ModelParams, next ai.ModelNext) (*ai.ModelResponse, error)

    // WrapTool wraps each tool execution. May run concurrently for parallel tool calls.
    WrapTool func(ctx context.Context, params *ai.ToolParams, next ai.ToolNext) (*ai.MultipartToolResponse, error)
}

Implement only the hooks your middleware needs. A nil hook field is treated as a pass-through.

When each hook fires

A Generate call runs a tool loop: the model produces output, any tool calls execute, results feed back into a new model call, and so on until the model stops. The hooks attach at three different layers of this loop:

Hook	Fires	Use for
`WrapGenerate`	Once per tool-loop iteration. N tool turns means N+1 invocations.	Logic that needs to see the whole conversation: rewrites, system-prompt injection, message accumulation.
`WrapModel`	Once per model API call, inside an iteration.	Logic about the model call itself: retry, fallback, caching.
`WrapTool`	Once per tool execution. May run concurrently for parallel tool calls in the same iteration.	Logic about a single tool execution: approval, sandboxing, logging.

WrapGenerate and WrapModel are not called concurrently within a single Generate call. WrapTool may be, since multiple tools can execute in parallel.

A simple example

Here is a custom middleware that logs how long each model call takes:

type Logger struct {
    Prefix string `json:"prefix,omitempty"`
}

func (Logger) Name() string { return "mine/logger" }

func (l Logger) New(ctx context.Context) (*ai.Hooks, error) {
    return &ai.Hooks{
        WrapModel: func(ctx context.Context, p *ai.ModelParams, next ai.ModelNext) (*ai.ModelResponse, error) {
            start := time.Now()
            resp, err := next(ctx, p)
            log.Printf("%s model call took %s", l.Prefix, time.Since(start))
            return resp, err
        },
    }, nil
}

To use it:

resp, err := genkit.Generate(ctx, g,
    ai.WithPrompt("Hello"),
    ai.WithUse(Logger{Prefix: "[trace]"}),
)

State that should be shared across the hooks of a single Generate call lives in closures captured by New. Each call gets a fresh Hooks bundle, so nothing leaks between calls:

type Counter struct{}

func (Counter) Name() string { return "mine/counter" }

func (Counter) New(ctx context.Context) (*ai.Hooks, error) {
    var modelCalls int
    return &ai.Hooks{
        WrapModel: func(ctx context.Context, p *ai.ModelParams, next ai.ModelNext) (*ai.ModelResponse, error) {
            modelCalls++
            return next(ctx, p)
        },
        WrapGenerate: func(ctx context.Context, p *ai.GenerateParams, next ai.GenerateNext) (*ai.ModelResponse, error) {
            // The same `modelCalls` is visible here: both closures capture it from `New`.
            resp, err := next(ctx, p)
            log.Printf("iteration %d: %d model calls so far", p.Iteration, modelCalls)
            return resp, err
        },
    }, nil
}

WrapTool may run concurrently for parallel tool calls in the same iteration, so any state it touches must be guarded with sync primitives:

func (Counter) New(ctx context.Context) (*ai.Hooks, error) {
    var (
        mu        sync.Mutex
        toolCalls int
    )
    return &ai.Hooks{
        WrapTool: func(ctx context.Context, p *ai.ToolParams, next ai.ToolNext) (*ai.MultipartToolResponse, error) {
            mu.Lock()
            toolCalls++
            mu.Unlock()
            return next(ctx, p)
        },
    }, nil
}

The built-in Filesystem middleware uses this pattern: New allocates a per-call file-state cache and a path-lock map, then the read, write, and edit tool implementations close over both.

Plugin-provided middleware and plugin-level state

Middleware shipped as part of a plugin needs two things the simple cases above don’t:

A way to be registered automatically when the plugin is added to genkit.Init, so the Dev UI and cross-runtime callers can address it by name.
A way to keep plugin-level state (an HTTP client, a logger, a database handle) that isn’t part of the JSON-serializable config.

Both are handled by implementing ai.MiddlewarePlugin on the plugin struct and putting plugin-level state on unexported fields of the config struct. The plugin’s Middlewares method passes a prototype with those fields populated to ai.NewMiddleware, which captures the prototype in a build closure. JSON-dispatched calls (Dev UI or cross-runtime) recreate the config by value-copying that prototype, which preserves the unexported fields and overlays only the JSON config:

import (
    "context"
    "fmt"
    "io"
    "time"

    "github.com/genkit-ai/genkit/go/ai"
    "github.com/genkit-ai/genkit/go/core/api"
)

type Logger struct {
    Prefix string    `json:"prefix,omitempty"`
    out    io.Writer // unexported; preserved across JSON dispatch by value-copy
}

func (Logger) Name() string { return "mine/logger" }

func (l Logger) New(ctx context.Context) (*ai.Hooks, error) {
    return &ai.Hooks{
        WrapModel: func(ctx context.Context, p *ai.ModelParams, next ai.ModelNext) (*ai.ModelResponse, error) {
            start := time.Now()
            resp, err := next(ctx, p)
            fmt.Fprintf(l.out, "%s model call took %s\n", l.Prefix, time.Since(start))
            return resp, err
        },
    }, nil
}

type LoggerPlugin struct{ Out io.Writer }

func (p *LoggerPlugin) Name() string                          { return "mine/logger" }
func (p *LoggerPlugin) Init(ctx context.Context) []api.Action { return nil }

func (p *LoggerPlugin) Middlewares(ctx context.Context) ([]*ai.MiddlewareDesc, error) {
    return []*ai.MiddlewareDesc{
        ai.NewMiddleware("logs model call latency", Logger{out: p.Out}),
    }, nil
}

Application code then registers the plugin once during Init, which makes the middleware available everywhere by name:

g := genkit.Init(ctx, genkit.WithPlugins(
    &googlegenai.GoogleAI{},
    &LoggerPlugin{Out: os.Stderr},
))

resp, err := genkit.Generate(ctx, g,
    ai.WithPrompt("Hello"),
    ai.WithUse(Logger{Prefix: "[trace]"}),
)

When the Dev UI dispatches the same middleware with JSON like {"prefix": "[debug]"}, Genkit value-copies the prototype to recreate the config: out (which isn’t in JSON) is preserved from the plugin’s prototype, while the unmarshaled JSON overrides Prefix.

The built-in plugins/middleware package follows exactly this pattern. See plugin.go for a minimal real-world example.

Application-owned middleware

When your application code defines a middleware directly rather than wrapping it in a plugin, use genkit.DefineMiddleware to register it with the Genkit instance:

genkit.DefineMiddleware(g, "logs model call latency", Logger{out: os.Stderr})

Registration surfaces the middleware in the Dev UI and lets cross-runtime callers reference it by name. For pure Go use, registration is not required: passing a middleware value directly to ai.WithUse invokes its New method on the local fast path. Registration is what makes the middleware visible to the Dev UI.

Inline middleware

For ad-hoc middleware that doesn’t need a named type or Dev UI visibility, use ai.MiddlewareFunc:

ai.WithUse(ai.MiddlewareFunc(func(ctx context.Context) (*ai.Hooks, error) {
    return &ai.Hooks{
        WrapModel: func(ctx context.Context, p *ai.ModelParams, next ai.ModelNext) (*ai.ModelResponse, error) {
            log.Printf("model call: %d messages", len(p.Request.Messages))
            return next(ctx, p)
        },
    }, nil
}))

The adapter satisfies Middleware with a placeholder name. Inline middleware is resolved on the local fast path and never touches the registry, so the placeholder is fine.

Composition order

ai.WithUse(A, B, C) composes left to right with the first listed middleware as the outermost wrapper, like HTTP middleware: at call time the chain expands to A { B { C { actual } } }. Each layer’s next continuation runs the next inner layer:

ai.WithUse(
    &middleware.Retry{MaxRetries: 3},                  // outer: retries the whole inner stack
    &middleware.Fallback{Models: fallbackModels},      // inner: tries fallback models on failure
)
// effective chain: Retry { Fallback { model } }

Order matters. Retry outside Fallback retries the entire fallback cascade as a unit. Swap them and you’d retry the primary first and fall back only after exhausting retries.

For more complex examples of building custom middleware, you can refer to the source code of the built-in middleware in the Genkit GitHub repository.

Middleware in Dart

Genkit Dart uses a registry-based system for middleware, similar to plugins. This allows middleware to be resolved by name and configured via schemas, enabling support for the Genkit Developer UI.

Available middleware

The following middleware is available in Genkit Dart:

1. FileSystem middleware (`filesystem`)

import 'package:genkit/genkit.dart';

final response = await ai.generate(
  model: googleAI.gemini('gemini-flash-latest'),
  prompt: 'Create a hello world node app in the workspace',
  use: [
    filesystem(rootDirectory: './workspace'),
  ],
);

Configuration options:

rootDirectory (required): The root directory to which all filesystem operations are restricted.

Note: Unlike the JavaScript version, the Dart FileSystem middleware does not currently support allowWriteAccess or toolNamePrefix options and always enables write operations.

2. Skills middleware (`skills`)

Automatically scans a directory for SKILL.md files and injects them into the system prompt. It also provides a use_skill tool the model can use to retrieve more specific skills on demand.

import 'package:genkit/genkit.dart';

final response = await ai.generate(
  prompt: 'How do I run tests in this repo?',
  use: [
    skills(skillPaths: ['./skills']),
  ],
);

Configuration options:

skillPaths (optional): Paths to directories containing skills (defaults to ['skills']).

3. Tool approval middleware (`toolApproval`)

Restricts execution of tools to an approved list. If the model attempts to call an unapproved tool, it throws a ToolInterruptException allowing you to prompt the user for manual confirmation before resuming.

import 'package:genkit/genkit.dart';

final response = await ai.generate(
  prompt: 'write a file',
  tools: [writeFileTool],
  use: [
    toolApproval(approved: []), // Empty list means call triggers interrupt
  ],
);

if (response.finishReason == FinishReason.interrupted) {
  final part = response.interrupts.first;

  // Ask user for approval...
  final approved = true; // Assume user approved

  // Resume execution
  final response2 = await ai.generate(
    messages: response.messages,
    resume: [
      InterruptResponse(part, approved),
    ],
    use: [
      toolApproval(approved: []),
    ],
  );
}

Configuration options:

approved (optional): List of approved tool names.

4. Retry middleware (`retry`)

Automatically retries failed model generations on transient error codes (like RESOURCE_EXHAUSTED, UNAVAILABLE) using exponential backoff with jitter.

import 'package:genkit/genkit.dart';

final response = await ai.generate(
  model: googleAI.gemini('gemini-flash-latest'),
  prompt: 'Reliable request',
  use: [
    retry(maxRetries: 3),
  ],
);

Configuration options:

maxRetries (optional): Maximum number of retry attempts (default: 3).
statuses (optional): A list of StatusCodes constants that should trigger a retry. Defaults to UNAVAILABLE, DEADLINE_EXCEEDED, RESOURCE_EXHAUSTED, ABORTED, INTERNAL.
initialDelayMs (optional): The initial delay before the first retry (default: 1000).
maxDelayMs (optional): The maximum capped delay between retries (default: 60000).
backoffFactor (optional): Exponential backoff multiplier (default: 2.0).
noJitter (optional): If false, adds a random factor (Full Jitter) to the delay (default: false).
retryModel (optional): Whether to retry model calls (default: true).
retryTools (optional): Whether to retry tool calls (default: false).

Note: The onError callback is available when instantiating RetryMiddleware directly, but cannot be passed via the retry() helper.

To use the retry() helper, you must register the RetryPlugin when initializing Genkit:

final ai = Genkit(
  plugins: [
    RetryPlugin(),
  ],
);

Building your own custom middleware

To build a production-ready middleware in Dart that integrates with the Genkit UI, you should follow the registered middleware pattern, which consists of four parts:

1. Define the configuration schema

Use the @Schema() annotation to define the configuration options for your middleware.

import 'package:schemantic/schemantic.dart';

part 'logger.g.dart';

@Schema()
abstract class LoggerOptions {
  bool? get enableColor;
  int? get maxLogLength;
}

2. Implement the middleware logic

Create a class that extends GenerateMiddleware and override the hooks you need (model, tool, or generate).

class LoggerMiddleware extends GenerateMiddleware {
  final bool enableColor;
  final int maxLogLength;

  LoggerMiddleware({
    this.enableColor = false,
    this.maxLogLength = 1000,
  });

  @override
  Future model(
    ModelRequest request,
    dynamic ctx,
    dynamic next,
  ) async {
    // Custom interception logic here...
    return next(request, ctx);
  }
}

3. Define the middleware and plugin

Use defineMiddleware to link your schema and implementation, and expose it via a GenkitPlugin.

class LoggerPlugin extends GenkitPlugin {
  @override
  String get name => 'logger';

  @override
  List middleware() => [
    defineMiddleware(
      name: 'logger',
      configSchema: LoggerOptions.schema,
      create: ([LoggerOptions? config]) => LoggerMiddleware(
        enableColor: config?.enableColor ?? false,
        maxLogLength: config?.maxLogLength ?? 1000,
      ),
    ),
  ];
}

4. Create the DX helper function

Create a factory function that returns a GenerateMiddlewareRef for ergonomic use.

GenerateMiddlewareRef logger({
  bool? enableColor,
  int? maxLogLength,
}) {
  return middlewareRef(
    name: 'logger',
    config: LoggerOptions(
      enableColor: enableColor,
      maxLogLength: maxLogLength,
    ),
  );
}

To use your custom middleware:

final ai = Genkit(
  plugins: [LoggerPlugin()],
);

final response = await ai.generate(
  model: customModel,
  prompt: 'Hello world',
  use: [
    logger(enableColor: true, maxLogLength: 500),
  ],
);

For more complex examples of building custom middleware, you can refer to the source code of the built-in middleware in the Genkit Dart GitHub repository.

Middleware

Installation

Available middleware

1. FileSystem middleware (filesystem)

2. Skills middleware (skills)

3. Tool approval middleware (toolApproval)

4. Retry middleware (retry)

5. Fallback middleware (fallback)

Building your own custom middleware

Installation

Available middleware

1. Retry middleware (Retry)

2. Fallback middleware (Fallback)

3. Tool approval middleware (ToolApproval)

4. Skills middleware (Skills)

5. Filesystem middleware (Filesystem)

Building your own custom middleware

When each hook fires

A simple example

Sharing state across hooks

Plugin-provided middleware and plugin-level state

Application-owned middleware

Inline middleware

Composition order

Middleware in Dart

Available middleware

1. FileSystem middleware (filesystem)

2. Skills middleware (skills)

3. Tool approval middleware (toolApproval)

4. Retry middleware (retry)

Building your own custom middleware

1. Define the configuration schema

2. Implement the middleware logic

3. Define the middleware and plugin

4. Create the DX helper function

1. FileSystem middleware (`filesystem`)

2. Skills middleware (`skills`)

3. Tool approval middleware (`toolApproval`)

4. Retry middleware (`retry`)

5. Fallback middleware (`fallback`)

1. Retry middleware (`Retry`)

2. Fallback middleware (`Fallback`)

3. Tool approval middleware (`ToolApproval`)

4. Skills middleware (`Skills`)

5. Filesystem middleware (`Filesystem`)

1. FileSystem middleware (`filesystem`)

2. Skills middleware (`skills`)

3. Tool approval middleware (`toolApproval`)

4. Retry middleware (`retry`)