Extending Flemma

Flemma uses registry patterns throughout – tools, approval resolvers, sandbox backends, credential resolvers, and personalities are all pluggable. This guide covers the extension points that don't have a dedicated document, and links to those that do.

Hooks (Lifecycle Events)

Flemma emits User autocmds at lifecycle boundaries. External plugins listen with standard Neovim autocommand APIs – no Flemma-specific setup required.

Available events

Event name	Autocmd pattern	Payload fields	When it fires
`request:sending`	`FlemmaRequestSending`	`bufnr`	Just before an API request is sent
`request:finished`	`FlemmaRequestFinished`	`bufnr`, `status` (`"completed"`, `"cancelled"`, or `"errored"`), `request?` (`flemma.session.Request` — present on completed status with pricing info)	After an API request completes (any outcome)
`tool:executing`	`FlemmaToolExecuting`	`bufnr`, `tool_name`, `tool_id`	When a tool invocation starts execution
`tool:completed`	`FlemmaToolCompleted`	`bufnr`, `tool_name`, `tool_id`, `status` (`"success"` or `"error"`)	When a tool invocation completes
`usage:estimated`	`FlemmaUsageEstimated`	`bufnr`	When a buffer's token estimate cache changes
`config:updated`	`FlemmaConfigUpdated`	(none)	After runtime configuration changes (see note below)
`boot:complete`	`FlemmaBootComplete`	(none)	After all async tool sources finish loading
`sink:created`	`FlemmaSinkCreated`	`bufnr`, `name`	When a new output buffer (sink) is created
`sink:destroyed`	`FlemmaSinkDestroyed`	`bufnr`, `name`	When an output buffer (sink) is destroyed
`conversation:idle`	`FlemmaConversationIdle`	`bufnr`	When the conversation reaches idle after a response
`job:submitted`	`FlemmaJobSubmitted`	`bufnr`, `job_id`, `tool_id`, `tool_name`, `active_count`	When a tool is moved to background execution
`job:completed`	`FlemmaJobCompleted`	`bufnr`, `job_id`, `tool_id`, `tool_name`, `success`, `active_count`	When a background job result is delivered
`autopilot:resume-scheduled`	`FlemmaAutopilotResumeScheduled`	`bufnr`, `delay_ms`	When autopilot schedules a debounced auto-continue
`autopilot:resume-cancelled`	`FlemmaAutopilotResumeCancelled`	`bufnr`	When a scheduled auto-continue is cancelled
`autopilot:resumed`	`FlemmaAutopilotResumed`	`bufnr`	When autopilot fires after the resume delay

Warning

The config:updated event is not yet dispatched consistently across all config mutation paths. Today it only fires from :Flemma switch (provider switching). Other mutations — frontmatter changes, programmatic config.apply() calls — do not emit it yet. Treat it as a best-effort signal for now.

Listening to events

vim.api.nvim_create_autocmd("User", {
  pattern = "FlemmaRequestFinished",
  callback = function(ev)
    if ev.data.status == "completed" then
      vim.notify("Request finished for buffer " .. ev.data.bufnr)
    end
  end,
})

Payload fields are available on ev.data. The data table is always present (never nil), even for events with no payload fields like boot:complete. Errors in consumer callbacks are caught and logged – they never crash the request pipeline.

Naming convention

Internal hook names use domain:action format (e.g., request:sending). The autocmd pattern is derived by TitleCasing each segment and prepending Flemma:

request:sending → FlemmaRequestSending
tool:completed → FlemmaToolCompleted
Hyphenated words are split: tool-use:completed → FlemmaToolUseCompleted

Example: busy indicator

The built-in bufferline integration uses hooks to track busy state:

vim.api.nvim_create_autocmd("User", {
  pattern = "FlemmaRequestSending",
  callback = function(ev) increment_busy(ev.data.bufnr) end,
})
vim.api.nvim_create_autocmd("User", {
  pattern = "FlemmaRequestFinished",
  callback = function(ev) decrement_busy(ev.data.bufnr) end,
})

Lua subscribers

In addition to User autocmds, hooks support direct Lua callbacks via hooks.on(). Internal subscribers fire synchronously before the autocmd, in registration order, with per-subscriber error isolation.

local hooks = require("flemma.hooks")

local handle = hooks.on("job:completed", function(data)
  if data.active_count == 0 then
    vim.notify("All background jobs finished")
  end
end)

-- Later: unsubscribe
handle:off()

hooks.on(name, callback) returns a handle with an :off() method. Prefer autocmds for external plugins; prefer hooks.on() when you need guaranteed ordering relative to other subscribers or want to avoid the ev.data unwrapping overhead.

Credential Resolution

Flemma resolves API keys and tokens through a priority-based resolver chain. The first resolver that finds a credential wins. Results are cached with TTL awareness to avoid repeated lookups.

Resolution order

Priority	Resolver	Platform	What it checks
100	Environment	All	`{SERVICE}_{KIND}` env var (e.g., `ANTHROPIC_API_KEY`), then aliases
50	Secret Service	Linux	`secret-tool lookup service {service} key {kind}` (GNOME Keyring)
50	Keychain	macOS	`security find-generic-password -s {service} -a {kind} -w`
25	Gcloud	All (w/ CLI)	`gcloud auth print-access-token` (only for `access_token` kind)

How it works

Cache check – if a cached credential exists and its TTL hasn't expired, return it immediately.
Resolver iteration – try each resolver in priority order (highest first). Each resolver's supports() method filters by platform, credential kind, or service.
First match wins – the first resolver that returns a value is used. The result is cached.
Fallback – if no resolver succeeds, a warning notification lists every resolver that was tried and why it couldn't help (e.g., "ANTHROPIC_API_KEY not set", "secret-tool not found on PATH", "gcloud: executable not found"). This diagnostic output helps pinpoint which resolver to configure.

Environment resolver conventions

The environment resolver constructs the variable name from the credential's service and kind:

Service	Kind	Variable checked
anthropic	api_key	`ANTHROPIC_API_KEY`
openai	api_key	`OPENAI_API_KEY`
vertex	access_token	`VERTEX_AI_ACCESS_TOKEN`
vertex	service_account	`VERTEX_SERVICE_ACCOUNT`
moonshot	api_key	`MOONSHOT_API_KEY`

Credentials can also define aliases – alternative variable names checked in order after the convention.

TTL caching

Credentials are cached per kind:service key (e.g., api_key:anthropic). The effective TTL is:

effective_ttl = base_ttl * ttl_scale

Where base_ttl comes from the resolver result or credential config, and ttl_scale (default 1.0) allows adjusting the refresh window (e.g., 0.8 to refresh at 80% of token lifetime). When no TTL is set, the credential caches indefinitely until manually invalidated.

Configuring resolvers

Built-in resolvers can be configured through the secrets config namespace. Currently only the gcloud resolver has configurable options:

require("flemma").setup({
  secrets = {
    gcloud = {
      path = "/usr/local/bin/gcloud",  -- override the gcloud binary path (default: "gcloud")
    },
  },
})

This is useful on NixOS, Guix, or systems where the gcloud CLI is not on $PATH.

Registering a custom resolver

local secrets = require("flemma.secrets")

secrets.register("my_vault", {
  name = "my_vault",
  priority = 60,  -- between environment (100) and keyring (50)

  supports = function(self, credential)
    return credential.service == "my-service"
  end,

  resolve = function(self, credential)
    local value = vim.fn.system("vault read -field=value secret/" .. credential.kind)
    if vim.v.shell_error == 0 then
      return { value = vim.trim(value), ttl = 300 }
    end
    return nil  -- pass to next resolver
  end,
})

The resolver contract:

supports(self, credential, ctx) → boolean – whether this resolver can attempt this credential. ctx is a SecretsContext (see below).
resolve(self, credential, ctx) → { value: string, ttl?: integer } | nil – the credential value, or nil to pass. ctx is a SecretsContext.

Resolvers receive a SecretsContext that provides:

ctx:get_config() → table|nil – returns the resolver's config subtree from secrets.<resolver_name> (e.g., secrets.gcloud for the gcloud resolver). Returns a deep copy; modifications don't affect global config.
ctx:diagnostic(message) – record a diagnostic explaining why this resolver couldn't help. These are surfaced in the failure notification when all resolvers fail.

Invalidating credentials

local secrets = require("flemma.secrets")

secrets.invalidate("api_key", "anthropic")  -- invalidate a specific credential
secrets.invalidate_all()                    -- clear the entire cache

Extension point index

These extension points have full documentation in their respective pages:

Extension point	What it does	Documentation
Custom tools	Register tools the model can call	docs/tools.md – Registering custom tools
Async tool sources	Resolve tool definitions from external processes/APIs	docs/tools.md – Async tool definitions
Approval resolvers	Priority-based chain for tool approval decisions	docs/tools.md – Approval resolvers
Sandbox backends	Platform-specific sandbox enforcement	docs/sandbox.md – Custom backends
Personalities	Dynamic system prompt generators	docs/personalities.md
Template populators	Custom globals for `{{ }}` and `{% %}` expressions	docs/templates.md – Extending the Environment
Frontmatter parsers	Custom frontmatter languages (e.g., YAML)	docs/templates.md – Custom frontmatter parsers
Frontmatter operators	JSON config operators (`$set`, `$append`, etc.)	docs/templates.md – JSON frontmatter operators
Preview formatters	Custom tool preview rendering in pending placeholders	docs/tools.md – Custom preview formatters

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extending Flemma

Hooks (Lifecycle Events)

Available events

Listening to events

Naming convention

Example: busy indicator

Lua subscribers

Credential Resolution

Resolution order

How it works

Environment resolver conventions

TTL caching

Configuring resolvers

Registering a custom resolver

Invalidating credentials

Extension point index

FilesExpand file tree

extending.md

Latest commit

History

extending.md

File metadata and controls

Extending Flemma

Hooks (Lifecycle Events)

Available events

Listening to events

Naming convention

Example: busy indicator

Lua subscribers

Credential Resolution

Resolution order

How it works

Environment resolver conventions

TTL caching

Configuring resolvers

Registering a custom resolver

Invalidating credentials

Extension point index