feat: added the llama-server backend by PeterStaar-IBM · Pull Request #28 · docling-project/docling-agent

PeterStaar-IBM · 2026-05-21T10:41:14Z

Summary

Adds a new LlamaServerBackend for llama.cpp's llama-server OpenAI-compatible HTTP API (default
http://localhost:8080/v1).
Subclasses OpenAICompatibleBackend for consistency with LMStudioBackend and LiteLLMBackend.
Wires the new backend into the registry, exports, factory, and the BackendConfig.type literal so it can be
selected from task YAML via type: llama-server.
Updates the CLI task template, README, and the editor.yaml/writer.yaml/enrich.yaml task configs to list the
new option.

Usage

backend:
type: llama-server # mellea | ollama | lmstudio | litellm | llama-server
# base_url: http://localhost:8080/v1 # default
timeout: 120
models:
reasoning: gpt-oss-20b
writing: gpt-oss-20b

Test plan

uv run pytest tests/test_backend_factory.py tests/test_direct_backends.py
tests/test_task_model_backend_config.py — all 11 tests pass, including:
- test_create_llama_server_backend_from_config — verifies factory + default base URL.
- test_create_llama_server_backend_with_custom_base_url — verifies custom base_url is respected.
- test_llama_server_session_tracks_history — verifies session posts to /chat/completions and maintains chat
  history.
Smoke test against a locally running llama-server instance with a GGUF model.

Signed-off-by: Peter Staar <taa@zurich.ibm.com>

github-actions · 2026-05-21T10:41:24Z

✅ DCO Check Passed

Thanks @PeterStaar-IBM, all your commits are properly signed off. 🎉

mergify · 2026-05-21T10:41:34Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

🟢 Require two reviewer for test updates

Wonderful, this rule succeeded.

When test data is updated, we require two reviewers

#approved-reviews-by >= 1

codecov · 2026-05-21T10:43:41Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

ceberam

LGTM

feat: added the llama-server backend

8ab437b

Signed-off-by: Peter Staar <taa@zurich.ibm.com>

PeterStaar-IBM requested a review from ceberam May 21, 2026 10:41

ceberam approved these changes May 21, 2026

View reviewed changes

ceberam merged commit cb2ad4c into main May 21, 2026
11 checks passed

ceberam deleted the dev/add-llama-server-backend branch May 21, 2026 12:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: added the llama-server backend#28

feat: added the llama-server backend#28
ceberam merged 1 commit into
mainfrom
dev/add-llama-server-backend

PeterStaar-IBM commented May 21, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 21, 2026

Uh oh!

mergify Bot commented May 21, 2026 •

edited

Loading

Uh oh!

codecov Bot commented May 21, 2026

Uh oh!

ceberam left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

PeterStaar-IBM commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Usage

Test plan

Uh oh!

github-actions Bot commented May 21, 2026

Uh oh!

mergify Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge Protections

🟢 Enforce conventional commit

🟢 Require two reviewer for test updates

Uh oh!

codecov Bot commented May 21, 2026

Codecov Report

Uh oh!

ceberam left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PeterStaar-IBM commented May 21, 2026 •

edited

Loading

mergify Bot commented May 21, 2026 •

edited

Loading