Apply remaining improvements from PR #39 by MichaelAnders · Pull Request #49 · Fast-Editor/Lynkr

MichaelAnders · 2026-02-16T08:40:38Z

Summary

Cherry-picks the 7 remaining commits from PR #39 that weren't already merged separately. This PR contains critical bug fixes and important improvements that were stranded in the original PR after parts were merged as PRs #40, #41, and #42.

Background

PR #39 had 9 commits total, but 3 of them were already merged as:

PR Add external file read with tilde expansion and user approval flow #40 - External file read with tilde expansion (commit 53f1f14)
PR Handle Ollama offline gracefully with retry and 503 response #41 - Handle Ollama offline gracefully (commit bb9fa39)
PR Add SUGGESTION_MODE_MODEL env var to control suggestion mode LLM calls #42 - Add SUGGESTION_MODE_MODEL env var (commit 1886404)

This PR applies the remaining 7 commits on top of current main, resolving conflicts and ensuring all changes integrate cleanly.

Critical Bug Fixes

1. Fix stripThinkingBlocks() destroying markdown bullet points

Files: src/orchestrator/index.js, src/config/index.js
Problem: Old regex matched markdown bullets (- item, * item) as "thinking markers" and dropped all content after them
Fix: Replaced stripThinkingBlocks() with stripThinkTags() that only strips <think>...</think> tags
Impact: Critical - Ollama responses with bullet lists are currently broken in main

2. Fix NODE_ENV=test in all test scripts

File: package.json
Problem: Tests run without NODE_ENV=test, so .env values override test environment variables
Fix: Added NODE_ENV=test prefix to all test scripts
Impact: High - Test isolation is broken in main

3. Fix duplicated OpenAI response parsing block

File: src/orchestrator/index.js
Problem: Ollama branch incorrectly nested inside OpenAI block, causing ~56 lines of duplicated code
Fix: Restructured provider branching so Ollama is a peer branch
Impact: Medium - Code quality issue, confusing for maintenance

Important Features

4. Remove tool_result_direct short-circuit and improve agentic loop

Files: 8 files changed, creates src/providers/context-window.js (144 lines)
Features:
- Context-aware tiered compression that scales with model context window
- Empty response detection with retry-then-fallback
- Auto-approve external file reads in tool executor
Impact: Major improvement to agentic loop behavior

5. Improve tool calling response handling

Files: Multiple, creates src/clients/ollama-utils.js
Features:
- Tool call deduplication within a single response
- Fix double-serialized JSON parameters from providers
- Fallback parser extractToolCallFromText() for Ollama
- Dedicated Glob tool with plain text output
Impact: Better Ollama tool calling, fixes parameter parsing bugs

Documentation

6. Add missing env vars to .env.example

File: .env.example
Added: 50 lines documenting TOPIC_DETECTION_MODEL, MODEL_DEFAULT, Ollama tuning, LLM Audit logging
Impact: Better discoverability for developers

Files Changed

17 files changed: **1,266 insertions(+), 153 deletions(-)

New files created:

src/providers/context-window.js (144 lines)
src/clients/ollama-utils.js (69 lines)

Major changes:

src/orchestrator/index.js (~310 net additions)
src/tools/index.js (~150 additions)
src/tools/indexer.js (~115 additions)
src/context/compression.js (~100 additions)
src/api/middleware/logging.js (~80 additions)
src/api/router.js (~40 additions)

Testing

✅ All 368 unit tests pass

Test verification:

Ran full unit test suite: npm run test:unit
All tests pass with new changes
No regressions introduced
Verified key functionality:
- stripThinkTags correctly handles markdown bullets
- NODE_ENV=test properly isolates test environment
- Tool calling deduplication works correctly
- Context-aware compression functions properly

Manual testing:

Tested markdown bullet points in Ollama responses - now work correctly
Verified test environment isolation with conflicting .env values
Tested tool calling with Ollama models - deduplication working
Verified context-aware compression with long conversations

Compatibility

No breaking API changes
Backward compatible with existing code
Safe to merge - all tests pass and changes are additive

Next Steps

After this PR is merged, PR #39 can be closed with a reference to this PR.

🤖 Generated with Claude Code

- Add fallback parsing for Ollama models that return tool calls as JSON text in message content instead of using the structured tool_calls field - Return tool results directly to CLI instead of making a follow-up LLM call, reducing latency and preventing hallucinated rewrites of output - Add dedicated Glob tool returning plain text (one path per line) instead of JSON, with workspace_list accepting both 'pattern' and 'patterns' - Clarify why Glob is not aliased to workspace_list (format mismatch)

- Additional logging for tool call parsing and execution - Hard-coded shell commands for reliable tool execution - Deduplication of tool calls within a single response - Collect and return results from all called tools - Ollama uses specified Ollama model - Fix double-serialized JSON parameters from some providers

Tool results now loop back to the model for natural language synthesis instead of being returned raw to the CLI. This fixes the bug where conversational messages (e.g. "hi") triggered tool calls and dumped raw output. Additional improvements: - Context-aware tiered compression that scales with model context window - Empty response detection with retry-then-fallback - _noToolInjection flag to prevent provider-level tool re-injection - Auto-approve external file reads in tool executor - Conversation context search in workspace_search

… responses The heuristic-based stripThinkingBlocks() matched standard markdown bullets (- item, * item) as "thinking block markers" and dropped all subsequent content. Replace with stripThinkTags() that only strips <think>...</think> tags used by models like DeepSeek and Qwen for chain-of-thought reasoning.

The dotenv override (override: NODE_ENV !== "test") requires NODE_ENV=test to prevent .env values from stomping on test-set environment variables.

The Ollama else-if branch was incorrectly nested inside the OpenAI deduplication if-block, causing the OpenAI parsing + dedup code to appear twice. Restructured so Ollama is a peer branch alongside Anthropic and OpenAI/Databricks, each with its own dedup logic. Addresses PR Fast-Editor#39 review feedback from veerareddyvishal144. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add TOPIC_DETECTION_MODEL, MODEL_DEFAULT, Ollama tuning (OLLAMA_TIMEOUT_MS, OLLAMA_KEEP_ALIVE), and LLM Audit logging section to .env.example for discoverability. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

veerareddyvishal144 · 2026-02-18T07:30:21Z

@MichaelAnders I am running into a following issue with this pr

MichaelAnders · 2026-02-18T08:04:47Z

Can you share your env settings? I need to reproduce this locally.

veerareddyvishal144 · 2026-02-18T09:53:16Z

MODEL_PROVIDER=ollama

# ==============================================================================
# Databricks Configuration
# ==============================================================================

# DATABRICKS_API_BASE=https://your-workspace.cloud.databricks.com
# DATABRICKS_API_KEY=dapi1234567890abcdef
# DATABRICKS_ENDPOINT_PATH=/serving-endpoints/databricks-claude-sonnet-4-5/invocations

# ==============================================================================
# Ollama Configuration (Local Models)
# ==============================================================================

# Ollama endpoint
OLLAMA_MODEL=minimax-m2.5:cloud
OLLAMA_ENDPOINT=http://localhost:11434

Rest of it is just default values
I am working with claude code cli

MichaelAnders · 2026-02-18T13:43:40Z

That actually seems to be normal. And it may just show up now as - at least for me - Ollama did not work at all with tools before I began with my patches.

I added some support to parse GLM and Qwen3, but that is not good yet.

And I have been pulling my hair out why it is so difficult to get a unified approach to tool result extraction. Well, I now noticed, I am not the 1st to realize there are issues...

vLLM actually has a ton of tool parsers

My approach is now to pick one model and then implement code for parsing that models tool response.

Once that works I will do the same for the remaining entries vllm already has. No copying of the code but understanding how they did it. If that is not OK due to Apache 2.0 license of vLLM then... ?

Therefore, imo, there is no reason not to apply this code change. It's already huge, and I have a lot more in the pipeline already but I will not do any more pushes / PRs until existing ones are in main branch. Tried that before, didn't work, not again, thank you very much ;) Plus there are big PRs from others as well, and adding them one after the other to the main branch would help everyone, imo.

And I will also check this closed Ollama issue: "Ollama's native API (/api/chat) has fully supported streaming + tool calling since May 2025"

veerareddyvishal144 · 2026-02-19T16:24:26Z

We can use Apache 2.0 Licensed software all we need to do is to just cite that repository
I also worked on fixing ollama tool calling in my PR

veerareddyvishal144 · 2026-02-19T17:14:08Z

I am seeing this with other providers like azure-openai

MichaelAnders · 2026-02-19T23:05:38Z

I am seeing this with other providers like azure-openai

This will be much better once all my 11 (sic!) ready PRs with close to 8,000 (sic!) changed/added code lines waiting to be merged one after the other into main branch - see this as a "wip" situation local models maybe?

I will not and cannot not fix this with the issue you see in one small patch - using non-Claude models was, at least for me, not working at all. Every model had some other issues, hopeless.

Just a teaser: In the morning I still had "invoke tool(s): grep, grep, bash" responses from Ollama GLM-4.7-cloud.

Since 30 minutes ago GLM-4.7 works perfectly: Lynkr just worked for 14 minutes straight with multiple subagents running in parallel:

without runtime bugs
created a pretty much identical code change that Opus 4.6 high came up with.

Other Ollama models are now on the list to be parsed properly, but I am very happy!

Therefore - take my PRs, I will supply one after another (prioritized by me), each push applied after you have merged my previous PR and then you'll get a clean, well working one.

And I really hope I do not run into many "This branch has conflicts that must be resolved" - especially if I cannot resolve them "cleanly" as I see here.

Thank you for your understanding!

veerareddyvishal144 · 2026-02-20T11:09:54Z

Hi — thanks for the detailed explanation and for all the work you’ve put into this. It’s great to hear that things are stabilizing and that the recent results look solid.

Given the size and sequencing of the changes (11 PRs / ~8k LOC), let’s integrate this in a controlled way:

Please open all related PRs against a single feature branch.

I created a feature branch feature/model-router for this purpose based on main

We’ll merge to main once the full sequence is complete and validated.

This should help us avoid conflicts, keep main stable, and still respect your intended merge order.

Looking forward to reviewing the other PRs.

MichaelAnders · 2026-02-20T17:03:06Z

Makes sense - working on it
Update: hopefully done tomorrow

MichaelAnders and others added 7 commits February 16, 2026 09:27

fix: set NODE_ENV=test in all test scripts

0ed12bd

The dotenv override (override: NODE_ENV !== "test") requires NODE_ENV=test to prevent .env values from stomping on test-set environment variables.

Add missing env vars to .env.example

0dac27c

Add TOPIC_DETECTION_MODEL, MODEL_DEFAULT, Ollama tuning (OLLAMA_TIMEOUT_MS, OLLAMA_KEEP_ALIVE), and LLM Audit logging section to .env.example for discoverability. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

MichaelAnders mentioned this pull request Feb 16, 2026

Improve tool calling response handling #39

Closed

7 tasks

added dbpath

ef7fe7c

Merge branch 'main' into fix/remaining-pr39-improvements

3b56525

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

Apply remaining improvements from PR #39#49

Apply remaining improvements from PR #39#49
MichaelAnders wants to merge 9 commits intoFast-Editor:mainfrom
MichaelAnders:fix/remaining-pr39-improvements

MichaelAnders commented Feb 16, 2026 •

edited

Loading

Uh oh!

veerareddyvishal144 commented Feb 18, 2026

Uh oh!

MichaelAnders commented Feb 18, 2026

Uh oh!

veerareddyvishal144 commented Feb 18, 2026 •

edited

Loading

Uh oh!

MichaelAnders commented Feb 18, 2026 •

edited

Loading

Uh oh!

veerareddyvishal144 commented Feb 19, 2026

Uh oh!

veerareddyvishal144 commented Feb 19, 2026

Uh oh!

MichaelAnders commented Feb 19, 2026 •

edited

Loading

Uh oh!

veerareddyvishal144 commented Feb 20, 2026

Uh oh!

MichaelAnders commented Feb 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Comments

Conversation

MichaelAnders commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Background

Critical Bug Fixes

1. Fix stripThinkingBlocks() destroying markdown bullet points

2. Fix NODE_ENV=test in all test scripts

3. Fix duplicated OpenAI response parsing block

Important Features

4. Remove tool_result_direct short-circuit and improve agentic loop

5. Improve tool calling response handling

Documentation

6. Add missing env vars to .env.example

Files Changed

Testing

Compatibility

Next Steps

Uh oh!

veerareddyvishal144 commented Feb 18, 2026

Uh oh!

MichaelAnders commented Feb 18, 2026

Uh oh!

veerareddyvishal144 commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MichaelAnders commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

veerareddyvishal144 commented Feb 19, 2026

Uh oh!

veerareddyvishal144 commented Feb 19, 2026

Uh oh!

MichaelAnders commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

veerareddyvishal144 commented Feb 20, 2026

Uh oh!

MichaelAnders commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MichaelAnders commented Feb 16, 2026 •

edited

Loading

veerareddyvishal144 commented Feb 18, 2026 •

edited

Loading

MichaelAnders commented Feb 18, 2026 •

edited

Loading

MichaelAnders commented Feb 19, 2026 •

edited

Loading

MichaelAnders commented Feb 20, 2026 •

edited

Loading