Skip to content

Feat: Add tool call caching#53

Open
Mir47-47 wants to merge 6 commits intothisisthepy:developfrom
Mir47-47:develop
Open

Feat: Add tool call caching#53
Mir47-47 wants to merge 6 commits intothisisthepy:developfrom
Mir47-47:develop

Conversation

@Mir47-47
Copy link
Copy Markdown

Server consolidation:
I modified server.py to retrieve the session's 'tool_call_cache' for both standard chat requests and streaming chat requests and forward it to the model.

Update Tool Features and Schema:
To retrieve cached tool call results, we added a new function 'get_cache_data' to 'cache.py ' and registered it in the tool function schema and implementation

I updated the system prompt at 'qwen3/model.py ' and instructed the model to follow the cache priority principle when handling tool calls.

Mir47-47 and others added 6 commits August 12, 2025 20:37
# Conflicts:
#	api/src/main/models/base.py
#	api/src/main/models/llama3/model.py
#	api/src/main/utils/__init__.py
#	api/src/main/utils/cache.py
Updated the tool call rules to emphasize a cache-first principle, detailing a step-by-step process for checking and utilizing cached data before invoking new tool calls. This change aims to improve efficiency and ensure cached results are prioritized when relevant.
Deleted the api.zip binary file
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant