Skip to content

Output Language Inconsistency: Hardcoded English Preset Prompts Overriding User Input (Model Specific: Gemma-4-26B) #5

Description

@buddytian-jpg

Description
I am reporting a significant language-mixing issue when using the Gemma-4-26B Vision Language Model (VLM) within this suite. Despite explicitly setting both the custom_prompt and system_prompt to require Chinese-only output, the model continues to generate large amounts of English text.

Through code analysis of nodes.py, I have identified that the root cause is the hardcoded English strings within the preset_prompts dictionary. These English instructions are prepended to the user's prompt, creating a "language bias" that causes global models like Gemma-4 to default to English or use English for "Chain of Thought" reasoning.

Technical Evidence

  1. Source Code Analysis (from nodes.py)
    The plugin defines several presets in the nodes.py file that are entirely in English:

"Normal - Describe": "Describe this @."

"Prompt Style - Extreme Detailed": "Generate an extremely detailed and descriptive text-to-@ prompt from the @..."

In the process function, the plugin concatenates these English strings with the user's prompt:

Python
p = preset_prompts[preset_prompt].replace("#", custom_prompt.strip()).replace("@", "video" if video_input else "image")
user_content.append({"type": "text", "text": p})
This results in the model receiving a prompt like: "Describe this image. [User's Chinese Prompt]". For Gemma-4-26B, the initial English instruction sets a high-priority "language context," leading to English output.

  1. Visual Evidence (Verified from Screenshots)
    Gemma-4-26B (Failure Case): As seen in image_cd9f21.jpg and image_cdfa10.jpg, the model loader is explicitly using gemma-4-26B\gemma-4-26B-A4B-it-UD-IQ4_XS.gguf.

The Result: The output in the "Show Text" node contains an English <|channel|>thought block and English headers (e.g., "Main Focus", "Desk Items", "Background"), directly following the structure of the English preset instead of the Chinese instruction.

Steps to Reproduce
Load Gemma-4-26B (GGUF) using the Llama-cpp Model Loader.

Set preset_prompt to "Normal - Describe" or "Extreme Detailed".

Input a Chinese request in custom_prompt (e.g., "请详细描述图片内容").

Observe the output containing English headers, reasoning, or full English descriptions.

Suggested Fix
Localization of Presets: Allow the preset_prompts in nodes.py to be localized (Chinese version) or detect the user's input language.

User Control: When a custom_prompt is provided, the English preset_prompt should not be prepended unless the user specifically toggles a "Combine with Preset" option.

Prompt Weighting: Ensure that system instructions regarding language (e.g., "Respond strictly in Chinese") are placed after any preset templates to ensure they carry higher weight in the model's attention mechanism.

Attachments
image_cd9f21.jpg / image_cdfa10.jpg: Showing Gemma-4-26B generating English thought-chains and headers despite Chinese input.

Image

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions