[arXiv'26] Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? A Pilot Study

by Zihao Zhao, Frederik Hauke, Juliana De Castilhos, Sven Nebelung, and Daniel Truhn

(Left) Despite highly overlapping visual patterns, some disease pairs can have totally different etiologies and managements, which makes imaging-only differentiation challenging and high-stakes (Right) The overview of our proposed Contrastive Agent REasoning (CARE). Two disease-specific agents generate opposing evidence from the same input image. A judge agent adjudicates the arguments, flags unsupported evidence, and outputs the final diagnosis in a training-free, zero-shot setting.

Prerequisite

For Gemini-based experiments:

conda env create -f environment.yml

For experiments based on open-source MLLMs:

uv venv vlm --python 3.10
source vlm/bin/activate
uv pip install -r uv_req.txt

Put the resized (512×512) version of mimic-cxr-jpg dataset and raw derm7pt dataset into ./data

Usage

For Geminis, specify your personal API Key in

client = genai.Client(api_key = "xxxxxxxxxxxxxxx")

and run

conda activate openai
python gemini_cxr_care.py

For open-source MLLMs, run the following command on slurm cluster

sbatch open-source-vlm.sh

or run the following command on your local computer as suggested in open-source-vlm.sh

python [derm/cxr]_agent_script_[/care].py --model_name medvlm --output_path OUTPUT_PATH

📎 Citation

If you find this repository useful for your work, please cite our arXiv paper:

@article{zhao2026can,
  title={Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? A Pilot Study},
  author={Zhao, Zihao and Hauke, Frederik and De Castilhos, Juliana and Nebelung, Sven and Truhn, Daniel},
  journal={arXiv preprint arXiv:2602.22959},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
asset		asset
data		data
.DS_Store		.DS_Store
LICENSE		LICENSE
PROMPT.py		PROMPT.py
README.md		README.md
agent.py		agent.py
cxr_agent_script.py		cxr_agent_script.py
cxr_agent_script_care.py		cxr_agent_script_care.py
derm_agent_script.py		derm_agent_script.py
derm_agent_script_care.py		derm_agent_script_care.py
environment.yml		environment.yml
gemini_cxr_blind_care.py		gemini_cxr_blind_care.py
gemini_cxr_care.py		gemini_cxr_care.py
gemini_cxr_self_check_2x.py		gemini_cxr_self_check_2x.py
gemini_derm_blind_care.py		gemini_derm_blind_care.py
gemini_derm_care.py		gemini_derm_care.py
gemini_derm_self_check_2x.py		gemini_derm_self_check_2x.py
open-source-vlm.sh		open-source-vlm.sh
uv_req.txt		uv_req.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[arXiv'26] Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? A Pilot Study

Prerequisite

Usage

📎 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

[arXiv'26] Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? A Pilot Study

Prerequisite

Usage

📎 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages