Skip to content

auto-detect GPU arch and adapt XSched preemption level, remove hardco…#1

Open
kylin1019 wants to merge 1 commit intoXpuOS:xschedfrom
kylin1019:xsched
Open

auto-detect GPU arch and adapt XSched preemption level, remove hardco…#1
kylin1019 wants to merge 1 commit intoXpuOS:xschedfrom
kylin1019:xsched

Conversation

@kylin1019
Copy link
Copy Markdown

Enable automatic GPU architecture detection and dynamic XSched preemption level adaptation for llama.cpp. Removed hardcoded preemption levels to support full compatibility across NVIDIA GPU platforms and eliminate runtime crashes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant