[codex] refactor llm prediction flow and hf backend tooling by Felix3322 · Pull Request #10 · scukeqi/Wisdom-Weasel

Felix3322 · 2026-03-19T15:04:54Z

Summary

This refreshes PR #10 with the organized commit stack from the current codex/openai-extra-request-params work.

Main changes

refactor the LLM request pipeline around shared llm_request helpers, recent text context, and streaming partial callbacks
improve LLM scheduling / candidate presentation in RimeWithWeasel, including rerank snapshots, delayed triggers after traditional candidates, and better mixed Rime + LLM display behavior
keep the local HF backend/runtime/training tooling and tune the default config to quantized Qwen/Qwen3.5-0.8B
add include/afxres.h as a lightweight Windows resource compatibility header

Commit organization

support openai extra request params
feat(llm): personalize ranking and expose candidate sources
refactor llm rerank flow and provider plumbing
add local hf backend runtime and oai bridge
add hf backend data prep and training tooling
refactor(llm): stream partial results and improve candidate scheduling
chore(hf): tune default qwen3.5 0.8b quantized config
build(win): add afxres compatibility header

Notes

Force-pushed the PR head branch on March 20, 2026 to replace the older codex/all-local-uncommitted-pr history with the organized stack above.
Left local transient files out of git: build_digest.txt, build_raw.txt.
I have not rerun the full build after this history cleanup.

Felix3322 · 2026-03-20T05:36:42Z

还有一些代码没有提交（不太稳定解决后会提交）

能给个推送权限吗

scukeqi · 2026-03-20T06:12:35Z

非常感谢你的提交和付出！这次变更涉及内容较多，我需要相当时间来充分理解并评估其中部分思路的可行性和影响，感谢你的耐心等待。

Felix3322 · 2026-03-20T06:14:58Z

这个新的无提示候选仅支持ollama scukeqi ***@***.***>于2026年3月20日周五上午2:12写道：

…

*scukeqi* left a comment (scukeqi/Wisdom-Weasel#10) <#10 (comment)> 非常感谢你的提交和付出！这次变更涉及内容较多，我需要相当时间来充分理解并评估其中部分思路的可行性和影响，感谢你的耐心等待。 — Reply to this email directly, view it on GitHub <#10?email_source=notifications&email_token=A3T3RVM7UD22DJWGZH6GMN34RTOOTA5CNFSNUABFM5UWIORPF5TWS5BNNB2WEL2JONZXKZKDN5WW2ZLOOQXTIMBZGU4DOMJRGU32M4TFMFZW63VGMF2XI2DPOKSWK5TFNZ2KYZTPN52GK4S7MNWGSY3L#issuecomment-4095871157>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A3T3RVIXGMZS3TY5MV6R4QL4RTOOTAVCNFSM6AAAAACWYCCK3SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHM2DAOJVHA3TCMJVG4> . You are receiving this because you authored the thread.Message ID: ***@***.***>

scukeqi · 2026-03-20T16:53:32Z

我认为数据集下载、LLM 训练和评估代码不应该放在这个仓库中。
这些功能已经有像 LLaMA-Factory 这样的成熟框架可以很好地支持，在这里重复实现不仅超出项目范围，也会带来不必要的复杂度。

Felix3322 · 2026-03-20T16:55:20Z

我认为数据集下载、LLM 训练和评估代码不应该放在这个仓库中。这些功能已经有像 LLaMA-Factory 这样的成熟框架可以很好地支持，在这里重复实现不仅超出项目范围，也会带来不必要的复杂度。

忘记设忽略文件了这个你删掉就好了我自己用的

Felix3322 · 2026-03-20T17:17:20Z

笑死我了这都是啥训练数据啊

Felix3322 · 2026-03-20T18:06:41Z

更好的上下文处理。

Felix3322 added 2 commits March 19, 2026 08:35

support openai extra request params

e552ebf

feat(llm): personalize ranking and expose candidate sources

8258878

Felix3322 marked this pull request as ready for review March 19, 2026 15:18

This was referenced Mar 19, 2026

[codex] support openai extra request params #7

Closed

[codex] personalize ranking and expose candidate sources #9

Closed

LLM流式解码 #11

Open

Felix3322 added 3 commits March 19, 2026 16:53

refactor llm rerank flow and provider plumbing

8567ec3

add local hf backend runtime and oai bridge

40696d3

add hf backend data prep and training tooling

6860eba

Felix3322 added 3 commits March 20, 2026 12:04

refactor(llm): stream partial results and improve candidate scheduling

9e29ec3

chore(hf): tune default qwen3.5 0.8b quantized config

f552a4e

build(win): add afxres compatibility header

cd22bcb

Felix3322 force-pushed the codex/all-local-uncommitted-pr branch from d146ba1 to cd22bcb Compare March 20, 2026 16:20

Felix3322 changed the title ~~[codex] improve llm rerank flow and hf backend config~~ [codex] refactor llm prediction flow and hf backend tooling Mar 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex] refactor llm prediction flow and hf backend tooling#10

[codex] refactor llm prediction flow and hf backend tooling#10
Felix3322 wants to merge 8 commits intoscukeqi:mainfrom
Felix3322:codex/all-local-uncommitted-pr

Felix3322 commented Mar 19, 2026 •

edited

Loading

Uh oh!

Felix3322 commented Mar 20, 2026

Uh oh!

scukeqi commented Mar 20, 2026

Uh oh!

Felix3322 commented Mar 20, 2026 via email

Uh oh!

scukeqi commented Mar 20, 2026

Uh oh!

Felix3322 commented Mar 20, 2026

Uh oh!

Felix3322 commented Mar 20, 2026

Uh oh!

Felix3322 commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Felix3322 commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Main changes

Commit organization

Notes

Uh oh!

Felix3322 commented Mar 20, 2026

Uh oh!

scukeqi commented Mar 20, 2026

Uh oh!

Felix3322 commented Mar 20, 2026 via email

Uh oh!

scukeqi commented Mar 20, 2026

Uh oh!

Felix3322 commented Mar 20, 2026

Uh oh!

Felix3322 commented Mar 20, 2026

Uh oh!

Felix3322 commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Felix3322 commented Mar 19, 2026 •

edited

Loading