Skip to content

[codex] refactor llm prediction flow and hf backend tooling#10

Open
Felix3322 wants to merge 8 commits intoscukeqi:mainfrom
Felix3322:codex/all-local-uncommitted-pr
Open

[codex] refactor llm prediction flow and hf backend tooling#10
Felix3322 wants to merge 8 commits intoscukeqi:mainfrom
Felix3322:codex/all-local-uncommitted-pr

Conversation

@Felix3322
Copy link
Copy Markdown

@Felix3322 Felix3322 commented Mar 19, 2026

Summary

This refreshes PR #10 with the organized commit stack from the current codex/openai-extra-request-params work.

Main changes

  • refactor the LLM request pipeline around shared llm_request helpers, recent text context, and streaming partial callbacks
  • improve LLM scheduling / candidate presentation in RimeWithWeasel, including rerank snapshots, delayed triggers after traditional candidates, and better mixed Rime + LLM display behavior
  • keep the local HF backend/runtime/training tooling and tune the default config to quantized Qwen/Qwen3.5-0.8B
  • add include/afxres.h as a lightweight Windows resource compatibility header

Commit organization

  1. support openai extra request params
  2. feat(llm): personalize ranking and expose candidate sources
  3. refactor llm rerank flow and provider plumbing
  4. add local hf backend runtime and oai bridge
  5. add hf backend data prep and training tooling
  6. refactor(llm): stream partial results and improve candidate scheduling
  7. chore(hf): tune default qwen3.5 0.8b quantized config
  8. build(win): add afxres compatibility header

Notes

  • Force-pushed the PR head branch on March 20, 2026 to replace the older codex/all-local-uncommitted-pr history with the organized stack above.
  • Left local transient files out of git: build_digest.txt, build_raw.txt.
  • I have not rerun the full build after this history cleanup.

@Felix3322
Copy link
Copy Markdown
Author

image还有一些代码没有提交(不太稳定 解决后会提交)

能给个推送权限吗

@scukeqi
Copy link
Copy Markdown
Owner

scukeqi commented Mar 20, 2026

非常感谢你的提交和付出!这次变更涉及内容较多,我需要相当时间来充分理解并评估其中部分思路的可行性和影响,感谢你的耐心等待。

@Felix3322
Copy link
Copy Markdown
Author

Felix3322 commented Mar 20, 2026 via email

@Felix3322 Felix3322 force-pushed the codex/all-local-uncommitted-pr branch from d146ba1 to cd22bcb Compare March 20, 2026 16:20
@Felix3322 Felix3322 changed the title [codex] improve llm rerank flow and hf backend config [codex] refactor llm prediction flow and hf backend tooling Mar 20, 2026
@scukeqi
Copy link
Copy Markdown
Owner

scukeqi commented Mar 20, 2026

我认为数据集下载、LLM 训练和评估代码不应该放在这个仓库中。
这些功能已经有像 LLaMA-Factory 这样的成熟框架可以很好地支持,在这里重复实现不仅超出项目范围,也会带来不必要的复杂度。

@Felix3322
Copy link
Copy Markdown
Author

我认为数据集下载、LLM 训练和评估代码不应该放在这个仓库中。 这些功能已经有像 LLaMA-Factory 这样的成熟框架可以很好地支持,在这里重复实现不仅超出项目范围,也会带来不必要的复杂度。

忘记设忽略文件了 这个你删掉就好了 我自己用的

@Felix3322
Copy link
Copy Markdown
Author

image 笑死我了这都是啥训练数据啊

@Felix3322
Copy link
Copy Markdown
Author

image 更好的上下文处理。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants