feat(daily): weekly digest 2026-W19#35
Open
yayajjiang wants to merge 1 commit into
Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Weekly Digest — 2026-W19 (May 4–10)
6 new papers added to
src/lib/daily.ts(3 editor's picks ⭐):2605.06241) — RL不教推理新技巧,仅在1-3%高熵决策点稀疏调整概率,从根本上重构了对RL后训练的认知。2605.06638) — ScaleLogic分离推理深度与逻辑表达力,指数由表达力决定(1.04→2.60),揭示长程推理泛化关键。2605.05750) — RVPO惩罚优势聚合中的跨奖励方差,修复多目标RLHF的均值聚合缺陷,17个并发奖励信号验证。2605.06375) — Soft/Hard Pair-GRPO统一隐式与显式偏好约束,解决GRPO类方法训练不稳定问题。2605.06206) — FoE将MoE模块重组为专家集群,切断跨节点token嵌入通信瓶颈。2605.00342) — EVICT无需训练实现专家感知草稿树剪枝,比AR快2.35倍、比EAGLE-3快1.21倍。https://claude.ai/code/session_015ootf94u65YKT9eXtiMwVz
Generated by Claude Code