牛哥精选 · 三个月

📋 全部 🤖 AI·大模型 ⚡ 效率工具 📝 深度技术 🚀 产品观察 💰 商业科技 🔓 开源项目 🎨 设计创意 📖 阅读推荐 🏷 资源合集 🌱 成长效率

🤖 AI·大模型 arXiv AI 2026-07-14

Depth-Entropy Guided Sampling for Training-Free LLM Reasoning

无需训练的LLM推理新方法：深度熵引导采样，提升推理效率与质量。

arXiv:2607.09693v1 Announce Type: cross Abstract: Reinforcement learning (RL) has become the dominant paradigm for improving the reasoning capabilitie…

深度熵引导采样无需训练 llm推理采样策略 arxiv论文

📝 深度技术 arXiv 机器学习 2026-06-29

EntMTP: Accelerating LLM Inference with Entropy Guided Multi Token Prediction

熵引导多token预测方法，加速LLM推理并提升生成质量。

arXiv:2606.27550v1 Announce Type: cross Abstract: Multi-token prediction has been shown to increase data density during training, improve downstream t…

llm 推理加速多token预测熵引导生成质量

📝 深度技术 arXiv AI 2026-05-19

Reinforcement Learning for Diffusion LLMs with Entropy-Guided Step Selection and Stepwise Advantages

扩散语言模型遇上强化学习，熵引导步骤选择与逐步优势破局后训练难题。

arXiv:2603.12554v2 Announce Type: replace-cross Abstract: Reinforcement learning (RL) has been effective for post-training autoregressive (AR) languag…

扩散语言模型强化学习熵引导步骤选择逐步骤优势

📅 日期

2026-05-20 2026-05-19

🐂 牛哥精选

Depth-Entropy Guided Sampling for Training-Free LLM Reasoning

EntMTP: Accelerating LLM Inference with Entropy Guided Multi Token Prediction

Reinforcement Learning for Diffusion LLMs with Entropy-Guided Step Selection and Stepwise Advantages

📅 日期