牛哥精选 · 所有

📋 全部 ☁️ 云服务 🤖 AI 平台 🔗 API 中转 🔐 安全/认证 💳 支付 📧 通讯 📊 数据分析 🖼 媒体处理 🌐 域名/DNS

📝 深度技术 arXiv 机器学习 2026-06-09

Nonparametric LLM Evaluation from Preference Data

非参数方法评估LLM性能，突破参数假设限制，提供可靠的不确定性量化

arXiv:2601.21816v2 Announce Type: replace Abstract: Evaluating the performance of large language models (LLMs) from human preference data is crucial f…

非参数统计 llm评估偏好数据不确定性量化机器学习

📝 深度技术 arXiv 机器学习 2026-05-20

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

该论文提出Kernelized Advantage Estimation方法，从非参数统计视角优化LLM推理，为强化学习提供新思路。

arXiv:2604.28005v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have increasingly relied on reinforcement learning…

kernelized 非参数统计 llm推理强化学习优势函数估计

📅 日期

2026-05-20 2026-05-19

🐂 牛哥精选

Nonparametric LLM Evaluation from Preference Data

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

📅 日期