牛哥精选 · 半年

1

🤖 AI·大模型 arXiv AI 2026-07-07

RSPO: Reward-Swap Policy Optimization for Multi-Turn LLM Agents

提出Reward-Swap策略优化，解决多轮对话LLM代理奖励稀疏问题，提升推理与交互效率。

arXiv:2607.04713v1 Announce Type: cross Abstract: Reinforcement learning holds significant potential for training large language models (LLMs) to hand…

多轮对话 llm代理强化学习策略优化 rspo

2

🤖 AI·大模型 arXiv 机器学习 2026-07-07

How Many Iterations to Jailbreak? Dynamic Budget Allocation for Multi-Turn LLM Evaluation

提出动态预算分配方法，高效评估多轮对话中LLM的越狱风险，解决计算瓶颈与稀有事件检测难题。

arXiv:2605.06605v2 Announce Type: replace Abstract: Evaluating and predicting the performance of large language models (LLMs) in multi-turn conversati…

llm评估多轮对话越狱攻击动态预算分配安全性

3

🤖 AI·大模型 arXiv AI 2026-06-24

Accuracy and Satisfaction in Multi-Turn LLM Dialogues for NFR Assessment

首次聚焦多轮LLM对话中非功能性需求评估，揭示准确性与用户满意度的新挑战

arXiv:2606.24834v1 Announce Type: new Abstract: LLM-based dialogue assistants have become mainstream tools for software developers, yet current evalua…

llm对话评估非功能需求 nfr 软件工程准确性

4

🤖 AI 工具 Hacker News AI 2026-06-06

How to Build an AI Agent for Slack with Chat SDK and AI SDK

用Vercel AI SDK和Chat SDK轻松搭建自带流式响应和工具调用的Slack AI Agent

Article URL: https://vercel.com/kb/guide/how-to-build-an-ai-agent-for-slack-with-chat-sdk-and-ai-sdk Comments URL: https://news.ycombinator.com/item?i…

ai agent slack 流式响应工具调用多轮对话

5

🤖 AI·大模型 arXiv NLP 2026-06-03

PsychoPass: Geometric Profiling of Multi-Turn Adversarial LLM Conversations

用几何方法剖析多轮对话中LLM的对抗性攻击模式，为AI安全提供全新视角

arXiv:2606.03136v1 Announce Type: cross Abstract: Multi-turn jailbreak attacks on large language models (LLMs) reveal a mismatch in current guardrails…

llm安全对抗性攻击多轮对话几何分析 ai安全

6

🤖 AI·大模型 arXiv 机器学习 2026-05-28

Evolving and Detecting Multi-Turn Deception using Geometric Signatures

利用几何签名方法检测多轮对话中的欺骗行为，为AI安全提供新思路。

arXiv:2605.27671v1 Announce Type: cross Abstract: Safety defenses for large language models (LLMs) are typically trained and evaluated on single-turn …

多轮对话欺骗检测几何签名 ai安全语言模型

7

🤖 AI·大模型 arXiv 机器学习 2026-05-28

SaFeR-Steer: Evolving Multi-Turn MLLMs via Synthetic Bootstrapping and Feedback Dynamics

提出SaFeR-Steer方法，通过合成引导和反馈动态机制进化多轮多模态大模型，创新模型训练范式。

arXiv:2604.16358v2 Announce Type: replace Abstract: MLLMs are increasingly deployed in multi-turn settings, where attackers can escalate unsafe intent…

多模态大模型多轮对话合成数据反馈学习模型进化

8

🤖 AI·大模型 arXiv AI 2026-05-27

Stop Listening to Me! How Multi-turn Conversations Can Degrade LLM Reliability

多轮对话让LLM可靠性下降？研究揭示在医疗等高风险场景中静态评测与真实使用的巨大鸿沟。

arXiv:2603.11394v3 Announce Type: replace-cross Abstract: Large language models (LLMs) excel on static benchmarks, but their performance across multi-…

llm可靠性多轮对话医疗场景高风险应用性能退化

9

🤖 AI·大模型 arXiv NLP 2026-05-27

AIDG: A Formal Decomposition of Information Extraction and Containment Asymmetries in Multi-Turn LLM Dialogue

多轮对话中信息不对称的数学建模，为LLM安全交互提供新理论框架

arXiv:2602.17443v2 Announce Type: replace Abstract: Multi-turn LLM evaluation is typically reported as a single win-rate scalar, conflating distinct c…

多轮对话信息不对称形式化分解信息提取 llm安全

10

🤖 AI·大模型 arXiv NLP 2026-05-27

Memory Architectures for Multi-Turn Text-to-SQL: A Benchmark and Empirical Study

多轮对话中记忆架构如何影响Text-to-SQL性能？基准与实证研究揭示关键设计差异。

arXiv:2605.26394v1 Announce Type: new Abstract: Multi-turn Text-to-SQL is central to enterprise analytics yet remains predominantly evaluated in singl…

text-to-sq 多轮对话记忆架构基准测试实证研究

11

📝 深度技术 arXiv AI 2026-05-26

Bilevel Optimization of Synthetic Trajectories for Multi-Turn LLM Fine-Tuning

新方法利用双层优化生成合成轨迹，显著提升多轮LLM对话微调效果

arXiv:2605.24743v1 Announce Type: cross Abstract: While LLMs excel at single-turn generation, they struggle with long-horizon, multi-turn interactions…

bilevel op 合成轨迹多轮llm 微调双层优化

12

🤖 AI·大模型 arXiv NLP 2026-05-26

Found in Conversation: LLMs Teach Themselves to Close the Multi-Turn Gap

研究发现LLM在多轮对话中表现远逊于单轮，但模型能通过自训练弥补这一差距，值得AI研究者关注。

arXiv:2605.24432v1 Announce Type: new Abstract: Large Language Model (LLM) interactions are typically underspecified, with users clarifying all necess…

llm 多轮对话对话系统自我学习性能差距

13

📝 深度技术 arXiv 机器学习 2026-05-26

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

无需提示词，仅靠对话轮次结构就能触发LLM后门，揭露多轮交互中的新型安全威胁。

arXiv:2601.14340v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are widely integrated into interactive systems such as dialogue…

llm安全后门攻击多轮对话结构性触发无需提示

14

📝 深度技术 arXiv AI 2026-05-21

What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents

多轮对话代理只能“一刀切”蒸馏？这篇论文给出何时蒸馏、蒸馏什么的智能选择策略

arXiv:2605.19447v1 Announce Type: new Abstract: Reinforcement learning can train LLM agents from sparse task rewards, but long-horizon credit assignme…

知识蒸馏多轮对话代理训练选择性蒸馏后见学习

15

📝 深度技术 arXiv NLP 2026-05-20

Multilingual jailbreaking of LLMs using low-resource languages

用非洲低资源语言玩多轮对话，成功绕过ChatGPT、Gemini等主流大模型的安全护栏，安全漏洞新发现。

arXiv:2605.18239v1 Announce Type: new Abstract: Large Language Models (LLMs) remain vulnerable to jailbreak attempts that circumvent safety guardrails…

多语言越狱大语言模型低资源语言非洲语言安全机制

16

📝 深度技术 arXiv NLP 2026-05-20

SKG-Eval: Stateful Evaluation of Multi-Turn Dialogue via Incremental Semantic Knowledge Graphs

基于增量语义知识图谱的状态化多轮对话评估方法，提升对话系统评测的连贯性与深度。

arXiv:2605.16650v1 Announce Type: new Abstract: Evaluating multi-turn dialogue systems remains challenging because response quality depends not only o…

skg-eval 多轮对话评估语义知识图谱对话系统状态化评估

17

📝 深度技术 arXiv AI 2026-05-20

Moltbook Moderation: Uncovering Hidden Intent Through Multi-Turn Dialogue

新方法通过多轮对话挖掘隐藏恶意意图，解决多智能体系统审核盲区。

arXiv:2605.12856v2 Announce Type: replace Abstract: The emergence of multi-agent systems introduces novel moderation challenges that extend beyond con…

多轮对话内容审核隐藏意图多智能体系统安全

18

📝 深度技术 arXiv 机器学习 2026-05-20

Mitigating Conversational Inertia in Multi-Turn Agents

多轮对话智能体易陷入重复模式？这篇论文提出缓解“对话惯性”的新方法。

arXiv:2602.03664v3 Announce Type: replace-cross Abstract: Large language models excel as few-shot learners when provided with appropriate demonstratio…

多轮对话对话惯性智能体大模型交互优化

🐂 牛哥精选