牛哥精选 · 三个月

1

🤖 AI·大模型 arXiv NLP 2026-05-19

Measuring Maximum Activations in Open Large Language Models

揭秘开源LLM激活动态范围新发现，挑战旧有量化认知，影响推理效率优化。

arXiv:2605.15572v1 Announce Type: new Abstract: The dynamic range of activations is a first-order constraint for low-bit quantization, activation scal…

大语言模型激活值低比特量化推理优化开源模型

2

📝 深度技术 arXiv NLP 2026-05-19

Toward LLMs Beyond English-Centric Development

揭示大模型英语偏见真相，证明持续预训练成本优势不存在，语言专用投资或成必然。

arXiv:2605.15613v1 Announce Type: new Abstract: Through an analysis of sequences generated by open-weight large language models (LLMs), we demonstrate…

大型语言模型英语偏见多语言持续预训练文化理解

3

🤖 AI·大模型 arXiv NLP 2026-05-19

Can Large Language Models Imitate Human Speech for Clinical Assessment? LLM-Driven Data Augmentation for Cognitive Score Prediction

用大模型模拟人类语音做临床认知评估，数据增强解决样本不足难题，创新性十足。

arXiv:2605.16077v1 Announce Type: new Abstract: Accurate assessment of cognitive decline from spontaneous speech remains challenging due to limited da…

llm 数据增强认知评估语音模拟临床诊断

4

📝 深度技术 arXiv NLP 2026-05-19

SGR: A Stepwise Reasoning Framework for LLMs with External Subgraph Generation

新框架SGR通过外部子图逐步引导LLM推理，提升复杂逻辑推理能力。

arXiv:2605.16117v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated strong capabilities across diverse NLP applications, su…

llm推理逐步推理外部子图知识图谱逻辑推理

5

📝 深度技术 arXiv NLP 2026-05-19

Optimized Three-Dimensional Photovoltaic Structures with LLM guided Tree Search

结合LLM与树搜索自动生成高效3D光伏结构，突破平面效率极限的AI科研案例。

arXiv:2605.16191v1 Announce Type: new Abstract: We present a case study for how AI coding systems can be used to generate novel scientific hypotheses.…

llm 树搜索光伏三维结构 ai4science

6

📝 深度技术 arXiv NLP 2026-05-19

Improving Cross-Cultural Survey Simulation with Calibrated Value Personas

用校准价值人格（Value Personas）提升大模型跨文化调查模拟的准确度，突破传统依赖人口统计特征的局限

arXiv:2605.16193v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used to simulate human opinions and survey responses, bu…

大语言模型跨文化模拟价值人格调查模拟 ai校准

7

🤖 AI·大模型 arXiv NLP 2026-05-19

Conversations in Space: Structuring Non-Linear LLM Interactions on a Canvas

把线性LLM对话变成空间画布上的分支交互，解决备选探索和长对话管理难题。

arXiv:2605.15848v1 Announce Type: cross Abstract: Conversational interfaces powered by large language models (LLMs) are widely used for ideation and a…

llm 对话界面非线性交互画布人机交互

8

📝 深度技术 arXiv NLP 2026-05-19

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

提出双令牌约束方法，稳定知识并提升推理能力，解决RLVR中令牌均匀优化问题

arXiv:2507.15778v2 Announce Type: replace Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has become an effective post-training method…

rlvr 强化学习大语言模型双令牌约束推理

9

📝 深度技术 arXiv NLP 2026-05-19

DiscussLLM: Teaching Large Language Models When to Speak

从被动回应到主动参与，最新研究教大模型何时该主动发言，突破传统对话边界。

arXiv:2508.18167v2 Announce Type: replace Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities in understanding and genera…

llm 主动对话论文协作自然语言处理

10

📝 深度技术 arXiv NLP 2026-05-19

When Importance Sampling Misallocates Credit: Asymmetric Ratios for Outcome-Supervised RL

最新研究揭示重要性采样在结果监督RL中的信用分配偏差，提出不对称比率改进方法，为LLM后训练提供新视角。

arXiv:2510.06062v2 Announce Type: replace Abstract: Reinforcement learning (RL) has shown great promise in large language models (LLMs) post-training,…

重要性采样结果监督rl grpo 不对称比率 llm后训练

11

🤖 AI·大模型 arXiv NLP 2026-05-19

CryptoBench: A Dynamic Benchmark for Expert-Level Evaluation of LLM Agents in Cryptocurrency

首个专家策划的动态基准，专用于评估加密货币领域LLM Agent表现。

arXiv:2512.00417v5 Announce Type: replace Abstract: This paper introduces CryptoBench, the first expert-curated, dynamic benchmark designed to rigorou…

cryptobenc llm agents 加密货币动态基准评估

12

📝 深度技术 arXiv NLP 2026-05-19

RapidUn: Influence-Driven Parameter Reweighting for Efficient Large Language Model Unlearning

提出影响力驱动参数重加权方法，高效解决大模型遗忘小样本或不平衡数据的难题。

arXiv:2512.04457v2 Announce Type: replace Abstract: Removing specific data influence from large language models (LLMs) remains challenging, as retrain…

大模型遗忘参数重加权影响力驱动 llm 高效遗忘

13

📝 深度技术 arXiv NLP 2026-05-19

The Company You Keep: How LLMs Respond to Dark Triad Traits

LLM会不自觉地奉承黑暗三人格用户，揭示AI对齐中隐藏的伦理风险与安全漏洞。

arXiv:2603.04299v4 Announce Type: replace Abstract: Large Language Models (LLMs) often exhibit highly agreeable and reinforcing conversational styles,…

大语言模型黑暗三人格 ai奉承行为分析 ai安全

14

🤖 AI·大模型 arXiv NLP 2026-05-19

Wiki Dumps to Training Corpora: South Slavic Case

从维基百科原始数据构建高质量南斯拉夫语语料库，详解七种语言的文本提取与清洗流程。

arXiv:2604.25384v2 Announce Type: replace Abstract: This paper presents a pipeline designed to transform raw Wikimedia dumps into quality textual corp…

维基百科转储语料库构建南斯拉夫语文本清洗多语言nlp

15

📝 深度技术 arXiv NLP 2026-05-19

HIVE: Hidden-Evidence Verification for Hallucination Detection in Diffusion Large Language Models

扩散大模型幻觉检测新方法：利用隐藏证据验证，捕捉生成轨迹中的幻觉信号，突破传统输出检测局限。

arXiv:2604.26139v2 Announce Type: replace Abstract: Diffusion large language models generate text through multi-step denoising, where hallucination si…

扩散模型幻觉检测大语言模型验证方法隐藏证据

16

📝 深度技术 arXiv NLP 2026-05-19

LLM-based Detection of Manipulative Political Narratives

最新研究：用大语言模型精准识别社交媒体上的操纵性政治叙事，区分合法批评与刻意图谋

arXiv:2605.14354v2 Announce Type: replace Abstract: We present a new computational framework for detecting and structuring manipulative political narr…

llm 政治叙事操纵检测社交媒体计算语言学

17

📝 深度技术 arXiv 计算机视觉 2026-05-19

Deep Pre-Alignment for VLMs

VLMs靠轻量投影器映射视觉特征，但早期层对齐不足浪费深度，论文提出深度预对齐方案解决此缺陷。

arXiv:2605.15300v1 Announce Type: new Abstract: Most Vision Language Models (VLMs) directly map outputs from ViT encoders to the LLM via a lightweight…

视觉语言模型 vlm 深度预对齐对齐挑战 vit

18

🤖 AI·大模型 arXiv 计算机视觉 2026-05-19

ELDOR: A Dataset and Benchmark for Illegal Gold Mining in the Amazon Rainforest

亚马逊雨林非法金矿监测的计算机视觉数据集与基准，助力生态环境智能保护

arXiv:2605.15397v1 Announce Type: new Abstract: Illegal gold mining in the Amazon rainforest causes deforestation, water contamination, and long-term …

非法金矿亚马逊雨林卫星影像数据集基准

19

📝 深度技术 arXiv 计算机视觉 2026-05-19

Video Models Can Reason with Verifiable Rewards

视频扩散模型不再只求逼真，引入强化学习实现时空逻辑约束下的可验证推理，提升智能体规划能力。

arXiv:2605.15458v1 Announce Type: new Abstract: Video diffusion models have made rapid progress in perceptual realism and temporal coherence, but they…

视频扩散模型可验证推理强化学习时空约束逻辑约束

20

🤖 AI·大模型 arXiv 计算机视觉 2026-05-19

EgoExo-WM: Unlocking Exo Video for Ego World Models

用海量第三人称视频破解第一人称世界模型训练难题，新方法EgoExo-WM来了。

arXiv:2605.15477v1 Announce Type: new Abstract: Egocentric world models present a promising direction for enabling agents to predict and plan, but the…

世界模型第一人称第三人称视频学习动作预测

🐂 牛哥精选

Measuring Maximum Activations in Open Large Language Models

Toward LLMs Beyond English-Centric Development

Can Large Language Models Imitate Human Speech for Clinical Assessment? LLM-Driven Data Augmentation for Cognitive Score Prediction

SGR: A Stepwise Reasoning Framework for LLMs with External Subgraph Generation

Optimized Three-Dimensional Photovoltaic Structures with LLM guided Tree Search

Improving Cross-Cultural Survey Simulation with Calibrated Value Personas

Conversations in Space: Structuring Non-Linear LLM Interactions on a Canvas

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

DiscussLLM: Teaching Large Language Models When to Speak

When Importance Sampling Misallocates Credit: Asymmetric Ratios for Outcome-Supervised RL

CryptoBench: A Dynamic Benchmark for Expert-Level Evaluation of LLM Agents in Cryptocurrency

RapidUn: Influence-Driven Parameter Reweighting for Efficient Large Language Model Unlearning

The Company You Keep: How LLMs Respond to Dark Triad Traits

Wiki Dumps to Training Corpora: South Slavic Case

HIVE: Hidden-Evidence Verification for Hallucination Detection in Diffusion Large Language Models

LLM-based Detection of Manipulative Political Narratives

Deep Pre-Alignment for VLMs

ELDOR: A Dataset and Benchmark for Illegal Gold Mining in the Amazon Rainforest

Video Models Can Reason with Verifiable Rewards

EgoExo-WM: Unlocking Exo Video for Ego World Models

📅 日期