牛哥精选 · 三个月

1

🤖 AI·大模型 MIT Technology Review 2026-07-16 NEW

Meet GPT-Red: an LLM super-hacker OpenAI built to make its models safer

OpenAI打造LLM超级黑客GPT-Red，与GPT-5.6对抗训练，让模型安全防御能力大幅提升。

OpenAI has built an LLM super-hacker called GPT-Red that it uses as a sparring partner to help its other models boost their defenses against cyberatta…

gpt-red openai llm安全对抗训练 gpt-5.6

2

🤖 AI·大模型 TechCrunch 2026-07-16 NEW

Hack suggests AI music generator Suno scraped YouTube for training data

AI音乐生成明星Suno被曝利用YouTube数据训练，还隐瞒了安全泄露，行业伦理再敲警钟。

The hacker used an employee's credentials to access source code, which revealed how Suno scraped decades of audio.

suno ai音乐生成 youtube数据训练数据数据泄露

3

📝 深度技术 arXiv 机器学习 2026-07-15

Are we Merging the Right Models? Impact of Expert Training Duration on Model Merging for LLMs

专家训练时长如何影响LLM模型合并效果？这篇ICML 2026 workshop论文揭示关键发现。

arXiv:2607.11997v1 Announce Type: new Abstract: Multi-task model merging combines separately trained expert models into a single model that handles al…

模型合并专家训练时长 llm 权重空间对称性 icml 2026

4

🤖 AI·大模型 IT 之家 2026-07-15

出版商与作者集体起诉谷歌，指控其盗用版权内容训练 Gemini AI

出版商和作家组团起诉谷歌，指控其未经授权用版权作品训练Gemini AI，类似案件曾引发天价赔偿，版权与AI的博弈再升级。

IT之家 7 月 15 日消息，一群出版商和作家已对谷歌提起集体诉讼，指控这家科技巨头未经授权使用他们的版权作品训练其人工智能平台 Gemini。原告方包括 Hachette Livre、Cengage Group、Elsevier、作家 Scott Turow 以及 S.C.R.I.B.E . …

出版商与作者集体起诉谷歌指控其盗用版权内容训练谷歌

5

💰 商业科技 TechCrunch 2026-07-15

Google faces another AI training lawsuit from major publishers

Google内部文件泄露：用版权书籍训练AI面临百亿美元罚款，大型出版商再掀诉讼风暴。

Hachette, Cengage, Elsevier, and other publishers allege that Google trained its AI on copyrighted works without the necessary permissions.

google ai训练版权诉讼内部文件罚款

6

🤖 AI·大模型少数派 2026-07-15

派早报：Meta 被诉借助 AI 违规裁员、Google 被诉使用版权内容训练 Gemini 模型等

Google 推出新版图片搜索功能，Spotify 开启语音交互功能内测等。查看全文

派早报被诉借助违规裁员被诉使用版权内容训练

7

📝 深度技术 arXiv 机器学习 2026-07-14

FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale

通过合成指令数据扩展预训练规模，突破传统监督训练数据瓶颈的新方法

arXiv:2601.22146v2 Announce Type: replace-cross Abstract: Due to limited supervised training data, large language models (LLMs) are typically pre-trai…

合成数据指令微调预训练大语言模型自监督学习

8

🤖 AI·大模型 ByteByteGo 2026-07-14

How LLMs Learn to Be Helpful (RLHF vs DPO)

一文对比RLHF与DPO两种主流大模型训练方法的核心差异与适用场景

In this article, we will look at how that learning actually happens, starting with why instruction-following alone falls short, then walking through t…

rlhf dpo 大模型训练人类反馈强化学习

9

🤖 AI·大模型 arXiv 机器学习 2026-07-14

Serving the Long Tail: Training-Free LLM Candidate Generation for Vacation Rental Marketplaces

无需训练，LLM直接为度假租赁长尾房源生成推荐候选，破解协同过滤信号稀疏难题。

arXiv:2607.09877v1 Announce Type: new Abstract: Vacation rental marketplaces face a structural imbalance on the supply side: a small fraction of prope…

llm 推荐系统长尾效应候选生成度假租赁

10

📝 深度技术 arXiv AI 2026-07-14

ARMOR: Stabilizing On-Policy LLM RL with Off-Policy Anchor Samples

提出ARMOR方法，用离策锚点样本稳定在策强化学习训练大语言模型，解决训练震荡难题

arXiv:2607.10481v1 Announce Type: cross Abstract: Reinforcement learning (RL) has significantly enhanced the reasoning capabilities of large language …

armor 强化学习 llm 离策锚点在策学习

11

🚀 产品观察 TechCrunch 2026-07-14

Satya Nadella has issued a shocking warning to companies using AI

微软CEO警告企业AI使用风险：你的核心数据可能正被模型制造商学习利用。

In a surprising blog post on Monday, Microsoft CEO is warning enterprises of the dangers of using proprietary models like Anthropic's and OpenAI's.

satya nade ai数据安全企业风险模型训练隐私泄露

12

🤖 AI·大模型 arXiv AI 2026-07-14

Reinforcement Learning with Verifiable Physics: Post-training LLMs with Continuous Rewards

巧用物理规则为LLM提供连续奖励信号，让强化学习后训练更可解释、更高效

arXiv:2607.10474v1 Announce Type: cross Abstract: Partial differential equations (PDEs) are foundational to modeling in science and engineering, but c…

强化学习可验证物理连续奖励 llm后训练物理驱动ai

13

🤖 AI·大模型 arXiv AI 2026-07-14

Depth-Entropy Guided Sampling for Training-Free LLM Reasoning

无需训练的LLM推理新方法：深度熵引导采样，提升推理效率与质量。

arXiv:2607.09693v1 Announce Type: cross Abstract: Reinforcement learning (RL) has become the dominant paradigm for improving the reasoning capabilitie…

深度熵引导采样无需训练 llm推理采样策略 arxiv论文

14

🤖 AI 平台 36氪 2026-07-14

高德发布通用世界模型工坊ABot-World Studio

高德发布ABot-World Studio，用通用世界模型工坊赋能智能体训练与模拟

36氪获悉，近日，阿里巴巴集团旗下高德正式发布通用世界模型工坊ABot-World Studio，并同步开放测试。该工坊将交互式视频生成与3DGS场景生成统一在同一产品中——用户只需输入一段文字或一张图片，即可生成一个可实时交互、任意分享的AI世界，输出结果可保存为视频与3DGS文件。

高德发布通用世界模型工坊高德世界模型 abot-world

15

🤖 AI·大模型 Dev.to 2026-07-14

Bootcamp Grad Explores Open-Source AI APIs: What I Learned

从训练营毕业生的视角，揭秘开源AI API的成本与体验，告诉你为何自建模型可能比API更贵。

Here's the thing: bootcamp Grad Explores Open-Source AI APIs: What I Learned I graduated from a coding bootcamp about six months ago, and honestly, I …

开源ai api 成本训练营 gpu

16

🤖 AI·大模型 arXiv NLP 2026-07-14

UMoE:Unlocking Every Expert in Domain-Specific Training

新论文提出UMoE方法，在领域特定训练中激活每个专家，突破传统MoE路径选择限制，提升模型适应性与效率。

arXiv:2607.11444v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models scale capacity without proportional compute cost and have become a key…

umoe mixture-of 领域特定训练专家激活模型优化

17

🤖 AI·大模型 IT 之家 2026-07-14

三星健康被指存“霸王条款”：拒绝 AI 训练授权恐失去备份能力

三星健康新增条款，用户拒绝AI训练授权将无法备份数据，健康数据与功能捆绑引发隐私争议。

IT之家 7 月 14 日消息，科技媒体 Neowin 昨日（7 月 13 日）发布博文，报道称三星健康（Samsung Health）新增选项，用户若不同意其健康数据用于训练 AI，可能无法备份数据且被威胁删除数据。在三星健康应用设置中，三星新增“同意将健康数据用于人工智能训练和建模”的选项，…

三星健康被指霸王条款拒绝训练授权恐失去备份能力

18

🔓 开源项目 Hacker News LLM 2026-07-13

I trained a 113M-parameter earthquake LLM from absolute scratch

从零训练113M参数地震大模型，开源全流程管线，自建数据集+分布式训练，值得参考复现。

Article URL: https://github.com/jiazhe868/nanogpt-seis Comments URL: https://news.ycombinator.com/item?id=48885236 Points: 9 # Comments: 2

地震llm 大语言模型开源训练管线数据爬取

19

🤖 AI·大模型 Hacker News LLM 2026-07-13

Show HN: Latent-free ternary LLM training

无需隐变量，三值量化LLM训练新方法，开源项目BitBop让你以极低成本体验高效训练。

Article URL: https://github.com/ValerioDolci/bitbop Comments URL: https://news.ycombinator.com/item?id=48892013 Points: 1 # Comments: 1

三值量化 llm训练开源高效训练模型量化

20

📝 深度技术 arXiv 机器学习 2026-07-13

Complexity-Guided Component-wise Initialization for Language Model Pretraining

巧妙利用预训练模型权重谱的结构化模式作为初始化信号，提升GPT-2风格语言模型预训练效率

arXiv:2607.09204v1 Announce Type: cross Abstract: Pretrained language models often exhibit structured weight spectra, suggesting that training may rep…

语言模型预训练初始化方法权重谱 gpt-2 结构模式

🐂 牛哥精选