牛哥精选 · 一年

1

🤖 AI·大模型 Hacker News LLM 2026-07-16 NEW

Agent runtime reduces LLM turns by 80% with a higher success rate in DeepSWE

AI Agent新方案Tura，将大模型调用次数锐减83%，成功率反升16.7个百分点，极限压缩token预算。

Article URL: https://github.com/Tura-AI/tura Comments URL: https://news.ycombinator.com/item?id=48922039 Points: 2 # Comments: 1

agent runt llm优化 deepswe tura 成功率提升

2

🚀 产品观察 Dev.to 2026-07-16 NEW

Your Docs Are Doing Your Marketing Now (Whether You Like It Or Not)

品牌改名可能让AI搜索“失忆”，旧品牌在AI回答中曝光度反超新品牌10倍，这是文档营销的新暗线。

TL;DR - Nobody has one AI visibility number. You have six (one per model), and they disagree — a cloud infra company we audited scored 33% on ChatGPT/…

ai可见性品牌重塑文档营销开发者工具语言模型

3

🤖 AI·大模型 arXiv 机器学习 2026-07-15

Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

ICML 2026重磅论文：进化策略替代强化学习，开创大模型微调新范式。

arXiv:2509.24372v3 Announce Type: replace Abstract: Fine-tuning large language models (LLMs) for downstream tasks is an essential stage of modern AI d…

进化策略 llm微调强化学习 icml 2026 大规模模型

4

📝 深度技术 Dev.to 2026-07-15

Sync to Async: Migrating FastAPI Endpoints to arq/Redis

从同步到异步，用arq/Redis优化FastAPI端点避免请求循环阻塞，提升LLM提取性能。

The problem: a PDF ingest that blocks for minutes My ingest endpoint was synchronous. Upload a file, wait for OCR, wait for the LLM extraction pipelin…

fastapi arq redis 异步任务性能优化

5

🔓 开源项目 Hacker News Show 2026-07-15

Show HN: An open-source Claude skill that stops AI building the wrong app

开源Claude技能，专注帮你构建正确的应用，从产品经理经验提炼而成，三个入口避免重复工作。

Howdy friends. I've been a zero to one product manager for 12 plus years and along the way I kept borrowing pieces from different discovery frameworks…

claude 开源技能 ai产品管理构建正确产品验证

6

🤖 AI·大模型 arXiv NLP 2026-07-15

Optimization Is Not All You Need

颠覆传统认知：论文指出优化并非AI大模型成功的唯一关键，值得思考模型能力边界

arXiv:2607.11977v1 Announce Type: cross Abstract: In 2019, OpenAI released two million GPT-2 outputs-ungrammatical, half broken-to aid the detection o…

优化 openai 大模型技术讨论论文观点

7

📝 深度技术 arXiv AI 2026-07-14

Beyond Na\"ive Prompting: Strategies for Improved Context-aided Forecasting with LLMs

突破简单提示，提出改进上下文辅助预测的新策略，让LLM预测更精准

arXiv:2508.09904v3 Announce Type: replace-cross Abstract: Real-world forecasting requires models to integrate not only historical data but also releva…

llm 预测提示工程上下文学习策略优化

8

📝 深度技术 Ars Technica 2026-07-14

Hackers quickly prove that Neo Geo Doom ports are not "impossible"

黑科技巧突破硬件极限，经典游戏《毁灭战士》成功移植Neo Geo，证明技术没有"不可能

Clever coding and graphical compromises get a classic game on more classic hardware.

neo geo doom 游戏移植黑客技术复古硬件

9

🔓 开源项目 Hacker News LLM 2026-07-14

Oya – Keep tool outputs away from the LLM to cut tokens and stop injection

Oya项目革新工具调用方式，省10倍token、提速3.5倍，且天然防注入，两行代码即可迁移。

Article URL: https://github.com/OyaAIProd/oya Comments URL: https://news.ycombinator.com/item?id=48907336 Points: 1 # Comments: 0

oya 工具调用 token优化注入防护确定性

10

📝 深度技术 arXiv 机器学习 2026-07-14

FastTPS: An Optimized Method for LLM Token Phase for AI accelerators

针对LLM推理中token阶段低并行度瓶颈，FastTPS提出优化方法，提升AI加速器吞吐量。

arXiv:2607.11211v1 Announce Type: new Abstract: The popularity of large language models (LLMs) escalates an ongoing demand for effective inference. Ho…

llm推理 token阶段 ai加速器吞吐量优化低并行度

11

📝 深度技术 arXiv AI 2026-07-14

OS-Pruner: Pruning Chains-of-Thought of Reasoning Models via Optimal Stopping

针对大模型CoT推理中“过度思考”的冗余步骤，用最优停止策略动态剪枝，降本增效不丢精度。

arXiv:2607.11089v1 Announce Type: new Abstract: Large Language Models (LLMs) have achieved remarkable success in complex reasoning tasks through Chain…

大语言模型 chain-of-t 推理模型最优停止剪枝

12

📝 深度技术 arXiv AI 2026-07-14

YUKTI: From Natural-Language Situations to Robust, Verifiable Decisions An Uncertainty-Typed Proposition IR, Assumption-Robust Pareto Frontiers, and a Regret Certificate

语言模型做决策不可靠？YUKTI提出从自然语言到鲁棒可验证决策的新框架，挑战传统单目标优化的置信度陷阱。

arXiv:2607.09706v1 Announce Type: new Abstract: Language models turn a worded situation into a numeric plan, and the dominant pipelines (NL4Opt, OptiM…

自然语言处理决策鲁棒性语言模型优化可验证性

13

🔓 开源项目小众软件 2026-07-14

colibri – 在 25GB 内存电脑上运行 GLM-5.2 (744B MoE)

普通电脑也能跑744B参数大模型！colibri开源工具让25GB内存畅玩GLM-5.2。

colibri 是一个非常实用的开源项目，它能让普通电脑也能运行超大语言模型（GLM-5.2（744B），并且可以在无显卡的情况下，仅使用 CPU，但需要至少25G 内存。@Appinn 普通电脑跑 GLM-5.2（744B）模型 colibri 使用纯 C 语言，零依赖。可以按需从硬盘加载 Exp

内存电脑上运 colibri glm-5.2 moe架构开源项目

14

⚡ 效率工具 Simon Willison's Blog 2026-07-14

Using uvx in GitHub Actions in a cache-friendly way

用 uvx 在 GitHub Actions 里运行 Python 工具，通过 UV_EXCLUDE_NEWER 巧用缓存避免重复下载，提升 CI 速度。

TIL: Using uvx in GitHub Actions in a cache-friendly way I finally found a cache-friendly recipe for using uvx tool-name in GitHub Actions workflows t…

github act uvx 缓存 python ci/cd

15

📝 深度技术 Hacker News LLM 2026-07-14

Building Food Metadata with LLM Juries

DoorDash用LLM陪审团与上下文优化，多模态AI精准构建食品元数据，工程实践亮点。

Article URL: https://careersatdoordash.com/blog/building-food-metadata-with-llm-juries-context-optimization-multimodal-ai/ Comments URL: https://news.…

llm jury 食品元数据上下文优化多模态ai doordash

16

📝 深度技术 arXiv 计算机视觉 2026-07-14

TriP: A Triangle Puzzle Approach to Robust Translation Averaging

用三角拼图法破解平移平均的鲁棒性难题，为视觉SLAM和3D重建提供全新优化思路。

arXiv:2605.07143v2 Announce Type: replace Abstract: Translation averaging aims to recover camera locations from pairwise relative translation directio…

三角拼图平移平均鲁棒优化计算机视觉几何优化

17

💰 商业科技 IT 之家 2026-07-14

“eVTOL 第一股”亿航智能回应裁员传闻：基于 AI 提效背景优化低绩效岗位，核心骨干团队保持稳定

亿航智能回应裁员传闻：基于AI提效背景优化低绩效岗位，核心骨干保持稳定，eVTOL第一股面临组织调整。

IT之家 7 月 14 日消息，近日有消息称，“eVTOL 第一股”亿航智能正在裁员。据《每日经济新闻》报道，对此，亿航智能方面回应称，公司近期开展组织效能优化与人才结构焕新，属于正常管理动作。本次优化基于 AI（人工智能）提效背景，聚焦低绩效岗位，在优化部分岗位的同时，公司也积极引进专业领域人才…

第一股亿航智能回应裁员传闻基于提效背景优化

18

📄 文档手册 IT 之家 2026-07-14

微软优化 Win11 的 Linux 子系统：32MiB 内存余量保护 WSL 主机进程

Win11的WSL新增32MiB内存余量保护，防止主机进程被误杀，提升子系统稳定性。

IT之家 7 月 14 日消息，科技媒体 Neowin 今天（7 月 14 日）发布博文，报道称微软将重构适用于 Windows 11 的 WSL 资源管理架构，降低高负载下发生灾难性故障的可能性。 IT之家注：WSL 全称为 Windows Subsystem for Linux，是微软提供的 …

微软优化子系统内存余量保护主机进程 wsl

19

🤖 AI·大模型 arXiv NLP 2026-07-14

UMoE:Unlocking Every Expert in Domain-Specific Training

新论文提出UMoE方法，在领域特定训练中激活每个专家，突破传统MoE路径选择限制，提升模型适应性与效率。

arXiv:2607.11444v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models scale capacity without proportional compute cost and have become a key…

umoe mixture-of 领域特定训练专家激活模型优化

20

🎨 设计工具 IT 之家 2026-07-14

微软预告 PowerPoint 改进：新增 13 个 SmartArt，优化文件打开体验

PowerPoint新增13款SmartArt版式和主题，时间轴、列表、团队介绍一应俱全，演示设计更轻松高效

IT之家 7 月 14 日消息，微软昨日（7 月 13 日）发布博文，预告介绍了适用于 PowerPoint（微软演示文稿应用）的 4 项更新，包括扩展新增 13 个 SmartArt、新增 13 个主题等等，目前处于 Beta 测试状态，将于 7 月 ~8 月期间上线。 IT之家注： Smart…

微软预告改进新增优化文件打开体验

🐂 牛哥精选

Agent runtime reduces LLM turns by 80% with a higher success rate in DeepSWE

Your Docs Are Doing Your Marketing Now (Whether You Like It Or Not)

Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

Sync to Async: Migrating FastAPI Endpoints to arq/Redis

Show HN: An open-source Claude skill that stops AI building the wrong app

Optimization Is Not All You Need

Beyond Na\"ive Prompting: Strategies for Improved Context-aided Forecasting with LLMs

Hackers quickly prove that Neo Geo Doom ports are not "impossible"

Oya – Keep tool outputs away from the LLM to cut tokens and stop injection

FastTPS: An Optimized Method for LLM Token Phase for AI accelerators

OS-Pruner: Pruning Chains-of-Thought of Reasoning Models via Optimal Stopping

YUKTI: From Natural-Language Situations to Robust, Verifiable Decisions An Uncertainty-Typed Proposition IR, Assumption-Robust Pareto Frontiers, and a Regret Certificate

colibri – 在 25GB 内存电脑上运行 GLM-5.2 (744B MoE)

Using uvx in GitHub Actions in a cache-friendly way

Building Food Metadata with LLM Juries

TriP: A Triangle Puzzle Approach to Robust Translation Averaging

“eVTOL 第一股”亿航智能回应裁员传闻：基于 AI 提效背景优化低绩效岗位，核心骨干团队保持稳定

微软优化 Win11 的 Linux 子系统：32MiB 内存余量保护 WSL 主机进程

UMoE:Unlocking Every Expert in Domain-Specific Training

微软预告 PowerPoint 改进：新增 13 个 SmartArt，优化文件打开体验

📅 日期