牛哥精选 · 本周

📋 全部 🤖 AI·大模型 ⚡ 效率工具 📝 深度技术 🚀 产品观察 💰 商业科技 🔓 开源项目 🎨 设计创意 📖 阅读推荐 🏷 资源合集 🌱 成长效率

🤖 AI·大模型 arXiv 机器学习 2026-05-25

AGZO: Activation-Guided Zeroth-Order Optimization for LLM Fine-Tuning

创新激活指导的零阶优化方法，大幅提升大模型微调效率。

arXiv:2601.17261v4 Announce Type: replace Abstract: Zeroth-Order (ZO) optimization has emerged as a promising solution for fine-tuning LLMs under stri…

agzo 大模型微调零阶优化激活指导效率提升

🤖 AI·大模型 arXiv 机器学习 2026-05-21

torchtune: PyTorch native post-training library

PyTorch官方推出的后训练微调库torchtune，原生集成LoRA、QLoRA等高效技术，简化大模型适配流程。

arXiv:2605.21442v1 Announce Type: new Abstract: Modern LLMs typically require multistage training pipelines to achieve strong downstream performance, …

torchtune pytorch 后训练大模型微调库

📝 深度技术 arXiv 机器学习 2026-05-20

Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning

通过调整学习率，简单LoRA即可媲美复杂微调方法，揭示被忽视的关键因素。

arXiv:2602.04998v2 Announce Type: replace Abstract: Low-Rank Adaptation (LoRA) is the prevailing approach for efficient large language model (LLM) fin…

lora 学习率大模型微调参数高效微调深度学习优化

📝 深度技术 Hacker News LLM 2026-05-19

Distribution Fine Tuning (DFT): A post training step that fixes LLM writing

一种新的后训练微调技术DFT，专门修复大语言模型的写作水平，值得关注

Article URL: https://twitter.com/rosmine/status/2056406211369541947 Comments URL: https://news.ycombinator.com/item?id=48184164 Points: 1 # Comments: …

dft 大模型微调 llm写作后训练技术

📅 日期

2026-05-20 2026-05-19

🐂 牛哥精选

AGZO: Activation-Guided Zeroth-Order Optimization for LLM Fine-Tuning

torchtune: PyTorch native post-training library

Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning

Distribution Fine Tuning (DFT): A post training step that fixes LLM writing

📅 日期