AGZO: Activation-Guided Zeroth-Order Optimization for LLM Fine-Tuning
创新激活指导的零阶优化方法,大幅提升大模型微调效率。
arXiv:2601.17261v4 Announce Type: replace Abstract: Zeroth-Order (ZO) optimization has emerged as a promising solution for fine-tuning LLMs under stri…
创新激活指导的零阶优化方法,大幅提升大模型微调效率。
arXiv:2601.17261v4 Announce Type: replace Abstract: Zeroth-Order (ZO) optimization has emerged as a promising solution for fine-tuning LLMs under stri…
PyTorch官方推出的后训练微调库torchtune,原生集成LoRA、QLoRA等高效技术,简化大模型适配流程。
arXiv:2605.21442v1 Announce Type: new Abstract: Modern LLMs typically require multistage training pipelines to achieve strong downstream performance, …
通过调整学习率,简单LoRA即可媲美复杂微调方法,揭示被忽视的关键因素。
arXiv:2602.04998v2 Announce Type: replace Abstract: Low-Rank Adaptation (LoRA) is the prevailing approach for efficient large language model (LLM) fin…
一种新的后训练微调技术DFT,专门修复大语言模型的写作水平,值得关注
Article URL: https://twitter.com/rosmine/status/2056406211369541947 Comments URL: https://news.ycombinator.com/item?id=48184164 Points: 1 # Comments: …