Sustainability via LLM Right-sizing
评估11款专有模型,揭示何时小模型更优,兼顾可持续性与成本效益
arXiv:2504.13217v3 Announce Type: replace Abstract: Large language models (LLMs) have become increasingly embedded in organizational workflows. This h…
评估11款专有模型,揭示何时小模型更优,兼顾可持续性与成本效益
arXiv:2504.13217v3 Announce Type: replace Abstract: Large language models (LLMs) have become increasingly embedded in organizational workflows. This h…
用代理指标提前预判LLM下游表现,为模型选型提供可靠决策依据
arXiv:2605.18607v1 Announce Type: cross Abstract: Progress in language model development is often driven by comparative decisions: which architecture …
73%的LLM API成本削减实战策略,从路由层到缓存策略,无需降低质量即可实现。
I Cut My LLM API Bill by 73% — Here's the Exact Optimization Playbook Running LLMs in production burns cash. Fast. When your app goes from "prototype"…
细粒度图像识别中教师模型如何选?这篇研究为资源受限设备的知识蒸馏提供了新思路。
arXiv:2605.15689v1 Announce Type: new Abstract: Fine-grained image recognition classifies subcategories such as bird species or car models. While stat…
简单kNN方法竟击败复杂学习路由器,重新思考LLM路由预测建模的惊人发现!
arXiv:2505.12601v2 Announce Type: replace Abstract: As large language models (LLMs) grow in scale and specialization, routing--selecting the best mode…
自动检测硬件并基于基准测试排名本地LLM,帮你找到最适合自己设备的模型
Article URL: https://github.com/Andyyyy64/whichllm Comments URL: https://news.ycombinator.com/item?id=48146369 Points: 282 # Comments: 65