牛哥精选 · 三个月

📋 全部 🤖 AI·大模型 ⚡ 效率工具 📝 深度技术 🚀 产品观察 💰 商业科技 🔓 开源项目 🎨 设计创意 📖 阅读推荐 🏷 资源合集 🌱 成长效率

🤖 AI·大模型 Dev.to 2026-07-13

Token Economics: Why Your LLM Bill Is 3 What the Pricing Page Promised

揭秘LLM账单比宣传贵3倍的背后逻辑：输出token成本与Tokenizer差异才是隐形杀手

Every LLM provider publishes a pricing table. $2.50 per million input tokens. $10 per million output tokens.` Clean. Transparent. Easy to spreadsheet.…

token经济学 llm成本输出token tokenizati 定价陷阱

📝 深度技术 arXiv 机器学习 2026-06-15

Generalizing GNNs with Tokenized Mixture of Experts

KDD 2026论文提出Tokenized MoE框架，用专家混合机制突破GNN泛化瓶颈

arXiv:2602.09258v2 Announce Type: replace Abstract: Deployed graph neural networks (GNNs) are frozen at deployment yet must fit clean data, generalize…

gnn mixture of 图神经网络泛化 tokenized

📝 深度技术 Dev.to 2026-06-13

Your LLM can't read. Here's the weird trick it uses instead

揭秘LLM无法真正阅读，而是靠BPE分词器将文本切成词块，频繁内容整体化，罕见内容碎片化。

Here's a fact that breaks people's mental model of large language models the first time they really sit with it: A language model never sees your word…

bpe 分词 tokenizati llm 算法原理

🤖 AI·大模型 arXiv 机器学习 2026-06-02

You Can Learn Tokenization End-to-End with Reinforcement Learning

用强化学习实现分词端到端训练，挑战LLM中最后的硬编码压缩步骤

arXiv:2602.13940v2 Announce Type: replace Abstract: Tokenization is a hardcoded compression step which remains in the training pipeline of Large Langu…

tokenizati 强化学习端到端学习大语言模型压缩

📝 深度技术 arXiv 机器学习 2026-05-20

A More Word-like Image Tokenization for MLLMs

让图像分词更接近文本语义，提出新方法优化多模态大语言模型的融合效果。

arXiv:2605.17954v1 Announce Type: cross Abstract: Modern multimodal large language models (MLLMs) typically keep the language model fixed and train a …

多模态大语言模型图像分词 tokenizati 视觉语义对齐计算机视觉

📝 深度技术 arXiv AI 2026-05-20

Tokenizing Single-Channel EEG with Time-Frequency Motif Learning

提出TFM-Tokenizer，从单通道脑电信号学习时频模式并编码为离散token，为EEG基础模型提供新思路。

arXiv:2502.16060v5 Announce Type: replace-cross Abstract: Foundation models are reshaping EEG analysis, yet an important problem of EEG tokenization r…

eeg tokenizati time-frequ motif lear foundation

📅 日期

2026-05-20 2026-05-19