牛哥精选 · 本月

📋 全部 🤖 AI·大模型 ⚡ 效率工具 📝 深度技术 🚀 产品观察 💰 商业科技 🔓 开源项目 🎨 设计创意 📖 阅读推荐 🏷 资源合集 🌱 成长效率

📝 深度技术 arXiv 机器学习 2026-05-20

A More Word-like Image Tokenization for MLLMs

让图像分词更接近文本语义，提出新方法优化多模态大语言模型的融合效果。

arXiv:2605.17954v1 Announce Type: cross Abstract: Modern multimodal large language models (MLLMs) typically keep the language model fixed and train a …

多模态大语言模型图像分词 tokenizati 视觉语义对齐计算机视觉

📝 深度技术 arXiv AI 2026-05-19

Beyond Binary: Reframing GUI Critique as Continuous Semantic Alignment

将GUI批评从二元判断重构为连续语义对齐，提升智能体测试时扩展的排序能力

arXiv:2605.14311v1 Announce Type: cross Abstract: Test-Time Scaling (TTS), which samples multiple candidate actions and ranks them via a Critic Model,…

gui批评语义对齐 test-time ai agent 连续评估

📅 日期

2026-05-20 2026-05-19

🐂 牛哥精选

A More Word-like Image Tokenization for MLLMs

Beyond Binary: Reframing GUI Critique as Continuous Semantic Alignment

📅 日期