牛哥精选 · 所有

1

📝 深度技术 arXiv 机器学习 2026-05-19

SEED: Targeted Data Selection by Weighted Independent Set

由相似图构建加权独立集，平衡样本质量与多样性，为高效数据选择提供新框架。

arXiv:2605.15691v1 Announce Type: new Abstract: Data selection seeks to identify a compact yet informative subset from large-scale training corpora, b…

数据选择加权独立集机器学习训练数据相似图

2

📝 深度技术 arXiv 机器学习 2026-05-19

ITGPT: Generative Pretraining on Irregular Timeseries

不规则时间序列生成预训练新方法ITGPT，突破传统模型在多模态缺失数据上的局限。

arXiv:2605.16069v1 Announce Type: new Abstract: Timeseries regression models often struggle to leverage large volumes of labeled multimodal data, part…

不规则时间序列生成式预训练 transforme 自监督学习多模态数据

3

📝 深度技术 arXiv 机器学习 2026-05-19

Entropic Auto-Encoding via Implicit Free-Energy Minimization

提出熵自编码器框架，通过隐式自由能最小化解决VAE后验崩溃，理论创新显著。

arXiv:2605.16164v1 Announce Type: new Abstract: Despite their ubiquity, variational autoencoders (VAEs) inherently suffer from posterior collapse, a f…

变分自编码器后验崩溃熵自编码器隐自由能隐变量

4

📝 深度技术 arXiv 机器学习 2026-05-19

Hypothesis-driven construction of mesoscopic dynamics

基于广义Onsager原理，提出假设驱动的介观动力学建模新范式，摆脱传统方程依赖，开启复杂多尺度系统建模新思路。

arXiv:2605.16211v1 Announce Type: new Abstract: Traditional scientific modeling typically begins with fixed, instance-wise effective equations and the…

介观动力学 onsager原理多尺度系统机器学习科学建模

5

📝 深度技术 arXiv 机器学习 2026-05-19

Minerva-Ego: Spatiotemporal Hints for Egocentric Video Understanding

AI视频推理新突破：用时空提示增强第一人称视频理解，还附带中间步骤评估，填补了基准空白。

arXiv:2605.15342v1 Announce Type: cross Abstract: Video reasoning models are a core component of egocentric and embodied agents. However, standard ben…

视频理解第一人称视频时空提示视频推理计算机视觉

6

📝 深度技术 arXiv 机器学习 2026-05-19

Calibrating LLMs with Semantic-level Reward

用语义级奖励替代二元反馈，让LLM学会表达真实不确定性，提升高安全场景下的可靠性。

arXiv:2605.15588v1 Announce Type: cross Abstract: As large language models (LLMs) are deployed in consequential settings such as medical question answ…

大语言模型校准语义级奖励不确定性估计强化学习

7

📝 深度技术 arXiv 机器学习 2026-05-19

Syntax Without Semantics: Teaching Large Language Models to Code in an Unseen Language

研究揭示大模型能否掌握未见编程语言：仅凭语法而非语义，LLM在全新语言上表现如何？

arXiv:2605.15607v1 Announce Type: cross Abstract: Large language models (LLMs) achieve high pass rates on code generation benchmarks, yet whether they…

大语言模型代码生成语法语义迁移学习

8

📝 深度技术 arXiv 机器学习 2026-05-19

Pessimistic Risk-Aware Policy Learning in Contextual Bandits

离线数据下的风险感知策略学习新框架，用悲观原则优化高风险场景的决策效果

arXiv:2605.15620v1 Announce Type: cross Abstract: We study risk-aware offline policy learning, aiming to learn a decision rule from logged data that i…

离线策略学习风险感知上下文赌博机悲观原则机器学习

9

📝 深度技术 arXiv 机器学习 2026-05-19

Semi-MedRef: Semi-Supervised Medical Referring Image Segmentation with Cross-Modal Alignment

医学指代图像分割遇上半监督学习，跨模态对齐降低标注成本，精准融合文本与视觉。

arXiv:2605.15720v1 Announce Type: cross Abstract: Medical referring image segmentation (MRIS) requires pixel-level masks aligned with textual descript…

医学图像分割半监督学习跨模态对齐指代分割

10

📝 深度技术 arXiv 机器学习 2026-05-19

StippleDiffusion: Capacity-Constrained Stippling using Controlled Diffusion

用可控扩散模型生成容量约束点画，告别传统慢速迭代优化，实现高效图像条件点阵生成

arXiv:2605.15816v1 Announce Type: cross Abstract: Stipple patterns, point sets whose local density tracks a target image, are traditionally produced b…

stippledif 点画扩散模型图像条件生成容量约束

11

📝 深度技术 arXiv 机器学习 2026-05-19

Complexity of Non-Log-Concave Sampling in Fisher Information

首次给出非对数凹分布采样相对Fisher信息保证的查询复杂度，基于近端采样器与限制高斯oracle，是采样理论的重要进展

arXiv:2605.15859v1 Announce Type: cross Abstract: We study the query complexity of obtaining a relative Fisher information guarantee for sampling from…

非对数凹采样 fisher信息查询复杂度近端采样器限制高斯oracle

12

📝 深度技术 arXiv 机器学习 2026-05-19

Unsupervised Domain Shift Detection with Interpretable Subspace Attribution

机器学习新工具：无监督检测域迁移，还能定位异常特征子空间，实现可解释归因。

arXiv:2605.15920v1 Announce Type: cross Abstract: We developed a tool for detecting domain shifts, namely subtle differences in the probability distri…

无监督学习域迁移检测密度异常特征子空间可解释性

13

📝 深度技术 arXiv 机器学习 2026-05-19

Explainable AI Isn't Enough! Rethinking Algorithmic Contestability

探讨机器学习决策系统中可解释AI的不足，提出算法可争议性新视角，挑战传统XAI框架。

arXiv:2605.16041v1 Announce Type: cross Abstract: Machine learning systems increasingly make life-changing decisions about individuals, such as loan a…

explainabl algorithmi 机器学习 ai伦理可争议性

14

📝 深度技术 arXiv 机器学习 2026-05-19

Scalable neuromorphic computing from autonomous spiking dynamics in a clockless reconfigurable chip

无时钟异步电路驱动的可扩展神经形态计算架构，在FPGA上实现布尔脉冲神经元网络，突破传统时钟同步限制。

arXiv:2605.16114v1 Announce Type: cross Abstract: We propose a scalable neuromorphic architecture based on spiking dynamics emerging from the autonomo…

神经形态计算 fpga 异步电路脉冲神经网络可扩展架构

15

📝 深度技术 arXiv 机器学习 2026-05-19

Runtime-Orchestrated Second-Order Optimization for Scalable LLM Training

二阶优化方法加速LLM训练的瓶颈被Asteria运行时系统破解，大幅提升训练效率。

arXiv:2605.16184v1 Announce Type: cross Abstract: Second-order methods offer an attractive path toward more sample-efficient LLM training, but their p…

llm训练二阶优化运行时系统分布式训练系统优化

16

📝 深度技术 arXiv 机器学习 2026-05-19

Deep Learning Alternatives of the Kolmogorov Superposition Theorem

探索KST替代公式作为神经网络设计新范式，挑战经典数学定理的深度学习实践

arXiv:2410.01990v3 Announce Type: replace Abstract: This paper explores alternative formulations of the Kolmogorov Superposition Theorem (KST) as a fo…

深度学习 kolmogorov 神经网络设计数学基础

17

📝 深度技术 arXiv 机器学习 2026-05-19

Searching on a Budget: HW-NAS with 10 Latency Probes

低资源硬件感知NAS：仅需10次延迟探测就能高效搜索网络架构，降低对精确延迟模型的依赖

arXiv:2504.00663v2 Announce Type: replace Abstract: Existing hardware-aware NAS (HW-NAS) methods typically assume access to precise information circa …

hw-nas 神经架构搜索延迟预测预算有限机器学习

18

📝 深度技术 arXiv 机器学习 2026-05-19

Rethinking Predictive Modeling for LLM Routing: When Simple kNN Beats Complex Learned Routers

简单kNN方法竟击败复杂学习路由器，重新思考LLM路由预测建模的惊人发现！

arXiv:2505.12601v2 Announce Type: replace Abstract: As large language models (LLMs) grow in scale and specialization, routing--selecting the best mode…

llm路由 knn 预测建模模型选择效率

19

📝 深度技术 arXiv 机器学习 2026-05-19

Beyond Sunk Costs: Boosting LLM Pre-training Efficiency via Orthogonal Growth of Mixture-of-Experts

全新方法利用MoE正交生长，大幅节省LLM预训练成本，突破沉没成本陷阱。

arXiv:2510.08008v2 Announce Type: replace Abstract: As the computational demands for pre-training Large Language Models (LLMs) continue to surge, the …

llm预训练 moe 正交生长效率提升计算成本

20

📝 深度技术 arXiv 机器学习 2026-05-19

Adaptive Conformal Prediction for Quantum Machine Learning

量子机器学习的不确定性量化难题，本文提出自适应共形预测框架解决可靠性问题。

arXiv:2511.18225v2 Announce Type: replace Abstract: Quantum machine learning seeks to leverage quantum computers to improve upon classical machine lea…

量子机器学习共形预测不确定性量化自适应方法量子计算

🐂 牛哥精选

SEED: Targeted Data Selection by Weighted Independent Set

ITGPT: Generative Pretraining on Irregular Timeseries

Entropic Auto-Encoding via Implicit Free-Energy Minimization

Hypothesis-driven construction of mesoscopic dynamics

Minerva-Ego: Spatiotemporal Hints for Egocentric Video Understanding

Calibrating LLMs with Semantic-level Reward

Syntax Without Semantics: Teaching Large Language Models to Code in an Unseen Language

Pessimistic Risk-Aware Policy Learning in Contextual Bandits

Semi-MedRef: Semi-Supervised Medical Referring Image Segmentation with Cross-Modal Alignment

StippleDiffusion: Capacity-Constrained Stippling using Controlled Diffusion

Complexity of Non-Log-Concave Sampling in Fisher Information

Unsupervised Domain Shift Detection with Interpretable Subspace Attribution

Explainable AI Isn't Enough! Rethinking Algorithmic Contestability

Scalable neuromorphic computing from autonomous spiking dynamics in a clockless reconfigurable chip

Runtime-Orchestrated Second-Order Optimization for Scalable LLM Training

Deep Learning Alternatives of the Kolmogorov Superposition Theorem

Searching on a Budget: HW-NAS with 10 Latency Probes

Rethinking Predictive Modeling for LLM Routing: When Simple kNN Beats Complex Learned Routers

Beyond Sunk Costs: Boosting LLM Pre-training Efficiency via Orthogonal Growth of Mixture-of-Experts

Adaptive Conformal Prediction for Quantum Machine Learning

📅 日期