LLM Sparsity Prior for Robust Feature Selection
利用大模型蕴含的稀疏性先验,为高维数据特征选择提供鲁棒策略,理论贡献显著。
arXiv:2605.23102v1 Announce Type: cross Abstract: Large language models (LLMs) offer a scalable mechanism to elicit domain-informed prior information …
利用大模型蕴含的稀疏性先验,为高维数据特征选择提供鲁棒策略,理论贡献显著。
arXiv:2605.23102v1 Announce Type: cross Abstract: Large language models (LLMs) offer a scalable mechanism to elicit domain-informed prior information …
国家统计局发布2026年5月中旬流通领域重要生产资料市场价格变动情况,对全国流通领域9大类50种重要生产资料市场价格的监测显示,2026年5月中旬与5月上旬相比,20种产品价格上涨,23种下降,7种持平。其中,生猪(外三元)价格为9.5元/千克,环比下跌1%。
提出SMART框架,将预训练模型融入高维非参数变量选择,为微调提供理论基础。
arXiv:2604.12288v2 Announce Type: replace-cross Abstract: Fine-tuning is a widely used strategy for adapting pre-trained models to new tasks, yet its …
不同权威机构对2025年全球AI市场规模的预测从数千亿到万亿不等,为何差距如此巨大?一文拆解数字背后的定义边界与统计逻辑。
Article URL: https://philippdubach.com/posts/reconciling-enterprise-ai-revenue/ Comments URL: https://news.ycombinator.com/item?id=48208134 Points: 1 …
研究如何用LLM代理模拟退休态度,创新性结合人口统计与调查锚点,启发AI社会科学应用。
arXiv:2605.16303v1 Announce Type: cross Abstract: Large language models (LLM) agents may offer tools to predict human responses to surveys. A common t…
PCA中解释方差并非万能指标,本文通过实例警示其潜在陷阱,值得数据分析者关注。
arXiv:2605.13520v2 Announce Type: replace-cross Abstract: We address shortcomings of principal component analysis (PCA) for visualizing high-dimension…
稀疏主成分分析的随机新算法,基于SDP松弛的巧妙改进,理论与实验并重。
arXiv:2507.09148v2 Announce Type: replace-cross Abstract: Sparse Principal Component Analysis (SPCA) is a fundamental technique for dimensionality red…
自回归序列的矩阵解耦集中不等式,为稀疏长上下文奖励提供无维度保证,理论创新突破。
arXiv:2605.06017v2 Announce Type: replace Abstract: Sequence-level evaluations in autoregressive Large Language Models (LLMs) rely on highly dependent…
该论文提出Kernelized Advantage Estimation方法,从非参数统计视角优化LLM推理,为强化学习提供新思路。
arXiv:2604.28005v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have increasingly relied on reinforcement learning…
探讨非交换面板数据的在线共形预测新方法,突破传统假设,提升实时预测可靠性
arXiv:2605.17705v1 Announce Type: cross Abstract: Panel data, in which multiple units are repeatedly observed over time, arise throughout science and …
从统计物理视角分析遮蔽语言模型中Glauber动力学的混合时间,为理解MLM的采样行为提供理论依据。
arXiv:2605.16378v1 Announce Type: new Abstract: Masked language models (MLMs) define local conditional distributions over tokens but do not, in genera…
因果推断新方法:跨时间运输效应,助力时间序列分析更精准
arXiv:2603.07018v2 Announce Type: replace-cross Abstract: Treatment effects estimated from a randomized controlled trial are local not only to the stu…
探索因果关系与条件依赖的掩盖机制,为因果推断理论提供新见解
arXiv:2603.06984v2 Announce Type: replace-cross Abstract: Many regulatory and analytic problems require that a prohibited variable influence a decisio…
用假设检验框架实现分布级统计遗忘,为机器学习遗忘机制提供新理论视角。
arXiv:2605.16645v1 Announce Type: cross Abstract: Machine learning systems increasingly face requirements to forget not only individual data points, b…
开源短链接神器,漂亮后台+强大API+统计,支持自定义域名,安全可靠。
更新:感谢 @123 同学的提示,kuttit 域名丢失的原因是 .it 域名必须意大利人持有,并不是忘记续费。开发者持有很多年,被注册局收回了。 请勿使用 kuttit!!!它不属于 Kutt 了。 Kutt 是一款开源的短链接程序,5月14日发布了一条命为 Do not use ku
从遗憾视角统一在线多重检验的评估,揭示假阳性与假阴性非对称成本的优化新框架。
arXiv:2605.13916v1 Announce Type: cross Abstract: Online Multiple Testing (OMT), a fundamental pillar of sequential statistical inference, traditional…
新方法BMTI通过无箱多维积分实现非参数密度估计,数据高效且鲁棒。
arXiv:2407.08094v3 Announce Type: replace-cross Abstract: We introduce the Binless Multidimensional Thermodynamic Integration (BMTI) method for nonpar…
从xkcd经典谜题出发,揭示LLM训练语料中日期出现的神秘规律,数据控必看。
Article URL: https://venkatasg.net/blog/dates-2026-05-12.html Comments URL: https://news.ycombinator.com/item?id=48148071 Points: 1 # Comments: 0