Tell HN: Gemini 3.5 Flash breaks in stupid ways
谷歌轻量AI模型,响应快,擅长文本生成与推理,评分时需避免复杂标准以防中心化偏差。
I thought I was going crazy, trying to use Gemini 3.5 Flash to rate some answers, but it kept giving 7 instead of 10 for correct answers. Apparently o…
谷歌轻量AI模型,响应快,擅长文本生成与推理,评分时需避免复杂标准以防中心化偏差。
I thought I was going crazy, trying to use Gemini 3.5 Flash to rate some answers, but it kept giving 7 instead of 10 for correct answers. Apparently o…
阿里Qwen3.7-Max是国产最强AI模型,性能全球前五,可通过阿里云百炼API调用,高效应对复杂推理与文本生成任务。
Qwen3.7-Max即将上线阿里云百炼对外提供API服务
AAAI 2025竞赛成果揭晓:反图灵测试如何识别AI生成文本,看17位研究者交出怎样答案。
arXiv:2605.20761v1 Announce Type: new Abstract: The rapid proliferation of AI-generated text has introduced significant challenges in maintaining the …
场景文本编辑新框架TextSculptor,训练与基准测试双突破,AI文字处理再升级。
arXiv:2605.21090v1 Announce Type: new Abstract: Recent advances in Multimodal Large Language Models (MLLMs) and diffusion-based generative models have…
谷歌在I/O大会上发布AI设计工具Pics,直接挑战Canva和Claude Design,零门槛生成社交图像和营销素材。
The tech giant says it's designed the app to be accessible to everyone, from teachers to small business owners.
用强化学习提升长文本生成中置信度表达,直击大模型幻觉难题。
arXiv:2505.23912v2 Announce Type: replace-cross Abstract: Hallucination remains a major challenge for the safe and trustworthy deployment of large lan…
基于扩散模型的文本生成新方法,通过在token嵌入上平滑去噪,实现高质量离散文本生成
arXiv:2505.18853v2 Announce Type: replace Abstract: Diffusion models have achieved state-of-the-art performance in generating images, audio, and video…
提出轨迹自蒸馏方法,让扩散语言模型用少步就能快速并行生成文本,突破推理速度瓶颈。
arXiv:2602.12262v3 Announce Type: replace-cross Abstract: Diffusion large language models (DLLMs) have emerged as powerful generative models with the …
Vercel v0 平台 API 公测:文本生成 web 应用、自动纠错、可集成工作流与自动化脚本。
The v0 Platform API is now available in public beta. The v0 Platform API is a text-to-app API — it provides programmatic access to v0’s app generation…
AI既是灵感镜也是牢笼,作者坦诚分享矛盾体验
A few more notes about ( ugh ) AI. I promise I’ll stop at some point but this is basically therapy now and since you aren’t legally obligated to read …