AI 每日热点 - 2026-05-21

💻 GitHub 热门 AI 项目

1 colbymchenry/codegraph

为 Claude Code、Codex、Cursor 等 AI 编码工具提供预索引代码知识图谱，减少 token 消耗与工具调用，完全本地运行

直接解决 AI 编码助手 token 浪费与工具调用过多的痛点，且无隐私顾虑

连续4天 +2,123 today TypeScript

2 Imbad0202/academic-research-skills

为 Claude Code 提供完整学术研究工作流技能：调研→写作→审阅→修改→定稿

将学术写作全流程封装为 Claude Code 技能，适合研究人员一键复用

连续3天 +1,667 today Python

3 tinyhumansai/openhuman

私人超级 AI 智能体，强调私密性、简洁性与强大能力

主打本地隐私与极简体验，对抗云端 AI 依赖的替代方案

连续10天 +3,394 today Rust

4 multica-ai/andrej-karpathy-skills

单文件 CLAUDE.md，汇总 Karpathy 对 LLM 编码陷阱的观察，改善 Claude Code 行为

借助 Karpathy 的实战经验背书，一个文件即可显著提升 Claude Code 代码质量

+2,679 today

5 rohitg00/ai-engineering-from-scratch

从零学习 AI 工程：学习、构建、上线完整链路教程

面向实践的 AI 工程学习路径，覆盖从原理到部署的完整闭环

NEW +765 today Python

6 HKUDS/CLI-Anything

让所有软件具备 Agent 原生能力的 CLI 框架，配套 CLI-Hub 生态

港大出品，试图将任意命令行工具无缝接入 AI Agent，生态野心较大

连续4天 +890 today Python

7 can1357/oh-my-pi

终端 AI 编码 Agent，支持哈希锚定编辑、LSP、Python、浏览器与子 Agent

工具链完整度高，哈希锚定编辑机制是防止 AI 幻觉改错位置的创新设计

NEW +270 today TypeScript

8 anthropics/claude-plugins-official

Anthropic 官方维护的高质量 Claude Code 插件目录

官方背书的插件生态目录，是寻找可信 Claude Code 扩展的第一来源

+674 today Python

9 msitarzewski/agency-agents

完整 AI Agency 工具集，含前端、社区运营、创意注入等多种专业人格 Agent

将 Agency 工作流拆解为有个性的专业 Agent 组合，适合内容与运营团队

连续4天 +1,636 today Shell

10 truelockmc/streambert

跨平台 Electron 桌面应用，可流媒体播放或下载全球电影、剧集与动漫，无广告无追踪

零广告零追踪的全平台流媒体聚合客户端，功能定位直接对标商业流媒体

NEW +582 today JavaScript

11 rohitg00/agentmemory

基于真实基准测试排名第一的 AI 编码 Agent 持久化记忆方案

以实测基准为卖点，填补 AI 编码工具跨会话记忆缺失的核心短板

连续7天 +1,080 today TypeScript

12 ggml-org/llama.cpp

纯 C/C++ 实现的高性能 LLM 本地推理框架，支持多种量化格式与硬件后端

本地 LLM 推理事实标准，持续迭代且生态最广，是众多工具的底层依赖

+309 today C++

🤗 HuggingFace 热门

模型

1 bytedance-research/Lance

字节跳动研究院发布的大语言模型，面向推理与指令跟随任务优化。

any-to-any 438 下载 470 赞

2 SulphurAI/Sulphur-2-base

基于LTX 2.3的开源视频生成模型，支持文本转视频和图像转视频，内置提示词增强器，无内容审查限制。

连续17天 text-to-video 1,157,497 下载 1205 赞

3 openbmb/MiniCPM-V-4.6

面壁智能出品的轻量级多模态大模型，支持图文理解与问答，参数量小但性能媲美大模型

连续10天 image-text-to-text 166,049 下载 827 赞

4 Supertone/supertonic-3

Supertone出品的轻量级多语言TTS模型，支持31种语言，仅99M参数，可在CPU上本地运行，支持表情标签

连续9天 text-to-speech 31,940 下载 503 赞

5 unsloth/Qwen3.6-27B-MTP-GGUF

Qwen3.6 27B参数模型的GGUF量化版本，由Unsloth优化，支持多token预测（MTP），适合本地推理部署。

连续7天 image-text-to-text 411,598 下载 356 赞

6 circlestone-labs/Anima

连续6天 571,087 下载 1451 赞

7 unsloth/Qwen3.6-35B-A3B-MTP-GGUF

连续7天 image-text-to-text 363,131 下载 295 赞

8 sapientinc/HRM-Text-1B

NEW text-generation 23,532 下载 183 赞

9 ResembleAI/Dramabox

连续4天 text-to-speech 1,229 下载 202 赞

10 froggeric/Qwen-Fixed-Chat-Templates

连续3天 0 下载 336 赞

数据集

1 PsiBotAI/SynData

大规模第一人称视角合成视频数据集，含44.9万条多模态数据，覆盖107种任务，用于机器人操作与动作识别训练

连续6天 41,523 下载 153 赞

2 AlienKevin/SWE-ZERO-12M-trajectories

软件工程代理轨迹数据集，含1200万条零样本代码修复与任务执行轨迹，用于训练SWE智能体。

连续7天 8,670 下载 93 赞

3 TuringEnterprises/Open-MM-RL

图灵企业发布的开源多模态强化学习数据集，用于提升视觉语言模型的推理与对齐能力

连续9天 9,340 下载 189 赞

4 angrygiraffe/claude-opus-4.6-4.7-reasoning-8.7k

包含约8700条Claude Opus 4.6/4.7推理链的微调数据集，用于蒸馏或增强模型思维链能力。

连续15天 3,507 下载 157 赞

5 5CD-AI/Viet-Handwriting-OCR-v2

越南语手写文字识别OCR模型第二版，专为越南文手写体场景设计优化。

连续3天 268 下载 44 赞

6 TeichAI/DeepSeek-v4-Pro-Agent

连续4天 2,623 下载 41 赞

7 Jackrong/GLM-5.1-Reasoning-1M-Cleaned

连续25天 11,534 下载 216 赞

8 Qwen/WebWorldData

连续6天 648 下载 44 赞

9 Modotte/CodeX-2M-Thinking

连续4天 5,917 下载 101 赞

10 openai/gsm8k

NEW 945,957 下载 1323 赞

热门论文

1 生成式递归推理

Generative Recursive Reasoning

GRAM 为神经推理系统引入概率多轨迹计算，通过随机潜在轨迹实现多假设并行推断，提升复杂推理能力。

NEW 3 票 Junyeob Baek, Mingyu Jo, Minsu Kim, Mengye Ren

2 UniT：基于分组自回归Transformer的统一几何学习

UniT: Unified Geometry Learning with Group Autoregressive Transformer

UniT 提出统一前馈几何感知模型，融合多种感知范式，通过尺度自适应损失与队列式KV缓存保持度量尺度精度。

NEW 2 票 Haotian Wang, Yusong Huang, Zhaonian Kuang, Hongliang Lu

3 伦理超速（EHV）：面向智能体系统的可证明确定性治理感知JIT编译器架构

Ethical Hyper-Velocity (EHV): A Provably Deterministic Governance-Aware JIT Compiler Architecture for Agentic Systems

EHV 架构将无冲突复制数据类型与可信执行环境相结合，实现亚毫秒级AI治理策略的实时形式化验证与执行。

NEW 0 票 Riddhi Mohan Sharma

4 SENSE：面向可持续环境的卫星能源合成框架

SENSE: Satellite-based ENergy Synthesis for Sustainable Environment

SENSE 利用扩散模型融合卫星图像与能耗数据，构建生成式城市建筑能耗建模框架，以更少标注数据实现高保真结果。

NEW 1 票 Kailai Sun, Mingyi He, Heye Huang, Can Rong

5 S-Bus：面向多智能体LLM状态协调的自动读集重建机制

S-Bus: Automatic Read-Set Reconstruction for Multi-Agent LLM State Coordination

S-Bus 中间件通过 DeliveryLog 机制重建读集、强制可观察读隔离一致性，有效消除并发LLM智能体中的结构性竞态条件。

NEW 0 票 Sajjad Khan

6 ThoughtTrace：理解真实LLM交互中的用户思维

ThoughtTrace: Understanding User Thoughts in Real-World LLM Interactions

ThoughtTrace 构建大规模人机对话与用户自述思维配对数据集，用于改善用户行为预测并通过思维引导重写训练个性化助手。

NEW 3 票 Chuanyang Jin, Binze Li, Haopeng Xie, Cathy Mengying Fang

7 交互式评估需要一门设计科学

Interactive Evaluation Requires a Design Science

交互式评估代表一种原则性的范式转变，需要新框架通过动态交互轨迹而非静态响应来评估系统行为。

NEW 11 票 Keyang Xuan, Peiyang Song, Pan Lu, Pengrui Han

8 并非每条评分标准都同等有效：面向RLVR的策略感知评分奖励

Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR

POW3R 是一种策略感知强化学习框架，在训练过程中自适应调整评分标准权重，在保留人工标准重要性的同时提升策略优化效果。

NEW 1 票 Utkarsh Tyagi, Xingang Guo, MohammadHossein Rezaei, Daniel George

9 基础模型在AI检测器眼中形同人类

Base Models Look Human To AI Detectors

指令微调模型生成的文本被商业检测器识别为非人类写作，为此开发了一种在不同模型规模下保留语义的同义改写流水线以提升文本的人类相似度。

NEW 1 票 Yixuan Even Xu, Ziqian Zhong, Aditi Raghunathan, Fei Fang

10 善意改写：通过改写的良性投影抵御LLM数据投毒攻击

Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks

开放式良性改写通过良性提示投影中和恶意内容，有效防御大语言模型后门攻击，在保持计算效率和自然语言任务性能的同时优于现有防御方法。

NEW 0 票 John T. Halloran, Noopur S. Bhatt

📝 ArXiv 最新 AI 论文

1 Position: Let's Develop Data Probes to Fundamentally Understand How Data Affects LLM Performance

arXiv:2605.18801v1 Announce Type: new Abstract: Data is fundamental to large language models (LLMs). However, understanding of what makes certain data useful for different stages of an LLM workflow, i

NEW Shiqiang Wang, Herbert Woisetschl\"ager, Hans Arno Jacobsen 等 · Wed, 20 Ma cs.AI

2 Operationalizing Document AI: A Microservice Architecture for OCR and LLM Pipelines in Production

arXiv:2605.18818v1 Announce Type: new Abstract: Academic research tends to focus on new models for document understanding creating a wide gap in the literature between model definition and running mod

NEW Yao Fehlis, Benjamin Bengfort, Zhangzhang Si 等 · Wed, 20 Ma cs.AI

3 Evaluating the Utility of Personal Health Records in Personalized Health AI

arXiv:2605.18937v1 Announce Type: new Abstract: Patient-managed Personal Health Records (PHRs) promises to empower patients to better understand their health; but information in the record is complex,

NEW Rory Sayres, Kejia Chen, Ayush Jain 等 · Wed, 20 Ma cs.AI

4 Learn-by-Wire Training Control Governance: Bounded Autonomous Training Under Stress for Stability and Efficiency

arXiv:2605.19008v1 Announce Type: new Abstract: Modern language-model training is increasingly exposed to instability, degraded runs, and wasted compute, especially under aggressive learning-rate, sca

NEW Anis Radianis · Wed, 20 Ma cs.AI

5 AgentNLQ: A General-Purpose Agent for Natural Language to SQL

arXiv:2605.19010v1 Announce Type: new Abstract: Natural language to SQL (NL2SQL) conversion is an important problem for researchers and enterprises due to the ubiquitous importance of relational datab

NEW Olena Bogdanov, Yeunji Jung, Chandra Dhir 等 · Wed, 20 Ma cs.AI

6 KAN-MLP-Mixer: A comprehensive investigation of the usage of Kolmogorov-Arnold Networks (KANs) for improving IMU-based Human Activity Recognition

arXiv:2605.19031v1 Announce Type: new Abstract: Kolmogorov-Arnold Networks (KANs) have demonstrated an exceptional ability to learn complex functions on clean, low-dimensional data but struggle to mai

NEW Mengxi Liu, Sizhen Bian, Vitor Fortes 等 · Wed, 20 Ma cs.AI

7 Trustworthy Agent Network: Trust in Agent Networks Must Be Baked In, Not Bolted On

arXiv:2605.19035v1 Announce Type: new Abstract: The rapid advancement of Large Language Models has given rise to autonomous LLM-based agents capable of complex reasoning and execution. As these agents

NEW Yixiang Yao, Yuhang Yao, Xinyi Fan 等 · Wed, 20 Ma cs.AI

8 Interference-Aware Multi-Task Unlearning

arXiv:2605.19042v1 Announce Type: new Abstract: Machine unlearning aims to remove the contribution of designated training data from a trained model while preserving performance on the remaining data.

NEW Ying-Hua Huang, Rui Fang, Hsi-Wen Chen 等 · Wed, 20 Ma cs.AI

9 Embedding by Elicitation: Dynamic Representations for Bayesian Optimization of System Prompts

arXiv:2605.19093v1 Announce Type: new Abstract: System prompts are a central control mechanism in modern AI systems, shaping behavior across conversations, tasks, and user populations. Yet they are di

NEW Zhiyuan Jerry Lin, Benjamin Letham, Samuel Dooley 等 · Wed, 20 Ma cs.AI

10 DecisionBench: A Benchmark for Emergent Delegation in Long-Horizon Agentic Workflows

arXiv:2605.19099v1 Announce Type: new Abstract: We introduce DecisionBench, a benchmark substrate for emergent delegation in long-horizon agentic workflows. The substrate fixes a task suite (GAIA, tau

NEW Yuxuan Gao, Megan Wang, Yi Ling Yu 等 · Wed, 20 Ma cs.AI

11 POLAR-Bench: A Diagnostic Benchmark for Privacy-Utility Trade-offs in LLM Agents

arXiv:2605.19127v1 Announce Type: new Abstract: LLM agents increasingly have access to private user data and act on the user's behalf when interacting with third-party systems. The user defines what m

NEW Qiaoyuan Zheng, Yiqu Yang, Qi Gao 等 · Wed, 20 Ma cs.AI

12 Learning to Hand Off: Provably Convergent Workflow Learning under Interface Constraints

arXiv:2605.19140v1 Announce Type: new Abstract: We study workflow learning in a setting where specialized agents hand off control through a shared artifact, each agent observes only a local function o

NEW Jiayu Li, Enpei Zhang, Dawei Zhou 等 · Wed, 20 Ma cs.AI

🔥 AI 社区热议

1 [D] Self-Promotion Thread

连续9天 Reddit r/MachineLearning

2 [D] Monthly Who's Hiring and Who wants to be Hired?

连续10天 Reddit r/MachineLearning

3 How competitive are PhD admissions currently [D]

NEW Reddit r/MachineLearning

4 Machine Learning on Spherical Manifold [R]

NEW Reddit r/MachineLearning

5 Any tool to get accepted conference papers sorted by citation count? [D]

NEW Reddit r/MachineLearning