Project

Rethinking Workflow Optimization

Agent4Society

GAME Platform

Thinking as a Tool

Rubric as Reward, Self-Evolve, Continue Learning

Small Agents, Stronger Together

Coding

MiniMind

TRL Practice

Reading

basic

LLM基础调研

大模型基础教材(赵鑫)

DeepSeek Tech Reports

扩散概率模型

PaLm-E: 谷歌的具象化多模态语言模型