-
Fudan University
- Shanghai
Pinned Loading
-
lsdefine/GenericAgent
lsdefine/GenericAgent PublicSelf-evolving agent: grows skill tree from 3.3K-line seed, achieving full system control with 6x less token consumption
-
JinyiHan99/Proactive-Self-Refine-in-LLMs
JinyiHan99/Proactive-Self-Refine-in-LLMs Public[ICLR26] An open-source project dedicated to training language models to proactively refine their outputs during generation via reinforcement learning.
Python 5
-
lsdefine/simple_GRPO
lsdefine/simple_GRPO PublicA very simple GRPO implement for reproducing r1-like LLM thinking.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

