Skip to content
@horizon-llm

Horizon Team

Towards Long-Horizon AI Agents

Pinned Loading

  1. OpenKimi OpenKimi Public

    Reproduce Kimi K1.5/K2 RL algorithm and rollout system

    Python 17 2

  2. Think-RM Think-RM Public

    [NeurIPS 2025] Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models

    Python 16 1

  3. uncertainty-router uncertainty-router Public

    [NeurIPS 2025] Ask a Strong LLM Judge when Your Reward Model is Uncertain

    Python 9

  4. HeaPA HeaPA Public

    Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning

    Python 5

  5. AlphaQuanter AlphaQuanter Public

    [ACL2026] AlphaQuanter: An End-to-End Tool-Orchestrated Agentic Reinforcement Learning Framework for Stock Trading.

    Python 49 9

Repositories

Showing 7 of 7 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python

Most used topics

Loading…