contextual-bandit

Star

Here are 18 public repositories matching this topic...

KKeishiro / Yahoo_recommendation

Star

Yahoo! news article recommendation system by linUCB

recommendation-system contextual-bandit bandit-algorithms linucb

Updated Feb 1, 2018
Python

HumphreySun98 / Smart-Study-Agent

Star

🎓 Adaptive AI study agent with POMDP belief state — OPEAA loop, Q-learning + LinUCB bandit policies, SM-2 spaced repetition, concept DAG. Streamlit web app + Chrome extension (MV3). Claude & free HF backends.

chrome-extension reinforcement-learning q-learning spaced-repetition pomdp contextual-bandit huggingface streamlit agent-framework study-tool ai-agent claude-api

Updated Apr 28, 2026
JavaScript

niffler92 / Bandit

Star

Bandit algorithms

simulation thompson-sampling multiarm-bandit contextual-bandit bandit-algorithms linucb

Updated Oct 12, 2017
Python

Digitalized-Energy-Systems / opfgym

Star

A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.

benchmark environment reinforcement-learning supervised-learning rl optimal-power-flow energy-system gymnasium opf pandapower contextual-bandit reward-design reward-shaping power-system environment-design action-shaping

Updated Mar 22, 2025
Python

Bilkent-CYBORG / ACC-UCB

Star

Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volatile multi-armed bandit setting.

reinforcement-learning contextual-bandit multiarmed-bandits combinatorial-bandit

Updated Feb 24, 2020
Python

Whatsonyourmind / oraclaw

Star

Deterministic decision-intelligence MCP server for AI agents — 17 tools, 21 algorithms (LinUCB, HiGHS LP/MIP, PageRank, Monte Carlo, CMA-ES, conformal). Sub-25ms. Zero LLM cost. Listed on the MCP Registry, Glama & Smithery.

Updated Jun 3, 2026
TypeScript

doerlbh / BerlinUCB

Star

Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".

reinforcement-learning paper semi-supervised-learning bandits bandit contextual-bandits contextual-bandit self-supervised-learning nonstationary-environments

Updated Sep 21, 2020
MATLAB

ej0cl6 / cbpr

Star

Contextual Bandit with Piled Rewards

contextual-bandit piled-rewards

Updated Dec 18, 2017
Python

Hins-Hu / Bandit-Algorithms

Star

An illustrative project including some multi-armed bandit algorithms and contextual bandit algorithms

multi-armed-bandit contextual-bandit bandit-algorithms

Updated Feb 3, 2021
Python

bsteenwi / ContextualBandit

Star

Contextual bandit implementation using Keras

python keras contextual-bandit

Updated Apr 30, 2018
Python

abailey81 / implicit-interaction-intelligence

Star

Adaptive AI companion that builds a model of each user from implicit interaction signals — keystroke dynamics, linguistic complexity, temporal patterns — and continuously adapts its responses. Custom TCN + transformer + contextual bandit, built from scratch in PyTorch.

Updated Apr 29, 2026
Python

Brahamanbtp / ORBIT

Star

Block-level adaptive compression using LinUCB contextual bandit routing. Outperforms LZ4 by 20.4%, LZMA by 9.9%.

python compression research reproducible-research information-theory data-compression online-learning contextual-bandit linucb regret-minimization lossless-compression block-compression adaptive-compression codec-selection

Updated May 1, 2026
Python

gagan-53 / self-healing-ml-pipeline

Star

Self-Healing ML Pipeline: autonomous fault detection, RCA, and recovery with a LinUCB contextual bandit, VS Code extension, and 17-prompt LLM library.

machine-learning vscode-extension self-healing anomaly-detection fault-detection contextual-bandit mlops root-cause-analysis drift-detection ml-pipeline

Updated Apr 27, 2026
Python

aipi590-ggn / aipi590-challenge-4

Star

Contextual bandit (LinUCB) that re-tunes PID gains for a line-following robot as its chassis changes

reinforcement-learning robotics pid-control adaptive-control contextual-bandit linucb robotics-education

Updated Apr 21, 2026
Python

SC5 / bandits

Star

machine-learning reinforcement-learning bandits contextual-bandit

Updated Nov 16, 2017
Python

victor-iyi / contextual-bandit

Star

A Reinforcement Learning approach to a contextual bandit problem.

reinforcement-learning reinforcement-learning-algorithms bandit-learning markov-decision-processes contextual-bandit

Updated Dec 2, 2017
Jupyter Notebook

ElromEvedElElyon / rl-router

Star

Contextual Thompson Sampling router for multi-provider LLM APIs. Zero config.

python reinforcement-learning cross-platform rate-limiting openai thompson-sampling multi-armed-bandit contextual-bandit zero-dependencies adaptive-routing llm anthropic cerebras llm-ops llm-gateway llm-routing

Updated Apr 19, 2026
Python

adityagangwani30 / credit-limit-bandit

Star

Contextual Multi-Armed Bandit that automates credit limit decisions for 10,000 users using Thompson Sampling — beats static limits by 30%+ with an interactive Streamlit dashboard.

python data-science machine-learning reinforcement-learning fintech thompson-sampling multi-armed-bandit credit-risk contextual-bandit streamlit

Updated May 23, 2026
HTML

Improve this page

Add a description, image, and links to the contextual-bandit topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the contextual-bandit topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

contextual-bandit

Here are 18 public repositories matching this topic...

KKeishiro / Yahoo_recommendation

HumphreySun98 / Smart-Study-Agent

niffler92 / Bandit

Digitalized-Energy-Systems / opfgym

Bilkent-CYBORG / ACC-UCB

Whatsonyourmind / oraclaw

doerlbh / BerlinUCB

ej0cl6 / cbpr

Hins-Hu / Bandit-Algorithms

bsteenwi / ContextualBandit

abailey81 / implicit-interaction-intelligence

Brahamanbtp / ORBIT

gagan-53 / self-healing-ml-pipeline

aipi590-ggn / aipi590-challenge-4

SC5 / bandits

victor-iyi / contextual-bandit

ElromEvedElElyon / rl-router

adityagangwani30 / credit-limit-bandit

Improve this page

Add this topic to your repo