Yahoo! news article recommendation system by linUCB
-
Updated
Feb 1, 2018 - Python
Yahoo! news article recommendation system by linUCB
🎓 Adaptive AI study agent with POMDP belief state — OPEAA loop, Q-learning + LinUCB bandit policies, SM-2 spaced repetition, concept DAG. Streamlit web app + Chrome extension (MV3). Claude & free HF backends.
Bandit algorithms
A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.
Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volatile multi-armed bandit setting.
Deterministic decision-intelligence MCP server for AI agents — 17 tools, 21 algorithms (LinUCB, HiGHS LP/MIP, PageRank, Monte Carlo, CMA-ES, conformal). Sub-25ms. Zero LLM cost. Listed on the MCP Registry, Glama & Smithery.
Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".
An illustrative project including some multi-armed bandit algorithms and contextual bandit algorithms
Contextual bandit implementation using Keras
Adaptive AI companion that builds a model of each user from implicit interaction signals — keystroke dynamics, linguistic complexity, temporal patterns — and continuously adapts its responses. Custom TCN + transformer + contextual bandit, built from scratch in PyTorch.
Block-level adaptive compression using LinUCB contextual bandit routing. Outperforms LZ4 by 20.4%, LZMA by 9.9%.
Self-Healing ML Pipeline: autonomous fault detection, RCA, and recovery with a LinUCB contextual bandit, VS Code extension, and 17-prompt LLM library.
Contextual bandit (LinUCB) that re-tunes PID gains for a line-following robot as its chassis changes
A Reinforcement Learning approach to a contextual bandit problem.
Contextual Thompson Sampling router for multi-provider LLM APIs. Zero config.
Contextual Multi-Armed Bandit that automates credit limit decisions for 10,000 users using Thompson Sampling — beats static limits by 30%+ with an interactive Streamlit dashboard.
Add a description, image, and links to the contextual-bandit topic page so that developers can more easily learn about it.
To associate your repository with the contextual-bandit topic, visit your repo's landing page and select "manage topics."