FU-max-boop

FU-max-boop

Popular repositories Loading

skills-introduction-to-github skills-introduction-to-github Public

Exercise: Introduction to GitHub
jj jj Public

Python
mini-llm-lab mini-llm-lab Public

Controlled mini-benchmark for context visibility, shortcut regimes, and composition in tiny causal transformers.

Python
FU-max-boop FU-max-boop Public

Profile README for AI research engineering direction.

Python
Spider2 Spider2 Public

Forked from xlang-ai/Spider2

[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

HTML
claw-eval claw-eval Public

Forked from claw-eval/claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python