KaiP-598

Follow

Kai Wu KaiP-598

Follow

3 followers · 2 following

Achievements

Achievements

Pinned Loading

grpo-from-scratch grpo-from-scratch Public

From-scratch GRPO (DeepSeek-R1) implementation reaching 74.2% on MATH, with full ablation sweep

Python
filing-sense filing-sense Public

RAG pipeline for SEC 10-K filing analysis — FinQA dataset, hybrid retrieval (BM25 + FAISS + reranker), LoRA & Full SFT fine-tuning on Qwen2.5-3B

Python