Yonas Mekonnen Sanoy24

Yonas Mekonnen — AI Engineer | Forward Deplpyed Engineer | Backend Developer

I build the infrastructure that makes AI agents reliable in production.
Event-sourced memory · Multi-agent orchestration · LLM evaluation systems · Backend APIs

🟢 Open to work — Full-time · Contract · Freelance

What I do

3+ years building production backend systems and the infrastructure layer for AI agents — event-sourced memory, multi-agent orchestration, document intelligence pipelines, and LLM evaluation from scratch. I publish everything I build: on GitHub, Hugging Face, and Substack.

Skills

Languages — Python · TypeScript · Go · JavaScript

AI & Agents — LangGraph · LangChain · MCP (Model Context Protocol) · OpenAI · Anthropic · Langfuse · LangSmith

Backend — FastAPI · NestJS · Node.js · Express · Gin · Django

Data & Infra — PostgreSQL · Redis · MongoDB · ChromaDB · SQLite · Docker

ML — HuggingFace · LoRA fine-tuning (SimPO) · RAG pipelines · Pydantic

Projects

Five projects. Each one production-grade, each one ships real output.

⚖️ Automaton Auditor — LangGraph Governance Swarm

A multi-agent system that performs forensic code audits through a Digital Courtroom architecture: detective agents gather evidence in parallel, three independent judge agents score every rubric criterion, and a ChiefJustice resolves conflicts using deterministic Python rules — not LLM prompts. Outputs a structured JSON verdict and a human-readable Markdown report covering 10+ rubric dimensions.

Result: Replaces hours of manual code review with a single command. Conflict resolution is 100% deterministic — zero hallucinated scores regardless of model.

LangGraph LangSmith Python AST analysis RAG Docker Multi-LLM routing

🗺️ Brownfield Cartographer — Codebase Intelligence System

Ingests any GitHub repository and produces a living knowledge graph of its architecture, data flows, and semantic structure. Four specialized agents (Surveyor, Hydrologist, Semanticist, Archivist) feed into a LangGraph ReAct query interface. Outputs an interactive visualization, a module graph, a data lineage DAG, and a Day-One onboarding brief that answers five key architectural questions automatically.

Result: An engineer joining a new codebase can get full architectural context — what the system does, how data flows, which modules are critical — in under an hour instead of days.

LangGraph tree-sitter sqlglot NetworkX Python

🏦 TRP1 Ledger — Event Store for AI Agents

Append-only, ACID-backed event store for multi-agent AI systems — the memory and governance layer for a financial services loan processing platform. Includes optimistic concurrency control (verified by double-decision concurrency tests: one agent wins, one gets a clean error — no silent data corruption), causal tracing on every event, and an MCP server that exposes the store as agent-callable tools.

Result: Zero conflicting agent decisions under concurrent load. Every AI decision is cryptographically traceable and packaged for regulatory examination — something most production AI systems can't provide at all.

Python PostgreSQL FastMCP Event Sourcing CQRS Pydantic

🤖 Conversion Engine — AI Sales Agent

Four-stage agentic pipeline that enriches prospects from public data, classifies them into ICP segments, drafts signal-grounded outreach with a tone-preservation check, and books discovery calls — all with a kill switch, strict channel hierarchy, and full trace observability via Langfuse.

Result: $0.01 cost-per-qualified-lead — 500× under the $5 industry target. Evaluated end-to-end against the τ²-Bench benchmark with a 37-probe adversarial suite.

FastAPI Langfuse HubSpot MCP Playwright Resend Cal.com Python

📊 Tenacious-Bench — LLM Benchmark & Fine-Tuned Judge

Built a 242-task evaluation benchmark for B2B sales AI agents from scratch — four generation modes, 14 failure dimensions, inter-rater agreement κ=0.91. Then fine-tuned a SimPO LoRA adapter on Qwen2.5-3B that lifted pairwise scoring accuracy from 97.7% to 100% on the sealed held-out set.

Result: Published dataset and model adapter on Hugging Face. Total cost: $0.38.

HuggingFace SimPO LoRA Qwen2.5 Python Jupyter

Other work

Project	What it is
fastapi-ecommerce	Production e-commerce API — FastAPI, PostgreSQL, JWT, Docker
lyrics-rest-api	RESTful API in idiomatic Go
trp1-document-intelligence-refinery	5-stage document pipeline — ChromaDB, Docling, LangGraph ReAct
movie-streaming	Full-stack streaming app — Next.js, React, Tailwind
djinni-telegram-bot	Job alert bot — scraping, automation, Telegram API

GitHub Stats

_{Available for the right opportunity · myonas886@gmail.com · LinkedIn}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly