Skip to content
View s4um1l's full-sized avatar

Block or report s4um1l

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
s4um1l/README.md

👋 Hi, I'm Saumil Srivastava

Principal/Staff AI & ML Engineer

12+ years building production AI/ML systems at scale. 8 years at Intuit shipping tax intelligence, CTO of an internal venture (0-to-GA in 9 months), and identity migration across 100+ microservices. Open to Principal/Staff+ AI Engineering roles in Canada.

Stay Updated

Subscribe to my newsletter for insights on production AI systems, engineering leadership, and practical strategies for building reliable AI at scale.

Newsletter Signup

What I Do

  • Production-Scale ML Systems: governed execution layers, identity systems.
  • Agent Reliability Engineering: Multi-agent orchestration with ReAct loops, token budgeting, HITL gates, and failure taxonomies
  • RAG & Retrieval Systems: Permission-aware retrieval (pgvector + RLS), hybrid BM25 + dense with RRF, NLI entailment verification, RAPTOR synthesis
  • LLMOps & Observability: OTel GenAI conventions, LLM-as-judge evaluation harnesses, multi-tenant pipelines, A/B testing infrastructure
  • 0-to-1 Technical Leadership: Owned architecture, hiring, and delivery as CTO — comfortable driving ambiguous problems from whiteboard to production

Technical Expertise

  • Multi-Agent Orchestration & LangGraph
  • RAG Pipelines & Vector Databases (pgvector)
  • LLM Evaluation, Calibration & Fine-Tuning (LoRA)
  • Distributed Systems & Temporal
  • OpenTelemetry & LLMOps
  • Mechanistic Interpretability (logit lens, published research)
  • Python, Production ML at Scale

Let's Connect


Pinned Loading

  1. agent-evals-lab agent-evals-lab Public

    Python

  2. context-engineering-lab context-engineering-lab Public

    Production-style context engineering for LLM systems: budgets, retrieval, reranking, compaction, caching, rollback gates. Exercises + learning guide.

    Python

  3. saumil-ai-roi-engineering saumil-ai-roi-engineering Public

    Saumil AI ROI Engineering

    Jupyter Notebook

  4. saumil-ai-implementation-examples saumil-ai-implementation-examples Public

    ai implementation examples

    Jupyter Notebook 2

  5. aya-cross-lingual-probe aya-cross-lingual-probe Public

    Mechanistic interpretability of cross-lingual concept representations in Tiny Aya — rise, peak, collapse.

    Python 5

  6. citation-hallucination-detector citation-hallucination-detector Public

    Mechanistic analysis of confidence formation in LLMs to detect fabricated citations.

    Jupyter Notebook