Engineer Β· Architect Β· Founder
Building the infrastructure layer that should already exist
I lead teams by day and build ecosystems by night.
My stack runs from Android NDK to bare-metal Rust services β no abstraction layer is too high, no system too low. I've fine-tuned LLMs, written custom WebRTC SFU servers, shipped OS-level Android tooling, architected microservice migrations for Fortune 500 energy infrastructure, designed post-quantum cryptography platforms, and somewhere in between, started questioning whether transformers are the right architecture at all.
Primary languages: Kotlin Β· Rust. Secondary religion: making things fast.
Not side projects. A connected vision: own every layer of the AI development lifecycle β and secure it while you're at it.
π₯ Runlit Β· github.com/runlit-dev β The trust layer below your AI IDE
AI writes fast. Runlit keeps score. An eval layer that catches what code review misses β hallucinated APIs, intent mismatches, security vulnerabilities, and compliance violations β before they reach production.
$ runlit check --pr 2847
β scanning 147 lines (AI attribution: Cursor/claude-sonnet)
hallucination 0 phantom APIs found β 1.00
intent matches spec, minor drift β 0.91
security no critical issues β 0.97
compliance 1 medium finding (PCI 6.2.4) β 0.88
eval.score 94 / 100
β CLEARED β merge when ready.139 rules. 12 provider-specific packs (OpenAI, Anthropic, LangChain, Stripe...). 30-second GitHub App install. Merge-blocking. MIT-licensed rules.
1.7Γ more issues in AI-generated code. 45% of AI tasks introduce security vulnerabilities. 75% of enterprise engineers will use AI coding tools by 2028. Runlit exists because those three numbers are a collision course.
Rust GitHub App GitLab Azure DevOps Bitbucket CLI GitHub Actions
β‘ Routra Β· github.com/routra-dev β One API. Every GPU Cloud.
Intelligent LLM routing layer between your app and 10+ GPU providers. Scores every provider every 30s on price, latency, uptime, and queue depth. Routes to the cheapest option that meets your SLA β <8ms overhead, <200ms auto-failover.
# Before
client = openai.OpenAI(api_key="sk-...")
# After. That's it.
client = openai.OpenAI(api_key="rtr-...", base_url="https://api.routra.dev/v1")Typical savings: 60% of inference spend. Adaptive ML routing via LinUCB bandit β learns per model, region, and time-of-day.
Rust Axum OpenAI-compatible Python SDK TypeScript SDK Go SDK Rust SDK CLI
π₯οΈ Zylora Β· github.com/zylora-dev β Serverless GPU. Zero DevOps.
Decorate a function. Get a production HTTPS endpoint backed by GPU hardware. No Docker, no YAML, no DevOps hire. Not Python-first β full feature parity across Python, TypeScript, Go, and Rust. Same mental model, idiomatic syntax per language.
Serverless GPU Python TypeScript Go Rust Multi-Language SDK
π€ QCHO Β· github.com/qcho-ai β Ambient AI for your entire digital life
Not a chatbot. A background-capable multi-agent operating layer β voice-first, connector-driven, deeply personalised via persistent per-user memory (imprint.bit). Orchestrates Gmail, Calendar, Notion, Slack, GitHub, and custom MCP servers. Every external dependency behind a swappable adapter trait.
SENSA (intent) β ORACLE (task decompose + DAG) β Agent Mesh β Connectors β ECHO (response)
Rust Axum Tauri 2 Next.js 16 Claude (Anthropic) pgvector Whisper ElevenLabs
βοΈ Nuclyr Β· github.com/nuclyr β Multi-Cloud. Made for India.
Route storage, compute, and data workloads across AWS, GCP, and Azure β automatically. Built for Indian businesses: INR billing, UPI/Razorpay, GST invoices, DPDP compliance scanning. Phase 1 shipped: gRPC-first architecture, AES-256-GCM per-tenant encryption, Rust adapters, full SDK surface.
Rust gRPC Protobuf Next.js PostgreSQL 18 Redis Razorpay
π QShield β Quantum-safe by default.
The full-stack post-quantum cryptography migration platform. Secures all four layers β transport, authentication, secrets storage, and credentials β under one open core, before quantum computers make today's encryption obsolete.
1Password secures passwords. Cloudflare secures transport. Nobody secures all four layers under one brand with one open core. QShield is that platform.
Built in Rust. Aligned to NIST FIPS 203 (ML-KEM), FIPS 204 (ML-DSA), and FIPS 205 (SLH-DSA). India-first. Enterprise and government-ready. Harvest-now-decrypt-later attacks are already happening β the migration window is open and won't stay that way.
| Product | What it does |
|---|---|
| QShield Core | Open-source PQC library β ML-KEM, ML-DSA, hybrid classical+PQC. Rust-native with Python, Node, Go, Java bindings |
| QShield Proxy | Drop-in PQC-hybrid TLS termination. Zero application code changes |
| QShield Vault | PQC-first zero-knowledge secrets manager. Browser, desktop, mobile |
| QShield Auth | Drop-in Auth0/Clerk replacement issuing ML-DSA-signed JWTs |
| QShield Enterprise | Codebase scanning, PQC readiness scoring, CERT-In / RBI / SEBI compliance reports |
Rust ML-KEM ML-DSA SLH-DSA PyO3 NAPI-RS WASM Tauri 2 Flutter Apache 2.0
Neural Execution & Understanding System. Not an LLM variant. A new architecture class.
NEXUS addresses fundamental flaws in the transformer architecture. NOOR is the first model family built on it β scaling bottom-up with intention, not brute force:
| Model | Params | Context | Status |
|---|---|---|---|
| NOOR Nano | 29M | 512 | v0.4 training |
| NOOR Micro | 80β120M | 2048 | Next |
| NOOR Mini | ~350M | TBD | Planned |
| NOOR Small | ~1B | TBD | Planned |
Also running Grafts β NEXUS modules injected into existing open-source backbones (Qwen-NOOR, Gemma-NOOR) as experimental sidelines.
This is what happens when you spend enough time staring at transformer limitations that you decide to do something about it.
Systems & Backend
AI & Infra
Mobile & Multiplatform
- Application Lead @ Vendeep / NRG Energy (2022 β present) β Leading a 25-person team modernizing legacy enterprise portlets into microservices. 145+ APIs migrated across Spring Boot, FastAPI, and Salvo. Still running.
- Senior Android + Reverse Engineering @ Innobuzz (2020 β 2022) β NDK-level engineering, custom OS injection, Android security tooling at the system layer.
- Delivered 4 production phases for HCTRA (Harris County Toll Road Authority, Houston) β Kafka streaming, Oracle GoldenGate, millions of daily toll transactions.
- Built and fine-tuned a generative AI training platform for HCTRA β Meta Llama 3, Rust + Python, custom fine-tuning pipeline.


