Skip to content
View rehan243's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report rehan243

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
rehan243/README.md

waving hand animation    Typing SVG — AI Engineer, RAG, LLM, Agents

GitHub followers  LinkedIn — Rehan Malik AI Engineer  Kaggle — Machine Learning  profile visitors

animated divider

About Me — AI/ML Engineer | Generative AI | LLM Systems

i'm an AI/ML engineer based in the US, currently building production AI systems at Reallytics.ai and Verticiti. most of my work revolves around getting large language models to do useful things in production — not toy demos, actual systems handling real traffic.

before this, i spent years at Afiniti and Cloud Kinetics doing the grunt work of making ML models reliable at scale. fraud detection, voice analytics, enterprise search — the kind of stuff that breaks at 3am and you have to fix.

what keeps me going: that moment when an AI agent you built actually solves a problem you didn't explicitly program it for. still hits different every time.

right now i'm deep into:

  • multi-agent systems that coordinate without falling apart
  • RAG pipelines that actually find what you're looking for
  • writing daily about what i learn — AI Engineering Notes
developer coding animation — AI engineer at work

animated gradient divider

Featured Projects — AI Agents, RAG, LLM Fine-Tuning  rocket

Agentic AI Workflows — Production AI Agents
8 specialized AI agents with LangChain + OpenAI function calling. multi-agent orchestration with planning loops and guardrails. the project i'm most excited about.

RAG Enterprise Search — Retrieval-Augmented Generation
production retrieval pipeline over 2TB+ data. LangChain, FAISS, ChromaDB, cross-encoder re-ranking. deployed on AWS SageMaker.

Voice AI Platform — Real-Time Speech AI
real-time voice infrastructure handling 500+ concurrent calls. WebSockets, Apache Kafka, gRPC with CUDA. speech-to-text, sentiment analysis.

LLM Fine-Tuning (LoRA/QLoRA) — Parameter-Efficient Fine-Tuning
fine-tuning LLaMA-2 and Mistral with LoRA/QLoRA/PEFT. 40% cost reduction vs hosted APIs. vLLM serving on SageMaker.

RLHF LLM Optimization — Reinforcement Learning from Human Feedback
full RLHF pipeline — supervised fine-tuning, reward modeling, PPO with KL constraints. 68% win rate, 96% safety compliance.

Sentinel Fraud Detection — Explainable AI
ensemble XGBoost + Isolation Forest with 650+ engineered features. SHAP explainability, UMAP clustering, GenAI reports via Amazon Bedrock.

view all repositories

animated divider

Tech Stack — Python, PyTorch, LangChain, AWS, Docker

i'm not going to pretend i use everything equally. here's what i actually reach for day-to-day:

tech stack icons — Python PyTorch TensorFlow Docker AWS GCP FastAPI Kafka Redis PostgreSQL MongoDB Git Linux VS Code

the full picture (click to expand)
daily drivers Python, PyTorch, FastAPI, Docker, Git, VS Code
LLM & GenAI LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA
vector & data FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch
cloud & MLOps AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI
ML frameworks TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX
infrastructure Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases

animated gradient divider

GitHub Stats

i commit a lot. sometimes it's good code, sometimes it's "fix: typo in typo fix".

Rehan Malik GitHub stats GitHub contribution streak

top programming languages — Python


GitHub Trophies

GitHub profile trophies


Contribution Activity Graph

GitHub contribution activity graph


Contribution Snake Animation

GitHub contribution snake animation

animated divider

Latest AI Research Articles

i publish research notes daily — not polished papers, just honest writeups of what i'm learning and building. think of it as a public lab notebook for generative AI, LLM fine-tuning, RAG, and agentic systems.

new articles dropping soon — browse AI Engineering Notes


Recent Open-Source Activity

loading...


Currently Researching  globe animation

topics discovered daily by a multi-model AI research engine (GPT-4.1, Grok-3, DeepSeek R1, Llama-4)

research engine warming up...


Code Snippets & Gists

gists incoming...

animated divider

if you made it this far, you should probably just say hi

connect on LinkedIn   follow on GitHub

footer wave

Pinned Loading

  1. Voice-AI-Platform Voice-AI-Platform Public

    Real-time voice AI infrastructure — 500+ concurrent calls, WebSockets, Apache Kafka, gRPC/C++ with CUDA. Speech-to-text, sentiment analysis, sales insights.

    Python

  2. Agentic-AI-Workflows Agentic-AI-Workflows Public

    Production AI Agents for enterprise automation — 8+ specialized agents using LangChain, OpenAI function calling, and FastAPI. Multi-agent orchestration, tool use, planning loops, guardrails.

  3. BiiView-Object-Detection BiiView-Object-Detection Public

    Real-time object detection with Meta AI Segment Anything Model (SAM) — 90% accuracy across 11M+ images and 1.1B+ segmentation masks.

  4. Digital-People-Platform Digital-People-Platform Public

    Hyper-realistic talking avatars — SadTalker lip-sync + Microsoft SpeechT5 TTS + OpenAI conversational AI. 70% realism improvement.

  5. LLM-Fine-Tuning-LoRA LLM-Fine-Tuning-LoRA Public

    Fine-tuning LLaMA-2, Mistral with LoRA, QLoRA, PEFT — 40% cost reduction vs hosted APIs. VLLM serving with CUDA optimization on AWS SageMaker.

    Python

  6. RAG-Enterprise-Search RAG-Enterprise-Search Public

    Production RAG pipeline — enterprise knowledge retrieval across 2TB+ data using LangChain, FAISS, ChromaDB, PG-Vector with cross-encoder re-ranking. Deployed on AWS SageMaker.

    Python