Skip to content
View pankrulez's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report pankrulez

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pankrulez/README.md

👋 Hey there! I'm Pankaj Kapri

Typing Animation
Profile Views

🚀 The Journey So Far

My path in tech has been a fun and evolving adventure! For years, I built strong foundations in backend logic and problem-solving as a Java trainer and PHP web developer, followed by a deep dive into system troubleshooting as a senior technical support engineer.

Today, I am fully focused on Data Science, Machine Learning, and Generative AI. Because of my developer and support background, I don't just build models in isolated Jupyter notebooks—I care about clean architecture, handling real-world messy data, and deploying end-to-end applications that actually work in production.


💻 Projects I'm Proud Of

(Click to expand the details of my recent builds!)

🕵️‍♂️ Fraud Detection Paysphere

An end-to-end fraud detection application designed to handle highly imbalanced datasets.

  • The Challenge: Real-world fraud data is inherently skewed.
  • The Solution: I engineered a robust pipeline using XGBoost and deployed it via Streamlit. The model was trained on 50,000 transaction records and successfully navigated a realistic ~10% fraud rate, maximizing precision without sacrificing recall.
⚖️ Legal Eagle (RAG Query Engine)

An intelligent document retrieval system for complex legal texts.

  • The Tech: Built using LangChain and HuggingFace embeddings.
  • The Solution: Implemented a Retrieval-Augmented Generation (RAG) architecture that allows users to seamlessly query, retrieve, and analyze dense legal documentation with high accuracy.
📈 FinSight Pro

An automated financial analysis platform and interactive dashboard.

  • The Tech: Streamlit, NewsAPI, and Python data libraries.
  • The Solution: Blends traditional technical stock analysis with live news sentiment analysis, giving a holistic view of market movements in one sleek web app.
⚾ "Moneyball" Scouting & Churn Analytics

Predictive pipelines solving specific business and sports analytics problems.

  • The Solution: Utilized machine learning classification and regression techniques to predict customer churn, as well as applying advanced data visualizations (Seaborn, Matplotlib, PowerBI) for sports scouting analysis.

🛠️ My Digital Toolbox

I love experimenting with new tech, but here is my daily driver stack:

Category Technologies
Languages Python SQL Java PHP
Data & ML pandas NumPy scikit-learn
Deep Learning & Gen AI TensorFlow HuggingFace LangChain
Big Data & Web PySpark Databricks Streamlit
Dev Tools Git VS Code

💡 Current Obsession: I'm currently a massive fan of building and experimenting with cutting-edge open-source models, specifically utilizing meta-llama/llama-4-scout-17b-16e-instruct for complex reasoning and development tasks!


📊 GitHub Analytics

Pankaj's GitHub Stats GitHub Streak

Top Languages

📫 Let's Connect!

Whether you want to discuss machine learning, collaborate on an open-source project, or just chat about cool data visualizations, my inbox is open!

Constantly building, optimizing, and learning.

Pinned Loading

  1. legal-eagle legal-eagle Public

    An AI-powered RAG application that lets users upload PDF contracts and ask legal questions in natural language, powered by Llama-3, LangChain, and ChromaDB.

    Python 1 1

  2. tvnet tvnet Public

    Cross‑platform Flutter app for internet provider customers — manage accounts, view plans, make payments, and access support with Razorpay integration. Runs on Android, iOS, Web, Windows, macOS, and…

    Dart 1

  3. WeatherForecast WeatherForecast Public

    End-to-end machine learning project for predicting next-day rainfall in Australia, featuring data cleaning, model training, and a Streamlit web app for real-time predictions.

    Jupyter Notebook 1

  4. customer-churn-prediction customer-churn-prediction Public

    End-to-end customer churn prediction project using the Telco dataset. Includes EDA, data preprocessing, Logistic Regression / Random Forest / XGBoost model comparison, SHAP explainability, and a pr…

    Jupyter Notebook 1

  5. dieselgate-causal-impact dieselgate-causal-impact Public

    Quantifying the financial impact of the 2015 Volkswagen emissions scandal using Causal Inference and Synthetic Control methods. Models the 'Counterfactual VW' stock price using competitor data to m…

    Jupyter Notebook 1

  6. FinSight FinSight Public

    An autonomous multi-agent investment analyst powered by Llama-3, LangGraph, and XGBoost. Features a real-time dashboard for technical analysis, news sentiment (VADER), fundamental RAG research, and…

    Python 1