Skip to content
View harishaaram's full-sized avatar

Block or report harishaaram

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
harishaaram/README.md

👋 Hi, I'm Harish Gandhi

🚀 Data Engineer | ETL Pipelines | ML Integrations | Automation

LinkedIn  Website  Medium  Twitter  Email


👨‍💻 About Me

I specialize in designing and scaling ETL pipelines, data automation, and machine learning integrations — bridging the gap between business needs and technical execution.

I strongly believe in the quote: "You become what you believe." With a can-do attitude and a passion for continuous learning, I thrive on solving complex data challenges and building impactful solutions. Beyond my technical work, I'm passionate about building a global, data-driven community, mentoring peers, and sharing knowledge through talks, blogs, and open-source contributions.

🔍 Areas of Interest

  • 🛠️ ETL Development & Data Pipeline Optimization
  • 🤖 Machine Learning Deployment & Integrations
  • 📊 Data Modeling & Quality Assurance
  • ☁️ Cloud Data Infrastructure (AWS & Azure)
  • 🔄 Streaming & Real-time Data Processing

🛠️ Technical Stack

🗣️ Languages & Scripting

Python SQL Scala Shell Script

⚙️ Data Engineering & Orchestration

Apache Airflow dbt Azure Data Factory Databricks Apache Spark Kafka

☁️ Cloud & Big Data

Snowflake AWS Azure Docker Kubernetes Jenkins Git

🗄️ Databases & Visualization

SQL Server PostgreSQL MongoDB Tableau Redash JupyterHub


🏆 Recent Wins

  • 🥇 TextNow AI Hackathon 2025 — Winner
    • Built an AI-powered support assistant that classifies user questions against Slack history, suggests relevant past answers, and escalates unmatched queries via chatbot to Jira with contextual ticket creation.

💡 Featured Projects

AI-powered support assistant that classifies questions against Slack history, suggests relevant answers, and escalates to Jira with contextual ticket creation.

  • Stack: AI, NLP, Slack API, Jira, Chatbot

Analyzed the impact of mandatory calorie labeling on restaurant menu choices using causal inference and statistical modeling.

  • Stack: Python, Causal Inference, Statistics

Web analytics dashboard tracking recruiter engagement patterns on LinkedIn profiles using web scraping and data visualization pipelines.

  • Stack: Web Scraping, Analytics, Visualization

Applied supervised ML models (Random Forest, SVM, XGBoost) on Chicago crime dataset, achieving 85%+ classification accuracy.

  • Stack: Scikit-learn, XGBoost, Machine Learning

Amazon Alexa skill that recites couplets from the ancient Tamil classic Tirukkural, with contextual explanations and keyword search.

  • Stack: Alexa SDK, Node.js, NLP

NLP pipeline that crawls major news outlets, applies sentiment analysis and topic modeling, then surfaces a daily "optimism score."

  • Stack: NLP, Sentiment Analysis, NLTK

Computer vision system for monitoring patient activity and posture in clinical settings using image classification.

  • Stack: Computer Vision, TensorFlow, Healthcare

🎓 Education & Achievements

  • 🎓 Master of Science in Computer Science | Illinois Institute of Technology, Chicago (2016–2018)
    • GPA: 3.72 / 4.0 | Concentration in Data Science, Software Engineering & Database Systems
  • 🏆 TextNow AI Hackathon 2025 Winner
  • 🎤 PyData Chicago Speaker (June 2018) — Text mining & causal inference research
  • 🌐 NumFOCUS Guest Speaker — Open-source scientific computing community outreach

📜 Certifications

  • 🧠 ML with TensorFlow on GCP Specialization — Google Cloud / Coursera
  • 🔬 Art and Science of Machine Learning — Google Cloud / Coursera
  • ⛏️ Mining Massive Datasets — Stanford University
  • 📊 Statistical Learning — Stanford University
  • 🧬 Neural Networks and Deep Learning — deeplearning.ai / Coursera

"If you hang around the barbershop long enough… sooner or later you're gonna get a haircut." — Denzel Washington


Let's Connect on LinkedInView PortfolioRead My Blog

Pinned Loading

  1. NewsOptimism NewsOptimism Public

    Comparative Analysis of various news media and quantify its postiveness based on the vocabulary usage.

    Jupyter Notebook 2

  2. Global-Terrorism-Prediction Global-Terrorism-Prediction Public

    Predict the groups that carried out the attack based on GTD dataset

    Jupyter Notebook

  3. Alexa-Skill Alexa-Skill Public

    Implementation of Tirukkural using Alexa skill

    JavaScript 1 1

  4. Topic-Modeling Topic-Modeling Public

    An Unbiased Examination of Federal Reserve Meeting minutes

    Jupyter Notebook 5 3

  5. Talks Talks Public

    About my talks - slides, codes, demos

    Jupyter Notebook