📊 ML Portfolio Projects

Real-world machine learning projects — from exploratory data analysis to production-ready predictive models.

Each project tackles a genuine business or scientific problem using structured ML workflows: data cleaning, EDA, feature engineering, model selection, hyperparameter tuning, and performance evaluation.

🗂️ Projects

🏎️ Mercedes-Benz USA (2005–2025): Luxury Price Prediction

Goal: Predict used car prices across two decades of Mercedes-Benz listings using model, mileage, AMG/4MATIC flags, and trim level.
Highlights: 9 models compared · Overfitting analysis · Log transformation · Ridge vs XGBoost
Result: Ridge Regression wins with CV R² = 0.708 and near-zero overfit gap (0.03) — XGBoost overfitted severely despite high train score
Regression Ridge XGBoost Overfitting Detection Feature Engineering

🎓 Career Placement: 8-Model Predictive Analysis

Goal: Predict student employment outcomes using academic performance, coding skills, and internship experience.
Highlights: 8 algorithms benchmarked · 5-fold cross-validation · Overfitting detection · Feature importance
Result: XGBoost achieved 96.80% test accuracy (CV Mean: 0.9648) — Decision Tree dropped 14.2% from train to test
Classification XGBoost Model Benchmarking Cross-Validation

💧 Global Water Scarcity: 99.6% Accuracy with XGBoost

Goal: Classify regional water scarcity levels (Low / Moderate / High) across 200+ countries from 2000–2025.
Highlights: 8 models compared · Zero overfitting validated · Groundwater depletion as #1 predictor (56% importance)
Result: XGBoost achieved 99.62% test accuracy with only 0.003 train-test gap
Classification XGBoost LightGBM Environmental ML Multi-Class

🏙️ Boston Housing: EDA & Gradient Boosting Mastery

Goal: Predict median home values using socioeconomic, environmental, and structural features.
Highlights: 7 models compared · Log transformation · RobustScaler · Multicollinearity mitigation
Result: Tuned Gradient Boosting achieved R² = 0.88, RMSE = $2,940 — Lower income % and avg rooms as top drivers
Regression Gradient Boosting EDA Feature Engineering

🏠 House Price & Insurance Cost Prediction

Goal: Dual-dataset regression — King County residential pricing + medical insurance cost estimation.
Highlights: 4 models compared · GridSearchCV tuning · Polynomial features · SVR vs Decision Tree
Result: Tuned Decision Tree achieved R² = 0.7908 on 21,613 housing records
Regression Decision Tree Polynomial Regression SVR GridSearchCV

🩺 Medical Cost Analysis: Insurance Price Prediction

Goal: Predict individual annual medical insurance costs from health and demographic data.
Highlights: End-to-end pipeline · Custom prediction function · Feature importance via coefficients
Result: Linear Regression achieved R² = 0.78, RMSE = $5,796 — smoking status identified as dominant cost driver
Regression Linear Regression EDA Feature Importance

🎬 Netflix Global Content & Pricing Analysis

Goal: Explore Netflix's content library size and subscription pricing across countries.
Highlights: Multi-library visualisation (Plotly · Seaborn · Matplotlib) · Pandas Profiling · Regional pricing patterns
Result: Identified significant regional disparities in both content availability and pricing strategy
EDA Data Visualisation Plotly Pandas Profiling

🛠️ Tech Stack

Core ML      │ Scikit-learn · XGBoost · LightGBM · Gradient Boosting
Analysis     │ Pandas · NumPy · SciPy
Visualisation│ Matplotlib · Seaborn · Plotly
Environment  │ Jupyter Notebook · Kaggle Kernels

📈 Methodology

Every project follows a consistent, professional workflow:

Problem Definition — What business question are we answering?
Exploratory Data Analysis — Distributions, correlations, outliers
Feature Engineering — Encoding, scaling, new feature creation
Model Selection — Multiple algorithms compared objectively
Hyperparameter Tuning — Grid search / cross-validation
Evaluation — Metrics relevant to the problem type

🔗 All Notebooks on Kaggle

Full interactive notebooks with outputs, visualisations, and commentary are available on Kaggle:
👉 kaggle.com/brahimenesulusoy

💼 Need a Predictive Model Built?

I build custom ML solutions for businesses — sales forecasting, churn prediction, price estimation, and more.

🔗 LinkedIn: ibrahim-enes-ulusoy
🌐 Portfolio: enesulusoy-portfolio.netlify.app
📧 Email: c.enes.eng@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
boston-housing		boston-housing
house-price-insurance		house-price-insurance
medical-cost-prediction		medical-cost-prediction
mercedes-benz-price-prediction		mercedes-benz-price-prediction
netflix-data-analysis		netflix-data-analysis
placement-analysis		placement-analysis
water-scarcity-prediction		water-scarcity-prediction
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📊 ML Portfolio Projects

🗂️ Projects

🏎️ Mercedes-Benz USA (2005–2025): Luxury Price Prediction

🎓 Career Placement: 8-Model Predictive Analysis

💧 Global Water Scarcity: 99.6% Accuracy with XGBoost

🏙️ Boston Housing: EDA & Gradient Boosting Mastery

🏠 House Price & Insurance Cost Prediction

🩺 Medical Cost Analysis: Insurance Price Prediction

🎬 Netflix Global Content & Pricing Analysis

🛠️ Tech Stack

📈 Methodology

🔗 All Notebooks on Kaggle

💼 Need a Predictive Model Built?

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📊 ML Portfolio Projects

🗂️ Projects

🏎️ Mercedes-Benz USA (2005–2025): Luxury Price Prediction

🎓 Career Placement: 8-Model Predictive Analysis

💧 Global Water Scarcity: 99.6% Accuracy with XGBoost

🏙️ Boston Housing: EDA & Gradient Boosting Mastery

🏠 House Price & Insurance Cost Prediction

🩺 Medical Cost Analysis: Insurance Price Prediction

🎬 Netflix Global Content & Pricing Analysis

🛠️ Tech Stack

📈 Methodology

🔗 All Notebooks on Kaggle

💼 Need a Predictive Model Built?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages