An end-to-end churn analytics project that identifies churn drivers, predicts churn probability, and segments high-risk customers using the Telco Customer Churn dataset (adapted to OTT/SaaS PRD requirements).
- π Churn KPI Analysis: Calculates overall churn rate and group-level churn trends.
- π§Ή Data Cleaning Pipeline: Handles missing values and type corrections.
- π§ Feature Engineering: Builds engagement score, tenure groups, and churn-ready features.
- π Rich Visualizations: Tenure, usage, pricing, support behavior, and correlation heatmap.
- π€ ML Modeling: Logistic Regression and Random Forest model training + evaluation.
- π― Risk Segmentation: Generates churn probabilities and classifies users into Low/Medium/High risk.
- πΌ Business Insights: Produces actionable retention recommendations.
- Python
- Pandas, NumPy
- Matplotlib, Seaborn
- Scikit-learn
- Jupyter Notebook
Subscription_Churn_Analysis/
βββ Telco_Customer_Churn.csv
βββ README.md
βββ analysis_summary.md
βββ subscription_churn.csv
βββ subscription_churn.ipynb
βββ subscription_churn.py
βββ outputs/
β βββ churn_rate.png
β βββ tenure_churn.png
β βββ usage_churn.png
β βββ charges_churn.png
β βββ support_churn.png
β βββ correlation_heatmap.png
β βββ model_accuracy.png
β βββ feature_importance.png
βββ requirements.txt
- Clone the repository
git clone https://github.com/https://github.com/bikram73/Subscription_Churn_Analysis.git
cd Subscription_Churn_Analysis- Create and activate virtual environment (recommended)
python -m venv .venvWindows PowerShell:
.venv\Scripts\Activate.ps1- Install dependencies
pip install -r requirements.txtpython subscription_churn.pyThis generates:
subscription_churn.csv(normalized + scored dataset)analysis_summary.md(insights + model performance + recommendations)outputs/*.png(all required charts)
Open subscription_churn.ipynb and run all cells in order.
- Notebook visualizations are shown inline.
- Notebook cells are designed to analyze without saving charts to
outputs/.
- Calculate churn rate
- Identify major churn factors
- Segment high-risk users
- Build and evaluate prediction models
- Recommend retention strategies
Source dataset (Kaggle): https://www.kaggle.com/datasets/blastchar/telco-customer-churn
The Telco dataset does not directly contain OTT telemetry fields like exact app usage frequency, last login days, and customer support call counts.
This project is licensed under the MIT License.
- See LICENSE