Skip to content
View janmejoykar1807's full-sized avatar

Block or report janmejoykar1807

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
janmejoykar1807/README.md
Janmejoy Kar

 

     

 

About

Data Scientist specializing in Microsoft Fabric, PySpark, Power BI, and D365 F&O. MS Business Analytics, University of North Texas. I build forecasting models, anomaly detection systems, and analytics pipelines across Microsoft Fabric, AWS, and Databricks. Previously organized and spoke at UNT's Business Analytics Day on "Empowering Data-Driven Decisions with Microsoft Fabric."

 


forecast accuracy↑
PySpark ML models


revenue impact
A/B testing & propensity


faster refresh
Fabric pipeline tuning


fewer gaps
anomaly detection

 

Data & ML

Python · R · PySpark · SQL
Scikit-Learn · TensorFlow · OpenCV
A/B Testing · Bayesian Inference
NLP · Forecasting · Anomaly Detection

Cloud & Engineering

Microsoft Fabric · Azure Data Factory
Databricks · AWS (S3, EC2)
Hadoop · Hive · D365 ERP
ETL Pipelines · Data Lakehouse

Analytics & Databases

Power BI · Tableau · DAX · QlikView
SQL Server · MySQL · MongoDB
Git · Jupyter · Google Colab
Star Schema · Medallion Architecture

 

Data Scientist — Group O (2024 – Present, Roanoke, TX)
Building data pipelines and analytics solutions on Microsoft Fabric. Technologies: Microsoft Fabric, PySpark, Power BI, D365 F&O, Azure.

Graduate Teaching Assistant — University of North Texas (Aug–Dec 2023)
DSCI 5240: Data Mining & Machine Learning. Grading, lectures, and student mentorship.

Graduate Student Assistant — UNT Libraries (Jun 2022–Aug 2023)
Research assistance, data collection systems, and technology instruction.

ML Trainee — Indian Servers (Jun–Sep 2020)
Computer Vision with OpenCV + Caffe. MNIST, Fashion-MNIST, CIFAR-100, Titanic, Mobile Price Prediction.

Business Strategist — FilingRabbit (Nov 2020–Jan 2021)
Competitive analysis for copyright/patent pricing. Website development and digital marketing.

 

🩺 Hepatitis C Prediction
Hypothesis testing, z-tests & logistic regression on clinical biomarker data.
Python · R · Scikit-Learn · Statistics

⛏️ Python Data Mining
Spam detection, airfare regression, salary prediction & k-means clustering.
Python · Scikit-Learn · Pandas · NumPy

📊 R Data Mining
Time series, text mining, SVM classification & insurance regression.
R · R Markdown · ggplot2 · caret

🏭 Toxic Release Inventory
EPA environmental data analysis across industries and states.
Python · Pandas · Data Visualization

🗄️ SQL Masterclass
Complex joins, CTEs, window functions & query optimization.
SQL · SQL Server · Performance Tuning

🌐 Portfolio Website
Live personal portfolio with projects, skills & certifications.
React · Vercel · JavaScript · CSS

🤖 AI-powered ATS Resume Optimizer
ML pipeline with fine-tuned FLAN-T5 for ATS resume optimization, including a React web app and n8n workflow automation.
Python · FLAN-T5 · FastAPI · React · n8n

📊 Control Tower Dashboard
Executive KPI dashboard covering Sales, Purchases, and Inventory Management with Days of Supply, warehouse utilization, and replenishment pipelines.
Power BI · DAX · Microsoft Fabric · React · Recharts

 

Year Competition Focus
2023 HPE Case Study Competition — UNT Business strategy & analytics
2022 Humana May's Case Analytics Competition — UNT Predicting housing insecurity for Medicare/Medicaid customers
2020 IBM ICE Day Hackathon — 🏆 Winner Data science
2019 IBM ICE Day Hackathon — 🥈 1st Runner-up Data science

 

Certification Issuer Year Status
Power BI Data Analyst Associate (PL-300) Microsoft 2025 Active
Azure Data Scientist Associate (DP-100) Microsoft 2024 Active
Fabric Analytics Engineer Associate (DP-600) Microsoft 2024 Active
Azure Fundamentals (AZ-900) Microsoft 2023 Active
Microsoft Azure Microsoft 2022 Active
Python Level 1 Cambridge Certification Authority 2022 Active
AWS Cloud Practitioner Amazon Web Services 2020 Active
Architecting on AWS (Associate) Amazon Web Services 2020 Active
Business Analytics Graduate IBM 2021 Active

 

Title Journal Authors Year Link
Achieving Effective Batch-to-Batch Error Correction through Suppression Correction and Dual MSTUS Normalization Analytical Chemistry (ACS) Debasish Ghosh, Janmejoy Kar, Felice A. de Jong, Chris Beecher, Vladimir Shulaev 2025 📄 View
Detection of Traffic Signs by Convolutional Neural Network Using Sequential API International Journal of Creative Research Thoughts (IJCRT) Janmejoy Kar, Manish Kumar, Dipali Dhake, Gayatri Palde, Umakant Mandawkar 2021 📄 View

 

Role Details Years
🔬 Peer Reviewer — CODATA Reviewing manuscripts in data science, statistical modeling & computational methods 2026–Present
🎤 Speaker — 3rd UNT ITDS Business Analytics Day Presented "Empowering Data-Driven Decisions with Microsoft Fabric" 2024
🎯 Student Mentor — GradRight Inc. Helped students choose universities and plan their careers 2022–2024
👔 President — Business Analytics Club, UNT Organized hackathons, industry mixers, technical seminars & sporting events 2022–2023

 




Activity Graph


 


"The more I learn, the more I realize how much I don't know." — Albert Einstein

 

 

Pinned Loading

  1. janmejoykar1807 janmejoykar1807 Public

    ✨ GitHub profile README

    Python

  2. HepatitisC_Project HepatitisC_Project Public

    Python 1

  3. Internship_Data_Science Internship_Data_Science Public

    Python 1

  4. KagglePracticeProject KagglePracticeProject Public

    Kaggle Project

    Python 1

  5. Python_Data_Mining_Projects Python_Data_Mining_Projects Public

    Python 1

  6. R_Data_Mining_Projects R_Data_Mining_Projects Public

    1