Skip to content

hkv-31/Particle-Track-Reconstruction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Particle Track Reconstruction with Machine Learning

A personal project exploring the application of machine learning for reconstructing particle trajectories from the TrackML dataset.
This repository implements clustering algorithms to group 3D hit coordinates in particle detectors — bridging particle physics and data science.

🔍 Project Overview

In experimental particle physics, charged particles leave traces ("hits") as they move through layered detector systems.
The challenge is to group these hits into individual particle tracks, a process known as track reconstruction.

This project tackles that challenge using unsupervised learning algorithms to cluster 3D hit data into meaningful trajectories.

✨ Key Features

  • Exploratory Data Analysis (EDA) — Analyze hit distributions and detector geometry.
  • Machine Learning Models — Implement K-Means and DBSCAN clustering using Scikit-learn.
  • 3D Visualization — Render hit data and clustering outputs with Matplotlib.
  • Modular Codebase — Clean, extensible structure for quick experimentation and future upgrades.

📁 Dataset

This project uses the TrackML Particle Identification dataset from Kaggle,
simulating detector hits from a collider experiment.

Each event contains:

  • 3D hit coordinates
  • Detector layer information
  • Ground-truth particle IDs (for benchmarking)

🚀 Usage

Open the Jupyter Notebook to explore the full workflow:

trackml_clustering_analysis.ipynb

The notebook covers:

  1. Data loading and preprocessing
  2. Exploratory data analysis (EDA)
  3. Clustering with K-Means and DBSCAN
  4. 3D visualization of particle tracks

K-Means offers simplicity and speed but fails on complex geometries. DBSCAN, while slower, provides better alignment with the physics of track formation. Together, these insights lay the foundation for graph-based and ML-driven approaches in future iterations.

🔮 Future Work

Planned improvements include:

  1. Feature Engineering — Incorporate radial, angular, and detector-layer features.
  2. Graph-Based Methods — Apply Graph Neural Networks (GNNs) for track association.
  3. Physics-Inspired Algorithms — Explore Kalman filters and Hough transforms.
  4. Benchmarking — Evaluate performance using ground-truth particle IDs.

📜 License

This project is licensed under the MIT License — see the LICENSE.md file for details.

Acknowledgments

The TrackML collaboration for providing the dataset and research problem. CERN and global particle physics communities for inspiring this intersection of science and machine learning.

About

Particle track reconstruction with the TrackML dataset

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors