data-eng
Here are 14 public repositories matching this topic...
86% faster data lineage tracking for pandas DataFrames with zero infrastructure. Real-time monitoring, ML anomaly detection, and enterprise compliance features.
-
Updated
Sep 17, 2025 - Python
Production-grade market data pipeline: Alpaca (Daily & 1Min) → normalized schema → partitioned Parquet → DuckDB analytics + strict QA observability.
-
Updated
Feb 26, 2026 - Jupyter Notebook
This project is an End-to-End Video Captioning System designed to bridge the gap between Computer Vision and Natural Language Processing. It automatically generates descriptive text for video content, essentially teaching a computer to "watch" a video and describe what is happening in English
-
Updated
Feb 9, 2026 - Python
Apache Airflow Cheatsheet
-
Updated
Apr 1, 2023
Beowulf meets agentic AI. An MCP server that gives LLMs structured access to the complete Old English text, dictionary, and linguistic annotations.
-
Updated
Mar 5, 2026 - HTML
Create a fully functional and scalable RESTful API from scratch using modern tools available. Set up routes and pipelines, database storage, token generation using different encryption and hashing algorithms, handle asynchronous code, handle common errors and known exploits.
-
Updated
May 14, 2020 - TypeScript
End-to-end Implementation of the medium article leveraging services of AWS and Algolia.
-
Updated
Apr 30, 2023
An Exhaustive Analysis of WhatsApp Chat Data for Extracting Real-Time Insights, Identifying Usage Patterns, Detecting Spam, and Understanding User Sentiment at Scale
-
Updated
Jun 21, 2025 - Jupyter Notebook
Supermarket sale and stock data management project for ETL and Real-Time Analytics.
-
Updated
Nov 24, 2025 - Python
Portfolio of my projects in Data Science
-
Updated
Apr 14, 2022
-
Updated
Oct 3, 2024
Improve this page
Add a description, image, and links to the data-eng topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-eng topic, visit your repo's landing page and select "manage topics."