Data & Search Platform Products | AI Engineering | Enterprise Analytics
Boston, MA
I run data and search platform products at a publicly traded company (NASDAQ) in Boston. 10+ years across enterprise analytics, warehouse design, BI migrations, and more recently, building AI workflows on top of governed data.
My day job stack is Snowflake, Oracle, dbt, Power BI, Python, and SQL. The repos here go in a different direction: practical AI tooling for companies that don't have a 50-person data team but still need to ship.
Most AI repos on GitHub are demos. Clean datasets, happy-path queries, no auth, no cost tracking, no governance. I've spent a decade inside messy enterprise systems, so I build for the mess.
88% of AI proofs-of-concept never make it to production. The failure point is almost never the model. It's data quality. Governance. Cost surprises. Operational gaps that enterprise data teams solved years ago but the AI community keeps rediscovering. I kept watching smart people hit the same walls, so I started building the missing pieces in the open.
Full stack (11 categories)
Data Engineering -- SQL (advanced), Snowflake, Databricks, Oracle, PostgreSQL, MySQL, DuckDB, dbt, Azure Data Factory, Microsoft Fabric, dimensional/semantic data modeling, ETL/ELT pipeline design, metadata and lineage, master data management
AI/ML Engineering -- Python, pandas, NumPy, scikit-learn, LLMs, RAG pipelines, vector search (HNSW, IVF-PQ), cross-encoder rerankers, text-to-SQL, AI agents, drift detection (PSI, KS), eval metrics (recall@k, MRR, NDCG)
Statistical & Experimental -- A/B testing, experiment design, statistical testing, cohort analysis, retention modeling, forecasting, funnel analysis
Product & Behavioral Analytics -- product analytics, user engagement, feature adoption, retention analytics, RFM analysis
Search -- Elasticsearch, vector databases/ANN, hybrid search (BM25 + dense)
APIs & Integration -- FastAPI, REST/OpenAPI design, OAuth2/OIDC/SAML, webhooks, SDK integration, JSON Schema/XSD
BI & Visualization -- Power BI, DAX, Power Query, Tableau, OBIEE, Streamlit, Matplotlib, Plotly
Cloud & Infrastructure -- Azure, AWS, Docker, Redis, OpenTelemetry, Splunk, Datadog
Identity & Security -- SSO, RBAC, least-privilege design, IdP integration (Okta, Ping, Auth0), entitlement reviews, audit trails
Product & Delivery -- product roadmaps, requirements definition, prioritization, dependency management, release planning, UAT, phase-gate delivery, product discovery, backlog management, sprint planning, Agile/Scrum, SDLC governance
Data Governance & Compliance -- Data Management Office design, policy development, data stewardship, reference data governance, data lineage documentation, KPI governance, SOX/GLBA/DPPA/PII compliance, internal audit partnership, control documentation, risk controls, dual-approval workflows, fraud controls
MS Operations & Project Management + MBA. BS Information Technology. 26 certifications. Currently leading a team of senior and associate PMs with a multi-million dollar budget.


