Skip to content
View Gerardo1909's full-sized avatar
🧭
Looking for opportunities
🧭
Looking for opportunities

Highlights

  • Pro

Block or report Gerardo1909

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Gerardo1909/README.md

Hi, I'm Gerardo Toboso 👋

📍 Buenos aires ↔ Global | 🏗️ Data & Backend Systems Architect | 📊 Causal Inference Specialist

Python SQL FastAPI PyMC Git Docker Bash

Bridging the gap between robust data infrastructure and statistical rigor. I build systems that don't just move data, but ensure it is causal, reliable, and decision-ready.

𝟏𝟗 nineteen (Featured) – local personal assistant with a claude-like interface to run on pc-only machines.

Current Projects

  • 💸 priceinsight – bayesian causal inference system for price optimization and profit maximization in eCommerce.
  • 🧠 Causalito - RAG agent oriented in answering bayesian causal inference questions.
  • 🗣️​ HN Analytical Platform - Batch data lake used to ingest data from HackerNews public API, process it and gain some insights using NLP.
  • 🏦 Modern Banking Analytics Platform (dbt) - data transformation dbt project for handling banking data about transactions, users and investing.
  • ✴️ IoT Data ETL Pipeline - batch pipeline to process IoT sensor data using pyspark and airflow orchestration.
  • 🏩 Booking BA scraper - playwright web scraper used to extract hotel pricing data in Buenos Aires.
  • 🏪​ Ecommerce reporting etl - etl pipeline to generate quick business insights about ecommerce data.
  • 🔎​ Web server logs analysis - diagnosing the performance and availability of a web server using SQL analysis on 1,000 access logs.

GitHub Activity

GitHub Contribution Graph

What I'm Doing

  • Automating Engineering workflows - Developing tools that eliminate manual code reviews for statistical models.
  • Deep-diving into Causal Inference - Applying Bayesian frameworks to solve real-world business attribution problems.
  • Writing technical insights - Simplifying complex systems and data engineering concepts on Medium.
  • Teaching at Scale - Leading the Causal Inference track, translating math into scalable software.

Latest Technical Writing

Connect

LinkedIn Medium Email


Recognition & Stats

  • Technical Lead and Backend Developer at Bayes Plurinacional for Bayesian Causal Inference.
  • Developed internal tools used by 50+ engineers/students daily.
  • Built and shipped 10+ production-grade data pipelines.

Philosophy

"Data without causality is just noise." I focus on building tools that provide clarity, automate the mundane, and scale the expert's intuition.

Random Facts
  • I prefer a clean CLI over a complex GUI any day.
  • Can explain MCMC while making a decent espresso.
  • Fan of "Software Engineering at Google" practices.
  • Currently reading: "The Book of Why" by Judea Pearl.

Pinned Loading

  1. iot-etl-pipeline iot-etl-pipeline Public

    Pipeline batch ETL con PySpark y Apache Airflow para procesar millones de lecturas de sensores IoT, transformándolas en un Star Schema optimizado para analytics de manufactura inteligente.

    Python

  2. fintech-flow-dbt fintech-flow-dbt Public

    Pipeline de Analytics Engineering para un ecosistema Fintech. Basado en dbt y DuckDB para transformar datos bancarios crudos en marts analíticos confiables, testeados y documentados.

  3. booking-ba-scraper booking-ba-scraper Public

    Pipeline de extracción en formato CSV de hoteles en Buenos Aires desde Booking.com

    Python

  4. causalito causalito Public

    Causalito - Agente RAG orientado al campo de la inferencia bayesiana causal. Implementado usando Langchain, chromadb y modelos LLM avanzados.

    Python 1

  5. nineteen nineteen Public

    Un asistente personal de IA, ligero y pensado para ejecutarse únicamente en CPU en tu máquina local. Soporte de herramientas, modelo por defecto 1GB<. 𝟏𝟗

    Python

  6. priceinsight priceinsight Public

    Optimización de precios con inferencia causal bayesiana. Simula escenarios y maximiza beneficios con probabilidades reales.

    Python