LeadGenius

The Open-Source, Self-Hosted B2B Lead Scoring & Routing Engine

Live RevOps Dashboard

The analytics artifacts are automatically built via CI/CD and hosted on GitHub Pages: View Live Evidence BI Dashboard

The Problem

If you operate a B2B SaaS, your landing pages frequently receive hundreds of sign-ups a day. A small percentage are enterprise opportunities, but the majority are students, personal emails, or B2C traffic.

Enterprise Revenue Operations (RevOps) teams typically rely on tools like Clearbit Reveal, MadKudu, or ZoomInfo to enrich emails, score them, and route actionable leads to Sales.

The primary issue is that these tools often cost over $20,000 annually and operate as "black boxes" where scoring criteria are opaque.

LeadGenius offers a flexible, transparent, open-source alternative without the monthly SaaS overhead.

Architecture

LeadGenius solves B2B data routing across four modular layers: Ingestion, Waterfall Enrichment, Rules-as-Code Scoring, and Reverse ETL.

graph TD
    %% Ingestion %%
    subgraph Layer 1: Ingestion
    A[Frontend/Zapier] -->|POST JSON Payload| B(FastAPI Webhook)
    B -->|Save Raw Lead| C[(PostgreSQL)]
    end

    %% Enrichment %%
    subgraph Layer 2: Waterfall Enrichment
    B -.->|Background Task| D{Enrichment Manager}
    D -->|1. Try Fast APIs| E[Apollo / Hunter]
    E -.->|If No Data| F[LLM Scraper Fallback]
    F -->|Analyze Homepage B2B/B2C| G[OpenAI/Anthropic]
    G -.->|Fallback| H[TechStack / SEO Metrics]
    D -->|Update Appended Data| C
    end

    %% Scoring %%
    subgraph Layer 3: dbt Scoring
    I[rules.yml Configuration] -->|Parsed by Python| J(dbt: int_lead_scoring.sql)
    C --> K{dbt run}
    J --> K
    K -->|Create View| L[(mart_scored_leads)]
    end

    %% Activation %%
    subgraph Layer 4: Activation
    M((APScheduler 60s)) --> N[Activation Manager]
    N -->|Query Top Tiers| L
    N -->|Route 1| O[Slack Webhook ALERT]
    N -->|Route 2| P[HubSpot CRM PATCH]
    end

    %% Observability %%
    subgraph Layer 5: Observability
    L -->|Automated CSV Export| Q[Evidence BI]
    Q -->|GitHub Actions| R[Live Dashboards]
    end

    %% Styling %%
    classDef primary fill:#e3f2fd,stroke:#1565c0,stroke-width:2px;
    classDef secondary fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px;
    classDef db fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px;
    class A,B,D,K,N,I primary;
    class E,F,G,H,O,P secondary;
    class C,L,Q db;

Mock Mode Operations

To facilitate evaluation without requiring active API keys for Apollo, Hunter, or LLMs, this repository features a robust Mock Mode Generator.

Initialize the cluster with TEST_MODE="true" in your .env file.
The FastAPI Server spins up a background data seeder automatically.
It posts 20 highly-realistic mock leads (e.g., Stripe, Vercel, Anthropic) directly into the webhook.
The EnrichmentManager intercepts these domains and injects standard firmographics, preserving API quotas.
The downstream dbt pipeline parses your rules.yml file, generates the necessary SQL models, and applies scoring tiers (Hot, Warm, Cold), creating a fully functional dataset.

This procedure runs passively during GitHub Actions CI/CD to power the public-facing Evidence RevOps Dashboard.

CRM Integrations

LeadGenius is built for direct activation into existing CRM and alerting spaces:

HubSpot: Utilizes a native HubSpotDestination module. It searches the HubSpot Contacts API by email and executes a PATCH request to update custom properties.
Slack: Configurable webhooks to send immediate notifications to a designated channel.
Extendable Setup: The modular ActivationManager allows for simple integration via small Python classes (e.g., Salesforce, Pipedrive).

Transparent YAML Scoring

Define your criteria using a straightforward YAML structure (rules.yml). LeadGenius automatically transpiles this into SQL models.

scoring_rules:
  - description: "Sales or Exec gets +30"
    sql_condition: "lower(job_title) like '%ceo%' or lower(job_title) like '%vp%'"
    points: 30
  - description: "Confirmed B2B"
    sql_condition: "is_b2b_from_llm = true"
    points: 20

Quick Start

Prerequisites

Docker & Docker Compose
Python 3.11+

Local Environment Initialization

# 1. Clone the repository
git clone https://github.com/Astoriel/LeadGenius.git
cd LeadGenius

# 2. Duplicate and configure environment (Mock environment enabled dynamically by default)
cp .env.example .env

# 3. Spin up the cluster
docker-compose up -d --build

Monitor the application terminal logs (docker-compose logs -f api) to observe the execution of the Waterfall Enrichment pipeline, periodic dbt run calls, and corresponding lead routing behavior.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
backend		backend
dashboard		dashboard
dbt_project		dbt_project
scripts		scripts
tests		tests
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
logo.png		logo.png
requirements.txt		requirements.txt
rules.yml		rules.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LeadGenius

Live RevOps Dashboard

The Problem

Architecture

Mock Mode Operations

CRM Integrations

Transparent YAML Scoring

Quick Start

Prerequisites

Local Environment Initialization

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LeadGenius

Live RevOps Dashboard

The Problem

Architecture

Mock Mode Operations

CRM Integrations

Transparent YAML Scoring

Quick Start

Prerequisites

Local Environment Initialization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages