personascout

Map your content against your ICPs. Find the gaps. Fill them.

personascout is a local-first CLI for B2B teams that want to understand how well their content serves their ideal customer profiles across the funnel. It ingests content from RSS feeds, websites, and CSV exports, classifies each piece against your personas, and highlights where your coverage is thin.

The long-term product direction is open core:

personascout the CLI stays truly open source under AGPL-3.0-only
a separate hosted product can exist on top, but improvements to the networked software covered by this repo stay in the commons

The Problem

Most B2B teams create content without a reliable way to check whether it actually covers their ICPs, buyer personas, and buyer funnel stages.

You can have dozens of blog posts, landing pages, or newsletter issues and still end up with most of your content implicitly targeting only one target persona. That leaves hidden content gaps across awareness, consideration, and decision stages.

PersonaScout answers a direct question: which ICPs is your content actually reaching, and which target personas are being ignored across the buyer funnel?

Setup Flow

The first-time setup flow is:

Initialise a local project with personascout init
Define the personas you care about
Define the content sources you want to analyse
Fetch content into the local project
Classify that content against your personas
Report on coverage gaps and generate new content ideas

The CLI commands for that first pass are:

personascout init
personascout status

# define personas
personascout persona templates
personascout persona use cfo
personascout persona generate
personascout persona add --interactive

# define sources
personascout source add
personascout source test

# ingest and analyse content
personascout fetch
personascout classify --dry-run
personascout classify
personascout report
personascout generate --all --format brief
personascout diff

In practice, most teams will define personas in one of three ways:

start from a built-in template with personascout persona use <template-id>
generate a persona from a short description with personascout persona generate
create one manually with personascout persona add --interactive

Most teams will define sources as a mix of:

company website pages
RSS feeds
CSV exports from blogs, newsletters, CMSs, or social/content tools

If your chosen provider needs an API key, set it in your shell before running AI-backed commands such as persona generate, classify, or generate.

Examples:

$env:ANTHROPIC_API_KEY="your_key_here"

export ANTHROPIC_API_KEY=your_key_here

You can inspect the exact env var name for any provider with:

personascout providers --provider anthropic

Ongoing Workflow

Once the project is set up, the regular workflow is simpler. On a monthly or quarterly cadence, you usually:

Refresh the content you want to measure
Re-run classification
Review the latest coverage report
Compare against the previous run
Use the new gaps to plan or generate the next round of content

Typical recurring commands:

personascout status
personascout fetch
personascout classify
personascout report
personascout diff
personascout generate --all --format brief

Example Output

This is the kind of terminal coverage report PersonaScout produces after classification:

Why This Exists

Most content audits break down at the moment a team asks a simple question:

"Do we actually have enough content for the personas we say we care about?"

Publishing calendars, analytics dashboards, and keyword tools do not answer that directly. PersonaScout is being built to answer it with a reproducible workflow:

Define the personas you care about
Add the content sources you publish into
Fetch and normalize the content into a local project
Classify each item against each persona and funnel stage
Generate a coverage report and use the gaps to plan new content

Current Status

personascout is now publicly released on npm and GitHub. What exists today:

personascout init
personascout status
personascout persona list
personascout persona templates
personascout persona add --interactive
personascout persona add --file <path>
personascout persona view <id>
personascout persona validate
personascout source list
personascout source add
personascout source remove <id>
personascout source test
personascout fetch for RSS, website, and CSV sources
personascout providers
personascout classify
personascout report
personascout generate
personascout diff
personascout persona use <template-id>
personascout persona view <id>
personascout persona edit <id>
personascout persona delete <id>
website crawling with same-domain cheerio fallback
optional Firecrawl-backed website crawling when FIRECRAWL_API_KEY is set

What is planned next:

source editing

Install

Install from npm:

npm install -g personascout
personascout --help

Current public release:

npm: https://www.npmjs.com/package/personascout
GitHub release: https://github.com/SparrowTechnology/personascout-cli/releases/tag/v0.1.0

If you want to run from source instead:

git clone https://github.com/SparrowTechnology/personascout-cli.git
cd personascout-cli
npm install
npm run build
node dist/index.js --help

If you want a local global-style install while developing from source:

npm link
personascout --help

Node.js 18+ is required.

Quick Start

Initialize a project in the directory you want to analyze:

personascout init

Add a persona from JSON:

personascout persona add --file ./persona.json

Start from a bundled persona template:

personascout persona templates
personascout persona use cfo
personascout persona view cfo

Or directly copy a known template:

personascout persona use cfo

Add a source:

personascout source add

Fetch content into .personascout/content/:

personascout fetch

Estimate a classification run:

personascout classify --dry-run

Run classification:

personascout classify

View the latest coverage report:

personascout report

Generate a brief for a gap:

personascout generate --all --channel linkedin-article --format brief

Compare the latest run against the previous one:

personascout diff

Starter example assets are available in examples/README.md.

Example Project Layout

.personascout/
  config.json
  personas/
  sources/
  content/
  results/

This stays local to the directory where you run personascout init, unless you override it with PERSONASCOUT_HOME.

Commands

`personascout init`

Creates the local project structure, writes config.json, and updates .gitignore so fetched content and result files do not pollute the repo.

`personascout status`

Shows the current project state at a glance:

how many personas are defined
how many sources are configured and fetched
how many content items are stored
whether classification has been run yet
whether the latest classification is stale
the latest generation run, including whether output was terminal-only or written to a directory
the suggested next command to run

For integration use, personascout status --format json prints the same project-state payload as structured JSON.

`personascout persona`

Current subcommands:

personascout persona list
personascout persona list --templates
personascout persona templates
personascout persona add --interactive
personascout persona add --file <path>
personascout persona use <template-id>
personascout persona view <id>
personascout persona generate
personascout persona edit <id>
personascout persona delete <id>
personascout persona validate

Personas are stored as JSON files under .personascout/personas/. Built-in starter templates now cover common B2B buyer personas including cfo, cto, vp-sales, vp-marketing, product-manager, ma-analyst, and procurement-lead.

`personascout source`

Current subcommands:

personascout source list
personascout source add
personascout source remove <id>
personascout source test

Supported source definitions today:

rss
website
csv

`personascout fetch`

Fetches content from configured sources and stores normalized content items in .personascout/content/{source-id}/.

By default, new projects fetch without an item cap. You can still set a limit explicitly with personascout fetch --limit 25, or use --limit 0 for unlimited fetches. Fetch also deduplicates matching content across sources by canonical URL, so a page found on both your website crawl and your RSS feed will only be stored once.

Currently implemented:

RSS feeds via rss-parser
website crawling via Firecrawl when configured
website crawling via axios + cheerio fallback when Firecrawl is unavailable
CSV imports via csv-parse/sync with per-source column mappings

`personascout classify`

Classifies fetched content against all configured personas and writes a run file under .personascout/results/.

Current flags:

personascout classify --dry-run
personascout classify --force
personascout classify --provider anthropic
personascout classify --model claude-sonnet-4-5
personascout classify --source acme-blog
personascout classify --since 2026-04-01
personascout classify --format json

Behavior today:

loads all personas in a single prompt per content item
skips items already present in the latest result file unless --force
supports Anthropic and OpenAI-compatible providers through the provider registry
gives rough token and cost estimates in dry-run mode
can emit machine-readable JSON for both dry-run and live run summaries

`personascout report`

Loads the latest classification run, computes the coverage matrix, and renders a terminal summary by persona and funnel stage.

Current flags:

personascout report
personascout report --format json
personascout report --format csv
personascout report --format markdown
personascout report --result run-2026-04-11T12-10-00-000Z

Outputs available today:

terminal table with stage-by-stage coverage bars
JSON including the run payload, personas, computed coverage matrix, and detected gaps
CSV with persona_id,funnel_stage,count,status
Markdown for docs, GitHub, or Notion

`personascout generate`

Generates gap-targeted content briefs or drafts using the configured LLM provider.

Current flags:

personascout generate
personascout generate --all
personascout generate --persona cfo --stage consideration
personascout generate --channel linkedin-article --format brief
personascout generate --output ./generated
personascout generate --result-format json

Behavior today:

uses detected weak and critical gaps from the latest coverage report by default
supports targeted generation for any --persona and --stage pair
can generate either structured briefs or full markdown drafts
uses existing classified content as tone/context examples
can print to the terminal or write files to a directory
can emit machine-readable JSON including generated artifacts and generation run metadata

`personascout diff`

Compares two classification runs and shows how persona-stage coverage changed over time.

Current flags:

personascout diff
personascout diff --from run-2026-04-10T09-00-00-000Z --to run-2026-04-11T09-00-00-000Z
personascout diff --format json
personascout diff --format markdown

Behavior today:

compares the latest run to the previous run by default
shows improved, declined, unchanged, new-gap, and gap-closed cells
supports terminal, JSON, and Markdown output

Provider Direction

The planned provider architecture keeps the model layer deliberately simple:

@anthropic-ai/sdk for Anthropic
openai for OpenAI-compatible APIs including Groq, DeepSeek, Kimi, Mistral, Together, Perplexity, and Ollama

That gives the CLI broad provider support without baking provider-specific logic throughout the codebase.

Provider discovery is now available via:

personascout providers
personascout providers --provider anthropic

Supported LLM Providers

Provider	Notes
Anthropic (Claude)	Recommended. Best classification accuracy.
OpenAI (GPT)	GPT-4o Mini is a cost-effective default.
Groq	Extremely fast. Good fit for low-latency runs.
DeepSeek	Very low cost. Strong for structured output.
Kimi (Moonshot AI)	Useful when you want a Moonshot-compatible option.
Mistral	Good European provider option.
Together AI	Access to many open-weight models.
Perplexity	Available through the OpenAI-compatible provider path.
Ollama	Free, local, and no API key required.
Any OpenAI-compatible API	vLLM, LM Studio, LocalAI, and similar endpoints can be added through config.

Bring your own API key, or run fully local with Ollama.

Open Core and Licensing

This repository is licensed under AGPL-3.0-only.

That means:

the CLI is genuinely open source
commercial use is allowed
if someone modifies and runs this software as a network service, the AGPL obligations apply to that covered software

That licensing choice is intentional. The goal is to build in public while preserving a strong copyleft boundary for the core tool.

Official license text:

GNU AGPL v3: https://www.gnu.org/licenses/agpl-3.0.en.html

Development Notes

The codebase is structured to stay boring and inspectable:

TypeScript
Commander for CLI wiring
Zod for schema validation
Vitest for tests
Local JSON storage under .personascout/

The current implementation lives mainly in:

src/commands
src/lib
src/types

Acknowledgements

It is common to credit the libraries that materially shape the product experience, especially in CLI/open-source projects. PersonaScout currently leans on:

chartscii and styl3 for the graphical terminal report and themed CLI output
Firecrawl for richer website extraction when an API key is configured
Commander for CLI structure
Zod for schema validation
Vitest for test coverage

Contributing

Issues, bug reports, and PRs are welcome.

If you contribute code, keep in mind the design bias of the project:

local-first over hosted dependencies
explicit JSON files over hidden state
simple CLI behavior over framework magic
provider abstraction without provider sprawl

Roadmap

Near-term milestones:

source editing

Release Notes

Release history is tracked in CHANGELOG.md. The current public release is v0.1.0.

Longer-term polish:

starter persona library
richer export formats
sharper terminal UX
stronger public docs and examples

Security

Do not commit private planning documents, credentials, or customer material into the repository.

Local-only notes and internal product briefs should stay outside version control or in ignored paths.

License

AGPL-3.0-only

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
examples		examples
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

personascout

The Problem

Setup Flow

Ongoing Workflow

Example Output

Why This Exists

Current Status

Install

Quick Start

Example Project Layout

Commands

`personascout init`

`personascout status`

`personascout persona`

`personascout source`

`personascout fetch`

`personascout classify`

`personascout report`

`personascout generate`

`personascout diff`

Provider Direction

Supported LLM Providers

Open Core and Licensing

Development Notes

Acknowledgements

Contributing

Roadmap

Release Notes

Security

License

About

Uh oh!

Releases 2

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

personascout

The Problem

Setup Flow

Ongoing Workflow

Example Output

Why This Exists

Current Status

Install

Quick Start

Example Project Layout

Commands

personascout init

personascout status

personascout persona

personascout source

personascout fetch

personascout classify

personascout report

personascout generate

personascout diff

Provider Direction

Supported LLM Providers

Open Core and Licensing

Development Notes

Acknowledgements

Contributing

Roadmap

Release Notes

Security

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Uh oh!

Contributors 1

Languages

`personascout init`

`personascout status`

`personascout persona`

`personascout source`

`personascout fetch`

`personascout classify`

`personascout report`

`personascout generate`

`personascout diff`