CallDraft

A real-time AI voice agent prototype for HVAC businesses. CallDraft lets users have a live voice conversation with "Sarah," an AI-powered virtual receptionist for Anderson Heating & Cooling (Dallas, TX). The entire call happens in the browser -- no phone line or backend WebSocket server required.

What the AI Agent Does

Sarah handles inbound calls for a fictional HVAC company. During a conversation she will:

Greet callers and ask how she can help
Answer questions about services (AC repair, heating repair, installation, maintenance, duct cleaning), business hours (Mon--Fri 8 AM--6 PM, 24/7 emergency), and pricing ($89 service call fee)
Collect the caller's name, phone number, and service address
Walk through booking an appointment (no real booking occurs)
Politely decline off-topic questions and stay focused on HVAC services

Conversations are fully speech-to-speech. A live transcript, call timer, and response-latency indicator are displayed alongside the call.

Prerequisites

Node.js 18 or later
An OpenAI API key with access to the Realtime API (gpt-realtime-mini)

Setup

Clone the repository
```
git clone <repo-url>
cd calldraft
```
Install dependencies
```
npm install
```
Configure environment variables

Copy the example file and add your key:
```
cp .env.example .env.local
```
Open .env.local and replace the placeholder with your real key:
```
OPENAI_API_KEY=sk-your-key-here
```
The key must have Realtime API access enabled. You can manage keys at https://platform.openai.com/api-keys.
Start the development server
```
npm run dev
```
Open http://localhost:3000 in your browser. Click "Start Call," grant microphone access, and begin talking.

Tech Stack

Layer	Choice	Why
Framework	Next.js (App Router)	Server-side API route keeps the OpenAI key hidden; static frontend deploys to Vercel with zero config
Language	TypeScript	Catches type errors at build time across event handlers and WebRTC payloads
Styling	Tailwind CSS v4 + HeroUI	Utility-first CSS for rapid prototyping; HeroUI provides accessible components out of the box
Voice	OpenAI Realtime API (WebRTC)	Direct browser-to-OpenAI audio stream -- no backend WebSocket server needed, reducing latency and infrastructure
AI Model	gpt-realtime-mini	Cheapest Realtime model; fast enough for scripted receptionist flows with low per-minute cost
Deployment	Vercel	One-click deploy, automatic HTTPS, serverless API routes -- matches the Next.js ecosystem

How It Works

The browser requests an ephemeral session token from the /api/session Next.js API route.
Using that token, the browser opens a direct WebRTC connection to the OpenAI Realtime API -- no persistent backend needed.
Audio streams both ways in real time: the user's microphone input goes to OpenAI, and the AI's spoken response plays back through the browser.
Transcription, latency measurement, and call state management all happen client-side.

Deployment

The project is designed for Vercel. No special configuration is needed beyond setting the environment variable.

Push the repository to GitHub (or your preferred Git host).
Import the project in Vercel.
Add OPENAI_API_KEY as an environment variable in the Vercel project settings.
Deploy. Vercel handles the serverless API route and static assets automatically.

You can also build and run manually:

npm run build
npm run start

Available Scripts

Command	Description
`npm run dev`	Start the development server
`npm run build`	Create a production build
`npm run start`	Run the production build locally
`npm run lint`	Run ESLint

Troubleshooting

Microphone access denied -- Check browser permissions. Some browsers require HTTPS in production. Try an incognito window if permissions are stuck.

Failed to create session -- Verify that OPENAI_API_KEY is set in .env.local and that the key has Realtime API access. Check your OpenAI account usage limits.

High latency -- Response time depends on network quality and OpenAI server load. Use a modern Chromium-based browser for the best WebRTC performance.

No audio playback -- Check system volume and browser audio permissions. The browser must allow autoplay for the AI's voice to come through.

License

This project is a prototype and is not licensed for production use.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
public		public
src		src
.gitignore		.gitignore
README.md		README.md
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CallDraft

What the AI Agent Does

Prerequisites

Setup

Tech Stack

How It Works

Deployment

Available Scripts

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CallDraft

What the AI Agent Does

Prerequisites

Setup

Tech Stack

How It Works

Deployment

Available Scripts

Troubleshooting

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages