Alan

Alan continuously captures screenshots and performs image recognition. The results of the image recognition are logged for further analysis.

👋 Hey! Watch me code it in this 📺recorded livestream.

Features

Automated screenshot capture
Image recognition using Google's Gemini 2.0 Flash model
Continuous results logging with timestamps
Built with TypeScript and Bun runtime

Prerequisites

Bun runtime installed
Gemini API key
TypeScript 5.x

Installation

Clone the repository:

git clone https://github.com/gavmor/alan.git
cd alan

Install dependencies:
```
bun install
```
Set up your Gemini API key:
- Visit Google AI Studio
- Create an API key
- Set the API key in your environment:
```
export GEMINI_API_KEY='your-api-key-here'
```

Usage

Run the application:

bun run index.ts

The application will:

Take screenshots of your desktop
Process the screenshots through Gemini Pro Vision for image recognition
Append the recognition results with timestamps to results.txt

Development

To run tests:

bun test

Dependencies

screenshot-desktop: For capturing desktop screenshots
- (Linux) image-magick
ollama: For interfacing with Ollama AI models
TypeScript: For type-safe development

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bun.lock		bun.lock
index.ts		index.ts
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Alan

Features

Prerequisites

Installation

Usage

Development

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Alan

Features

Prerequisites

Installation

Usage

Development

Dependencies

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages