euclid_rag

RAG-powered chatbot for querying Euclid mission documents...

Euclid RAG: A Local RAG System for Scientific Research

Euclid RAG is an open-source Retrieval-Augmented Generation (RAG) system designed to provide efficient document retrieval and knowledge augmentation for the Euclid scientific community. The project aims to integrate local Large Language Models (LLMs) with a vector database to retrieve, process, and generate relevant scientific information.

Origin & Development

This project was initially forked from the Rubin Observatory's Rubin RAG system. While we are working in consultation and knowledge-sharing with Rubin developers, Euclid RAG is evolving in a different direction to meet the specific needs of the Euclid collaboration. Key differences will include:

A focus on local deployment without API-based LLM dependencies.
Different document retrieval strategies tailored to Euclid's scientific workflows.
Potential agentic capabilities to enhance automated knowledge retrieval and processing.

Installation

Install euclid_rag in development mode:

   git clone https://github.com/yourusername/euclid_rag.git
   cd euclid_rag
   pip install -e .

euclid_rag is developed by Euclid Consortium Science Ground Segment members at https://github.com/jeipollack/euclid_rag.

Features

Developing euclid_rag

The best way to start contributing to rubin_rag is by cloning this repository, creating a virtual environment, and running the make init command:

git clone https://github.com/jeipollack/euclid_rag
cd euclid_rag

python3 -m venv .venv
source .venv/bin/activate

make init

Build the Vector Store

Before running the chatbot, you must ingest data and build the vector store. The location and type of the vector store(s) are defined in app_config.yaml, for example:

vector_store:
  type: "faiss"
  redmine_index_dir: "redmine_vector_store"
  public_data_index_dir: "public_data_vector_store"

type — currently only "faiss" is supported.
{prefix}_index_dir — path where the FAISS index files (index.faiss, index.pkl) will be stored.

This can be a relative path (within the repo) or an absolute path.

If the vector store is missing, the app will fail to start with:

RuntimeError: Vector store missing. Please run ingestion before launching the app.

Run ingestion:

python python/euclid/rag/ingestion/ingest_publications.py -c /path/to/config_file

By default, this uses the python/euclid/rag/app_config.yaml file in the repository.

To use a different config file, pass the -c / --config option:

python python/euclid/rag/ingestion/ingest_publications.py -c /path/to/config_file

Run the chatbot:

Once the vector store has been built, you can launch the chatbot:

cd python/euclid
streamlit run rag/app.py

You can run tests and build documentation with tox:

tox

To learn more about the individual environments:

tox -av

Name		Name	Last commit message	Last commit date
Latest commit History 389 Commits
.github		.github
.streamlit		.streamlit
_build		_build
changelog.d		changelog.d
docs		docs
python/euclid		python/euclid
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile_ollama		Dockerfile_ollama
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
compose.yaml		compose.yaml
pyproject.toml		pyproject.toml
ruff-shared.toml		ruff-shared.toml
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

euclid_rag

Euclid RAG: A Local RAG System for Scientific Research

Origin & Development

Installation

Features

Developing euclid_rag

Build the Vector Store

Run ingestion:

Run the chatbot:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

euclid_rag

Euclid RAG: A Local RAG System for Scientific Research

Origin & Development

Installation

Features

Developing euclid_rag

Build the Vector Store

Run ingestion:

Run the chatbot:

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages