TinyLLM: A Research Project for Small Language Models

This repository is a research project dedicated to the training and implementation of small, specialized language models. Our goal is to explore various architectures and train different types of models using publicly available datasets from sources like Hugging Face.

The ultimate ambition of this project is to incrementally build our expertise and capabilities to train a highly proficient coding model, with the aspirational goal of reaching the level of performance seen in models like Anthropic's Claude Opus.

Completed Models

We have successfully implemented and trained the following models:

mlx-gpt2: A character-level GPT-2 model implemented in MLX for Apple Silicon.
tinystories: A PyTorch-based GPT model trained on the TinyStories dataset, designed for GPU training and local MPS inference.
euromillions-transformer: a Transformer-based deep learning model to predict EuroMillions lottery numbers.

Future Models

Our roadmap includes the development of the following models:

How to Contribute

We welcome contributions to this research project! To maintain consistency and ensure that others can easily replicate the work, please follow these guidelines:

Environment Management: Use uv for creating virtual environments and managing dependencies. This ensures a consistent and fast setup process.
Development Workflow:
- GPU First: Begin by developing and validating your model training script in a GPU-accelerated environment. We primarily use Google Colab for this, as it provides free access to T4 GPUs.
- Structure for Reproducibility: Once the model is successfully trained, structure the code into a dedicated folder for your model. This folder must include:
  - A README.md with instructions on how to set up and run your code. Remember to include any training result images (like loss curves) in the README.
  - A requirements.txt file listing all necessary dependencies.
  - The core scripts: train.py, inference.py, and api.py.

Example Directory Structure: Each completed model directory should follow a structure similar to this:

Folder-name/
├── api.py
├── inference.py
├── main.py - #GPU Code
├── model.pth
├── output.txt
├── README.md
├── requirements.txt
├── train.py
├── training_loss_ma.png
└── training_loss.png

This structured approach ensures that every model in this repository can be easily trained, tested, and integrated by other users.

Getting Started

Clone the Repo:

git clone https://github.com/rodgersmag/tinyllm
cd TinyLLM

Set Up Environment:

uv venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate
uv pip install -r requirements.txt

Check Out Models:

Explore mlx-gpt2 and tinystories folders for examples. Follow their README.md to train or run inference.

Build Something Cool:

Pick a model from the future list or suggest your own. Have fun experimenting and share your results!

Thanks

Big shoutout to Google Colab for the free GPU power that fuels this project, and to the open-source community (especially Hugging Face) for awesome datasets and tools.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
euromillions-transformer		euromillions-transformer
mlx-gpt2		mlx-gpt2
tinystories		tinystories
.gitattributes		.gitattributes
.gitignore		.gitignore
Readme.MD		Readme.MD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TinyLLM: A Research Project for Small Language Models

Completed Models

Future Models

How to Contribute

Getting Started

Clone the Repo:

Set Up Environment:

Check Out Models:

Build Something Cool:

Thanks

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TinyLLM: A Research Project for Small Language Models

Completed Models

Future Models

How to Contribute

Getting Started

Clone the Repo:

Set Up Environment:

Check Out Models:

Build Something Cool:

Thanks

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages