This repository contains a Transformer-based Neural Machine Translation (NMT) pipeline where a pretrained MarianMT model was fine-tuned on the Tatoeba English–French parallel corpus.
- English → French
- MarianMT:
Helsinki-NLP/opus-mt-en-fr
- Tatoeba EN–FR parallel corpus:
Helsinki-NLP/tatoeba
| Model | BLEU Score |
|---|---|
| Base OPUS-MT (MarianMT) | 50.5 |
| Fine-tuned OPUS-MT | 55.44 |
finetuned-opus-en2fr.ipynb— training + evaluation pipelinedemo.ipynb— sample translationsdemo.py— streamlit demo