OSE

OSE PROJECT

Name: Biswajit Palit

Matriculation Number: 50071214

REPOSITORY STRUCTURE:

The repository consists of 4 files apart from the README.md files

OSE_Project_Essay.pdf : This outlines the project report and underlines my observations in detail
Biswajit_Palit_Final_Notebook_Completed.ipynb
OSE_Questions_and_Answers.pdf : This comprises answers to 25 questions.
environment.yml : This comprises of the packages used in the project.

PROJECT OUTLINE:

At the very outset I would like to specify, in order to get the best reproducibility, it is best if the notebook is run sequentially.

Dataset: In this project I have used the PolyAI/banking77 dataset from the HuggingFace library. It consists of customer queries and comments related to online banking and digital payments. The dataset consists of 10003 train data points and 3080 test data points. I have for the purpose of cross validation split the test set equally into validation_set and test_set.

Task: I have opted to perform a text classification task on the dataset. I wanted to check the performance of several sk-learn models and compare them with the performance of different transformer models. I have hypertuned each model to find the parameters giving the best accuracy score.

Structure: In the first half of the project I have run TF-IDF vectorization and run Logistic Regression, Decision Trees and Naive Bayes. In the second part of the project I have run the transformer models namely, BERT, ROBERTA, DISTILBERT, XLNET.

ENVIRONMENT NOTICE:

I have specified an environment in my code. However I did the project on torch version 1.9.1 and tensorflow 2.6.0 which is not a compatible version for jupyter notebook. Best reproducibility can be achieved when run with all the mentioned packages as well as these particular versions of torch and tensorflow. Reproducibility is mostly being achieved even without them.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OSE

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Biswajit_final_notebook_completed.ipynb		Biswajit_final_notebook_completed.ipynb
OSE_Project_Essay.pdf		OSE_Project_Essay.pdf
OSE_Questions_and_Answers.pdf		OSE_Questions_and_Answers.pdf
README.md		README.md
environment.yml		environment.yml

Folders and files

Latest commit

History

Repository files navigation

OSE

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages