This repository is dedicated to the NLP part of the DEEP project. The code is tightly coupled with AWS Sagemaker.
You can access the full documentation here (if you are already browsing the documentation on the web, it links to the same page you are at now)
Contact Stefano to get the AWS credentials, install the AWS CLI
Clone the repo and pull the data
git clone <deep_experiments_repo>
cd deep-experimentsCreate a new conda environment:
conda create -n deepl python=3.9.1Install necessary libraries:
make dev-installPull the data:
dvc pullAsk Stefano for a AWS user account and a new Notebook instance on AWS.
The notebook instance will be created in the region us-east-1, switch to this
region if you don't find your instance.
The notebook instance comes with the repo already cloned.
Once it is ready, start the instance and click on Open Jupyter.
Open the Jupyter terminal and cd to the deep-experiments repo. It should be:
cd SageMaker/deep-experimentsRun:
make cloud-install(This must be run everytime the instance is activated)
Pull the data:
dvc pulldatacontains the datadeepcontains the codedockercontains the Dockerfile used to build the test environment.notebookscontains all the Jupyter Notebook, divided by category and person working on themscriptscontains the training scripts necessary for Sagemakerrequirementscontains all the Python requirements for the different configurations