Note: To protect the privacy of the company, the file containing the data has been removed. The notebooks now only include the code and the HTML output generated by the data.
Order in which the folders must be executed:
- cleaning
- ner
- training_dataset
- inference_dataset
- training
- inference
- plots
Within each folder, there is only one Jupyter notebook. You should only focus on the notebook and ignore other files (e.g., CSV and pickle ones). Those other files will be called within the notebooks in due time.
You can create a virtual environment with most of the needed packages to run the notebooks by using the yml file in this folder. To create such an enviroment, use the command below
conda env create -f s2sd_environment.yml
that will create a Conda environment named s2ds that can then be activated using the command below
conda activate s2ds
You still may need to install a few Python packages yourself as you execute the notebooks.