This is a HSE Computational Linguistics course final project.
The training/evaluation data is Reddit Comments Dataset for Text Style Transfer Tasks: https://zenodo.org/records/8051180 The dataset contains Reddit comments translated into a formal language. For the translation of Reddit comments into a formal language text-davinci-003 was used.
A CUDA-powered environment (Google Colab) is recommended for executing this Jupyter notebook.
Here is a Google Colab verison of the notebook: https://colab.research.google.com/drive/19IIXFCxeiAoOVGLO2Qdelbd_mYgpDILw?usp=sharing