This repository serves as a foundational template for pyspark projects, equipped with essential tools for maintaining code quality and documentation consistency.
Make sure you have the following installed:
First, clone the repository containing the code to your local machine:
git clone https://github.com/gvatsal60/PySparkTemplate.git- Open the current directory in VS Code.
- Press
F1(orCtrl+Shift+Pon Windows/Linux,Cmd+Shift+Pon macOS). - Search for and select "Dev Containers: Reopen in Container".
- VS Code will build the dev container image (if required) and start the container.
- Once completed, you’ll be inside the dev container environment.
- Now you can develop in the isolated and pre-configured PySpark container environment.
Contributions are welcome! Please read our Contribution Guidelines before submitting pull requests.
This project is licensed under the Apache License 2.0 License - see the LICENSE file for details.