Welcome! In this repository you will find the code for the automation of the download of petroleum well files using Selenium, streamlining data collection from the REATE repository of Brazilian terrestrial sedimentary basins containing over 40 terabytes of data.
To work on this project, ensure you have the following installed:
- Python 3.10 or above (3.12 recommended)
- Git (for cloning the repository)
- Chrome Driver (download here)
-
Clone the repository:
git clone https://github.com/rafaseto/reate-bot.git cd reate-bot -
Create a virtual environment in the project directory:
python -m venv .venv
-
Activate the virtual environment:
- On Windows:
.venv\Scripts\activate
- On macOS/Linux:
source .venv/bin/activate
- On Windows:
With the virtual environment activated, install the required packages:
pip install -r requirements.txtTo get started right away, I set up a test_file.txt with the paths of some well files to download as an example.
-
Open the
src/app.pyfile and insert yourchromedriver.exepath in line 12. -
Change the directory to
srccd src -
Run
python app.py