ScrapeGraphAI is a python library that uses large language models(LLM) and direct graph logic to create scraping pipelines for websites, documents and XML files.
In these examples, we will show you how to integrate Amazon Bedrock ⛰️ with ScrapeGraphAI to extract information from multiple sources using natural language prompts.
-
Set AWS credentials
# Option 1: (recommended) AWS CLI aws configure # Option 2: environment variables export AWS_ACCESS_KEY_ID=... export AWS_SECRET_ACCESS_KEY=... export AWS_SESSION_TOKEN=... export AWS_DEFAULT_REGION=...
-
Install dependencies
# Install Python packages pip install -r requirements.txt # Install browsers # https://playwright.dev/python/docs/browsers#install-browsers playwright install # Install system dependencies # https://playwright.dev/python/docs/browsers#install-system-dependencies playwright install-deps
-
Start the demo application
streamlit run main.py
