Skip to content

JGalego/ScrapeGraphAI-Bedrock

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Amazon Bedrock 💖 ScrapeGraphAI

ScrapeGraphAI is a python library that uses large language models(LLM) and direct graph logic to create scraping pipelines for websites, documents and XML files.

In these examples, we will show you how to integrate Amazon Bedrock ⛰️ with ScrapeGraphAI to extract information from multiple sources using natural language prompts.

Instructions

  1. Set AWS credentials

    # Option 1: (recommended) AWS CLI
    aws configure
    
    # Option 2: environment variables
    export AWS_ACCESS_KEY_ID=...
    export AWS_SECRET_ACCESS_KEY=...
    export AWS_SESSION_TOKEN=...
    export AWS_DEFAULT_REGION=...
  2. Install dependencies

    # Install Python packages
    pip install -r requirements.txt
    
    # Install browsers
    # https://playwright.dev/python/docs/browsers#install-browsers
    playwright install
    
    # Install system dependencies
    # https://playwright.dev/python/docs/browsers#install-system-dependencies
    playwright install-deps
  3. Start the demo application

    streamlit run main.py

About

Streamlit app that demonstrates how to integrate Amazon Bedrock ⛰️ and ScrapeGraphAI 🕷️

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages