Firecrawl Quickstarts is an independent and unofficial collection of projects designed to help developers quickly get started with building applications using the Firecrawl API. Each quickstart provides a foundation that you can easily build upon and customize for your specific needs. This repository is not affiliated with, endorsed by, or officially supported by Firecrawl.
To use these quickstarts, you'll need a Firecrawl API key. If you don't have one yet, you can sign up for free at firecrawl.dev.
This quickstart introduces how to integrate Firecrawl with OpenAI's Anthropic models to search and extract information based on specific user objectives. Learn to map a website, identify relevant pages, and retrieve content aligned with the objective. Ideal for targeted information gathering.
Go to Firecrawl Web Crawling with OpenAI and Anthropic
Explore how to enhance the Firecrawl web crawling process with OpenAI’s o1 reasoning models. This quickstart guides you in using these advanced models to generate search parameters, map sites, and validate extracted content, enhancing the precision and relevance of data extraction.
Go to Integrating OpenAI o1 Models with Firecrawl
Combine Grok-2’s AI-powered understanding with Firecrawl’s search to create an intelligent web crawler. This quickstart demonstrates building a targeted crawler that finds and processes structured data on web pages, with output in JSON format for seamless data handling.
Go to Building a Web Crawler with Grok-2 and Firecrawl
Learn how to use Firecrawl's Map endpoint to create comprehensive sitemaps from single URLs. This quickstart is perfect for efficiently gathering website structures, enabling tasks such as content mapping, SEO analysis, and scalable web data extraction.
Go to Firecrawl Map Endpoint Quickstart
Automate job listing extraction and analysis with Firecrawl and OpenAI’s Structured Outputs. This quickstart demonstrates scraping job boards, extracting structured job details, and matching listings to a user’s resume with schema-compliant outputs for reliable data processing.
Go to Job Board Scraping with Firecrawl and OpenAI
Learn how to use Firecrawl’s LLM-powered data extraction features. This quickstart covers extracting structured data from web pages, with options for schema-defined and prompt-only extraction, making it adaptable for diverse data formats and applications.
Go to Firecrawl LLM Extract Tutorial
Each quickstart project is a Jupyter notebook designed to be easily opened and run on Google Colab. To get started, follow these steps:
-
Open the Repository in Google Colab
Each notebook has a link to open directly in Google Colab. Click on the link for the quickstart you want to explore.
-
Set Up Your Firecrawl API Key
Each notebook requires a Firecrawl API key. Once you've created your key (available here), enter it in the notebook when prompted or set it as an environment variable as directed in the notebook.
-
Run Each Notebook Cell Sequentially
Follow the instructions within each notebook, running cells in order. The notebooks will guide you through each step, from setting up the environment to executing web scraping or extraction tasks.
-
View Results and Experiment
The notebooks are designed to be interactive. You can modify the code cells, adjust parameters, or try different objectives to explore Firecrawl’s capabilities further.
Each notebook includes explanations and usage examples to help you understand and customize your setup.
To deepen your understanding of working with Firecrawl and its API, check out these resources:
- Firecrawl Documentation - Comprehensive guides and API references
- Firecrawl SDKs - Explore our SDKs for Python, Node.js, Go, and Rust
- LLM Framework Integrations - Learn how to use Firecrawl with frameworks like LangChain and Llama Index
- Firecrawl API Reference - Detailed API endpoints and parameters
We welcome contributions to the Firecrawl Quickstarts repository! If you have ideas for new quickstart projects or improvements to existing ones, please open an issue or submit a pull request.
- Join our Firecrawl Discord community for discussions and support
- Follow us on Twitter and LinkedIn for updates
- Check out the Firecrawl Support Documentation for additional help
This project is licensed under the MIT License - see the LICENSE file for details.
It is the sole responsibility of the end users to respect websites' policies when scraping, searching, and crawling with Firecrawl. Users are advised to adhere to the applicable privacy policies and terms of use of the websites prior to initiating any scraping activities. By default, Firecrawl respects the directives specified in the websites' robots.txt files when crawling. By utilizing Firecrawl, you expressly agree to comply with these conditions.
Copyright (c) 2024-present, Alex Fazio