Skip to content

voltmaister/elastic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Haystack Search & Elasticsearch Index

This project focuses on scraping a website containing Q&A data and utilizing either Haystack or Elasticsearch index to search for answers and return the best possible responses.

Technologies Used

  • API
  • Python
  • Google Colab
  • Haystack
  • Elasticsearch

Introduction

The project involves web scraping a targeted website to extract valuable Q&A data. Subsequently, we employ either Haystack or Elasticsearch to index this data for efficient search operations.

Setup and Installation

  1. Clone the Repository:

    git clone <repository_url>
    cd <repository_directory>
    
  2. Install Dependencies:

    pip install -r requirements.txt
    
  3. Haystack Setup (or Elasticsearch Setup):

    • For Haystack setup, follow the instructions in the Haystack documentation.
    • For Elasticsearch setup, follow the Elasticsearch documentation.
  4. Run the Project:

    • Execute the main script to scrape the website and perform searches using the chosen technology. python main.py

Usage

  1. Scraping:

    • Run the web scraping script to extract Q&A data from the target website.
  2. Indexing:

    • Use Haystack or Elasticsearch to index the scraped Q&A data for efficient searching.
  3. Searching:

    • Utilize the search capabilities of Haystack or Elasticsearch to search for answers within the indexed data.

Contributing

If you wish to contribute to this project, please follow these steps:

  1. Fork the repository and create a new branch.
  2. Make your changes and thoroughly test them.
  3. Create a pull request, explaining the changes you have made.

License

This project is licensed under the MIT License.

Releases

No releases published

Packages

No packages published