This project focuses on scraping a website containing Q&A data and utilizing either Haystack or Elasticsearch index to search for answers and return the best possible responses.
- API
- Python
- Google Colab
- Haystack
- Elasticsearch
The project involves web scraping a targeted website to extract valuable Q&A data. Subsequently, we employ either Haystack or Elasticsearch to index this data for efficient search operations.
-
Clone the Repository:
git clone <repository_url> cd <repository_directory>
-
Install Dependencies:
pip install -r requirements.txt
-
Haystack Setup (or Elasticsearch Setup):
- For Haystack setup, follow the instructions in the Haystack documentation.
- For Elasticsearch setup, follow the Elasticsearch documentation.
-
Run the Project:
- Execute the main script to scrape the website and perform searches using the chosen technology. python main.py
-
Scraping:
- Run the web scraping script to extract Q&A data from the target website.
-
Indexing:
- Use Haystack or Elasticsearch to index the scraped Q&A data for efficient searching.
-
Searching:
- Utilize the search capabilities of Haystack or Elasticsearch to search for answers within the indexed data.
If you wish to contribute to this project, please follow these steps:
- Fork the repository and create a new branch.
- Make your changes and thoroughly test them.
- Create a pull request, explaining the changes you have made.
This project is licensed under the MIT License.