AI-based Question Answering System

This project aims to fine-tune some existing models from the Hugging Face Transformers library. AS a source of data i used some public articles (questions for interview) from GitHub

The jedi way of building a QA system

Collect data (as such as possible)
Preprocess data (clean, turn it into question-answer pairs or dialogue)
Augment data (add noise, add duplicates, add outliers)
Split data (train, validation, test)
Configure model (choose model architecture, hyperparameters)
Train model (fit model to data)
Evaluate model (check model performance on validation data)

Getting Started

Follow these steps to set up and run the project on your local machine.

Clone the repository:

git clone git@github.com:iashchak/ai-tools.git

Change to the project directory:
```
cd ai-tools
```
Install the required packages:

Init a new conda environment with environment.yml file (preffered)
Update current one with environment.yml file
```
conda env update --file environment.yml
```

Prerequisites

Python 3.8 or higher
PyTorch 1.9 or higher
Hugging Face Transformers library

Usage

To run the project, execute the Jupyter Notebook notebooks/process_interview_questions. This will download the dataset, create question-answer pairs, train the model, and test it with some example questions.

Roadmap

Data collection
Dataset creation (question-answer pairs)
Model training using Hugging Face Transformers
Model evaluation and testing
Improve dataset quality with better question generation
Increase the size and diversity of the dataset
Improve model performance with hyperparameter tuning
Implement a user-friendly interface for interacting with the model

Contributing

Please read CONTRIBUTING.md for details on how to contribute to the project.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
data		data
datasets		datasets
models		models
notebooks		notebooks
.black.toml		.black.toml
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
__init__.py		__init__.py
environment.yml		environment.yml
requirements_dev.txt		requirements_dev.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-based Question Answering System

Table of Contents

The jedi way of building a QA system

Getting Started

Prerequisites

Usage

Roadmap

Contributing

License

About

Releases

Packages

Contributors 2

Languages

License

iashchak/ai-tools

Folders and files

Latest commit

History

Repository files navigation

AI-based Question Answering System

Table of Contents

The jedi way of building a QA system

Getting Started

Prerequisites

Usage

Roadmap

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages