Calls API

Calls API

Calls API

Project Description

This project was developed as part of the DevChallenge IT XXI Backend category. It processes and analyzes telephone conversations to extract structured datasets for analysis. The system extracts details such as names, locations, emotional tones, and categorizes conversations based on content. It operates without internet dependency and supports local file processing. Detailed task you can see in the task.md

Features

Submit audio files via a URL for processing.
Extract key information, including names and locations mentioned in conversations.
Determine the emotional tone of conversations.
Categorize conversations into relevant groups.
Support for multiple audio formats (e.g., WAV, MP3).
RESTful API accessible through a user-friendly documentation interface.
Local file handling and offline processing capabilities.

Technologies Used

Backend Framework

FastAPI: A modern, fast (high-performance) web framework for building APIs with Python 3.6+ based on standard Python type hints.

Database

PostgreSQL: A powerful, open-source object-relational database system.
SQLAlchemy: SQL toolkit and ORM for database interaction.
Alembic: A lightweight database migration tool for SQLAlchemy.

Audio Processing

Whisper: OpenAI’s automatic speech recognition model, used for transcribing audio to text.
ffmpeg: A multimedia framework for handling audio and video processing.

Natural Language Processing (NLP)

SpaCy: Used for extracting names and locations from transcriptions.
TextBlob: Provides tools for text analysis, such as sentiment analysis.

Containerization

Docker: To containerize the application for consistent deployment across environments.
Docker Compose: For defining and running multi-container Docker applications.

Asynchronous Programming

aiohttp: Used for asynchronous HTTP requests.

Dependency Management

Poetry: Python packaging and dependency management tool.

Installation and Usage

Running the App via Docker Compose

Prerequisites

Docker Desktop installed on your machine.

Run the following commands in the app directory:

docker-compose up -d --build

After the first run, the database and additional resources (e.g., whisper/base.pt) will be set up. This may take some time. To restart the server:

docker-compose down

docker-compose up -d

Access the API via the documentation interface at:

http://localhost:8080/docs#/

Running the App Manually for the First Time

See detailes in the readme.md

Access the API at:

http://localhost:8080/docs#/

Running the App Manually Next Time

From the app directory:

poetry shell
uvicorn main:app --port 8080 --reload

Plans for Future Enhancements

Add more detailed error handling and logging.
Add support for more audio file formats.
Improve the accuracy of name and location extraction.
Enhance the emotional tone detection algorithm.
Add more categories and improve category detection.
Implement a web-based user interface for easier interaction with the API.

Contributing

Fork the repository.
Create a new feature branch:
```
git checkout -b feature-name
```
Commit your changes:
```
git commit -m 'Add some feature'
```
Push to the branch:
```
git push origin feature-name
```
Open a pull request.

Links to the docs

License

This project is licensed under the MIT License. See the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
online_submission		online_submission
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
task.md		task.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Calls API

Project Description

Features

Technologies Used

Backend Framework

Database

Audio Processing

Natural Language Processing (NLP)

Containerization

Asynchronous Programming

Dependency Management

Installation and Usage

Running the App via Docker Compose

Prerequisites

Running the App Manually for the First Time

See detailes in the readme.md

Running the App Manually Next Time

Plans for Future Enhancements

Contributing

Links to the docs

License

About

Uh oh!

Languages

License

OleksiyM/calls-api

Folders and files

Latest commit

History

Repository files navigation

Calls API

Project Description

Features

Technologies Used

Backend Framework

Database

Audio Processing

Natural Language Processing (NLP)

Containerization

Asynchronous Programming

Dependency Management

Installation and Usage

Running the App via Docker Compose

Prerequisites

Running the App Manually for the First Time

See detailes in the readme.md

Running the App Manually Next Time

Plans for Future Enhancements

Contributing

Links to the docs

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages