🦠 TARP: Transformers for Antimicrobial Resistance Prediction

This repository is a suite of tools and models designed to predict antimicrobial resistance (AMR) using transformer-based architectures. The project uses state-of-the-art techniques in natural language processing (NLP) to analyze genetic sequences and predict resistance profiles.

✨ Features

Implementation of transformer and traditional architectures tailored for AMR prediction.
Data preprocessing pipelines for genetic sequences.
Automatic mixed precision training for improved performance.
Support for various datasets and easy integration of new data sources.

🚀 Getting Started

Clone the repository:

git clone https://github.com/debugst1ck/TARP.git

Navigate to the project directory:
```
cd TARP
```

(Optional, Recommended) Create and activate a virtual environment: For Windows PowerShell:

Set-ExecutionPolicy Unrestricted -Scope Process
python -m venv .venv
.venv\Scripts\activate

For Unix:

python -m venv .venv
source .venv/bin/activate

Install the required dependencies (use the correct index URL for your CUDA version):

pip install -e . --extra-index-url https://download.pytorch.org/whl/cu128 # For CUDA 12.8

Prepare your dataset in the required format (FASTA files with corresponding labels).
Run the training script with your dataset:
```
tarp
```

👨‍💻 Developer's notes

The codebase is structured to facilitate easy experimentation with different transformer architectures and hyperparameters. The main components include data preprocessing, model training, evaluation, and visualization of results.

🧠 Attention Mask

A value of 1 or True indicates that the model should attend to this position. This is for the actual content of the input. A value of 0 or False indicates that the model should not attend to this position, typically because it is padding.

🏷️ Class Weights

Class weights are calculated to address class imbalance in the dataset. The weights are inversely proportional to the frequency of each class, ensuring that the model pays more attention to minority classes during training.

$$ \text{weight}_i = \frac{N}{C \cdot n_i} $$

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
notebooks		notebooks
src/tarp		src/tarp
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
start.aqua		start.aqua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🦠 TARP: Transformers for Antimicrobial Resistance Prediction

✨ Features

🚀 Getting Started

👨‍💻 Developer's notes

🧠 Attention Mask

🏷️ Class Weights

About

Uh oh!

Uh oh!

Languages

License

debugst1ck/TARP

Folders and files

Latest commit

History

Repository files navigation

🦠 TARP: Transformers for Antimicrobial Resistance Prediction

✨ Features

🚀 Getting Started

👨‍💻 Developer's notes

🧠 Attention Mask

🏷️ Class Weights

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages