Numerical Comprehension in LLMs

Introduction

This project explores the capability of Large Language Models (LLMs), particularly BERT-based models, to understand and perform arithmetic tasks involving numerals. We fine-tuned models on single-digit addition and subtraction and extended our research to cover selective double-digit numerals.

Datasets

The datasets were constructed using two templates of arithmetic equations involving the operators + (plus) and - (minus), ensuring non-negative results. The training dataset contains 3600 samples, with additional datasets for testing single-digit (1860 samples) and double-digit (7200 samples) numerals.

Fine-Tuning Parameters

Epochs: 10
Batch size: 32
Learning rate: Started at 2.0e-4, decreasing exponentially to 2.0e-5

Experimental Setup

Reframing Numerals

This preprocessing strategy aims to improve numeral understanding by focusing on positional values, breaking down numerals into individual digits (e.g., "23" becomes "2 3").

Custom Masking Mechanisms

A modified masking approach targets spans of consecutive digits, enhancing the model's ability to understand and predict entire numerals within arithmetic contexts.

Installation

To set up the project, follow these steps:

Clone the repository:

git clone http://github.com/abbassix/arithBERT

(optional) 1½. Create a virtual environment:

python3.10 -m venv .venv

1½.a. On Windows:

.venv\Scripts\activate

1½.b. On macOS/Linux:

source .venv/bin/activate

Install the required dependencies:

pip install -r requirements.txt

Usage

To run the project, you need to create the training and test datasets. But before that you need to have a directory named datasets in the parent directoy of the project to store the datasets there. You will also need to have two more directories: models to load to store models and results to store the resulting accuracies. To make sure if you have the required directories and if not, to create them, run the following lines.

chmod +x create_dirs.sh
./create_dirs.sh

Then you need to run the following lines to create 6 different training and test datasets.

chmod +x create_datasets.sh
./create_datasets.sh

After creating datasets, you have to fine-tune the models. If the models are already stored in the models directory they will be loaded from there, otherwise they will be downloaded from Hugging Face Hub. The arguments to fine-tune the modela are stored in three different YAML files. Run the following lines to fine-tune three different scenarios to compare the results.

chmod +x finetune.sh
./finetune.sh

Finally we have to compare the results with graphs. Run the following line to create the graphs in the results directory.

python graphs.py

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
dataset_configs		dataset_configs
finetune_configs		finetune_configs
test_configs		test_configs
.gitignore		.gitignore
BERT_Numeracy.pdf		BERT_Numeracy.pdf
README.md		README.md
create_datasets.py		create_datasets.py
create_datasets.sh		create_datasets.sh
create_dirs.sh		create_dirs.sh
finetune.sh		finetune.sh
finetuning.py		finetuning.py
functions.py		functions.py
graphs.py		graphs.py
requirements.txt		requirements.txt
test.py		test.py
testing.py		testing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Numerical Comprehension in LLMs

Introduction

Datasets

Fine-Tuning Parameters

Experimental Setup

Reframing Numerals

Custom Masking Mechanisms

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

abbassix/arithBERT

Folders and files

Latest commit

History

Repository files navigation

Numerical Comprehension in LLMs

Introduction

Datasets

Fine-Tuning Parameters

Experimental Setup

Reframing Numerals

Custom Masking Mechanisms

Installation

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages