Skip to content

Leolty/tablellm

Repository files navigation

TableLLM Repository

Official implementation of the paper "Rethinking Tabular Data Understanding with Large Language Models" (https://arxiv.org/abs/2312.16702).

🚀 Installation

🛠 Clone the Repository

Start by cloning the repository to your local machine:

git clone https://github.com/Leolty/tablellm.git
cd tablellm

📦 Install Dependencies

Create and activate a new environment, and install the required packages:

conda create -n tablellm python=3.10
conda activate tablellm
pip install -r requirements.txt

🗂 Data

Unzip the dataset provided in the repository:

unzip assets/data.zip

After unzipping, you should have the following files:

data
├── wtq.json
├── tabfact.json

🔬 Experimentation

🔄 Reproducing the Results

For replicating our study's findings, navigate to the scripts folder:

Detailed explanations of parameters can be found in run_cot.py and run_agent.py.

🤖 Using the Table Agent

For hands-on experience with the table agent, refer to the following notebook:

📚 Citation

If you find this research useful in your work, please consider citing:

@misc{liu2023rethinking,
      title={Rethinking Tabular Data Understanding with Large Language Models}, 
      author={Tianyang Liu and Fei Wang and Muhao Chen},
      year={2023},
      eprint={2312.16702},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}