Name	Name	Last commit message	Last commit date
Latest commit History 5 Commits
config	config
data	data
data_reader	data_reader
models	models
utils	utils
README.md	README.md
eval.sh	eval.sh
evaluate.py	evaluate.py
requirements.txt	requirements.txt
train.py	train.py
train.sh	train.sh

Name

Last commit message

Last commit date

5 Commits

INFO: Intellectual and Friendly Dialogue Agents grounding

Source codes for the paper "You Truly Understand What I Need: Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona", accepted at EMNLP 2022 Findings.

1. Setup

1.1 Environmental Setup

The code runs with python 3.6. All dependencies are listed in requirements.txt

pip install -r requirements.txt

1.2 Dataset

You can download FoCus Dataset (Persona-Knowledge Chat) in here

1.3 Create a knowledge index

Since we use RAG for dialogue generation, you need to create a knowledge index file for the generation.
Before creating a knowledge index, you need to move Focus dataset into the data/ folder.

|-- data
    |-- FoCus
        |-- train_focus.json
        `-- valid_focus.json

1) The preprocessing code for creating raw knowledge is in the knowledge_index folder

create_knowledge_index_for_github.ipynb

2) The code for creating a knowledge index file is as below

python use_own_knowledge_dataset --csv_path=your file --output_dir=your dir

or you can simply run sh file

sh create_knowldege_index.sh

we used the same file in the transformers Github but modified it a bit for preprocessing the raw knowledge

3) After creating a knowledge index for FoCus Dataset, you should change your path in the config/rag-tok-base-ct.json

"data_dir": 
"save_dirpath": 
"knowledge_dataset_path": 
"knowledge_index_path":

2. Training

Before you train the model, please modify the config file.

sh train.sh

3. Evaluate

sh evaluate.sh

About

Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which is accepted to EMNLP 2022 (Findings)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

INFO: Intellectual and Friendly Dialogue Agents grounding

1. Setup

1.1 Environmental Setup

1.2 Dataset

1.3 Create a knowledge index

2. Training

3. Evaluate

About

Releases

Packages

Contributors 2

Languages

dlawjddn803/INFO

Folders and files

Latest commit

History

Repository files navigation

INFO: Intellectual and Friendly Dialogue Agents grounding

1. Setup

1.1 Environmental Setup

1.2 Dataset

1.3 Create a knowledge index

2. Training

3. Evaluate

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages