Learning to Say No: Unsolvable Robotic Task Detection Using Synthetic Data

Final Project of ME555

Introduction

This project focuses on training Large Multimodal Models (LMMs) to detect and appropriately respond to unsolvable robotic tasks. Using a combination of synthetic data generation and fine-tuning techniques, we develop a model that can effectively identify when a requested task is beyond a robot's capabilities and provide clear explanations for why the task cannot be performed.

Key Features

Synthetic data generation for unsolvable robotic tasks
Fine-tuned LLaVA model for task feasibility detection
Clear explanation generation for unsolvable tasks
Support for both SD and Habitat-generated scenarios

Dataset

Download our synthetic dataset from: Box

Dataset Structure

- sd_images: the SD dataset generated by SD
    - generated_tasks.jsonl: the text of the robot responses explaining task limitations
    - images: the images generated by SD

- habitat_images: the Habitat dataset generated by Habitat Simulator
    - generated_tasks.jsonl: the text of the robot responses explaining task limitations
    - images: the images generated by Habitat Simulator

Model Checkpoints

Download our fine-tuned model weights from: Huggingface

Base model: LLaVA-1.5-7B

Installation

Clone the repository:

bash
git clone https://github.com/linyueqian/ME555_Final_Project
cd ME555_Final_Project

Install dependencies:

pip install -r requirements.txt

Inference

To run inference with our fine-tuned model:

python inference.py

This will load the model, process the input, and generate the output.

Generate Synthetic Data

To generate synthetic data:

python task_generation/generate.py
python image_generation/generate.py

This will generate the synthetic data and save it to the specified output file.

Citation

If you find this project useful, please cite our project:

@software{lin_yueqian_2024_ME555_Final_Project,
author = {Lin, Yueqian and Yang, Yixuan},
license = {Apache-2.0},
title = {{ME555_Final_Project}},
url = {https://github.com/linyueqian/ME555_Final_Project}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
eval		eval
image_generation		image_generation
task_generation		task_generation
README.md		README.md
inference.py		inference.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning to Say No: Unsolvable Robotic Task Detection Using Synthetic Data

Final Project of ME555

Introduction

Key Features

Dataset

Dataset Structure

Model Checkpoints

Installation

Inference

Generate Synthetic Data

Citation

About

Releases

Packages

Languages

linyueqian/ME555_Final_Project

Folders and files

Latest commit

History

Repository files navigation

Learning to Say No: Unsolvable Robotic Task Detection Using Synthetic Data

Final Project of ME555

Introduction

Key Features

Dataset

Dataset Structure

Model Checkpoints

Installation

Inference

Generate Synthetic Data

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages