This is the official implementation of TeSMo, a method for text-controlled scene-aware motion generation, from the ECCV 2024 paper: "Generating Human Interaction Motions in Scenes with Text Control".
Our locomotion dataset is built from 3D-Front and HumanML3D, while our interaction data depends on SAMP. You will need to download these datasets first in order to build our datasets.
Run these scripts to process the datasets.
If you find this work useful for your research, please cite our paper:
@inproceedings{yi2024tesmo,
author={Yi, Hongwei and Thies, Justus and Black, Michael J. and Peng, Xue Bin and Rempe, Davis},
title={Generating Human Interaction Motions in Scenes with Text Control},
booktitle={European Conference on Computer Vision (ECCV)},
year={2024}
}