Language2motion

The goal of this project is to create multi-modal implementation of Transformer architecture in Swift. It's a learning exercise for me, so I've taken it slowly, starting from simple image classifier and building it up.

Also it's an attempt to answer the question if Swift for Tensorflow is ready for non-trivial work.

The use-case is based on a paper "Learning a bidirectional mapping between human whole-body motion and natural language using deep recurrent neural networks" by Matthias Plappert. He created a nice dataset of few thousand motions "The KIT Motion-Language Dataset (paper)", website.

The Motion2Language Transformer which kind-of-works is there, already. I'm working towards completing language2motion solution.

I'm using modified Swift Transformer implementation by Andre Carrera.

The plan

something 2 label
- image 2 label
  - build image2label dataset with images representing motions
  - assign 5 dummy(ish) classes with PCA and k-means on motion annotations
  - classify motion images (+in fastai, +in swift)
- language 2 label
  - Transformer encoder on annotation + classifier
  - batched prediction
  - Use BERT classifier to assign better labels - didn't work
  - manually assign better labels
- motion 2 label
  - 1-channel ResNet on motion + classifier
  - ResNet feature extractor + Transformer encoder on motion features + classifier - didn't work
  - Transformer encoder on motion + classifier
language 2 language
- Transformer seq2seq from annotation to label text
- Transformer seq2seq from annotation to (same) annotation
motion 2 language
- Transformer from motion to annotation
language 2 motion
- Transformer encoder on annotation
- * Transformer decoder on motion

Name		Name	Last commit message	Last commit date
Latest commit History 595 Commits
.vscode		.vscode
Sources		Sources
Tests		Tests
data		data
dataset_labeling		dataset_labeling
docs		docs
notebooks		notebooks
research		research
.gitignore		.gitignore
Package.swift		Package.swift
README.md		README.md
l2m.code-workspace		l2m.code-workspace

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language2motion

The plan

Dataset files

Motion player

Runtime env

About

Releases

Packages

Languages

jn-sidao/language2motion

Folders and files

Latest commit

History

Repository files navigation

Language2motion

The plan

Dataset files

Motion player

Runtime env

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages