This is a tensorflow implementation of the 'Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation'
'python3 -m vfi.src.train'
'python3 -m vfi.src.eval'
EMA is trained on a proprietary dataset with one million internet video clips, each comprising 3 frames.
- We use the publcicly available Vimeo-90k dataset (Xue et al., 2019), which is commonly used dataset for video frame interpolation