BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for
4K Video Frame Interpolation

Junhuem Park
jhpark@mcl.korea.ac.kr
Korea University

Jintae Kim
jtkim@mcl.korea.ac.kr
Korea University

Chang-Su Kim
cskim@mcl.korea.ac.kr
Korea University

Abstract

A novel 4K video frame interpolator based on bilateral transformer (BiFormer) is proposed in this paper, which performs three steps: global motion estimation, local motion refinement, and frame synthesis. First, in global motion estimation, we predict symmetric bilateral motion fields at a coarse scale. To this end, we propose Bi- Former, the first transformer-based bilateral motion estimator. Second, we refine the global motion fields efficiently using blockwise bilateral cost volumes (BBCVs). Third, we warp the input frames using the refined motion fields and blend them to synthesize an intermediate frame. Extensive experiments demonstrate that the proposed Bi- Former algorithm achieves excellent interpolation performance on 4K datasets. The source codes are available at https://github.com/JunHeum/BiFormer.

Overview

Bilateral attention

Demo video

Publication

Junhuem Park, Jintae Kim, and Chang-Su Kim, "BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation," accepted to Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
[arxiv] [code]