Deakin University
Browse

File(s) under permanent embargo

Short-Term and Long-Term Context Aggregation Network for Video Inpainting

Version 2 2024-06-05, 05:42
Version 1 2021-01-07, 14:49
conference contribution
posted on 2024-06-05, 05:42 authored by A Li, S Zhao, X Ma, M Gong, J Qi, R Zhang, D Tao, R Kotagiri
Video inpainting aims to restore missing regions of a video and has many applications such as video editing and object removal. However, existing methods either suffer from inaccurate short-term context aggregation or rarely explore long-term frame information. In this work, we present a novel context aggregation network to effectively exploit both short-term and long-term frame information for video inpainting. In the encoding stage, we propose boundary-aware short-term context aggregation, which aligns and aggregates, from neighbor frames, local regions that are closely related to the boundary context of missing regions into the target frame (The target frame refers to the current input frame under inpainting.). Furthermore, we propose dynamic long-term context aggregation to globally refine the feature map generated in the encoding stage using long-term frame features, which are dynamically updated throughout the inpainting process. Experiments show that it outperforms state-of-the-art methods with better inpainting results and fast inpainting speed.

History

Pagination

728-743

Location

Glasgow, Scotland

Start date

2020-08-23

End date

2020-08-28

ISSN

0302-9743

eISSN

1611-3349

ISBN-13

9783030585471

Language

eng

Publication classification

E1 Full written paper - refereed

Title of proceedings

ECCV 2020 : Proceedings of the 2020 European Conference of Computer Vision

Event

Computer Vision. European Conference (2020 : Glasgow, Scotland)

Publisher

Springer

Place of publication

Berlin, Germany

Series

Lecture Notes in Computer Science; v.12349

Usage metrics

    Research Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC