Automated video editing based on learned styles using LSTM-GAN

Huang H.-I; CHI-SHENG SHIH; Yang Z.-L.; Huang H.-I;Shih C.-S;Yang Z.-L.

doi:10.1145/3477314.3507141

Automated video editing based on learned styles using LSTM-GAN

Journal

Proceedings of the ACM Symposium on Applied Computing

Pages

73-80

Date Issued

2022

Author(s)

Huang H.-I

CHI-SHENG SHIH

Yang Z.-L.

DOI

10.1145/3477314.3507141

URI

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85130329395&doi=10.1145%2f3477314.3507141&partnerID=40&md5=9d4f3c372316709478132651a26e4e24

https://scholars.lib.ntu.edu.tw/handle/123456789/632567

Abstract

Experienced video editors use various editing techniques, including camera movement, types of shots, and shot compositions to create specific video semantics delivering messages to the viewers. In the video production process, the content of the video are essential, but so is the way to compose it. The goal of this work is to train a model learning how to edit the video that meets the videography requirements. This work proposes a deep generative model where both the generator and discriminator are unidirectional LSTM networks to generate the sequences of shot transitions for video editing. The proposed model learns different types of editing transitions from edited video clips. One is the stage performance of Korean music programs, and another is Chinese music programs. By combining different types of shots and camera movements, the proposed AI video editor brings various viewing experiences to the viewers. The quality of the generated shot sequences for video editing are evaluated by three metrics, which are creativity, inheritance, and diversity. The results show that the quality of the synthetic sequences generated by LSTM-GAN are better than those generated by the baseline model (Markov chain or LSTM). In summary, the quality of the sequence generated by LSTM-GAN is better than the quality generated by the Markov chain or LSTM while ensuring creativity, inheritance, and diversity at the same time. © 2022 ACM.

Subjects

GAN; LSTM; video editing

SDGs

[SDGs]SDG10

Other Subjects

Cameras; Long short-term memory; Semantics; User interfaces; Video recording; Automated video editing; Camera's movements; GAN; LSTM; Music program; Production process; Video editing; Video editor; Video production; Video semantics; Video signal processing

Type

conference paper

Automated video editing based on learned styles using LSTM-GAN

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)