DSpace at KOASAS: Talking Face Generation with Multilingual TTS

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

Talking Face Generation with Multilingual TTS

Cited 0 time in webofscience

Cited 0 time in

Hit : 125
Download : 0

Export

Song, Hyoung-Kyu researcher / Woo, Sang Hoon / Lee, Junhyeok / Yang, Seungmin / Cho, Hyunjae / Lee, Youseong / Choi, Dongho / Kim, Kang-Wook

Recent studies in talking face generation have focused on building a model that can generalize from any source speech to any target identity. A number of works have already claimed this functionality and have added that their models will also generalize to any language. However, we show, using languages from different language families, that these models do not translate well when the training language and the testing language are sufficiently different. We reduce the scope of the problem to building a language-robust talking face generation system on seen identities, i.e., the target identity is the same as the training identity. In this work, we introduce a talking face generation system that generalizes to different languages. We evaluate the efficacy of our system using a multilingual text-to-speech system. We present the joint text-to-speech system and the talking face generation system as a neural dubber system. Our demo is available at https://bit.ly/ml-face-generation-cvpr22-demo. Also, our screencast is uploaded at https://youtu.be/F6h0s0M4vBI.

Publisher: IEEE Computer Society

Issue Date: 2022-06

Language: English

Citation: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, pp.21393 - 21398

ISSN: 1063-6919

DOI: 10.1109/CVPR52688.2022.02074

URI: http://hdl.handle.net/10203/312784

Appears in Collection: RIMS Conference Papers

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Talking Face Generation with Multilingual TTS

KOASAS

Communities & Collections