A Software Facial Expression Synthesizer with The Chinese Text-to-Speech Function
Author(s)
DOI
20060927122908476468
Abstract
The proposed system is named Image Talk: a real-time
synthetic talking head using one single image with
Chinese text-to-speech capability. Image Talk uses one
single image to automatically create video-like talking
sequences in real time. The image can be acquired
from photographs, video clips, or hand drawn characters.
This interactive system accepts Chinese text input and
talks back in Mandarin Chinese, generating facial
expression in real time.
Image Talk analyzes Chinese text by converting it to a
standard Pinyin system used in Taiwan and fetches the
associated facial expressions from an expression pool
dynamically. The expressions are synchronized with
the synthetic speech and played back in the video-like
talking sequence in real time.
Image Talk also incorporates eye blinking, small-scale
head rotation and translation perturbations, to make the
results more natural. The generic Talk Mask is also
easy to switch to any other facial or non-facial images,
such as dogs, for special effects. The result is quite
entertaining, and can easily be used as a new human-machine
interface, as well as for lip synchronization for
computer animated characters.
synthetic talking head using one single image with
Chinese text-to-speech capability. Image Talk uses one
single image to automatically create video-like talking
sequences in real time. The image can be acquired
from photographs, video clips, or hand drawn characters.
This interactive system accepts Chinese text input and
talks back in Mandarin Chinese, generating facial
expression in real time.
Image Talk analyzes Chinese text by converting it to a
standard Pinyin system used in Taiwan and fetches the
associated facial expressions from an expression pool
dynamically. The expressions are synchronized with
the synthetic speech and played back in the video-like
talking sequence in real time.
Image Talk also incorporates eye blinking, small-scale
head rotation and translation perturbations, to make the
results more natural. The generic Talk Mask is also
easy to switch to any other facial or non-facial images,
such as dogs, for special effects. The result is quite
entertaining, and can easily be used as a new human-machine
interface, as well as for lip synchronization for
computer animated characters.
Subjects
Facial Animation
Image Warping
Real-time
Lip Synchronization
Lip Synchronization
Texture Mapping
Publisher
臺北市:國立臺灣大學資訊工程學系
Type
other
File(s)![Thumbnail Image]()
Loading...
Name
rams98.pdf
Size
228.82 KB
Format
Adobe PDF
Checksum
(MD5):18744d707b4419d189c120c530836c4c