How can MPEG-4 TTS interface operate with FA tools synchronously?
MPEG-4 TTS interface passes phoneme symbols with their duration and average pitch information to the phoneme-to-FAP (Facial Animation Parameter) converter. Then the phoneme-to-FAP converter generates FAPs with its duration for the corresponding phoneme and passes the information to FA tools. From this information FA tools can generate face images in synchronization with synthesized speech.
- Why does MPEG-4 TTS interface use IPA (International Phonetic Alphabet) as its standard representation for phoneme symbols?
- Will MPEG-4 standardise specific security tools or will it provide a generic interface to utilise such (private) tools?
- How can MPEG-4 TTS interface operate with FA tools synchronously?