Printed from www.flong.com
Contents © 2009 Golan Levin and Collaborators
Golan Levin and Collaborators
Projects
Sort by : Author | Date | Name | Type
- Solo
- 04 2009. Merce's Isosurface
- 03 2009. ART AND CODE
- 12 2008. New Year Cards
- 07 2008. Double-Taker (Snout)
- 05 2008. Poster design for Maeda lecture
- 01 2008. Solo exhibition at bitforms gallery
- 11 2007. Opto-Isolator
- 11 2007. Eyecode
- 11 2007. Interstitial Fragment Processor
- 05 2007. Ghost Pole Propagator
- 09 2005. Scrapple (Installation)
- 01 2004. Civic Exchange Prototype
- 09 2002. Axis
- 03 2002. Stria
- 10 2001. Dendron
- 02 2001. The Role of Relative Velocity
- 01 2001. Obzok
- 08 2000. Segmentation and Symptom
- 03 2000. Audiovisual Environment Suite
- 01 2000. Yellowtail
- 09 1999. Banded Clock
- 02 1999. Floccular Portraits
- 01 1999. Floccus
- 12 1998. Stripe
- 09 1998. Meshy
- 04 1998. Directrix
- 01 1997. Blebs
- Tmema (Golan Levin & Zachary Lieberman)
- 11 2007. Reface [Portrait Sequencer]
- 08 2006. Footfalls
- 08 2004. Motion Traces [A1 Corridor]
- 05 2004. The Manual Input Workstation
- 05 2004. The Manual Input Sessions
- 02 2004. Interactive Bar Tables
- 12 2003. Messa di Voce (Installation)
- 09 2003. Messa di Voce (Performance)
- 07 2003. Amore Pacific Display
- 09 2002. Hidden Worlds of Noise and Voice
- 09 2002. Re:MARK
- Other Collaborations
- 02 2009. Code, Form, Space
- 10 2008. IEEE InfoVis 2008 Art Exhibition
- 10 2007. IEEE InfoVis 2007 Art Exhibition
- 04 2006. Signal Operators
- 02 2006. The Dumpster
- 09 2005. Ursonography
- 09 2005. Scrapple (Performance)
- 03 2004. Finger Spies
- 05 2002. JJ (Empathic Network Visualization)
- 02 2002. The Secret Lives of Numbers
- 09 2001. Dialtones (A Telesymphony)
- 05 2001. Alphabet Synthesis Machine
- 03 2001. Interactive Logographs
- 09 2000. Scribble
- 07 2000. Introspection Machine
- 12 1999. Slamps
- 09 1999. Dakadaka
- 01 1998. Interval Projects
- 01 1997. Streamer
- 08 1996. Rouen Revisited
- 05 1994. Media Streams Icons
Mouther
1995 | Golan Levin with Malcolm Slaney and Tom Ngo

Mouther is a experimental prototype I helped develop at Interval Research, in which an animated cartoon face is lip-synched in real-time to the user's speech. I developed the prototype concept and artwork using Tom Ngo's in-house "Embedded Constraint Graphics" (ECG) animation engine, while Malcolm Slaney made Mouther possible by connecting these graphics to special speech-parsing technologies.
Mouther works by driving an embedded-constraint graphic with the output of a phoneme recognizer. This phoneme recognizer was built using mel-frequency cepstral coefficients (MFCC) as features and using maximum-likelihood selection based on Gaussian mixture models (GMMs) of each phoneme. Depending on the amount and diversity of training data, speaker-dependent or speaker-independent GMMs could be formed for each phoneme. To reduce the system's sensitivity to microphone and room acoustics, the MFCC's were filtered by RASTA (a widely accepted method for reducing the dependence of acoustic features on channel characteristics) prior to classification.
The result is a talking cartoon face whose animation is driven by the output of the classifier, and in which different mouth positions are displayed for the different phonemes that are spoken. While further work would be needed to increase the reliability of the classifier and the realism of the transitions between different visemes, the current result is amusing and could be sufficiently responsive for the quality-level needed in children's computer games.