MMM Home  |   Help   |  Contact   |   Links   |   Proceedings   |   MLMI'05  
MLMI04 snapshot
jump to section Theme categories
  HCI & applications
  Structuring & interaction
  Speech Processing
  Interaction & teleconferencing
  Multimodal processing
  Dialogue management
  Emotion & Tracking
  Speech Processing
  Invited Talks only
  Browse all the talks
MLMI 04 workshop homepage Recordings of the joint AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms
 
More Info
Authors
Help
 
Welcome talk, Hervé Bourlard - IDIAP, CH
 
play back the talk play back the talk

video in real media format
(noisy audio track)

 

    Theme: HCI and applications - Chairperson: Barbara Peskin, ICSI, Berkeley  user guide      
08:30Invited talk: Bill Buxton - Buxton Design, Mountains, Exploration, Education, Rich Media and Design
 
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
"Reading, TV and Internet [29:05]"click to play
09:15Simon Tucker, Steve Whittaker - Univ. of Sheffield, UK, Accessing Multimodal Meeting Data: Systems, Problems and Possibilities
play back the talk play back the talk

video in real media format
(noisy audio)
click image to view slides
slides
click image to browse directory
all files
"Summarize this hour long meeting in five minutes [06:43s]" click to play
09:35Pierre Wellner, Mike Flynn, Mael Guillemot - IDIAP, CH Browsing Recorded Meetings With Ferret
 
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
click image to view demo
demo
"What are observations of interest? [03:05s]" click to play
09:55Dennis Reidsma, Rutger Rienks, Natasa Jovanovic- Univ. of Twente, NL Meeting Modelling
 
play back the talk play back the talk

video in real media format
(noisy audio)
click image to view slides
slides
click image to browse directory
all files
"Multimodal framework [4:00]"click to play





 

    Theme: Structuring and interaction - Chairperson: Pierre Wellner, IDIAP, CH user guide      
11:00Invited talk: Rafah Hosn - IBM Research, A Programming Model for Next Generation Multimodal Applications
*** Hint for this session: play the audio-visual presentation while browsing the slides. ***
play back the talk click image to play

video in real media format
(noisy audio)
click image to view slides
slides
click image to browse directory
all files
"Next generation multimodal applications [16:00]"click to play
11:45Denis Lalanne, Rolf Ingold, Didier von Rotz, Ardhendu Behera, Dalila Mekhaldi, Andrei Popescu-Belis - Univ. of Fribourg, CH , Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
click image to view demo
demo
"Document/Speech Alignment [09:14s]" click to play
12:05Agnes Just, Sebastien Marcel, Olivier Bernier - IDIAP, CH , Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures using Discrete IOHMM
play back the talk play back the talk

video in real media format
(noisy audio)
click image to view slides
slides
click image to browse directory
all files
"Input Output Hidden Markov Models [8:40]"click to play
12:25Nicolas Moënne-Loccoz, Bruno Janvier, Stéphane Marchand-Maillet, Eric Bruno - Univ. of Geneva, CH, An Integrated Framework for the Management of Video Collection
play back the talk play back the talk

video in real media format
(noisy audio)
click image to view slides
slides
click image to browse directory
all files
"Indexing structures [5:50]"click to play





 

    Theme: Speech Processing - Chairperson: Steve Renals, Univ. of Edinburgh, UK user guide      
14:00Invited talk: Stephen Cox - University of East Anglia, Confidence Measures in Speech Recognition
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
"Why confidence measures? [2:20]"click to play
14:45Chuck Wooters, Nikki Mirghafori, Andreas Stolcke, Tuomo Pirinen, Ivan Bulyko, Dave Gelbart, Martin Graciarena, Scott Otterson, Barbara Peskin, Mari Ostendorf - ICSI, Berkeley, The 2004 ICSI-SRI-UW Meeting Recognition System
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
"Acoustic model adaptation [12:05]"click to play
15:05Mathew Magimai Doss, Hervé Bourlard - IDIAP, CH, On the Adequacy of Baseform Pronunciations and Pronunciation Variants
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
"fully connected ergodic Markov model [7:30]"click to play
15:25Qifeng Zhu, Barry Chen, Nelson Morgan, Andreas Stolcke - ICSI, Berkeley, Tandem Connectionist Feature Extraction for Conversational Speech Recognition
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
"We use MLP outputs as features to HMM [2:20]"click to play





 

    Theme: Interaction and teleconferencing - Chairperson: Jean-Philippe Thiran, EPFL, CH  user guide      
08:30Invited talk: Jonathan Foote - FX Palo Alto Laboratory, Immersive Conferencing Directions at FX Palo Alto Laboratory
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
"If you could save people from having to watch a boring meeting [08:40s] ..." click image to view slides
09:15Invited talk: Mats Ljungqvist - EU, EU research initiatives in multimodal interaction
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
"IST in FP6: Work Programme 2005-2006 [18:00]"click to play
09:35Iain McCowan, Daniel Gatica-Perez, Samy Bengio, Darren Moore, Hervé Bourlard - IDIAP, CH, Towards Computer Understanding of Human Interactions
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
"How can we model the underliying process that is generating the observations [2:14]"click to play
09:55Max Froumentin - W3C, FR, Zakim - A multimodal sofware system for large-scale teleconferencing
play back the talk play back the talk

video in real media format
click image to view slides
slides
click image to browse directory
all files
"Automatic publishing minutes with Zakim [15:20]"click to play

 


All of the audio, video, slide signals have been released publicly under the Creative Commons Attribution NonCommercial ShareAlike 2.5 License.


Those records are designed to disseminate a wide spectrum of views on research topics including machine learning, speech, vision, text analysis, HCI.
DISCLAIMER
The views or opinions expressed by the guest speakers are solely their own and do not necessarily represent the views or opinions of IDIAP.

Support: email us
Questions, comments, bug reports, and feedback very welcome.

  MMM Home  |   Help   |  Contact   |   Copyrights   |   PASCAL page   |   Springer - PDF papers