Machine Learning for Multimodal Interaction: 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers / Edition 1 available in Paperback
- Pub. Date:
- Springer Berlin Heidelberg
This book constitutes the thoroughly refereed post-proceedings of the 4th International Workshop on Machine Learning for Multimodal Interaction, MLMI 2007, held in Brno, Czech Republic, in June 2007.
The 25 revised full papers presented together with 1 invited paper were carefully selected during two rounds of reviewing and revision from 60 workshop presentations. The papers are organized in topical sections on multimodal processing, HCI, user studies and applications, image and video processing, discourse and dialogue processing, speech and audio processing, as well as the PASCAL speech separation challenge.
Table of ContentsInvited Paper.- Robust Real Time Face Tracking for the Analysis of Human Behaviour.- Multimodal Processing.- Conditional Sequence Model for Context-Based Recognition of Gaze Aversion.- Meeting State Recognition from Visual and Aural Labels.- Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers.- HCI, User Studies and Applications.- Automatic Annotation of Dialogue Structure from Simple User Interaction.- Interactive Pattern Recognition.- User Specific Training of a Music Search Engine.- An Ego-Centric and Tangible Approach to Meeting Indexing and Browsing.- Integrating Semantics into Multimodal Interaction Patterns.- Towards an Objective Test for Meeting Browsers: The BET4TQB Pilot Experiment.- Image and Video Processing.- Face Recognition in Smart Rooms.- Gaussian Process Latent Variable Models for Human Pose Estimation.- Discourse and Dialogue Processing.- Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech.- Term-Weighting for Summarization of Multi-party Spoken Dialogues.- Automatic Decision Detection in Meeting Speech.- Czech Text-to-Sign Speech Synthesizer.- Speech and Audio Processing.- Using Prosodic Features in Language Models for Meetings.- Posterior-Based Features and Distances in Template Matching for Speech Recognition.- A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems.- Transfer Learning for Tandem ASR Feature Extraction.- Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search.- Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding.- Modeling Vocal Interaction for Segmentation in Meeting Recognition.- Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation.- PASCAL Speech Separation Challenge II.- To Separate Speech.- Microphone Array Beamforming Approach to Blind Speech Separation.