Inferring the Structure of a Tennis Game Using Multimodal Information
Dr. Qiang Huang

This event took place on 13th June 2012 at 11:30am (10:30 GMT)
Knowledge Media Institute, Berrill Building, The Open University, Milton Keynes, United Kingdom, MK7 6AA

Our ambitious long-term goal is to understand multimodal interaction between humans and we use a sports game, tennis, as a starting-point. In tennis, the goals of interactions are clearly defined and the interaction is subject to clear rules. As such, the game can be effectively analysed in terms of sequences of “events”. Our work focuses on the retrieval of these sequences from audio and visual information, and moves beyond low-level information classification or clustering of features to inferring the low-level structure of the game, a task which we believe could also be accomplished by an intelligent human who had no previous exposure to the game of tennis. The process of segmenting the stream of events present in the game is somewhat akin to a child learning how to segment a stream of speech into a sequence of words: the child notices that some phonetic sequences tend to re-occur, and that there are patterns of co-occurrence across different sequences. In this spirit, we will use a variable-length multigram model (VLMM) to search for regular occurring patterns of match events that are detected and inferred using multimodal information and constitute the basic “units” in a tennis match.


The webcast was open to 100 users

Click below to play the event (54 minutes)
Apologies for some audio artefacts during the seminar.

Creative Commons Licence KMi logo