Put That There: 20 Years of Research on Multimodal Interaction

James L. Crowley 1
1 PERVASIVE - Interaction située avec les objets et environnements intelligents
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble, Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology, UGA - Université Grenoble Alpes
Abstract : Humans interact with the world using five major senses: sight, hearing, touch, smell, and taste. Almost all interaction with the environment is naturally multimodal, as audio, tactile or paralinguistic cues provide confirmation for physical actions and spoken language interaction. Multimodal interaction seeks to fully exploit these parallel channels for perception and action to provide robust, natural interaction. Richard Bolt's "Put That There" (1980) provided an early paradigm that demonstrated the power of multimodality and helped attract researchers from a variety of disciplines to study a new approach for post-WIMP computing that moves beyond desktop graphical user interfaces (GUI). In this talk, I will look back to the origins of the scientific community of multimodal interaction, and review some of the more salient results that have emerged over the last 20 years, including results in machine perception, system architectures, visualization, and computer to human communications. Recently, a number of game-changing technologies such as deep learning, cloud computing, and planetary scale data collection have emerged to provide robust solutions to historically hard problems. As a result, scientific understanding of multimodal interaction has taken on new relevance as construction of practical systems has become feasible. I will discuss the impact of these new technologies and the opportunities and challenges that they raise. I will conclude with a discussion of the importance of convergence with cognitive science and cognitive systems to provide foundations for intelligent, human-centered interactive systems that learn and fully understand humans and human-to-human social interaction, in order to provide services that surpass the abilities of the most intelligent human servants. Over the last 35 years, professor Crowley has made a number of fundamental contributions to computer vision, robotics and multi-modal interaction. These include early innovations in scale invariant computer vision, localization and mapping for mobile robots, appearance-based techniques for computer vision, and visual perception for human-computer interaction. Current research concerns context aware observation of human activity, Ambient Intelligence, and new forms of Human-Computer Interaction based on machine perception.
Document type :
Conference papers
Complete list of metadatas

Cited literature [3 references]  Display  Hide  Download

http://hal.univ-grenoble-alpes.fr/hal-02122093
Contributor : Sandrine Corvey-Biron <>
Submitted on : Tuesday, May 7, 2019 - 11:19:53 AM
Last modification on : Thursday, October 24, 2019 - 10:35:59 AM

Annex

Identifiers

Collections

Citation

James L. Crowley. Put That There: 20 Years of Research on Multimodal Interaction. ICMI 2018 - 20th ACM International Conference on Multimodal Interaction, Oct 2018, Boulder, CO, United States. pp.1, ⟨10.1145/3242969.3276309⟩. ⟨hal-02122093⟩

Share

Metrics

Record views

30

Files downloads

33