Put That There - Université Grenoble Alpes Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Put That There


Humans interact with the world using five major senses: sight, hearing, touch, smell, and taste. Almost all interaction with the environment is naturally multimodal, as audio, tactile or paralinguistic cues provide confirmation for physical actions and spoken language interaction. Multimodal interaction seeks to fully exploit these parallel channels for perception and action to provide robust, natural interaction. Richard Bolt's "Put That There" (1980) provided an early paradigm that demonstrated the power of multimodality and helped attract researchers from a variety of disciplines to study a new approach for post-WIMP computing that moves beyond desktop graphical user interfaces (GUI). In this talk, I will look back to the origins of the scientific community of multimodal interaction, and review some of the more salient results that have emerged over the last 20 years, including results in machine perception, system architectures, visualization, and computer to human communications. Recently, a number of game-changing technologies such as deep learning, cloud computing, and planetary scale data collection have emerged to provide robust solutions to historically hard problems. As a result, scientific understanding of multimodal interaction has taken on new relevance as construction of practical systems has become feasible. I will discuss the impact of these new technologies and the opportunities and challenges that they raise. I will conclude with a discussion of the importance of convergence with cognitive science and cognitive systems to provide foundations for intelligent, human-centered interactive systems that learn and fully understand humans and human-to-human social interaction, in order to provide services that surpass the abilities of the most intelligent human servants. Over the last 35 years, professor Crowley has made a number of fundamental contributions to computer vision, robotics and multi-modal interaction. These include early innovations in scale invariant computer vision, localization and mapping for mobile robots, appearance-based techniques for computer vision, and visual perception for human-computer interaction. Current research concerns context aware observation of human activity, Ambient Intelligence, and new forms of Human-Computer Interaction based on machine perception.
icmi1721c-crowleySC1.pdf (441.83 Ko) Télécharger le fichier
Format : Papier court
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02122093 , version 1 (07-05-2019)



James L. Crowley. Put That There: 20 Years of Research on Multimodal Interaction. ICMI 2018 - 20th ACM International Conference on Multimodal Interaction, Oct 2018, Boulder, CO, United States. pp.1, ⟨10.1145/3242969.3276309⟩. ⟨hal-02122093⟩
79 Consultations
212 Téléchargements



Gmail Facebook X LinkedIn More