Multi-modal Integration of Dynamic Audiovisual Patterns for an Interactive Reinforcement Learning Scenario

IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 759--766, doi:10.1109/IROS.2016.7759137 - Oct 2016.
Associated documents : Cruz_IROS_2016.pdf [945Ko]   http://dx.doi.org/10.1109/IROS.2016.7759137
Robots in domestic environments are receiving more attention, especially in scenarios where they should interact with parent-like trainers for dynamically acquiring and refining knowledge. A prominent paradigm for dynamically learning new tasks has been reinforcement learning. However, due to excessive time needed for the learning process, a promising extension has been made by incorporating an external parent-like trainer into the learning cycle in order to scaffold and speed up the apprenticeship using advice about what actions should be performed for achieving a goal. In interactive reinforcement learning, different uni-modal control interfaces have been proposed that are often quite limited and do not take into account multiple sensor modalities. In this paper, we propose the integration of audiovisual patterns to provide advice to the agent using multi-modal information. In our approach, advice can be given using either speech, gestures, or a combination of both. We introduce a neural network-based approach to integrate multi-modal information from uni-modal modules based on their confidence. Results show that multi-modal integration leads to a better performance of interactive reinforcement learning with the robot being able to learn faster with greater rewards compared to uni-modal scenarios.

 

@InProceedings{CPTW16,
  author       = "Cruz, Francisco and Parisi, German I. and Twiefel, Johannes and Wermter, Stefan",
  title        = "Multi-modal Integration of Dynamic Audiovisual Patterns for an Interactive Reinforcement Learning Scenario",
  booktitle    = "IEEE/RSJ International Conference on Intelligent Robots and Systems",
  pages        = "759--766",
  month        = "Oct",
  year         = "2016",
  organization = "IEEE",
  address      = "Daejeon, KR",
  doi          = "10.1109/IROS.2016.7759137",
  url          = "https://www2.informatik.uni-hamburg.de/wtm/publications/2016/CPTW16/Cruz_IROS_2016.pdf"
}

» Francisco Cruz
» German I. Parisi
» Johannes Twiefel
» Stefan Wermter