RAI: Human-Robot Interaction

You can utilize RAI Human-Robot Interaction (HRI) package to converse with your robots. This package allows you to simply chat with your robot, or to give it tasks and receive feedback and reports. You have the following options:

Voice communication using ASR and TTS models (OpenAI Whisper)
Text communication using Streamlit

If your environment is noisy, voice communication might be tricky. In noisy environments, it is better to use text channel.

How it works?

General Architecture

The general architecture follows the diagram above. Text is captured from the input source, transported to the HMI, processed according to the given tools and robot's rules, and then sent to the output source.

Voice Interface

In the voice interface, the input source is a microphone, while the output source is a speaker. The input is processed using the OpenAI Whisper model (cloud-based, paid) or with the local model, while the output can be produced using OpenTTS (Apache-2.0, depending on the model used) or ElevenLabs (cloud-based, paid).

Text Interface

The text interface is implemented directly in RAI_HMI using Streamlit. The GUI closely follows standard chat-like conversations, with built-in support for tool integration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

human_robot_interface.md

human_robot_interface.md

RAI: Human-Robot Interaction

How it works?

General Architecture

Voice Interface

Text Interface

Files

human_robot_interface.md

Latest commit

History

human_robot_interface.md

File metadata and controls

RAI: Human-Robot Interaction

How it works?

General Architecture

Voice Interface

Text Interface