This is my final project for non-traditional interfaces class. I’m responsible for designing the entertainment system and coordinating the overall speech interface functions across three systems (entertainment, operation, and environment).
Relevant Skills: Speech Interface Design, Mind Map, User Need Analysis
Overview
We designed Esya, a Level Four automated car, which will utilize a variety of non-traditional interfaces, including impoverished, speech, gesture, and haptic interfaces to interact with its users.
I will elaborate on the entertainment system design, but you are welcome to read the full report or presentation.
Entertainment System
Need analysis
We define five important functions that should be achieved by entertainment system:
- Play music, podcasts, radios, and other soundtracks when requested by the user
- Show movies, TV shows and other videos when requested by the user
- Perform advanced search, such as find a relaxing music playlist or display movies that are good for families
- Operate on the ongoing task, such as adjust volume, pause, replay, show lyrics
- Display the information of surrounding areas such as nearby restaurants, tourist attractions or gas stations when requested by the user
Users can perform those functions through the following ways.
Operate on the phone
Based on operational system, Esya will automatically connects to a phone at the initial setup. The user can operate on the phone directly. The music will be played through car stereo, and videos will be projected by AR to side windows.
To disconnect the phone, the user can either turn off the bluetooth or ask Esya to do so.
Talk to Esya
Here is a flow chart of the speech interface for entertainment system.
Here is the flow chart for environmental system:
Here is the flow chart for navigation system:
Based on the keywords found in the user’s speech, Esya will lead to the following four scenarios:
Discussion
Why choose speech interface for entertainment system?
We use speech interface process the majority of tasks because:
- Speech is a natural form of interaction, and Esya’s speaking voice is socially relatable, so it will contribute to user experience
- Car is a private environment and noise level can be controlled
- Good for passengers whose abilities to use their hands or eyes are limited
- Relatively mature technology (e.g. Alexa, Google Home, Siri)
We add an additional simple gesture interface (touch pad)
- It won’t interrupt the ongoing music/ video/ conversations
- Lower user frustration
- Simple and intuitive ways for simple adjustments
Potential human factor issues for speech interface
Speech interface:
- Only one person can talk when Esya is on, otherwise it may bring confusions
- To turn on Esya again, the user needs to be as loud as the music playing
Simple gesture interface:
- Passengers may have different opinions on what video or music should be played or how high the volume should be
- The person on the driver’s seat has a special button to freeze touchpad operations