Voice User Interfaces - Prototyping with Voiceflow

Voice User Interfaces

Prototyping with Voiceflow

Marisa Tschopp
by Marisa Tschopp
time to read: 8 minutes

Keypoints

How to start your first Voice Project

  • There are different ways to develop voice user interfaces
  • People without programming skills can use Voiceflow to develop their first voice project
  • Voiceflow is suitable for simple and more complicated voice projects
  • It can be used for Alexa Skills and Google Actions

As illustrated in the articles Voice User Interface Strategy – Opportunities and Risks of Voice Control and Voice User Interface Design – An Overview of Brand Personas, the individual facets of voice user strategies are not as trivial as one might think at first glance. Designing a conversation is a complex process on a strategic level: Which process of the company can be transformed into a voice process? Which voice, language, accent should the voice application have? How to market it? What character should the voice application have and what effect should it have on the user? These are just a few questions that VUI project managers have to ask themselves. The central question is: Who will program such a voice interface? Our IT? An external voice architect? This can quickly become expensive. Voiceflow offers a gentle introduction to experimenting with voice. A clear user interface, with which anyone, even someone without programming knowledge, can develop a voice application as Alexa Skill or Google Action.

The most important question before the start of the first voice project is not so much what the brandpersona looks like, what gender or tone of voice it has. The most important, and at the same time the most difficult, thing about the first voice project is to find a good usecase. Where and how do you want to use Voice? Where does the customer prefer voice control? Where does voice have real added value? Where is voice more fun? These and other questions are at the beginning and can be explored with all kinds of management methods, SWOT analyses, scorecards and other matrices and tables, design thinking workshops, and so on. But it does not have to be the big project in customer service. A small, simple usecase to get into the voice groove is quite useful and not expensive. Once you have a feeling for possibilities and limits, you can then tackle the big ideas, or maybe not.

A vocabulary trainer, a food order, or a quiz to play, real or playful, there are no limits to your imagination. We decided to use a very simple Google Action, which is called scip flashnews. Every Thursday, scip AG publishes a new blog article, some of which require quite a bit of reading time. If you just want to have a quick overview of the current article, you can use the Google Assistant to get it, be it while making coffee, in the car or anywhere else where you don’t have the time or desire to read it. The Google Action tells what it’s about (title of the blog). If the current topic is of interest, the main points of the article are summarized. The whole article must then be read on the homepage. The goal is to inform the reader in a very short time, which topic is current without having to read if the eyes are busy otherwise. The application can be expanded over time, with other topics on news, knowledge and learning about technology, or current vulnerabilities (this application is already available as an Alexa Skill).

Overview Voiceflow

With a free account a limited number of projects can be created. When a project is opened there are three tabs: Design, Prototype, and Build.

Design

On this starting page the voiceflow can be created by drag & drop. On the left side you find the possible options, such as the Speak or the Choice block. These can be easily tried out or studied via tutorial.

Possible blocks in the voiceflow design

The application always starts with the first fixed homeblock. This is not editable. Afterwards the Voiceflow design follows. We start with the company welcome note as a Speak block and combine it with a Choice block. After the welcome greeting the user has the Choice possibility to say yes or no. If the user wants to know what the title of the current article is, the title is read out. If the user is not interested, he is directly forwarded to the Speak block No Interest and said goodbye. When the title is read out the user is given the option to choose if he wants to know more about the article. If the user is interested, he is forwarded to the next block and the most important points of the article are read out and said goodbye. If the user has heard the title, but does not find it interesting, the user is forwarded to the Speak block No Interest and said goodbye.

Simple example Voiceflow

The individual Utterances can then be entered in the blocks. These are the possibilities what a user could say, e.g. if the user for instance says Nope or Naw instead of No.

Prototype

On this page the prototype can be tested. The entire voiceflow with all branches can be tested, either only in writing or orally. When it comes to conversation design, there is never enough testing. On the one hand because of the logic and what can happen and whether it flows orally as you imagine it when you write. On the other hand, pronunciation and tonality also need to be tested. For example, the name of the company was deliberately misspelled so that it is pronounced correctly, e.g. skip AG instead of scip AG. Are the pauses well set? Is an emphasis strangely disturbing? All these little things are adjusted in countless tests.

Prototype of a simple voice project

Build

Here the voice application can be published as Alexa Skill or Google Action. In order to publish the Google Action and make it available on a Google Smart Speaker, for example, you have to follow several steps and open an account, which takes some time. On the Homepage you will find a step-by-step guide.

Conclusion

Voiceflow is a great way to literally get into the voiceflow, to experiment and get a feel for whether a voice application makes sense, how much effort it takes or which platform can be used. It is simple, easy to understand and can be increased in complexity.

It is possible to develop a skill for Alexa and Google. However, building an Alexaskill is not worthwhile for the Swiss market at the moment because Amazon is not really present. Google is appealing and user friendly, accounts and devices can be combined very easily. But not everyone has or wants a Google Account and whoever gets involved with it has to deal with the topics data security and privacy. Ultimately, the devices and features work best for those who disclose the most data and information, but this is a bitter pill for data/privacy advocates. So a decision must be well thought out.

Notes on the Voice User Interface Strategy series

This article was written in the course of the continuing education course Voice User Interface Strategy of the Lucerne University of Applied Sciences and Arts Contents reflect the opinion of the author.

About the Author

Marisa Tschopp

Marisa Tschopp (Dr. rer. nat., University of Tübingen) is actively engaged in research on Artificial Intelligence from a human perspective, focusing on psychological and ethical aspects. She has shared her expertise on TEDx stages, among others, and represents Switzerland on gender issues within the Women in AI Initiative. (ORCID 0000-0001-5221-5327)

Links

You want to evaluate or develop an AI?

Our experts will get in contact with you!

×
Human and AI

Human and AI

Marisa Tschopp

Human and AI Art

Human and AI Art

Marisa Tschopp

Conversational Commerce

Conversational Commerce

Marisa Tschopp

ChatGPT & Co.

ChatGPT & Co.

Marisa Tschopp

You want more?

Further articles available here

You need support in such a project?

Our experts will get in contact with you!

You want more?

Further articles available here