Size: 2582
Comment:
|
← Revision 18 as of 2011-07-31 23:53:54 ⇥
Size: 6787
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 2: | Line 2: |
Certain classes of users, or any user in certain contexts, must be able to consume content without reading the screen. | You must allow certain classes of users, or any user in certain contexts, to consume content without reading the screen. Practically all mobile devices have audio output of some sort, and it can be accessed by almost every application or website. There can be strict limits, such as devices which only output over headsets, or those which only send phone call audio over Bluetooth, that can limit the use of some tones. |
Line 6: | Line 8: |
Mobile devices must be able to read text displayed on the screen, so it can be accessed and understood by users who cannot use the screen. | Mobile devices must be able to read text displayed on the screen for the user, so it can be accessed and understood by those who cannot use the screen. |
Line 8: | Line 10: |
Due to mobiles being contextually employed, an inability to use the screen may be because of distraction, an inability to | Due to mobiles being contextually employed, there are numerous instances in which the user may not be able to, not be allowed to, or may choose not to read the screen. The user may well choose to use '''Voice Readback''' to allow themselves to use their hands and eyes for other purposes. While working, or performing hobbies which do not require excessive cognitive load themselves, such as most driving, radio and other audio output is used to gather information or provide entertainment. Video is generally more entertaining, but is totally unsuitable for these situations. {{attachment:VoiceReadback-VoiceUI.png|Voice Readback can form an integral part of a complete voice UI for mobile devices. Readback is used to prompt for commands, and then confirms the user input, or declares how the system has interpreted the command. It will also read on-screen displays and options, to allow the user to select appropriate items without looking at the screen.}} == Variations == '''Voice Readback''' always works in broadly the same way, but what is being read varies: * '''Universal''' - The entire interface is read, to allow the device to be used without any view of the display. This is usually combined with the '''Voice Input''' pattern to create a complete voice UI, as an alternative to the conventional button (or touch) and screen UI native to the device. Even if used for only one section, action, or phrase, this same method is used for any readback of voice commands. * '''Elemental''' - An entire document, such as a PDF, email or web page is read until the user cancels the action or the entire document is read. * '''Selected''' - A selection the user has specified within any context -- for example by highlighting text in a web page -- is read in it's entirety. Voice output that is presented based on conditions, such as position or time, are discussed under the '''[[Voice Notifications]]''' pattern. |
Line 11: | Line 25: |
== Variations == Universal - whole interface, output side (replacing the screen) for a complete voice I/O. Even if used for only one phrase, this same method is used for any readback of voice commands. |
{{attachment:VoiceReadback-Selection.png|Selections made by the user can be read. The play control is usually contextual, and related to the selection such as the Pop-Up menu shown above.|align="right"}} |
Line 14: | Line 27: |
Elemental - Read a document, a web page, etc. Then ends. | == Interaction Details == '''Voice Readback''' can be turned on as a setting for the entire OS, or on an application basis. It will then be used automatically, whenever a change in the application is initiated. |
Line 16: | Line 30: |
Selected - Read a selection | Other input methods, such as keypads and '''[[Directional Controls]]''' will still function. Buttons will generally be needed to unlock or refresh a screen so that the current condition is read aloud. |
Line 18: | Line 32: |
Confirmation - In a voice I/O, reads back entry to confirm that this is what is intended. | Readback can also be initiated, for the Elemental and Selection variations, within the application or as a contextual control such as a '''[[Pop-Up]]''', menu, '''[[Annotation]]''' or other control. |
Line 20: | Line 34: |
Voice output that is presented based on conditions, such as position or time, are discussed under the '''[[Voice Reminders]]''' pattern. | Readback for single-use cases of UI control is the result of '''[[Voice Input]]'''. Initiation is discussed under that pattern. |
Line 23: | Line 37: |
== Interaction Details == initiate at the moment... set generally... in settings |
{{attachment:VoiceReadback-Elemental.png|When entire documents, long passages or even just marquee text, is being read the document will scroll to always have to current reading selection visible in the viewport. A cursor or highlight -- as shown above -- should be displayed to correspond to the word currently being read. An indicator of audio playback should be on the screen at all times, and a control should be provided to immediately mute or pause the audio.|align="right"}} |
Line 30: | Line 39: |
Audio should be played through the external speaker, or speakerphone. The last set in-call volume (or equivalent playback volume for non-phones) should be used. Whenever possible, detect the ambient noise level and adjust the volume accordingly, in order to make it audible. | Audio should be played by default through the external speaker, or speakerphone. The last set in-call volume (or equivalent playback volume for non-phones) should be used. Whenever possible, detect the ambient noise level and adjust the volume accordingly, in order to make it audible. |
Line 34: | Line 43: |
Content read must be identical to that printed on the screen. The condition that resulted in user employing '''Voice Readback''' may be temporary and transient; the user may switch between the screen and audio channels, or may even read along with the voice output. Even for users with a vision deficit, others may be accompanying them, who may also wish to use the device. | Content read must be identical to that printed on the screen. The condition that resulted in the user employing '''Voice Readback''' may be temporary and transient. Allow the user to switch between the screen and audio channels. They may even wish to read along with the voice output. Even for users with a vision deficit, others may be accompanying them, who may also wish to use the device. |
Line 36: | Line 45: |
There may be delays between phrases, or before the start of the audio readback. To inform the user that audio is about to commence, and to prepare them for the volume level, a subtle tone should be played immediately beforehand. | Also be sure that content scrolls as the audio gets to that part, so that the item being read back is in the viewport. |
Line 38: | Line 47: |
A similar tone should be used when '''Voice Readback''' has completed for a significant time, or for the selected setting, to confirm this condition to the user. Otherwise they may be left guessing due to just | There may be delays between phrases, or before the start of the audio readback. To inform the user that audio is about to commence, and to prepare them for the volume level, play a subtle tone immediately beforehand. Use a similar tone when '''Voice Readback''' has stopped for a significant time, or to communicate a selected ovum setting, to confirm this condition to the user. |
Line 42: | Line 53: |
Avoid mixing readback of commands and text. When the two must be used together, use delays, tones, changes in voice and clear syntax "You said..." to indicate the difference. | Avoid mixing readback of commands and text. When the two must be used together, use delays, tones, changes in voice and clear syntax (such as "You said...") to indicate the difference. The voice you select must be as understandable as possible. Text-to-voice translation of names, especially, can be difficult to understand or improperly pronounced. If quality is too low with the available hardware and software, do not implement the solution. Keep in mind that users may be wearing headsets. Some headsets will not accept all output, so if your application relies on '''Voice Readback,''' make sure the targeted devices support your application or service sending audio to all attached headsets or other audio devices. |
Line 45: | Line 60: |
---- Next: '''[[Voice Notifications]]''' ---- = Discuss & Add = Please do not change content above this line, as it's a perfect match with the printed book. Everything else you want to add goes down here. |
|
Line 46: | Line 69: |
If you want to add examples (and we occasionally do also) add them here. == Make a new section == Just like this. If, for example, you want to argue about the differences between, say, Tidwell's Vertical Stack, and our general concept of the List, then add a section to discuss. If we're successful, we'll get to make a new edition and will take all these discussions into account. |
Problem
You must allow certain classes of users, or any user in certain contexts, to consume content without reading the screen.
Practically all mobile devices have audio output of some sort, and it can be accessed by almost every application or website. There can be strict limits, such as devices which only output over headsets, or those which only send phone call audio over Bluetooth, that can limit the use of some tones.
Solution
Mobile devices must be able to read text displayed on the screen for the user, so it can be accessed and understood by those who cannot use the screen.
Due to mobiles being contextually employed, there are numerous instances in which the user may not be able to, not be allowed to, or may choose not to read the screen.
The user may well choose to use Voice Readback to allow themselves to use their hands and eyes for other purposes. While working, or performing hobbies which do not require excessive cognitive load themselves, such as most driving, radio and other audio output is used to gather information or provide entertainment. Video is generally more entertaining, but is totally unsuitable for these situations.
Variations
Voice Readback always works in broadly the same way, but what is being read varies:
Universal - The entire interface is read, to allow the device to be used without any view of the display. This is usually combined with the Voice Input pattern to create a complete voice UI, as an alternative to the conventional button (or touch) and screen UI native to the device. Even if used for only one section, action, or phrase, this same method is used for any readback of voice commands.
Elemental - An entire document, such as a PDF, email or web page is read until the user cancels the action or the entire document is read.
Selected - A selection the user has specified within any context -- for example by highlighting text in a web page -- is read in it's entirety.
Voice output that is presented based on conditions, such as position or time, are discussed under the Voice Notifications pattern.
Interaction Details
Voice Readback can be turned on as a setting for the entire OS, or on an application basis. It will then be used automatically, whenever a change in the application is initiated.
Other input methods, such as keypads and Directional Controls will still function. Buttons will generally be needed to unlock or refresh a screen so that the current condition is read aloud.
Readback can also be initiated, for the Elemental and Selection variations, within the application or as a contextual control such as a Pop-Up, menu, Annotation or other control.
Readback for single-use cases of UI control is the result of Voice Input. Initiation is discussed under that pattern.
Presentation Details
Audio should be played by default through the external speaker, or speakerphone. The last set in-call volume (or equivalent playback volume for non-phones) should be used. Whenever possible, detect the ambient noise level and adjust the volume accordingly, in order to make it audible.
When a headset is attached (either physically or by a link such as Bluetooth), the playback should default to this device, and use the last set in-call volume for this device.
Content read must be identical to that printed on the screen. The condition that resulted in the user employing Voice Readback may be temporary and transient. Allow the user to switch between the screen and audio channels. They may even wish to read along with the voice output. Even for users with a vision deficit, others may be accompanying them, who may also wish to use the device.
Also be sure that content scrolls as the audio gets to that part, so that the item being read back is in the viewport.
There may be delays between phrases, or before the start of the audio readback. To inform the user that audio is about to commence, and to prepare them for the volume level, play a subtle tone immediately beforehand.
Use a similar tone when Voice Readback has stopped for a significant time, or to communicate a selected ovum setting, to confirm this condition to the user.
Antipatterns
Avoid mixing readback of commands and text. When the two must be used together, use delays, tones, changes in voice and clear syntax (such as "You said...") to indicate the difference.
The voice you select must be as understandable as possible. Text-to-voice translation of names, especially, can be difficult to understand or improperly pronounced. If quality is too low with the available hardware and software, do not implement the solution.
Keep in mind that users may be wearing headsets. Some headsets will not accept all output, so if your application relies on Voice Readback, make sure the targeted devices support your application or service sending audio to all attached headsets or other audio devices.
Next: Voice Notifications
Discuss & Add
Please do not change content above this line, as it's a perfect match with the printed book. Everything else you want to add goes down here.
Examples
If you want to add examples (and we occasionally do also) add them here.
Make a new section
Just like this. If, for example, you want to argue about the differences between, say, Tidwell's Vertical Stack, and our general concept of the List, then add a section to discuss. If we're successful, we'll get to make a new edition and will take all these discussions into account.