Although speech is the most natural form of communication between humans, most people find using speech to communicate with machines anything but natural. Drawing from psychology, human-computer interaction, linguistics, and communication theory, Practical Speech User Interface Design provides a comprehensive yet concise survey of practical speech
Voice user interfaces (VUIs) are becoming all the rage today. But how do you build one that people can actually converse with? Whether you’re designing a mobile app, a toy, or a device such as a home assistant, this practical book guides you through basic VUI design principles, helps you choose the right speech recognition engine, and shows you how to measure your VUI’s performance and improve upon it. Author Cathy Pearl also takes product managers, UX designers, and VUI designers into advanced design topics that will help make your VUI not just functional, but great.Understand key VUI design concepts, including command-and-control and conversational systemsDecide if you should use an avatar or other visual representation with your VUIExplore speech recognition technology and its impact on your designTake your VUI above and beyond the basic exchange of informationLearn practical ways to test your VUI application with usersMonitor your app and learn how to quickly improve performanceGet real-world examples of VUIs for home assistants, smartwatches, and car systems
This book is a comprehensive and authoritative guide to voice user interface (VUI) design. The VUI is perhaps the most critical factor in the success of any automated speech recognition (ASR) system, determining whether the user experience will be satisfying or frustrating, or even whether the customer will remain one. This book describes a practical methodology for creating an effective VUI design. The methodology is scientifically based on principles in linguistics, psychology, and language technology, and is illustrated here by examples drawn from the authors' work at Nuance Communications, the market leader in ASR development and deployment. The book begins with an overview of VUI design issues and a description of the technology. The authors then introduce the major phases of their methodology. They first show how to specify requirements and make high-level design decisions during the definition phase. They next cover, in great detail, the design phase, with clear explanations and demonstrations of each design principle and its real-world applications. Finally, they examine problems unique to VUI design in system development, testing, and tuning. Key principles are illustrated with a running sample application. A companion Web site provides audio clips for each example: www.VUIDesign.org The cover photograph depicts the first ASR system, Radio Rex: a toy dog who sits in his house until the sound of his name calls him out. Produced in 1911, Rex was among the few commercial successes in earlier days of speech recognition. Voice User Interface Design reveals the design principles and practices that produce commercial success in an era when effective ASRs are not toys but competitive necessities.
Master the critical knowledge you need to design speech-enabled applications It's not just a far-fetched gizmo straight out of a sci-fi movie anymore. Speech interface technology, which allows a user to communicate with computers via voice instead of a keyboard or a mouse, is quickly becoming a main feature in new software. This straightforward guide provides traditional graphical user-interface designers, developers, usability engineers, and product managers with all the information they need to make a rapid transition in order to stay abreast of this monumental shift in technology. Weinschenk and Barker, two experts in state-of-the-art online communication, discuss the basics of speech interfaces and speech technology, hardware, and software. They clearly explain the interface design principles that are applied to S/GUI and AUI interfaces and describe the latest practices of leading experts. In addition to its in-depth look at speech technologies and the different types of user interfaces, this book: * Provides an overview of the field of human factors and defines the basic concepts of human computer interaction * Discusses the current state of speech technology applications * Explains the laws of human factors that apply to speech interfaces * Contains guidelines and examples for user control, human limitation, model integrity, accommodation, clear dialogue, and aesthetic integrity * Details the best practices in interface design and usability engineering * Explores the special issues involved in interface design for disabled persons Visit the companion web site at www.wiley.com/compbooks/weinschenk/ for a categorized resource list of speech, speech interface, and human-computer interaction books, articles, and links.
Bridging the gap between the needs of the technical engineer and cognitive researchers related to speech technology applications. Systematic approach focusing on the utility of speech related product design Designed to respond to the growing need for specific theories, tools and methods for design, testing and evaluating speech related human-system interfaces. Targeted at designers, engineers, and decision makers working in the area of speech technology research
Designing User Interfaces for an Aging Population: Towards Universal Design presents age-friendly design guidelines that are well-established, agreed-upon, research-based, actionable, and applicable across a variety of modern technology platforms. The book offers guidance for product engineers, designers, or students who want to produce technological products and online services that can be easily and successfully used by older adults and other populations. It presents typical age-related characteristics, addressing vision and visual design, hand-eye coordination and ergonomics, hearing and sound, speech and comprehension, navigation, focus, cognition, attention, learning, memory, content and writing, attitude and affect, and general accessibility. The authors explore characteristics of aging via realistic personas which demonstrate the impact of design decisions on actual users over age 55. - Presents the characteristics of older adults that can hinder use of technology - Provides guidelines for designing technology that can be used by older adults and younger people - Review real-world examples of designs that implement the guidelines and the designs that violate them
Auditory User Interfaces: Toward the Speaking Computer describes a speech-enabling approach that separates computation from the user interface and integrates speech into the human-computer interaction. The Auditory User Interface (AUI) works directly with the computational core of the application, the same as the Graphical User Interface. The author's approach is implemented in two large systems, ASTER - a computing system that produces high-quality interactive aural renderings of electronic documents - and Emacspeak - a fully-fledged speech interface to workstations, including fluent spoken access to the World Wide Web and many desktop applications. Using this approach, developers can design new high-quality AUIs. Auditory interfaces are presented using concrete examples that have been implemented on an electronic desktop. This aural desktop system enables applications to produce auditory output using the same information used for conventional visual output. Auditory User Interfaces: Toward the Speaking Computer is for the electrical and computer engineering professional in the field of computer/human interface design. It will also be of interest to academic and industrial researchers, and engineers designing and implementing computer systems that speak. Communication devices such as hand-held computers, smart telephones, talking web browsers, and others will need to incorporate speech-enabling interfaces to be effective.
From the voice on the phone, to the voice on the computer, to the voice from the toaster, speech user interfaces are coming into the mainstream and are here to stay forever. Soundly anchored in HCI, cognitive psychology, linguistics, and social psychology, this supremely practical book is loaded with examples, how-to advice, and design templates. Drawing widely on decades of research—in lexicography, conversation analysis, computational linguistics, and social psychology—author Randy Allen Harris outlines the principles of how people use language interactively, and illustrates every aspect of design work.In the first part of the book, Harris provides a thorough conceptual basis of language in all its relevant aspects, from speech sounds to conversational principles. The second part takes you patiently through the entire process of designing an interactive speech system: from team building to user profiles, to agent design, scripting, and evaluation. This book provides interaction designers with the knowledge and strategies to craft language-based applications the way users will expect them to behave.*Loaded with examples and practical synopses of the best practice. *An ideal combination of conceptual base, practical illustrations, and "how-to" advice—for design and for the entire design process.*Will bring novice voice designers fully up to speed, and give experienced designers a new understanding of the principles underlying human speech interaction, principles from which to improve voice interaction design.
As technology expands and evolves, one-dimensional, graphical user interface (GUI) design becomes increasingly limiting and simplistic. Designers must meet the challenge of developing new and creative interfaces that adapt to meet human needs and technological trends. HCI Beyond the GUI provides designers with this know how by exploring new ways to reach users that involve all of the human senses. Dr. Kortum gathers contributions from leading human factors designers to present a single reference for professionals, researchers, and students. - Explores the human factors involved in the design and implementation of the nontraditional interfaces, detailing design strategies, testing methodologies, and implementation techniques - Provides an invaluable resource for practitioners who design interfaces for children, gamers and users with accessibility needs - Offers extensive case studies, examples and design guidelines
How interactive voice-based technology can tap into the automatic and powerful responses all speech—whether from human or machine—evokes. Interfaces that talk and listen are populating computers, cars, call centers, and even home appliances and toys, but voice interfaces invariably frustrate rather than help. In Wired for Speech, Clifford Nass and Scott Brave reveal how interactive voice technologies can readily and effectively tap into the automatic responses all speech—whether from human or machine—evokes. Wired for Speech demonstrates that people are "voice-activated": we respond to voice technologies as we respond to actual people and behave as we would in any social situation. By leveraging this powerful finding, voice interfaces can truly emerge as the next frontier for efficient, user-friendly technology. Wired for Speech presents new theories and experiments and applies them to critical issues concerning how people interact with technology-based voices. It considers how people respond to a female voice in e-commerce (does stereotyping matter?), how a car's voice can promote safer driving (are "happy" cars better cars?), whether synthetic voices have personality and emotion (is sounding like a person always good?), whether an automated call center should apologize when it cannot understand a spoken request ("To Err is Interface; To Blame, Complex"), and much more. Nass and Brave's deep understanding of both social science and design, drawn from ten years of research at Nass's Stanford laboratory, produces results that often challenge conventional wisdom and common design practices. These insights will help designers and marketers build better interfaces, scientists construct better theories, and everyone gain better understandings of the future of the machines that speak with us.