Emacspeak --The Complete Audio Desktop

Clients connect to speech server.
Server provides speech services:
- Speak text.
- Set speech parameters.
- Stop, pause or resume speech.
Clients protected from device dependencies.

Speech servers are currently implemented in TCL

Core Speech Services

Core speech services provided by the Emacspeak platform

Speak a region of text.
Configure context-specific pronunciation and prosody.
Annotate text to produce audio formatted output.
Enhance auditory output with auditory icons.

Emacspeak core encourages code re-use throughout Emacspeak.

Architecture Overview

Architecture Of Emacspeak

Series of modular layers.
Low-level layers provide device-specific interfaces.
Core services are implemented on a device-independent layer.
Application-specific extensions rely on these core services.

Implementing Emacspeak

Lisp advice facility:

Extend code functionality without modifying original source.
Advice types:
- Before
- Around
- After
Advice fragments enhance and modify original behavior.

Speech-enables Emacs without modifying code base

Example: Speech-enable Function next-line

(defadvice next-line (after emacspeak pre act)
"Speak line that you just moved to."
(when (interactive-p)
(emacspeak-speak-line )))

Emacspeak: Current Status

Latest version: over 40,000 lines:

Core

7,000 lines

Speech-enables

Over 80 Emacs packages
Speech-enables all of Emacs 20.4
Speech-enables popular non-bundled extensions like VM, BBDB, and W3.

Speech-enabling extensions are a fraction of the size of the application being speech-enabled.

The User Experience

Succinct contextual speech feedback.
Auditory icons augment interaction.

User focuses on task at hand.

Demonstration --Editing

Simple editing, search and replace.
Completion and spell checking.
Syntax coloring using voice-lock mode.

Intuitive interface enables fluent interaction.

Demonstration --Browsing Information

Browsing the file system.
Reading and responding to email and news.
Browsing the WWW.

Window to digital information.

WWW --Speech Style Sheets

Voice properties,
Auditory icons,
Sound cues for document elements.

Generate richly formatted audio documents.

Demonstration --System Tasks

Managing processes.
Running a shell.
Running terminal based applications.

Speech-enabling Popular Linux Desktops

Making speech interaction a first-class citizen on Linux

Continue UNIX tradition of keeping the UI separable from the underlying computation engine.
Exploit modular architecture of Gnome and KDE.
Introduce speech services layer for both input and output.

Make speech-enabling Gnome and KDE clients a breeze.

Standardized Speech Servers And Services

Integrate speech services into the ORBs used by Gnome and KDE.

Standardized speech services to provide:

Customizable speech synthesis
Customizable auditory displays
Context-sensitive speech input

Speech-enabling Linux crucial for embedded appliance space.

Open Source Crucial To Success

Emacspeak would not be possible in the closed source world.

Sun	Mon	Tue	Wed	Thu	Fri	Sat
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31

Emacspeak --The Complete Audio Desktop

Table Of Contents

Introduction