Speech Processing for IP Networks

Name: Speech Processing for IP Networks
Brand: Wiley & Sons, Wiley
SKU: 12336005
Price: 72.90 EUR
Availability: OutOfStock

Media Resource Control Protocol (MRCP) (Sprache: Englisch)

Autor: David Burke

Schreiben Sie einen Kommentar zu "Speech Processing for IP Networks".

Speech Processing for IP Networks, David Burke

Merken

Mehr zum Inhalt Inhaltsverzeichnis Autorenporträt Video

Leider schon ausverkauft

versandkostenfrei

Bestellnummer: 12336005

Buch (Gebunden) 72.90 €

In den Warenkorb

Lastschrift, Kreditkarte, Paypal, Rechnung
Kostenlose Rücksendung
Ratenzahlung möglich

Produktdetails

Produktbeschreibung
Inhaltsverzeichnis
Autorenporträt
Biblio. Angaben / Produktdetails

Produktinformationen zu „Speech Processing for IP Networks “

Klappentext zu „Speech Processing for IP Networks “

Media Resource Control Protocol (MRCP) is a new IETF protocol, providing a key enabling technology that eases the integration of speech technologies into network equipment and accelerates their adoption resulting in exciting and compelling interactive services to be delivered over the telephone. MRCP leverages IP telephony and Web technologies such as SIP (Session Intiation Protocol), HTTP (Hypertext Transfer Protocol), and XML (Extensible Markup Language) to deliver an open standard, vendor-independent, and versatile interface to speech engines.

Speech Processing for IP Networks brings these technologies together into a single volume, giving the reader a solid technical understanding of the principles of MRCP, how it leverages other protocols and specifications for its operation, and how it is applied in modern IP-based telecommunication networks. Focusing on the MRCPv2 standard developed by the IETF SpeechSC Working Group, this book will also provide an overview of its precursor, MRCPv1.

Speech Processing for IP Networks:
* Gives a complete background on the technologies required by MRCP to function, including SIP, RTP (Real-time Transport Protocol), and HTTP.
* Covers relevant W3C data representation formats including Speech Synthesis Markup Language (SSML), Speech Recognition Grammar Specification (SRGS), Semantic Interpretation for Speech Recognition (SISR), and Pronunciation Lexicon Specification (PLS).
* Describes VoiceXML - the leading approach for programming cutting-edge speech applications and a key driver to the development of many of MRCP's features.
* Explains advanced topics such as VoiceXML and MRCP interworking.

This text will be an invaluable resource for technical managers, product managers, software developers, and technical marketing professionals working for network equipment manufacturers, speech engine vendors, and network operators. Advanced students on computer science and engineering courses will also find

... mehr

this to be an excellent guide to the topic.

... weniger

Inhaltsverzeichnis zu „Speech Processing for IP Networks “

PART I. BACKGROUND

1. Introduction

1.1 Introduction to Speech Applications

1.2 The MRCP Value Proposition

1.3 History of MRCP Standardisation

1.4 Summary

2. Basic Principles of Speech Processing

2.1 Human Speech Production

2.2 Speech Recognition

2.3 Speaker Verification and Identification

2.4 Speech Synthesis

2.5 Summary

3. Overview of MRCP

3.1 Architecture

3.2 Media Resource Types

3.3 Network Scenarios

3.4 Protocol Operation

3.5 Security

3.6 Summary

PART II. MEDIA AND CONTROL SESSIONS

4. Session Initiation Protocol

4.1 Introduction

4.2 Walkthrough Example

4.3 SIP URIs

4.4 Transport

4.5 Media Negotiation

4.6 SIP Servers

4.7 SIP Extensions

4.8 Security

4.9 Summary

5. Session Initiation in MRCP

5.1 Introduction

5.2 Initiating the Media Session

5.3 Initiating the Control Session

5.4 Session Initiation Examples

5.5 Locating Media Resource Servers

5.6 Security

5.7 Summary

6. The Media Session

6.1 Media Encoding

6.2 Media Transport

6.3 Security

6.4 Summary

7. The Control Session

7.1 Message Structure

7.2 Generic Methods

7.3 Generic Headers

7.4 Security

7.5 Summary

PART III. DATA REPRESENTATION FORMATS

8. Speech Synthesis Markup Language (SSML)

8.1 Introduction

8.2 Document Structure

8.3 Recorded Audio

8.4 Pronunciation

8.5 Prosody

8.6 Markers

8.7 Metadata

8.8 Summary

9. Speech Recognition Grammar Specification (SRGS)

9.1 Introduction

9.2 Document Structure

9.3 Rules, Tokens, and Sequences

9.4 Alternatives

9.5 Rule References

9.6 Repeats

9.7 DTMF Grammars

9.8 Semantic Interpretation

9.9 Summary

10. Natural Language Semantics Markup Language (NLSML)

10.1 Introduction

10.2 Document Structure

10.3 Speech Recognition Results

10.4 Voice Enrollment Results

10.5 Speaker Verification Results

10.6 Summary

11. Pronunciation Lexicon Specification

... mehr

(PLS)

11.1 Introduction

11.2 Document Structure

11.3 Lexical Entries

11.4 Abbreviations and Acronyms

11.5 Multiple Orthographies

11.6 Multiple Pronunciations

11.7 Summary

PART IV. MEDIA RESOURCES

12. Speech Synthesiser Resource

12.1 Overview

12.2 Methods

12.3 Events

12.4 Headers

12.5 Summary

13. Speech Recogniser Resource

13.1 Overview

13.2 Recognition Methods

13.3 Enrollment Methods

13.4 Events

13.5 Recognition Headers

13.6 Enrollment Headers

13.7 Summary

14. Recorder Resource

14.1 Overview

14.2 Methods

14.3 Events

14.4 Headers

14.5 Summary

15. Speaker Verification Resource

15.1 Overview

15.2 Methods

15.3 Events

15.4 Headers

15.5 Summary

PART V. PROGRAMMING SPEECH APPLICATIONS

16. Voice eXtensible Markup Language (VoiceXML)

16.1 Introduction

16.2 Document Structure

16.3 Dialogs

16.4 Media Playback

16.5 Media Recording

16.6 Speech and DTMF Recognition

16.7 Flow Control

16.8 Resource Fetching

16.9 Call Transfer

16.10 Summary

17. VoiceXML and MRCP Interworking

17.1 Introduction

17.2 Interworking Fundamentals

17.3 Application Example

17.4 Summary

Appendix A. MRCP Version 1

A.1 Overview

A.2 Session Management and Message Transport

A.3 General Protocol Details

A.4 Speech Synthesiser Resource

A.5 Speech Recogniser Resource

Appendix B. XML Primer

B.1 Background

B.2 Basic Concepts

B.3 Namespaces

B.4 Document Schemas

Appendix C. HTTP Primer

C.1 Background

C.2 Basic Concepts

C.3 Caching

C.4 Cookies

C.5 Security

References

Index

Acronyms

... weniger

Autoren-Porträt von David Burke

David Burke is Chief Technology Officer and co-founder of Voxpilot Ltd, UK. David led Voxpilot to its current position as a leader in VoiceXML interactive services platform technology. His management duties at Voxpilot include executive management and counsel, product vision, direction and management, responsibility for all R&D activities including budgeting, engineering team selection and mentoring, and architecture and design.

He is also member of the World Wide Web Consortium (W3C) Voice Browser Working Group and of the Internet Engineering Task Force (IETF) Speech SC Working Group.

Bibliographische Angaben

Autor: David Burke
2007, 1. Auflage, 360 Seiten, Maße: 24,4 cm, Gebunden, Englisch
Verlag: Wiley & Sons
ISBN-10: 0470028343
ISBN-13: 9780470028346

Sprache:

Englisch

Kommentar zu "Speech Processing for IP Networks"

Schreiben Sie einen Kommentar zu "Speech Processing for IP Networks".

Wie funktionieren Bewertungen?

0 Gebrauchte Artikel zu „Speech Processing for IP Networks“

Zustand	Preis	Porto	Zahlung	Verkäufer	Rating