Book description
Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech processing from a multilingual perspective. By taking this all-inclusive approach to speech processing, the editors have included theories, algorithms, and techniques that are required to support spoken input and output in a large variety of languages. This book presents a comprehensive introduction to research problems and solutions, both from a theoretical as well as a practical perspective, and highlights technology that incorporates the increasing necessity for multilingual applications in our global community.Current challenges of speech processing and the feasibility of sharing data and system components across different languages guide contributors in their discussions of trends, prognoses and open research issues. This includes automatic speech recognition and speech synthesis, but also speech-to-speech translation, dialog systems, automatic language identification, and handling non-native speech. The book is complemented by an overview of multilingual resources, important research trends, and actual speech processing systems that are being deployed in multilingual human-human and human-machine interfaces.
Researchers and developers in industry and academia with different backgrounds but a common interest in multilingual speech processing will find an excellent overview of research problems and solutions detailed from theoretical and practical perspectives.
* State-of-the-art research with a global perspective by authors from the USA, Asia, Europe, and South Africa
* The only comprehensive introduction to multilingual speech processing currently available
* Detailed presentation of technological advances integral to security, financial, cellular and commercial applications
Table of contents
- Front cover
- Title page
- Copyright
- Table of contents
- List of Figures
- List of Tables
- front matter
- Contributor Biographies
- Foreword
- body
- Chapter 1 Introduction
- Chapter 2 Language Characteristics
-
Chapter 3 Linguistic Data Resources
- 3.1 Demands and Challenges of Multilingual Data-Collection Efforts
- 3.2 International Efforts and Cooperation
- 3.3 Data Collection Efforts in the United States (1/3)
- 3.3 Data Collection Efforts in the United States (2/3)
- 3.3 Data Collection Efforts in the United States (3/3)
- 3.4 Data Collection Efforts in Europe (1/2)
- 3.4 Data Collection Efforts in Europe (2/2)
- 3.5 Overview of Existing Language Resources in Europe (1/2)
- 3.5 Overview of Existing Language Resources in Europe (2/2)
-
Chapter 4 Multilingual Acoustic Modeling
- 4.1 Introduction
- 4.2 Problems and Challenges (1/3)
- 4.2 Problems and Challenges (2/3)
- 4.2 Problems and Challenges (3/3)
- 4.3 Language Independent Sound Inventories and Representations (1/3)
- 4.3 Language Independent Sound Inventories and Representations (2/3)
- 4.3 Language Independent Sound Inventories and Representations (3/3)
- 4.4 Acoustic Model Combination (1/4)
- 4.4 Acoustic Model Combination (2/4)
- 4.4 Acoustic Model Combination (3/4)
- 4.4 Acoustic Model Combination (4/4)
- 4.5 Insights and Open Problems
-
Chapter 5 Multilingual Dictionaries
- 5.1 Introduction
- 5.2 Multilingual Dictionaries
- 5.3 What Is aWord? (1/3)
- 5.3 What Is aWord? (2/3)
- 5.3 What Is aWord? (3/3)
- 5.4 Vocabulary Selection (1/2)
- 5.4 Vocabulary Selection (2/2)
- 5.5 How to Generate Pronunciations (1/4)
- 5.5 How to Generate Pronunciations (2/4)
- 5.5 How to Generate Pronunciations (3/4)
- 5.5 How to Generate Pronunciations (4/4)
- 5.6 Discussion
-
Chapter 6 Multilingual Language Modeling
- 6.1 Statistical Language Modeling
- 6.2 Model Estimation for New Domains and Speaking Styles
- 6.3 Crosslingual Comparisons: A Language Modeling Perspective (1/4)
- 6.3 Crosslingual Comparisons: A Language Modeling Perspective (2/4)
- 6.3 Crosslingual Comparisons: A Language Modeling Perspective (3/4)
- 6.3 Crosslingual Comparisons: A Language Modeling Perspective (4/4)
- 6.4 Crosslinguistic Bootstrapping for Language Modeling (1/2)
- 6.4 Crosslinguistic Bootstrapping for Language Modeling (2/2)
- 6.5 Language Models for Truly Multilingual Speech Recognition
- 6.6 Discussion and Concluding Remarks
- Chapter 7 Multilingual Speech Synthesis
-
Chapter 8 Automatic Language Identification
- 8.1 Introduction
- 8.2 Human Language Identification
- 8.3 Databases and Evaluation Methods
- 8.4 The Probabilistic LID Framework
- 8.5 Acoustic Approaches (1/2)
- 8.5 Acoustic Approaches (2/2)
- 8.6 Phonotactic Modeling (1/3)
- 8.6 Phonotactic Modeling (2/3)
- 8.6 Phonotactic Modeling (3/3)
- 8.7 Prosodic LID
- 8.8 LVCSR-Based LID
- 8.9 Trends and Open Problems in LID
-
Chapter 9 Other Challenges: Non-native Speech, Dialects, Accents, and Local Interfaces
- 9.1 Introduction
- 9.2 Characteristics of Non-native Speech
- 9.3 Corpus Analysis (1/2)
- 9.3 Corpus Analysis (2/2)
- 9.4 Acoustic Modeling Approaches for Non-native Speech
- 9.5 Adapting to Non-native Accents in ASR (1/2)
- 9.5 Adapting to Non-native Accents in ASR (2/2)
- 9.6 Combining Speaker and Pronunciation Adaptation
- 9.7 Cross-Dialect Recognition of Native Dialects
- 9.8 Applications (1/2)
- 9.8 Applications (2/2)
- 9.9 Other Factors in Localizing Speech-Based Interfaces (1/2)
- 9.9 Other Factors in Localizing Speech-Based Interfaces (2/2)
- 9.10 Summary
-
Chapter 10 Speech-to-Speech Translation
- 10.1 Introduction
- 10.2 Statistical and Interlingua-Based Speech Translation Approaches (1/5)
- 10.2 Statistical and Interlingua-Based Speech Translation Approaches (2/5)
- 10.2 Statistical and Interlingua-Based Speech Translation Approaches (3/5)
- 10.2 Statistical and Interlingua-Based Speech Translation Approaches (4/5)
- 10.2 Statistical and Interlingua-Based Speech Translation Approaches (5/5)
- 10.3 Coupling Speech Recognition and Translation (1/2)
- 10.3 Coupling Speech Recognition and Translation (2/2)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (1/10)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (2/10)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (3/10)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (4/10)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (5/10)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (6/10)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (7/10)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (8/10)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (9/10)
- 10.4 Portable Speech-to-Speech Translation: The ATR System (10/10)
- 10.5 Conclusion
-
Chapter 11 Multilingual Spoken Dialog Systems
- 11.1 Introduction
- 11.2 PreviousWork
- 11.3 Overview of the ISIS System (1/2)
- 11.3 Overview of the ISIS System (2/2)
- 11.4 Adaptivity to Knowledge Scope Expansion (1/2)
- 11.4 Adaptivity to Knowledge Scope Expansion (2/2)
- 11.5 Delegation to Software Agents
- 11.6 Interruptions and Multithreaded Dialogs (1/2)
- 11.6 Interruptions and Multithreaded Dialogs (2/2)
- 11.7 Empirical Observations on User Interaction with ISIS
- 11.8 Implementation of Multilingual SDS in VXML (1/2)
- 11.8 Implementation of Multilingual SDS in VXML (2/2)
- 11.9 Summary and Conclusions (1/2)
- 11.9 Summary and Conclusions (2/2)
- Bibliography (1/9)
- Bibliography (2/9)
- Bibliography (3/9)
- Bibliography (4/9)
- Bibliography (5/9)
- Bibliography (6/9)
- Bibliography (7/9)
- Bibliography (8/9)
- Bibliography (9/9)
- Index (1/4)
- Index (2/4)
- Index (3/4)
- Index (4/4)
Product information
- Title: Multilingual Speech Processing
- Author(s):
- Release date: April 2006
- Publisher(s): Academic Press
- ISBN: 9780080457628
You might also like
book
Audio Source Separation and Speech Enhancement
Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim …
book
Robust Automatic Speech Recognition
Robust Automatic Speech Recognition: A Bridge to Practical Applications establishes a solid foundation for automatic speech …
book
Computer Vision Technology in the Food and Beverage Industries
The use of computer vision systems to control manufacturing processes and product quality has become increasingly …
book
Selective Visual Attention: Computational Models and Applications
Visual attention is a relatively new area of study combining a number of disciplines: artificial neural …