Multilingual Speech Processing

Book description

Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech processing from a multilingual perspective. By taking this all-inclusive approach to speech processing, the editors have included theories, algorithms, and techniques that are required to support spoken input and output in a large variety of languages. This book presents a comprehensive introduction to research problems and solutions, both from a theoretical as well as a practical perspective, and highlights technology that incorporates the increasing necessity for multilingual applications in our global community.

Current challenges of speech processing and the feasibility of sharing data and system components across different languages guide contributors in their discussions of trends, prognoses and open research issues. This includes automatic speech recognition and speech synthesis, but also speech-to-speech translation, dialog systems, automatic language identification, and handling non-native speech. The book is complemented by an overview of multilingual resources, important research trends, and actual speech processing systems that are being deployed in multilingual human-human and human-machine interfaces.

Researchers and developers in industry and academia with different backgrounds but a common interest in multilingual speech processing will find an excellent overview of research problems and solutions detailed from theoretical and practical perspectives.

* State-of-the-art research with a global perspective by authors from the USA, Asia, Europe, and South Africa
* The only comprehensive introduction to multilingual speech processing currently available
* Detailed presentation of technological advances integral to security, financial, cellular and commercial applications

Table of contents

  1. Front cover
  2. Title page
  3. Copyright
  4. Table of contents
  5. List of Figures
  6. List of Tables
  7. front matter
  8. Contributor Biographies
  9. Foreword
  10. body
  11. Chapter 1 Introduction
  12. Chapter 2 Language Characteristics
    1. 2.1 Languages and Dialects
    2. 2.2 Linguistic Description and Classification (1/3)
    3. 2.2 Linguistic Description and Classification (2/3)
    4. 2.2 Linguistic Description and Classification (3/3)
    5. 2.3 Language in Context
    6. 2.4 Writing Systems (1/2)
    7. 2.4 Writing Systems (2/2)
    8. 2.5 Languages and Speech Technology
  13. Chapter 3 Linguistic Data Resources
    1. 3.1 Demands and Challenges of Multilingual Data-Collection Efforts
    2. 3.2 International Efforts and Cooperation
    3. 3.3 Data Collection Efforts in the United States (1/3)
    4. 3.3 Data Collection Efforts in the United States (2/3)
    5. 3.3 Data Collection Efforts in the United States (3/3)
    6. 3.4 Data Collection Efforts in Europe (1/2)
    7. 3.4 Data Collection Efforts in Europe (2/2)
    8. 3.5 Overview of Existing Language Resources in Europe (1/2)
    9. 3.5 Overview of Existing Language Resources in Europe (2/2)
  14. Chapter 4 Multilingual Acoustic Modeling
    1. 4.1 Introduction
    2. 4.2 Problems and Challenges (1/3)
    3. 4.2 Problems and Challenges (2/3)
    4. 4.2 Problems and Challenges (3/3)
    5. 4.3 Language Independent Sound Inventories and Representations (1/3)
    6. 4.3 Language Independent Sound Inventories and Representations (2/3)
    7. 4.3 Language Independent Sound Inventories and Representations (3/3)
    8. 4.4 Acoustic Model Combination (1/4)
    9. 4.4 Acoustic Model Combination (2/4)
    10. 4.4 Acoustic Model Combination (3/4)
    11. 4.4 Acoustic Model Combination (4/4)
    12. 4.5 Insights and Open Problems
  15. Chapter 5 Multilingual Dictionaries
    1. 5.1 Introduction
    2. 5.2 Multilingual Dictionaries
    3. 5.3 What Is aWord? (1/3)
    4. 5.3 What Is aWord? (2/3)
    5. 5.3 What Is aWord? (3/3)
    6. 5.4 Vocabulary Selection (1/2)
    7. 5.4 Vocabulary Selection (2/2)
    8. 5.5 How to Generate Pronunciations (1/4)
    9. 5.5 How to Generate Pronunciations (2/4)
    10. 5.5 How to Generate Pronunciations (3/4)
    11. 5.5 How to Generate Pronunciations (4/4)
    12. 5.6 Discussion
  16. Chapter 6 Multilingual Language Modeling
    1. 6.1 Statistical Language Modeling
    2. 6.2 Model Estimation for New Domains and Speaking Styles
    3. 6.3 Crosslingual Comparisons: A Language Modeling Perspective (1/4)
    4. 6.3 Crosslingual Comparisons: A Language Modeling Perspective (2/4)
    5. 6.3 Crosslingual Comparisons: A Language Modeling Perspective (3/4)
    6. 6.3 Crosslingual Comparisons: A Language Modeling Perspective (4/4)
    7. 6.4 Crosslinguistic Bootstrapping for Language Modeling (1/2)
    8. 6.4 Crosslinguistic Bootstrapping for Language Modeling (2/2)
    9. 6.5 Language Models for Truly Multilingual Speech Recognition
    10. 6.6 Discussion and Concluding Remarks
  17. Chapter 7 Multilingual Speech Synthesis
    1. 7.1 Background
    2. 7.2 Building Voices in New Languages
    3. 7.3 Database Design
    4. 7.4 Prosodic Modeling
    5. 7.5 Lexicon Building (1/3)
    6. 7.5 Lexicon Building (2/3)
    7. 7.5 Lexicon Building (3/3)
    8. 7.6 Non-native Spoken Output
    9. 7.7 Summary
  18. Chapter 8 Automatic Language Identification
    1. 8.1 Introduction
    2. 8.2 Human Language Identification
    3. 8.3 Databases and Evaluation Methods
    4. 8.4 The Probabilistic LID Framework
    5. 8.5 Acoustic Approaches (1/2)
    6. 8.5 Acoustic Approaches (2/2)
    7. 8.6 Phonotactic Modeling (1/3)
    8. 8.6 Phonotactic Modeling (2/3)
    9. 8.6 Phonotactic Modeling (3/3)
    10. 8.7 Prosodic LID
    11. 8.8 LVCSR-Based LID
    12. 8.9 Trends and Open Problems in LID
  19. Chapter 9 Other Challenges: Non-native Speech, Dialects, Accents, and Local Interfaces
    1. 9.1 Introduction
    2. 9.2 Characteristics of Non-native Speech
    3. 9.3 Corpus Analysis (1/2)
    4. 9.3 Corpus Analysis (2/2)
    5. 9.4 Acoustic Modeling Approaches for Non-native Speech
    6. 9.5 Adapting to Non-native Accents in ASR (1/2)
    7. 9.5 Adapting to Non-native Accents in ASR (2/2)
    8. 9.6 Combining Speaker and Pronunciation Adaptation
    9. 9.7 Cross-Dialect Recognition of Native Dialects
    10. 9.8 Applications (1/2)
    11. 9.8 Applications (2/2)
    12. 9.9 Other Factors in Localizing Speech-Based Interfaces (1/2)
    13. 9.9 Other Factors in Localizing Speech-Based Interfaces (2/2)
    14. 9.10 Summary
  20. Chapter 10 Speech-to-Speech Translation
    1. 10.1 Introduction
    2. 10.2 Statistical and Interlingua-Based Speech Translation Approaches (1/5)
    3. 10.2 Statistical and Interlingua-Based Speech Translation Approaches (2/5)
    4. 10.2 Statistical and Interlingua-Based Speech Translation Approaches (3/5)
    5. 10.2 Statistical and Interlingua-Based Speech Translation Approaches (4/5)
    6. 10.2 Statistical and Interlingua-Based Speech Translation Approaches (5/5)
    7. 10.3 Coupling Speech Recognition and Translation (1/2)
    8. 10.3 Coupling Speech Recognition and Translation (2/2)
    9. 10.4 Portable Speech-to-Speech Translation: The ATR System (1/10)
    10. 10.4 Portable Speech-to-Speech Translation: The ATR System (2/10)
    11. 10.4 Portable Speech-to-Speech Translation: The ATR System (3/10)
    12. 10.4 Portable Speech-to-Speech Translation: The ATR System (4/10)
    13. 10.4 Portable Speech-to-Speech Translation: The ATR System (5/10)
    14. 10.4 Portable Speech-to-Speech Translation: The ATR System (6/10)
    15. 10.4 Portable Speech-to-Speech Translation: The ATR System (7/10)
    16. 10.4 Portable Speech-to-Speech Translation: The ATR System (8/10)
    17. 10.4 Portable Speech-to-Speech Translation: The ATR System (9/10)
    18. 10.4 Portable Speech-to-Speech Translation: The ATR System (10/10)
    19. 10.5 Conclusion
  21. Chapter 11 Multilingual Spoken Dialog Systems
    1. 11.1 Introduction
    2. 11.2 PreviousWork
    3. 11.3 Overview of the ISIS System (1/2)
    4. 11.3 Overview of the ISIS System (2/2)
    5. 11.4 Adaptivity to Knowledge Scope Expansion (1/2)
    6. 11.4 Adaptivity to Knowledge Scope Expansion (2/2)
    7. 11.5 Delegation to Software Agents
    8. 11.6 Interruptions and Multithreaded Dialogs (1/2)
    9. 11.6 Interruptions and Multithreaded Dialogs (2/2)
    10. 11.7 Empirical Observations on User Interaction with ISIS
    11. 11.8 Implementation of Multilingual SDS in VXML (1/2)
    12. 11.8 Implementation of Multilingual SDS in VXML (2/2)
    13. 11.9 Summary and Conclusions (1/2)
    14. 11.9 Summary and Conclusions (2/2)
  22. Bibliography (1/9)
  23. Bibliography (2/9)
  24. Bibliography (3/9)
  25. Bibliography (4/9)
  26. Bibliography (5/9)
  27. Bibliography (6/9)
  28. Bibliography (7/9)
  29. Bibliography (8/9)
  30. Bibliography (9/9)
  31. Index (1/4)
  32. Index (2/4)
  33. Index (3/4)
  34. Index (4/4)

Product information

  • Title: Multilingual Speech Processing
  • Author(s): Tanja Schultz, Katrin Kirchhoff
  • Release date: April 2006
  • Publisher(s): Academic Press
  • ISBN: 9780080457628