book

Intelligent Speech Signal Processing

Name: Intelligent Speech Signal Processing
Author: Nilanjan Dey
ISBN: 9780128181317

by Nilanjan Dey

March 2019

Intermediate to advanced

209 pages

6h 20m

English

Academic Press

Read now

Unlock full access

Cover image
Title page
Table of Contents
Copyright
Contributors
About the Editor
Preface
Chapter 1: Speech Processing in Healthcare: Can We Integrate?
Abstract
Chapter 2: End-to-End Acoustic Modeling Using Convolutional Neural Networks
Abstract2.1 Introduction2.2 Related Work2.3 Various Architecture of ASR2.4 Convolutional Neural Networks2.5 CNN-Based End-to-End Approach2.6 Experiments and Their Results2.7 Conclusion
Chapter 3: A Real-Time DSP-Based System for Voice Activity Detection and Background Noise Reduction
Abstract3.1 Introduction3.2 Microchip dsPIC33 Digital Signal Controller3.3 High Pass Filter3.4 Fast Fourier Transform3.5 Channel Energy Computation3.6 Channel SNR Computation3.7 VAD Decision3.8 VAD Hangover3.9 Computation of Scaling Factor3.10 Scaling of Frequency Channels3.11 Inverse Fourier Transform3.12 Application Programming Interface3.13 Resource Requirements3.14 Microchip PIC Programmer3.15 Audio Components3.16 VAD and Background Noise Reduction Techniques3.17 Results and Discussion3.18 Conclusion and Discussion

Chapter 4: Disambiguating Conflicting Classification Results in AVSR
Abstract4.1 Introduction4.2 Detection of Conflicting Classes4.3 Complementary Models for Classification4.4 Proposed Cascade of Classifiers4.5 Audio-Visual Databases4.6 Experimental Results4.7 Conclusions
Chapter 5: A Deep Dive Into Deep Learning Techniques for Solving Spoken Language Identification Problems
Abstract5.1 Introduction5.2 Spoken Language Identification5.3 Cues for Spoken Language Identification5.4 Stages in Spoken Language Identification5.5 Deep Learning5.6 Artificial and Deep Neural Network5.7 Comparison of Spoken LID System Implementations with Deep Learning Techniques5.8 Discussion5.9 Conclusion
Chapter 6: Voice Activity Detection-Based Home Automation System for People With Special Needs
Abstract6.1 Introduction6.2 Conceptual Design of the System6.3 System Implementation6.4 Significance/Contribution6.5 Conclusion
Chapter 7: Speech Summarization for Tamil Language
Abstract7.1 Introduction7.2 Extractive Summarization7.3 Abstractive Summarization7.4 Need for Speech Summarization7.5 Issues in the Summarization of a Spoken Document7.6 Tamil Language7.7 System Design for Summarization of Speech Data in Tamil Language7.8 Evaluation Metrics7.9 Speech Corpora for Tamil Language7.10 Conclusion
Chapter 8: Classifying Recurrent Dynamics on Emotional Speech Signals
Abstract8.1 Introduction8.2 Data Collection and Processing8.3 Research Methodology8.4 Numerical Experiments and Results8.5 Conclusion
Chapter 9: Intelligent Speech Processing in the Time-Frequency Domain
Abstract9.1 Wavelet Packet Decomposition9.2 Empirical Mode Decomposition9.3 Variational Mode Decomposition9.4 Synchrosqueezing Wavelet Transform: EMD Like a Tool9.5 Applications of the Decomposition Technique9.6 Conclusion
Chapter 10: A Framework for Artificially Intelligent Customized Voice Response System Design using Speech Synthesis Markup Language
Abstract10.1 Introduction10.2 Literature Survey10.3 AWS IoT10.4 Amazon Voice Service (AVS)10.5 AWS Lambda10.6 Message Queuing Telemetry Transport (MQTT)10.7 Proposed Architecture10.8 Conclusion
Index

Content preview from Intelligent Speech Signal Processing

Chapter 4

Disambiguating Conflicting Classification Results in AVSR

Gonzalo D. Sad; Lucas D. Terissi; Juan C. Gómez Laboratory for System Dynamics and Signal Processing, Universidad Nacional de Rosario, CIFASIS-CONICET, Rosario, Argentina

Abstract

A novel scheme for disambiguating conflicting classification results in audio-visual speech recognition (AVSR) applications is proposed in this chapter. The strategy can be implemented with generative and discriminative models. It can be employed with different kinds of input information, viz., audio, visual, or audio-visual information, indistinctly. The proposed training procedure, introduces the concept of complementary models. A complementary model to a particular class j refers to a model ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9780128181317

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Intelligent Speech Signal Processing

by Nilanjan Dey

Disambiguating Conflicting Classification Results in AVSR

Abstract

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.