Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments

1st Edition - September 4, 2024
Latest edition
Author: Xiao-Lei Zhang
Language: English

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments provides a detailed discussion of deep learning-based robust speech processing and its applic… Read more

Back to School

Start strong. Study with purpose.

Save up to 25% on trusted learning resources

Shop now

Description

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments provides a detailed discussion of deep learning-based robust speech processing and its applications. The book begins by looking at the basics of deep learning and common deep network models, followed by front-end algorithms for deep learning-based speech denoising, speech detection, single-channel speech enhancement multi-channel speech enhancement, multi-speaker speech separation, and the applications of deep learning-based speech denoising in speaker verification and speech recognition.

Key features

Provides a comprehensive introduction to the development of deep learning-based robust speech processing
Covers speech detection, speech enhancement, dereverberation, multi-speaker speech separation, robust speaker verification, and robust speech recognition
Focuses on a historical overview and then covers methods that demonstrate outstanding performance in practical applications

Readership

Senior undergraduate students, graduate students, and professionals with a solid foundation in speech signal processing and machine learning who are engaged in intelligent speech processing

1. Introduction

2. Fundamentals of Deep Learning

3. Voice Activity Detection

4. Single-Channel Speech Enhancement

5. Multi-Channel Speech Enhancement

6. Multi-Speaker Speech Separation

7. Speaker Recognition

8. Speech Recognition

Product details

Edition: 1
Latest edition
Published: September 4, 2024
Language: English

About the author

Xiao-Lei Zhang

Xiao-Lei Zhang received his Ph.D. degree with honors from Tsinghua University, China, in 2012. He was a postdoctoral researcher with the Department of Electronic Engineering at Tsinghua University from 2012 to 2014. He was a visiting scholar at The Ohio State University, USA, from 2013 to 2014 and a postdoctoral researcher with the Department of Computer Science and Engineering, The Ohio State University, from 2014 to 2016. Since 2016 he has been a full professor at the Northwestern Polytechnical University, Xi'an, China.

His research interests are the topics in speech processing, machine learning, statistical signal processing, and artificial intelligence.

Affiliations and expertise

Northwestern Polytechnical University, Xi'an, China

View book on ScienceDirect

Read Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments on ScienceDirect

Life Sciences

Physical Sciences & Engineering

Social Sciences & Humanities

Health