Speech and Audio Processing, As per AICTE
ISBN: 9788126540389
For more information write to us at: acadmktg@wiley.com

Description
Speech and Audio Processing is a text targeted towards the final year undergraduate Speech Processing course and PG students in ECE, CS, and IT streams. This book aims at explaining the basic concepts in a clear-cut and simplified manner. It begins with the human speech production mechanism and then goes on to the fundamental parameters of speech such as pitch frequency, formants, spectral features like log spectrum, 3-D spectrogram, cepstral features, MFCC, linear prediction coefficients, transform-domain parameters, template matching techniques, etc.
Preface
Chapter 1 Fundamentals of Speech
1.1 The Human Speech Production Mechanism
1.2 LTI Model for Speech Production
1.3 Nature of the Speech Signal
1.4 Linear Time-Varying Model
1.5 Phonetics
1.6 Types of Speech
1.7 Voiced and Unvoiced Decision Making
1.8 Audio File Formats: Nature of the WAV File
Chapter 2 Parameters of Speech: Pitch and Formants
2.1 Fundamental Frequency or Pitch Frequency
2.2 Parallel Processing Approach for Calculation of Pitch Frequency
2.3 Pitch Period Measurement Using Spectral Domain
2.4 Cepstral Domain
2.5 Formants and Their Relation With LPC
2.6 Evaluation of Formants Using Cepstrum
2.7 Evaluation of Formants Using Log Spectrum
2.8 Evaluation of Formants Using Power Spectral Density Estimate
2.9 Estimation of Formants: Other Methods
Chapter 3 Spectral Parameters of Speech
3.1 Homomorphic Processing
3.2 Cepstral Analysis of Speech: Cepstral Coefficients
3.3 The Auditory System as a Filter Bank
3.4 Mel Frequency Cepstral Coefficients (MFCCs)
3.5 Perceptual Linear Prediction (PLP)
3.6 Log Frequency Power Coefficients (LFPCs)
3.7 RelAtive SpecTrAl Perceptual Linear Prediction (Rasta-PLP): Strategies for Robustness
3.8 Short-Time Spectral Analysis of Speech: Short-Time Fourier Transform (STFT)
3.9 Wavelet Transform Analysis of Speech
Chapter 4 Linear Prediction of Speech 143
4.1 Lattice Structure Realization
4.2 Forward Linear Prediction
4.3 Autocorrelation Method
4.4 Covariance Method
4.5 Lattice Methods
4.6 Selection of Order of the Predictor
4.7 Line Spectral Frequencies/Line Spectral Pair Frequencies
Chapter 5 Speech Quantization and Coding
5.1 Uniform and Non-Uniform Quantizers and Coder
5.2 Companded Quantizer
5.3 Uniform Quantization of Non-uniform Sources: Adaptive Quantizers
5.4 Waveform Coding of Speech
5.5 Comparison of Different Waveform Coding Techniques
5.6 Parametric Speech Coding Techniques
5.7 Sinusoidal Speech Coding Techniques
5.8 Mixed Excitation Linear Prediction Coder
5.9 Multi-Mode Speech Coding (Hybrid Coder)
5.10 Transform Domain Coding of Speech
Chapter 6 Speech Processing Applications
6.1 Speech Recognition Systems
6.2 Architecture of a Large Vocabulary Continuous Speech Recognition System
6.3 Deterministic Sequence Recognition for ASR
6.4 Statistical Sequence Recognition for ASR
6.5 Statistical Pattern Recognition and Parameter Estimation
6.6 VQ-HMM-Based Speech Recognition
6.7 Discriminant Acoustic Probability Estimation
6.8 Word Spotting/Keyword Spotting
6.9 Speech Recognition and Understanding
6.10 Speaker Recognition
6.11 Distortion Measures: Mathematical and Perceptual
6.12 Speech Enhancement
6.13 Adaptive Echo Cancellation
Chapter 7 Speech Synthesis
7.1 A Text-to-Speech System
7.2 Synthesizer Technologies
7.3 Speech Synthesis Using Other Methods
7.4 Speech Transformations
7.5 Emotion Recognition from Speech
7.6 Watermarking for Authentication of a Speech/Music Signal
Chapter 8 Basics of Musical Instruments and Music Synthesis
8.1 Indian Musical Instruments
8.2 Features Used for Classification
8.3 Music Synthesis
8.4 Musical Instrument Digital Interface (MIDI)
8.5 Streaming Audio
8.6 Piano Note Synthesis Using LPC and WT
8.7 Audio Standards
Summary
Key Terms
Multiple Choice Questions
Review Questions
Problems (Write MATLAB Programs)
Suggested Projects (Write MATLAB Programs)
Answers
Frequently Asked Short Questions With Answers
Frequently Asked Long Questions With Pointers
Bibliography
Index