Speech Signal Processing
|Title||STFT Phase Improvement for Single Channel Speech Enhancement|
|Authors||Martin Krawczyk, Timo Gerkmann |
|Conference||Int. Workshop Acoust. Signal Enhancement (IWAENC) |
|Place ||Aachen, Germany |
In state-of-the-art single channel short-time Fourier transform (STFT) based speech enhancement algorithms only the amplitude of the noisy speech signal is improved, but its phase is left unchanged. It is commonly assumed that the noisy phase is the best estimate of the clean phase available. While using the noisy phase is indeed optimal under certain statistical assumptions, in this paper we show that blindly improving the noisy phase is possible when these, potentially limiting, assumptions are dropped. Without modifying the amplitude, the proposed algorithm leads to frequency weighted SNR improvements of up to 1.8 dB. Further, the presented phase enhancement scheme is real-time capable and can be combined with any off-the-shelf STFT-based amplitude estimator.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.