Motivation for launching the project by the client: denoise systems, including those based on neural networks, are actively used in various services for audio and video communication. Most of these systems do a good job of suppressing noise mainly in situations with a high desired signal level and low noise. The goal was to build a real-time system capable of removing noise from an audio recording within the specified limits.
What we had initially:
Project goals: Improving the quality of noise reduction models in case of extremely low SNR (signal-to-noise ratio).
MIL Team's solution: improvement of existing solutions and creation of our own models showing high gains in terms of generally accepted metrics for assessing the quality of audio recordings (PESQ, SDR) and speech recognition error (WER) for audio recordings with a high level of noise compared to speech (SNR from -10).
Tools for building the model: