Hacker Read

jmvalin · 2017-09-29 18:38:15+00:00

What you're describing is more or less why noise suppression algorithms in general cannot really improve intelligibility of the speech. Unless they're given extra cues (like with a microphone array), there's nothing they can do in real-time that will beat what the brain is capable of with "delayed decision" (sometimes you'll only understand a word 1-2 seconds after it's spoken). So the goal of noise suppression is really just making the speech less annoying when the SNR is high enough not to affect intelligibility.

That being said, I still have control over the tradeoffs the algorithm makes by changing the loss function, i.e. how different kinds of mistakes are penalized.

reply

lbill | karma 89 | avg karma 1.75 · | 2017-09-29 09:59:34

The RRNoise suppression is less appealing to my ear than the Speex suppression... But:

- the approach is pretty cool!

- as mention in the article, it might be very useful when applied to multiple speakers (conferencing)

- it might be very interesting for speech recognition softwares

Also, as a sound guy, when I have a noisy signal I sometimes remove it a bit too heavily -> I mask the artifacts with some background music. I will definitely try that with the RNNoise suppression !

reply

Kirby64 | karma 1746 | avg karma 1.68 · | 2020-06-09 18:51:45+00:00

I will say, using Krisp, it has the same problem that basically all these 'AI' based noise cancelling seem to exhibit: sound quality deteriorates when outside noise is suppressed, and people seem to sometimes not meet the threshold and get completely cut out from talking in some scenarios.

It's still better than food noises, but I have noticed that as a disadvantage.

reply

danielheath | karma 3720 | avg karma 3.28 · | 2024-06-30 05:54:46

Personally, I can't filter out background noise properly.

This means I can understand a conversation _much_ more clearly if I'm wearing active noise cancelling headphones. Yes, it makes _you_ quieter, but it also means I'm not trying to pick out your speech from complicated background noises.

reply

zerop | karma 1277 | avg karma 2.31 · | 2020-10-26 16:23:00

In a croweded place the noise has to be cancelled or speech recognition application must learn to recognise with noise in input audio.

spacefight | karma 969 | avg karma 1.84 · | 2015-03-31 13:12:23

Noise cancelling doesn't work with speech that well. It's way better for steady background noise such as on a small plane if you're the pilot or on a large plane if you're a passenger.

rzzzt | karma 3871 | avg karma 1.97 · | 2021-02-09 23:52:11+00:00

NVIDIA has ML-based noise suppression functionality in the form of RTX Voice.

There is also Krisp.ai, a similar product for noise canceling; they have written up an overview of the difficulties involved on the NVIDIA Developer Blog, interestingly enough (it seems they were called 2hz.ai back then):

- https://www.nvidia.com/en-us/geforce/guides/nvidia-rtx-voice...

- https://developer.nvidia.com/blog/nvidia-real-time-noise-sup...

- https://krisp.ai/blog/nvidia-rtx-voice-krisp/

reply

wodenokoto | karma 15554 | avg karma 2.59 · | 2015-05-11 12:36:52+00:00

The point is that the sum of added and reduced noise is smaller in the noise cancellation than in non-noise cancellation, unless you are in an optimized environment.

largote | karma 376 | avg karma 2.2 · | 2016-10-11 01:23:49+00:00

Active noise cancelling is far better at drowning out constant background noise than changing sounds like conversations.

ansgri | karma 1157 | avg karma 1.52 · | 2022-10-11 00:25:09

I wonder if there will be progress in higher-frequency canceling given there’s an engineering reason for limiting canceling to lower frequencies. The current ANC technology makes outside speech quieter yet more intelligible, and that very much increases distraction for me.

anonu | karma 7087 | avg karma 3.13 · | 2018-09-01 15:33:20+00:00

Noise cancellation is designed to cut out the ambient noise - which is mostly white - of which there is a lot in a big city. It does this with a phase cancellation of what is probably a fairly predictable waveform. Human voices I can imagine are not that predictable. Maybe advances in machine learnings and processing may be able to provide better cancellation techniques.

mnemotronic | karma 116 | avg karma 1.01 · | 2021-10-19 16:15:19

I'll bet most earbuds with active noise reduction and equalization could be tweaked to provide augmentation similar to a hearing aid. The current algorithms for active noise reduction are designed to suppress external sounds -- all they have to do is invert the logic and amplify external sounds. Just add EQ to make my wife's voice come in better.

lxgr | karma 11963 | avg karma 2.2 · | 2023-10-12 07:32:57

But active noise cancellation removes (perceived) sound. Wouldn't that make it worse, then?

bulltale | karma 61 | avg karma 1.33 · | 2012-09-12 17:47:21+00:00

Better noise cancellation and voice recognition.

codq | karma 621 | avg karma 2.94 · | 2018-09-16 17:14:44+00:00

a) Noise reduction usually requires a powered component, or some kind of neck brace to contain the battery required for active noise cancelling. This adds expense and weight.

b) Sometimes it's good to be aware of your surroundings! When I'm listening to music, noise is usually sufficiently blocked by the sheer volume of the track, and during podcasts and audiobooks I'm also not bothered by the sounds of the streets.

If anything, it's good to hear a siren, or the ramblings of a nearby crazy person in order to avoid them.

The only time I find noise cancellation useful is on airplanes.

reply

loco5niner | karma 1180 | avg karma 0.82 · | 2018-04-18 23:08:35+00:00

Yep, active noise cancellation is great for repeating sounds, but horrible for voices.

superuser2 | karma 8140 | avg karma 2.66 · | 2015-08-12 20:15:40+00:00

>avoid the chatter

If active noise canceling seems to lower conversation volume, it is because you've convinced yourself it should. No DSP located on your ear can analyze and cancel an unpredictable signal like conversation before it reaches your ear. It can be effective against drone sounds like motors, rushing air, etc. because the same cancellation signal works now as did 100ms ago. This is not true of human speech.

Opt for a pair of well-fitting in-ears. If you have the money, see an audiologist for a custom fit. With high-quality earbuds and a good seal, you can play music at a very low level and still 1) hear all its detail, and 2) not perceive outside sounds.

reply

kevincennis | karma 2338 | avg karma 6.3 · | 2018-11-28 15:42:07

That's true, of course. But it's much harder to actively cancel higher frequencies. This is why noise cancelling works brilliantly on an airplane (relatively low frequency background noise) but it does almost nothing to filter out the sounds of conversations around you.

SXX | karma 6418 | avg karma 3.88 · | 2020-09-18 03:22:24+00:00

Sorry. I should have specified I that I talking specifically about background noise cancellation for voice input. I haven't seen any other of this tech in action so can't comment on how good it is.

solarkraft | karma 8658 | avg karma 1.79 · | 2020-04-11 20:21:28+00:00

I think I understand what you mean now.

While the noise cancellation is active it will attempt to neutralize (destructively interfere with) sounds from the outside, including those generated by your speaker. You could indeed adversarially engage through something like a spontaneous phase shift (so the interference will become constructive, making the resulting signal louder) or generating a frequency the ANC can't compensate.

reply